We developed a new open-source command-line tool, COGNATE (Comparative Gene Annotation Characterizer), which uses a given genome assembly and its annotation of protein-coding genes for a detailed description of the respective gene and genome structure parameters. Additionally, we revised the standard definitions of gene and genome structures and provide the definitions used by COGNATE as a working draft suggestion for further reference. Complete parameter lists and summary statistics are inferred using this set of definitions to allow down-stream analyses and to provide an overview of the genome and gene repertoire characteristics. COGNATE is written in Perl and freely available at the ZFMK homepage ( https://www.zfmk.de/en/COGNATE ) and on github ( https://github.com/ZFMK/COGNATE ).
The tool COGNATE allows comparing genome assemblies and structural elements on multiples levels (e.g., scaffold or contig sequence, gene). It clearly enhances comparability between analyses. Thus, COGNATE can provide the important standardization of both genome and gene structure parameter disclosure as well as data acquisition for future comparative analyses. With the establishment of comprehensive descriptive standards and the extensive availability of genomes, an encompassing database will become possible.
我们开发了一个新的开源命令行工具,COGNATE(比较基因注释表征器),它使用给定的基因组组装及其对蛋白质编码基因的注释来详细描述各自的基因和基因组结构参数。此外,我们修订了基因和基因组结构的标准定义,并提供了COGNATE使用的定义作为工作建议草案,供进一步参考.使用这组定义推断完整的参数列表和汇总统计,以允许进行下游分析并提供基因组和基因库特征的概述。COGNATE是用Perl编写的,可以在ZFMK主页(https://www.zfmk.de/en/COGNATE)和github(https://github.com/ZFMK/COGNATE)。
工具COGNATE允许在多个水平上比较基因组组装和结构元件(例如,支架或重叠群序列,基因)。它显然增强了分析之间的可比性。因此,COGNATE可以提供基因组和基因结构参数公开的重要标准化以及数据采集,以用于未来的比较分析。随着全面描述性标准的建立和基因组的广泛可用性,一个完整的数据库将成为可能。