Skip to main content
Figure 2 | Biology Direct

Figure 2

From: Thousands of missed genes found in bacterial genomes and their analysis with COMBREX

Figure 2

Assignment of COMBREX support levels to the hypothetical/named missed genes using ComBlast. For each missed gene we assign a COMBREX support level based on sequence homology and assignment to gene clusters in COMBREX. A missed gene has the strong COMBREX support level of being a true protein coding gene if it is conserved or associated with at least one of the following information: possessing experimentally validated function, known 3D structure, purified protein, protein domain or EC number. It has the fair COMBREX support level if it has a sufficient number of homologs or is associated with a predicted function. The other named missed genes, which were confirmed by sequence homology to at least one gene with non hypothetical protein annotation, have a weak COMBREX support level. The rest of the hypothetical genes have insufficient evidence and thus they are not counted in the statistics of missed genes in this paper. See the text for more detailed description of the different levels.

Back to article page