Skip to main content

Table 1 Characteristics of the vertebrate nuclear regions that exhibit the greatest level of homology with the gau region

From: Probable presence of an ubiquitous cryptic mitochondrial gene on the antisense strand of the cytochrome oxidase I gene

Vertebrate species

Chrom. number

Chromosome position of the gauORF (101 codons)

Sequence of the possible start codon

Positions of the stop codons

% identities of nuclear nucleic acid sequences/mt-gauregion Number of indel(s)/mt-gauregion

% identities of nuclear nucleic acid sequence/mt-cox1gene Number of indel(s)/mt-cox1gene

Transcriptional data of the gaunuclear region according to Ensembl.org

Human Homo sapiens

1

557002-556700

ATG

94 (TAG) 101 (TAG)

97.0%

no indel

98.6%

1 indel

In the cDNA (Acc. n° ENST00000391564) of an unprocessed pseudogene (no protein product).

Orientation of gau vs this transcript: -

Human Homo sapiens

14

32023072-32023374

ATA

94 (TAG) 101 (TAG)

90.8% no indel

67.2%

29 indels

In the intron of the cDNA (Acc. n° ENST00000280979) encoding the Protein Kinase A-anchoring protein 6) (PRKA6).

Orientation vs transcript: +

Chimpanzee Pan troglodytes

2a

51807478-51807780

ATA

52(TGA) 94(TAG) 101 (TAA)

82.5% no indel

62.7%

30 indels

In the intron of 5 cDNAs of the same gene (Acc. n° ENSPTRT00000012096, ENSPTRT00000058923, ENSPTRT00000058924, ENSPTRT00000064068, ENSPTRT00000066428, ENSPTRT00000067456) encoding unknown protein.

Orientation of gau vs these transcripts: -

Chimpanzee Pan troglodytes

8

47845028-47844726

ATA

92(TGA) 94(TAA) 101 (TAG)

88.8%

no indel

80.9%

6 indels

Non transcripted region

Orangutan Pongo pygmaeus

2a

60553759-60553457

ATA

101 (TAA)

83.5%

no indel

64.9%

37 indels

In the intron of the cDNA (Acc. n° ENSPPYT00000014418) encoding the Neurexin-1-alpha Precursor (Neurexin I-alpha).

Orientation of gau vs this transcript: -

Macaque Macaca mulatta

1

108934996-108934700

ATA

no

95.2%

no indel

95.2% no indel

(partial sequence)

Non transcripted region

Macaque Macaca mulatta

2

12317948-123179188

ATA

no

96.4%

77.5% 19 indels (partial sequence)

Non transcripted region

Macaque Macaca mulatta

6(a)

30942119-30941823

ATA

no

95.7% no indel

68.9% indels (partial sequence)

Non transcripted region

Macaque Macaca mulatta

6(b)

50452034-50451738

ATA

no

97.0%

67.3% 31 indels (partial sequence)

In the intron of two cDNAs of the same gene (Acc. n° ENSMMUT00000002475, ENSMMUT00000002476) encoding the Integrin alpha-1 Precursor.

Orientation of gau vs these transcripts: -

Horse Equus caballus

27

5204837-5205139

ATT

50 (TGA)

87.8% no indel

88.9% no indel

Non transcripted region

Dog Canis familiaris

16

9457729-9458031

ATG

no

83.5% no indel

74.7% 15 indels (partial sequence)

Non transcripted region

Cow Bos taurus

10

4584422-4584126

ATA

no

88.1%

87.6% 4 indels

In the intron of a cDNA (Acc. n° ENSBTAT00000020753) encoding an unknown protein.

Orientation of gau vs this transcript: +

Mouse Mus musculus

2

22444482-22444784

ATT

50(TGA) 79(TGA) 101 (TAG)

97.7% no indel

97.6% no indel

In the intron of the cDNA (Acc. n° ENSMUST00000044749) encoding the Myosin IIIA.

Orientation of gau vs this transcript: +

  1. Sequences extracted from ensemble.org. The deduced amino acid sequences are shown in Figure 2.