Stop codons in bacteria are not selectively equivalent
© Povolotskaya et al.; licensee BioMed Central Ltd. 2012
Received: 13 May 2012
Accepted: 22 August 2012
Published: 13 September 2012
The evolution and genomic stop codon frequencies have not been rigorously studied with the exception of coding of non-canonical amino acids. Here we study the rate of evolution and frequency distribution of stop codons in bacterial genomes.
We show that in bacteria stop codons evolve slower than synonymous sites, suggesting the action of weak negative selection. However, the frequency of stop codons relative to genomic nucleotide content indicated that this selection regime is not straightforward. The frequency of TAA and TGA stop codons is GC-content dependent, with TAA decreasing and TGA increasing with GC-content, while TAG frequency is independent of GC-content. Applying a formal, analytical model to these data we found that the relationship between stop codon frequencies and nucleotide content cannot be explained by mutational biases or selection on nucleotide content. However, with weak nucleotide content-dependent selection on TAG, -0.5 < Nes < 1.5, the model fits all of the data and recapitulates the relationship between TAG and nucleotide content. For biologically plausible rates of mutations we show that, in bacteria, TAG stop codon is universally associated with lower fitness, with TAA being the optimal for G-content < 16% while for G-content > 16% TGA has a higher fitness than TAG.
Our data indicate that TAG codon is universally suboptimal in the bacterial lineage, such that TAA is likely to be the preferred stop codon for low GC content while the TGA is the preferred stop codon for high GC content. The optimization of stop codon usage may therefore be useful in genome engineering or gene expression optimization applications.
This article was reviewed by Michail Gelfand, Arcady Mushegian and Shamil Sunyaev. For the full reviews, please go to the Reviewers’ Comments section.
Translation termination is a crucial step in protein synthesis that, in most organisms, is triggered by three stop codons; TAA, TGA and TAG. These three stop codons are thought to be functionally equivalent in the broad sense of effective translation termination. Additional functions, such as coding for extra amino acids, effects only a tiny fraction of all codons , and these stop codons can be interchanged [2, 3] or even lost [4–8] without obvious functional consequences. Indeed, one of the motivations in a recent experimental study of genome-wide codon replacement in selecting to substitute all TAG stop codons in Escherichia coli, rather than making synonymous substitutions, was the rationale that synonymous “codon utilization bias has been shown to affect translation efficiency”  suggesting that in the author’s opinion stop codon substitution may have fewer functional consequences than synonymous substitution. Thus, at present there is broad consensus that three stop codons are functionally equivalent and interchanging stop codons is not expected to have functional or selective consequences. In that case substitutions between different stop codons should be neutral, such that the rate of evolution between stop codons should be broadly equivalent to the synonymous rate of evolution and the stop codon frequency should be governed by similar selective and mutational forces that govern nucleotide usage in synonymous sites.
Stop codon evolution and frequency
There are two predictions of the synonymous usage of stop codons: stop codon evolution should occur at a rate equivalent to that of synonymous evolution and stop codon frequency should mirror that of synonymous codons, such that AT-rich genomes should show a higher frequency of TAA. The three stop codons are interchangeable through one, or two, transitions of G - > A or A - > G. Thus, when comparing the rate of evolution of the stop codons it is best to use the same transition G < − > A, which occurs between some two-fold synonymous sites: glutamine, glutamic acid and lysine. Similarly, when comparing stop codon frequency it is more appropriate to use G-content at such two-fold sites than genome-wide or four-fold synonymous GC-content.
First, we compared the rate of stop codon evolution (K stop ) to synonymous evolution in 11 pairs of bacterial genomes. We found that stop codon evolution, which involves only the G < − > A transitions, is ~1.7 times slower than the rate of synonymous changes in G < − > A two-fold sites, K GA (K stop /K GA = 0.58 ± 0.19, SD). However, the difference is not large, such that K stop is closer to K GA than K N is to K S (K n /K s = 0.09 ± 0.04, SD) indicating that evolution of stop codons is affected by the action of weak selection or mutational biases. While the observation of K stop < K AG is indicative of negative selection acting on substitutions between stop codons, it is by itself not conclusive. It is likely that some form of negative selection is acting on synonymous sites, which in some circumstances increases the rate of evolution , thus, K stop < K AG may be a consequence of negative selection on synonymous sites [16, 17] and additional data are necessary to corroborate the possibility of selection acting on stop codons.
The independence of TAG on guanine frequency at first glance has a simple explanation, that TAA and TGA stop codons are functionally equivalent while the TAG stop codon performs a different function and almost never evolves into the other two codons. However, this simple explanation for these data is readily refuted by the observation that the rate of TAG stop codon evolution is non-zero and is comparable with the rate of evolution of the other two codons (0.50 ± 0.42, 0.86 ± 0.37, 0.43 ± 0.13 for TAA, TGA and TAG, respectively, with SD), the experimental evidence that TAG can be easily changed without profound consequences  and the observation that TAG frequency is the same for all functional categories (Additional file 1: Figure S1). Thus, the lack of a response of TAG to guanine frequency cannot be explained by strong evolutionary conservation of the TAG stop codon in specific genes. Similarly, this effect does not appear to be caused by different propensities of stop codons in overlapping genes (Additional file 2: Figure S2). These data are suggestive of a nontrivial system, such that despite the apparent lack of change of TAG frequency with guanine frequency the rate of TAG codon evolution is not close to zero.
Model of stop codon evolution
We consider finesses of every allele to be different, with the selection shaping G-content of the genome and selection acting on TAG (Figure 2). We assume both selective forces s 1 and s 2 to be small (~1/N e ) and thus the term s 1 *s 2 in the expression for the fitness of TAG (1-s 1 )*(1-s 2 ) is negligible. Another feature of this model is that the rate of mutation A < − > G in the stop codons is identical to the rate of mutation A < − > G in two fold synonymous sites (Figure 2). Overall, there are no reasons why these assumptions are not expected to hold in bacterial genomes so that our model should provide a reasonable approximation of frequencies and selection, if any, of stop codons.
Thus, if there is no selectional pressure the expected frequencies of TAG and TGA are equal and, therefore, a model without any selection cannot fit our data (Figure 1).
Next, we investigated the impact of selection S 1 which shapes G-content. Three parameters, μ 1 , μ 2 and S 1 act as one effective parameter in the expressions of stop codon frequencies: from (3). Thus, selection on G-content, S 1 , affects only G-content itself and does not change the form of the relationship between G frequency and stop codon usage as is evident from expressions (4).
In order to estimate the strength of selection acting on TAG we solve the system of equations (4) for the selection coefficient S 2 :
Using a population genetics model modified to describe stop codon and guanine frequencies we demonstrated that stop codon usage can be explained when selection is acting specifically on TAG. The predicted selection regime on TAG, S 2 , has three properties: it is relatively weak, with N e s between −0.5 and 1.5, nucleotide content dependent and is positive when G-content <16% and negative when G-content is >16%. The predicted selection strength is weak, on the order of 1/N e , which is not strong enough to severely restrict the rate of evolution of stop codons. Indeed, such weak selection on individual alleles can be overpowered by genetic drift, which may result in the large variability of stop codon frequencies in our data (Figure 1B). Alternatively, the observed variability of stop codon frequencies relative to the average expectation (compare Figure 1A and 1B) may be due to slight changes in selection pressure on TAG and the rates of A < − > G mutation between different species (Figure 3A).
The G-content dependence of the selection follows from the roughly constant TAG frequency relative to G-content. Yet at this point, there are no known molecular mechanisms that may explain why TAG stop codon has different selective consequences depending on nucleotide content. One possibility, however, is the dependence of translation termination efficiency on the nucleotide context in the vicinity of the TAG stop codon. Bacteria generally code for two release factors (RF), RF1 that recognizes TAA and TAG stop codons and RF2 that recognizes TAA and TGA . Thus, the prediction of the context-dependence hypothesis is that the efficiency of RF1 is GC-context dependent while RF2 functions independent of nucleotide context. Empirical evidence may be necessary to confirm or refute this hypothesis, however, given the relatively weak nature of the selection the differences in translation termination efficiency may be too small to be easily detected in the laboratory. The possibility of the molecular mechanism involving elongation termination factors, however, is left necessarily uncertain by conflicting data from other species. Eukaryota, that have only one release factor for all three stop codons [22, 23], and chloroplast genomes that have retained orthologs of both release factors , show a clear increase of TAG frequency with higher GC-content (Additional file 3: Figures S3 and Additional file 4: Figure S4). Clearly, further experimental work is likely necessary to elucidate the molecular mechanisms behind selection on TAG stop codon in bacteria.
Within the framework of our model it is possible to compare the fitness impacts of different stop codons depending on genomic nucleotide content. Regardless of selection on G-content itself (S 1 ) the difference in fitness between TAG and TGA stop codon is defined by S 2 (Figure 2). Thus, regardless of the value of S 1 our data signify that TAG stop codon is always less fit that the TGA stop codon for G-content >16%. Comparing the relative fitness of TAG and TAA, however, involves both S1 and S2, with their sum being the difference in relative fitness of these two stop codons (Figure 2). Within the model, G-content depends on relative rates of mutation A < − > G and S 1 , and we cannot disentangle the contribution of mutation (and ) versus selection (S 1 ) so we cannot analytically estimate the value of S 1 . However, we can define the range of values of these parameters for specific G-content.
Is there any evidence that G > A can be five or ten times faster than A > G mutations? Mutational biases against GC-content that have been measured were shown to be always less than tenfold in favor of AT-content and less than fivefold for 151 out of a total of 154 species considered in two separate studies (25, 26). Similarly, weak selection acting on GC-content has been postulated by several researchers (25,26). Given this evidence it is unlikely that the observed GC-content can be explained solely by G < − > A mutational biases and, therefore, S 1 is positive and > > 0.5 for G-content ≈ 5% and >0 for G-content ≈ 16%. Thus, for G-content < 16% the TAG stop codon is expected to be less fit than the TAA stop codon.
The relative fitness of TAG to TAA and TGA stop codons can thus be described as follows. When G-content is >16% TAG has lower fitness than TGA. As long as S 1 > −S 2 for G-content <16% then TAG has lower fitness than TAA in bacterial genomes with G-content < 16%. Because S 1 > −S 2 is expected to hold for G-content < 16% given the mutation parameters observed in nature [25, 26] it follows that TAG is a striking example of a global suboptimal codon, such that the substitution of TAG into either TAA or TGA for any bacterial species would lead to an increase of fitness. The use of suboptimal synonymous codons in bacteria is a well-documented phenomena, however, the exact codons that are suboptimal differ substantially between different species (see  for review). To our knowledge, the observation that one codon with synonymous function to other codons is always worse in such a large group of organisms, bacteria, is the first example of a global sub-optimality of the genetic table. The sub-optimal organization of the genetic table revealed here provides a striking counterexample to the remarkable optimization of the genetic code with respect to error minimization [28–30].
All available complete bacterial genomes were downloaded from NCBI website and 736 of those that utilize the standard genetic code were used for the analysis (See Additional file 5). Plasmid sequences were excluded. All available pairs of closely related genomes from the ATGC database , of which there were 11 pairs with 0.03 < K S < 0.22 were used to measure A < − > G synonymous transition rates (K AG ) and rates of stop codon evolution (K stop ). Orthologues were constructed using two-directional best BLAST  hit approach and aligned using MUSCLE . To obtain K AG we looked at the number of synonymous differences between three pairs of codons: CAA and CAG, AAA and AAG and GAA and GAG. The expected number of substitutions occurred was estimated using Jukes-Cantor model . The same method was applied to estimate the number of substitutions between stop codons with the only difference that the number of synonymous sites for TAA codon is twice as high as the number of synonymous sites for TAG and TGA codons. In order to obtain rates of TAG codon evolution the substitutions have to be polarized and for that the third organism was added to the 11 pairs of the genomes such that the synonymous distance between sister species 0.02 < K S < 0.15 and between sister species and outgroup 0.04 < K S < 0.62. Substitutions were polarized using simple parsimony approach.
To show that a distribution of selection coefficients for the same stop codon across different genes can only increase the differences between average selection coefficients of stop codons we proved the following conjecture. A given frequency of TAG codon in the genome can be explained by an equal strength of selection acting on all TAG codons in the genome () or a distribution of selection coefficients across different codons with an expected value of the distribution (). For any given observed frequency of the TAG codon in the genome , such that the average strength of selection in a distribution is larger when different codons are under different selection pressures. We consider the case where selection on each TAG stop codon is a discrete random variable which assumes the value S i with the probability p i . In this case we use S i as discrete values of a distribution of selection coefficients on TAG stop codons in different genes in the same genome, while S 1 and S 2 were used as fixed values of the selection coefficients for all genes across a single genome. In this case for any selection S i the expected number of the sites under this selection is N i = p i *N stop , the frequency of TAG is and the number of TAG stop codon is . The observed frequency of TAG in the genome is and the value of selection S 0 acting on TAG sites is estimated from the formula . Taking into account that the second derivative of f, , if S ≥ 1nfG, the Jensen’s inequality holds, or and . The only condition for this inequality to hold is S ≥ 1nfG, which is a reasonable assumption taking into account the fact that out of 736 genomes analyzed S0 ≥ 1nfG for 734 (Additional file 6: Figure S5).
Reviewer 1: Dr Mikhail Gelfand, Institute for Information Transmission Problems, RAS, Bolshoi Karetny per. 19, Moscow 127994, Russia and Faculty of Bioengineering and Bioinformatics, Moscow State University, Vorobievy Gory 1-73, Moscow 119992, Russia.firstname.lastname@example.org
The authors present a model explaining the following observation: while the use of the UGA stop codon depends on G-content, the UAG frequency is almost constant in genomes with highly diverse G-content. While I see no problems with the observations and the model, I have some editorial comments and questions.
The authors state several times – starting with the very first sentence of the abstract – that the usage of stop codons has not been rigorously studied. This is not correct. In the 90’s, several papers considered the usage of stop codons and its dependence on the local context, including tandem stops and tetranucleotides involving stop-codons. I think these papers should be mentioned.
Author response: Indeed, the term “usage” in this context is not very precise. We acknowledge that there have been studies of stop codon usage in the local context, that is to say that some stop codons have a preferred local context, however, in this manuscript we discuss only the evolution and genomic frequencies of the three different stop codons, which to our knowledge has not been rigorously considered previously. We cite some of the relevant literature and use the word “frequency” which we believe is not as ambiguous as “usage” in this context.
How the 11 studied genome pairs were selected?
Author response: We selected all genome triplets with 0.03 < KS < 0.22 that were available in the ATGC database. We now report this in the Methods section.
Is the G/A content the same in the 3rd codon position in all codon pairs? If not, why this is a good parameter?
Author response: There are three pairs of two-fold degenerated codon families: AAG/A, GAG/A, CAG/A. G-content at the third position of every pair is indeed highly correlated with overall G-content (see the figure below).
Dependency between G content in the third position of two-fold degenerated codon families and overall G content for AAG/A (blue), GAG/A (red), CAG/A (green).
And in any case, what are the reasons to suspect that the selection regime in the amino-acid-encoding codons is the same as in the stops (the former may depend on concentrations of tRNAs and the codon-anticodon interactions; the latter, on interactions with the release factors). What about the A/G choice in the four-fold codon families?
Author response: Indeed, we have created the model based on this assumption because it allowed us to reduce the number of parameters and make the system of equations solvable. However, we can also show that this assumption does not affect our main result that the TAG codon is selectively disadvantageous. Specifically, from system of equations (4) it follows that exp(S 2 ) = f TGA /f TAG . Thus, we can solve for the selective impact of TAG (S 2 ) solely based on the frequencies of TAG and TGA without making the assumption that the selective regime is the same in stop and amino acid codons. Since S2 is positive for almost the entire range of G content it follows that the TAG codon provides a selective disadvantage relative to the TGA codon. Unfortunately, we cannot estimate S 2 by comparing the frequencies of TAG and TAA codons because we cannot independently estimate the component of fTAA from (4). We now present the new estimate of S 2 in Figure 5 and the main text.
The reasoning in page 6 is not clearly presented, and misprints add to the confusion. How is formula S2 = ln ((fG(1-fTAG))/fTAG) used? Do I understand it correctly that the next formula S2 = ln (3.6fG + 0.4) results from a fit to observations (comparison of genome pairs)? – I think, this should be explained more explicitly.
Author response: Yes, this is what we mean, and we rewrote this section to hopefully make this clearer.
By the way, the two formulas for S2, theoretical and observed ones, yield a dependence between fG and fTAG – does it hold?
Author response: Yes, there is a slight dependence as can be seen from Figure 1.
Finally, reference to equation ( 5 ) in the preceding paragraph should be about equation ( 4 ), and the sentence “S2 has a clear G-content dependence is well approximated…” probably should be “S2 has a clear G-content dependence that is well approximated…” .
Author response: If the referee means this sentence “Thus, selection on G-content,, affects only G-content itself and does not change the form of the relationship between G frequency and stop codon usage as is evident from expressions (4).” then we mean that in the system of equations (4) G-content (f(taa,tga,tag) does not depend on S1. The other typo is corrected.
Polarization of substitutions using parsimony may be dangerous if there is selection towards a specific, preferred nucleotide: in some cases two parallel nonpreferred-to-preferred substitutions may occur, and they will be interpreted as a single preferred-to-nonpreferred substitution, hence skewing the substitution statistics.
Author response: This is true, however, these data has been obtained for a number of species with different GC-content and low sequence divergence. Therefore, we believe that it is unlikely that the use of parsimony have produced a systematic error of substantial effect that jeopardizes our conclusions.
Reviewer 2: Dr. Arcady Mushegian, Stowers Institute for Medical Research, Kansas City, Missouri, United States of America and Department of Microbiology, Kansas University Medical Center, Kansas City, Kansas, United States of America.email@example.com
The manuscript by Povolotskaya et al. puts forward a simple model of nucleotide substitutions in the stop codons in bacteria, and tests it against the genome-wide data. One of the main conclusions is that TAG may be globally suboptimal, with each of the remaining two codons turning out more fit under different values of GC content.
One biological explanation of these data may be in the phenomenon of overlapping ORFs in bacterial operons. TAG is the only codon that does not accommodate a minimal overlap, whereas TAA can give one kind of stop-start codon overlap (TAATG) and TGA even two kinds (ATGA and TGATG). Perhaps if the authors restricted their sample to the termination codons in the last (or only) genes in operons, they would see much less difference between fitness of those two and TAG?
Author response: The idea that the observed pattern of stop codon frequency in bacterial genomes is explained by gene overlap has occurred to us as well. However, we observe the same relationship between G-content and stop codon frequency in overlapping and non-overlapping genes. We now report these data in a new figure that is Additional file 2 Figure S2 in the new version of the manuscript. We have considered only tail-to-tail overlaps due to a much higher certainty of stop codon annotation compared to the uncertainty in the annotation of many start codons.
Reviewer 3: Dr. Shamil Sunyaev, Dr. Shamil Sunyaev, Division of Genetics, Brigham and Women's Hospital, Harvard Medical School, 77 Ave. Louis Pasteur, Boston MA 02115, USA. firstname.lastname@example.org
This manuscript presents an analysis of stop codon usage in bacterial species.
The authors report that TAG codon is un-preferred in most bacterial species and that its frequency does not depend on GC content. They suggest presence of weak selection against TAG codon due to unknown mechanism. One potential mechanism may involve dependency of efficiency of one of the release factors on GC content. I find the results of great interest. I only have two minor technical comments.
1) The analysis is based on Bulmer equations, which hold only if evolution is mutation limited. It would be great to briefly discuss applicability of this model to a wide variety of bacterial species.
Author response: Bulmer’s model assumes that the fate of a new mutation is decided independently of other mutations, that is to say that generally only one mutation is segregating in the population at the same time. This is certainly true if we consider only mutations in stop codons. In most bacterial genomes there are 2–5 thousand protein coding genes making it rather unlikely that more than one stop codon polymorphism is segregating at the same time.
2) Approximation of selection coefficient against TAG codon as a sum of contributions due to selection against GC content (S1) and selection against this specific codon (S2) ignores the S1*S2 term. It is OK if both selective forces are assumed to be small. It would be great if this assumption would be spelled out.
Author response: The referee is absolutely correct, we assume that both of the selective forces are small. We have added an explicit statement to this effect in the text.
We thank Elena Alkalaeva and Peter Kolosov for insightful discussion and Brian Charlesworth for a critical reading of our manuscript. The work has been supported by a Plan Nacional grant from the Spanish Ministry of Science and Innovation, EMBO Young Investigator and Howard Hughes Medical Institute International Early Career Scientist awards.
- Lobanov AV, Turanov AA, Hatfield DL, Gladyshev VN: Dual functions of codons in the genetic code. Crit Rev Biochem Mol Biol. 2010, 45: 257-265. 10.3109/10409231003786094.PubMedPubMed CentralView ArticleGoogle Scholar
- Vakhrusheva AA, Kazanov MD, Mironov AA, Bazykin GA: Evolution of prokaryotic genes by shift of stop codons. J Mol Evol. 2011, 72: 138-146. 10.1007/s00239-010-9408-1.PubMedView ArticleGoogle Scholar
- Isaacs FJ, et al: Precise manipulation of chromosomes in vivo enables genome-wide codon replacement. Science. 2011, 333: 348-353. 10.1126/science.1205822.PubMedView ArticleGoogle Scholar
- Barrell BG, Bankier AT, Drouin J: A different genetic code in human mitochondria. Nature. 1979, 282: 189-194. 10.1038/282189a0.PubMedView ArticleGoogle Scholar
- Yamao F, et al: UGA is read as tryptophan in Mycoplasma capricolum. Proc Natl Acad Sci USA. 1985, 82: 2306-2309. 10.1073/pnas.82.8.2306.PubMedPubMed CentralView ArticleGoogle Scholar
- Eisen JA, et al: Macronuclear genome sequence of the ciliate Tetrahymena thermophila, a model eukaryote. PLoS Biol. 2006, 4: e286-10.1371/journal.pbio.0040286.PubMedPubMed CentralView ArticleGoogle Scholar
- Aury JM, et al: Global trends of whole-genome duplications revealed by the ciliate Paramecium tetraurelia. Nature. 2006, 444: 171-178. 10.1038/nature05230.PubMedView ArticleGoogle Scholar
- Turanov AA, et al: Genetic code supports targeted insertion of two amino acids by one codon. Science. 2009, 323: 259-261. 10.1126/science.1164748.PubMedPubMed CentralView ArticleGoogle Scholar
- Poole ES, Brown CM, Tate WP: The identity of the base following the stop codon determines the efficiency of in vivo translational termination in Escherichia coli. EMBO J. 1995, 14: 151-158.PubMedPubMed CentralGoogle Scholar
- Tate WP, et al: The translational stop signal: codon with a context, or extended factor recognition element?. Biochimie. 1996, 78: 945-952. 10.1016/S0300-9084(97)86716-8.PubMedView ArticleGoogle Scholar
- Pavlov MY, et al: A direct estimation of the context effect on the efficiency of termination. J Mol Biol. 1998, 284: 579-590. 10.1006/jmbi.1998.2220.PubMedView ArticleGoogle Scholar
- Namy O, Hatin I, Rousset JP: Impact of the six nucleotides downstream of the stop codon on translation termination. EMBO Rep. 2001, 2: 787-793. 10.1093/embo-reports/kve176.PubMedPubMed CentralView ArticleGoogle Scholar
- Cridge AG, et al: Comparison of characteristics and function of translation termination signals between and within prokaryotic and eukaryotic organisms. Nucleic Acids Res. 2006, 34: 1959-1973. 10.1093/nar/gkl074.PubMedPubMed CentralView ArticleGoogle Scholar
- Wong TY, et al: Role of premature stop codons in bacterial evolution. J Bacteriol. 2008, 190: 6718-6725. 10.1128/JB.00682-08.PubMedPubMed CentralView ArticleGoogle Scholar
- Bulmer M: The selection-mutation-drift theory of synonymous codon usage. Genetics. 1991, 129: 897-907.PubMedPubMed CentralGoogle Scholar
- McVean GAT, Charlesworth B: A population genetic model for the evolution of synonymous codon usage: patterns and predictions. Genet Res. 1999, 74: 145-158. 10.1017/S0016672399003912.View ArticleGoogle Scholar
- Kondrashov FA, Ogurtsov AY, Kondrashov AS: Selection in favor of nucleotides G and C diversifies evolution rates and levels of polymorphism at mammalian synonymous sites. J Theor Biol. 2006, 240: 616-626. 10.1016/j.jtbi.2005.10.020.PubMedView ArticleGoogle Scholar
- Cutler RW, Chantawannakul P: Synonymous codon usage bias dependent on local nucleotide context in the class Deinococci. J Mol Evol. 2008, 67: 301-314. 10.1007/s00239-008-9152-y.PubMedView ArticleGoogle Scholar
- Kondrashov FA, Kondrashov AS: Measurements of spontaneous rates of mutations in the recent past and the near future. Philos Trans R Soc Lond B Biol Sci. 2010, 365: 1169-1176. 10.1098/rstb.2009.0286.PubMedPubMed CentralView ArticleGoogle Scholar
- Sharp PM, Bulmer M: Selective differences among translation termination codons. Gene. 1988, 63: 141-145. 10.1016/0378-1119(88)90553-7.PubMedView ArticleGoogle Scholar
- Scolnick E, Tompkins R, Caskey T, Nirenberg M: Release factors differing in specificity for terminator codons. Proc Natl Acad Sci USA. 1968, 61: 768-774. 10.1073/pnas.61.2.768.PubMedPubMed CentralView ArticleGoogle Scholar
- Zhouravleva G, et al: Termination of translation in eukaryotes is governed by two interacting polypeptide chain release factors, eRF1 and eRF3. EMBO J. 1995, 14: 4065-4072.PubMedPubMed CentralGoogle Scholar
- Dontsova M, et al: Translation termination factor aRF1 from the archaeon Methanococcus jannaschii is active with eukaryotic ribosomes. FEBS Lett. 2000, 472: 213-216. 10.1016/S0014-5793(00)01466-6.PubMedView ArticleGoogle Scholar
- Manuell A, Beligni MV, Yamaguchi K, Mayfield SP: Regulation of chloroplast translation: interactions of RNA elements, RNA-binding proteins and the plastid ribosome. Biochem Soc Trans. 2004, 32: 601-605. 10.1042/BST0320601.PubMedView ArticleGoogle Scholar
- Hershberg R, Petrov DA: Evidence that mutation is universally biased towards AT in bacteria. PLoS Genet. 2010, 6: e1001115-10.1371/journal.pgen.1001115.PubMedPubMed CentralView ArticleGoogle Scholar
- Hildebrand F, Meyer A, Eyre-Walker A: Evidence of selection upon genomic GC-content in bacteria. PLoS Genet. 2010, 6: e1001107-10.1371/journal.pgen.1001107.PubMedPubMed CentralView ArticleGoogle Scholar
- Plotkin JB, Kudla G: Synonymous but not the same: the causes and consequences of codon bias. Nat Rev Genet. 2011, 12: 32-42. 10.1038/nrg2899.PubMedPubMed CentralView ArticleGoogle Scholar
- Freeland SJ, Hurst LD: The genetic code is one in a million. J Mol Evol. 1998, 47: 238-248. 10.1007/PL00006381.PubMedView ArticleGoogle Scholar
- Jestin JL, Kempf A: Optimization models and the structure of the genetic code. J Mol Evol. 2009, 69: 452-457. 10.1007/s00239-009-9287-5.PubMedView ArticleGoogle Scholar
- Novozhilov AS, Koonin EV: Exceptional error minimization in putative primordial genetic codes. Biol Direct. 2009, 4: 44-10.1186/1745-6150-4-44.PubMedPubMed CentralView ArticleGoogle Scholar
- Novichkov PS, Ratnere I, Wolf YI, Koonin EV, Dubchak I: ATGC: a database of orthologous genes from closely related prokaryotic genomes and a research platform for microevolution of prokaryotes. Nucleic Acids Res. 2009, 37: D448-D454. 10.1093/nar/gkn684.PubMedPubMed CentralView ArticleGoogle Scholar
- Altschul SF, Gish W, Miller W, Myers EW, Lipman DJ: Basic local alignment search tool. J Mol Biol. 1990, 215: 403-410.PubMedView ArticleGoogle Scholar
- Edgar RC: MUSCLE: multiple sequence alignment with high accuracy and high throughput. Nucleic Acids Res. 2004, 32: 1792-1797. 10.1093/nar/gkh340.PubMedPubMed CentralView ArticleGoogle Scholar
- Jukes TH, Cantor CR: Evolution of Protein Molecules. 1969, New York: Academic, 21-132.Google Scholar
This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.