Stop codons in bacteria are not selectively equivalent

Background The evolution and genomic stop codon frequencies have not been rigorously studied with the exception of coding of non-canonical amino acids. Here we study the rate of evolution and frequency distribution of stop codons in bacterial genomes. Results We show that in bacteria stop codons evolve slower than synonymous sites, suggesting the action of weak negative selection. However, the frequency of stop codons relative to genomic nucleotide content indicated that this selection regime is not straightforward. The frequency of TAA and TGA stop codons is GC-content dependent, with TAA decreasing and TGA increasing with GC-content, while TAG frequency is independent of GC-content. Applying a formal, analytical model to these data we found that the relationship between stop codon frequencies and nucleotide content cannot be explained by mutational biases or selection on nucleotide content. However, with weak nucleotide content-dependent selection on TAG, -0.5 < Nes < 1.5, the model fits all of the data and recapitulates the relationship between TAG and nucleotide content. For biologically plausible rates of mutations we show that, in bacteria, TAG stop codon is universally associated with lower fitness, with TAA being the optimal for G-content < 16% while for G-content > 16% TGA has a higher fitness than TAG. Conclusions Our data indicate that TAG codon is universally suboptimal in the bacterial lineage, such that TAA is likely to be the preferred stop codon for low GC content while the TGA is the preferred stop codon for high GC content. The optimization of stop codon usage may therefore be useful in genome engineering or gene expression optimization applications. Reviewers This article was reviewed by Michail Gelfand, Arcady Mushegian and Shamil Sunyaev. For the full reviews, please go to the Reviewers’ Comments section.


Background
Translation termination is a crucial step in protein synthesis that, in most organisms, is triggered by three stop codons; TAA, TGA and TAG. These three stop codons are thought to be functionally equivalent in the broad sense of effective translation termination. Additional functions, such as coding for extra amino acids, effects only a tiny fraction of all codons [1], and these stop codons can be interchanged [2,3] or even lost [4][5][6][7][8] without obvious functional consequences. Indeed, one of the motivations in a recent experimental study of genomewide codon replacement in selecting to substitute all TAG stop codons in Escherichia coli, rather than making synonymous substitutions, was the rationale that synonymous "codon utilization bias has been shown to affect translation efficiency" [3] suggesting that in the author's opinion stop codon substitution may have fewer functional consequences than synonymous substitution. Thus, at present there is broad consensus that three stop codons are functionally equivalent and interchanging stop codons is not expected to have functional or selective consequences. In that case substitutions between different stop codons should be neutral, such that the rate of evolution between stop codons should be broadly equivalent to the synonymous rate of evolution and the stop codon frequency should be governed by similar selective and mutational forces that govern nucleotide usage in synonymous sites.
The hypothesis of selective equivalence of stop codons has not been rigorously tested and, contrary to the general expectation, there are data that suggest that stop codon may not be entirely synonymous. Firstly, translation termination efficiency may be nucleotide context dependent [9][10][11][12][13]. Second, TAG and TGA stop codon frequencies in bacterial genomes with different GC-contents are strikingly different (see Figure 1 in [14]), such that TGA frequency increases with genomic GC-content while TAG is GC-content independent. Here, we study stop codon frequency and evolution in bacterial genomes to gain an understanding of whether or not stop codons are used indiscriminately without any fitness costs. We compare rates of stop codon evolution to the rate of synonymous evolution and apply a simple population genetics model formulated by Bulmer [15] to stop codon frequency and nucleotide content in bacterial genomes.

Stop codon evolution and frequency
There are two predictions of the synonymous usage of stop codons: stop codon evolution should occur at a rate equivalent to that of synonymous evolution and stop codon frequency should mirror that of synonymous codons, such that AT-rich genomes should show a higher frequency of TAA. The three stop codons are interchangeable through one, or two, transitions of G -> A or A -> G. Thus, when comparing the rate of evolution of the stop codons it is best to use the same transition G < − > A, which occurs between some two-fold synonymous sites: glutamine, glutamic acid and lysine. Similarly, when comparing stop codon frequency it is more appropriate to use G-content at such two-fold sites than genome-wide or four-fold synonymous GC-content.
First, we compared the rate of stop codon evolution (K stop ) to synonymous evolution in 11 pairs of bacterial genomes. We found that stop codon evolution, which involves only the G < − > A transitions, is~1.7 times slower than the rate of synonymous changes in G < − > A two-fold sites, K GA (K stop /K GA = 0.58 ± 0.19, SD). However, the difference is not large, such that K stop is closer to K GA than K N is to K S (K n /K s = 0.09 ± 0.04, SD) indicating that evolution of stop codons is affected by the action of weak selection or mutational biases. While the observation of K stop < K AG is indicative of negative selection acting on substitutions between stop codons, it is by itself not conclusive. It is likely that some form of negative selection is acting on synonymous sites, which in some circumstances increases the rate of evolution [15], thus, K stop < K AG may be a consequence of negative selection on synonymous sites [16,17] and additional data are necessary to corroborate the possibility of selection acting on stop codons.
Second, we considered the dependence of the stop codon frequency on guanine content in G/A two-fold degenerate sites of 736 bacterial genomes (Figure 1), following the results of Wong and colleagues (Figure 1  from 14). The lack of a clear correlation between TAG usage and frequency of guanine is particularly striking in comparison to the expected behavior of TGA stop codon. Moreover, it is apparent that TAG stop codon is rarely very frequent in the genomes with the average expected frequency of around 20%, although this frequency is slightly lower in very A-rich genomes ( Figure 1B).
The independence of TAG on guanine frequency at first glance has a simple explanation, that TAA and TGA stop codons are functionally equivalent while the TAG stop codon performs a different function and almost never evolves into the other two codons. However, this simple explanation for these data is readily refuted by the observation that the rate of TAG stop codon evolution is non-zero and is comparable with the rate of evolution of the other two codons (0.50 ± 0.42, 0.86 ± 0.37, 0.43 ± 0.13 for TAA, TGA and TAG, respectively, with SD), the experimental evidence that TAG can be easily changed without profound consequences [3] and the observation that TAG frequency is the same for all functional categories (Additional file 1: Figure S1). Thus, the lack of a response of TAG to guanine frequency cannot be explained by strong evolutionary conservation of the TAG stop codon in specific genes. Similarly, this effect does not appear to be caused by different propensities of stop codons in overlapping genes (Additional file 2: Figure S2). These data are suggestive of a nontrivial system, such that despite the apparent lack of change of TAG frequency with guanine frequency the rate of TAG codon evolution is not close to zero.

Model of stop codon evolution
To understand the possible causes of the apparent paradox that in bacteria all three stop codons show substantial rates of evolution while the frequency of the TAG stop codons remains at~20% independently of the nucleotide content we developed a simple formal model of stop codon and guanine genomic frequency. We applied a model developed by Bulmer [15] for synonymous codon usage and solved it explicitly for the genomic frequency of stop codons, with rates of mutation between them and selection for each stop codon as parameters in the model. This theoretical framework assumes that substitutions are rare and two substitutions rarely segregate at the same time, which fits well to stop codon evolution given the relative rarity of stop codons in bacterial genomes. In this model we use guanine frequency in two fold synonymous sites instead of GC-content of the genome as an independent variable. Two rates of mutation G-> A and A-> G are an explicit part of the model of stop codon evolution ( Figure 2) and exactly the same mutations are found in G < − > A two-fold synonymous sites, making it possible to model G-content while using the same mutation rates.
Consider stochastic mutation-selection-drift equilibrium at a locus with three alleles: TAA, TAG and TGA. Assuming for any mutational rate μ, μN e ≪ 1, where N e is the effective population size, a specific certain allele is fixed most of the time and the frequency of this allele f is the fraction of time when it is fixed. The flux of switches to an allele with small selective advantage (s) is φ ¼ μS 1Àe ÀS f (eq. 7 from [15]), where S ¼ 2N e s . Equilibrium allele frequencies can be obtained by solving the following system of linear equations: We consider finesses of every allele to be different, with the selection S 1 shaping G-content of the genome and selection S 2 acting on TAG ( Figure 2). We assume both selective forces s 1 and s 2 to be small (~1/N e ) and thus the term s 1 *s 2 in the expression for the fitness of TAG (1-s 1 )*(1-s 2 ) is negligible. Another feature of this model is that the rate of mutation A < − > G in the stop codons is identical to the rate of mutation A < − > G in two fold synonymous sites ( Figure 2). Overall, there are no reasons why these assumptions are not expected to hold in bacterial genomes so that our model should provide a reasonable approximation of frequencies and selection, if any, of stop codons.
Within the framework of this model we can infer the fluxes between stop codons as: and system (1) can be solved analytically: Since S 1 is the selection on G content and mutational rates A < − > G in stop codons are the same as the mutational rates A < − > G in two fold synonymous sites, the equilibrium frequency of G in two fold synonymous sites is the solution of the following system: where The expressions for the frequencies of stop codons could thus be rewritten as: (See figure on previous page.) Figure 1 The relationship between stop codon frequencies in 736 bacterial genomes and G content in GA-type twofold synonymous codons. The pattern is shown for all genomes (A) and as an average measure for bins of 10% of G-content with SD (B).

Figure 2
The model, with rates of mutation and selection coefficients on nucleotide content and stop codons.
Next, we investigate the behavior of this model by starting with its simplest possible modification. Such modification is done by setting parameters to the value of zero, which allows us to trace the impact of each parameter. First, we investigated the model without any selection where all stop codon confer equal fitness and G frequency is determined solely by mutational pressure (S 1 = S 2 = 0). In this case Thus, if there is no selectional pressure the expected frequencies of TAG and TGA are equal and, therefore, a model without any selection cannot fit our data ( Figure 1).
Next, we investigated the impact of selection S 1 which shapes G-content. Three parameters, μ 1 , μ 2 and S 1 act as one effective parameter in the expressions of stop codon frequencies: μ 2 (3). Thus, selection on G-content, S 1 , affects only G-content itself and does not change the form of the relationship between G frequency and stop codon usage as is evident from expressions (4).
In order to estimate the strength of selection acting on TAG we solve the system of equations (4) for the selection coefficient S 2 : Now we can estimate the value of S 2 based on the observed frequencies of TAG, solving the equation (5) for the selection coefficient: Both G-content f G and frequency of TAG f TAG are measured directly and for every genome we calculate the predicted value S 2 using expression (5) ( Figure 3A). S 2 has a clear G-content dependence, which can be approximated by S 2 % ln 3:6f G þ 0:4 ð Þ(5). The predicted value of S 2 changes between −0.5 for A-rich and 1.5 for G-rich genomes, respectively ( Figure 3B). Using this approximation of S 2 we obtain the following expressions for stop codon frequencies: These expressions recapitulate the relationship between the frequency of TAG and nucleotide content ( Figure 4). Thus, the observed frequencies of stop codons in bacterial genomes can be explained only if stop codon are not selectively equivalent, with weak negative selection acting on TAG codon for G-content >16% and weakly positive selection for these two codon when G-content <16%.
In our model we assumed the same selection regime shaping G-content in coding regions and in stop codons. However, the selective disadvantage of S 2 holds when this assumption is removed from the model. Specifically, as could be seen from (4), exp(S 2 ) = f TGA /f TAG , such that we can solve for S2 only based on the comparison of (See figure on previous page.) Figure 3 The estimated selection coefficients on TAG, S2, for individual genome measurements (blue) and the average for bins of 10% in magenta (A). The average of the estimated values of S2 (black points) and the red line approximating the average estimated values as S2~ln (3.6fG + 0.4) (B).
TAG and TGA frequencies. The predicted values of S 2 based on this formula are similar to the predicted values based on formula (5) ( Figure 5) except for the genomes of low G content (<16%). This, the predicted selection coefficient S 2 is positive for nearly all ranges of G content, indicating that the TGA stop codon provides a selective advantage in comparison with the TAG stop codon.

Discussion
The relationship between stop codon frequency and Gcontent, or GC-content as reported previously [14], is one of the most striking and unambiguous patterns in bacterial genome composition ( Figure 1). Here, we have developed a simple model that captures all of the major observations of stop codon distribution across bacterial genomes. However, as with many theoretical treatments our  model necessarily makes several simplifying assumptions. First, we ignore the effects of neighboring nucleotides while, in principle, co-evolution of the nucleotide immediately after the stop codon may be effecting TAG frequency more than that of TGA and TAA frequencies through either contextually-dependent mutational effects [18,19] or by affecting the efficiency of stop-codon recognition [9][10][11][12]. However, the lack of a relationship of TAG frequency and G content does not depend on the nucleotide immediately after the stop codon ( Figure 6) indicating that the nucleotide context is not an important factor in explaining the pattern of stop codon usage ( Figure 1). Second, the same stop codons can be subject to different selection pressures in different genes due to difference in the levels of expression [20] or other factors. To alleviate the fears that a more general model, one that takes into account the distribution of selection coefficients in a genome, would substantially alter our conclusions we have analyzed the effect of assuming a distribution of selection coefficients. We have shown that if a given frequency of the TAG stop codon is explained by a distribution of selection coefficients then the expected value of the average selection of such a distribution would have to be greater or equal to a selection coefficient that is uniform across all TAG codons and leads to the same TAG frequency in the genome (see Methods). In other words, differences in the strength of selection across different TAG codons make our argument stronger that, on average, the TAG stop codon is unpreferred.
Using a population genetics model modified to describe stop codon and guanine frequencies we demonstrated that stop codon usage can be explained when selection is acting specifically on TAG. The predicted selection regime on TAG, S 2 , has three properties: it is relatively weak, with N e s between −0.5 and 1.5, nucleotide content dependent and is positive when G-content <16% and negative when G-content is >16%. The predicted selection strength is weak, on the order of 1/N e , which is not strong enough to severely restrict the rate of evolution of stop codons. Indeed, such weak selection on individual alleles can be overpowered by genetic drift, which may result in the large variability of stop codon frequencies in our data ( Figure 1B). Alternatively, the observed variability of stop codon frequencies relative to the average expectation (compare Figure 1A and 1B) may be due to slight changes in selection pressure on TAG and the rates of A < − > G mutation between different species (Figure 3A).
The G-content dependence of the selection follows from the roughly constant TAG frequency relative to G- content. Yet at this point, there are no known molecular mechanisms that may explain why TAG stop codon has different selective consequences depending on nucleotide content. One possibility, however, is the dependence of translation termination efficiency on the nucleotide context in the vicinity of the TAG stop codon. Bacteria generally code for two release factors (RF), RF1 that recognizes TAA and TAG stop codons and RF2 that recognizes TAA and TGA [21]. Thus, the prediction of the context-dependence hypothesis is that the efficiency of RF1 is GC-context dependent while RF2 functions independent of nucleotide context. Empirical evidence may be necessary to confirm or refute this hypothesis, however, given the relatively weak nature of the selection the differences in translation termination efficiency may be too small to be easily detected in the laboratory. The possibility of the molecular mechanism involving elongation termination factors, however, is left necessarily uncertain by conflicting data from other species. Eukaryota, that have only one release factor for all three stop codons [22,23], and chloroplast genomes that have retained orthologs of both release factors [24], show a clear increase of TAG frequency with higher GCcontent (Additional file 3: Figures S3 and Additional file 4: Figure S4). Clearly, further experimental work is likely necessary to elucidate the molecular mechanisms behind selection on TAG stop codon in bacteria.
Within the framework of our model it is possible to compare the fitness impacts of different stop codons depending on genomic nucleotide content. Regardless of selection on G-content itself (S 1 ) the difference in fitness between TAG and TGA stop codon is defined by S 2 (Figure 2). Thus, regardless of the value of S 1 our data signify that TAG stop codon is always less fit that the TGA stop codon for G-content >16%. Comparing the relative fitness of TAG and TAA, however, involves both S 1 and S 2 , with their sum being the difference in relative fitness of these two stop codons (Figure 2). Within the model, G-content depends on relative rates of mutation A < − > G and S 1 , and we cannot disentangle the contribution of mutation ( μ 1 and μ 2 ) versus selection (S 1 ) so we cannot analytically estimate the value of S 1 . However, we can define the range of values of these parameters for specific G-content.
To identify the plausible range of S 1 values in bacterial genomes we investigate the values of μ 2 =μ 1 and S 1 for Gcontent of 16%. We find that in order for the G < − > A twofold sites to maintain 16% G-content either the rate of G > A mutation must be at least five times larger than the rate of A > G mutations or, S 1 must be positive (Figure 7). Thus, for genomes with~16% G-content TAG can be the stop codon with the highest fitness only if G > A rate of mutation is five times higher than the rate of A > G mutation. For G-content of 5% TAA would confer higher fitness than TAG if S 1 < 0.5. When S 1 = 0 the ratio of G < − > A mutations ( μ 2 =μ 1 ) must bẽ 19, while a smaller ratio implies a positive S 1 (Figure 7). When S 1 = 0.5 the ratio μ 2 =μ 1 must be at least 11, Figure 7 The relationship between the ratio μ 2 /μ 1 , and S 1 for G-content of 5% (red) and 16% (blue) with the values of the parameters at which there is no selective difference between TAG and TAA (S 1 -S 2 = 0) are indicated by straight lines.
implying that if μ 2 =μ 1 < 11 then S 1 > 0.5 and TAA is more fit than TAG (Figure 7). Is there any evidence that G > A can be five or ten times faster than A > G mutations? Mutational biases against GC-content that have been measured were shown to be always less than tenfold in favor of ATcontent and less than fivefold for 151 out of a total of 154 species considered in two separate studies (25,26). Similarly, weak selection acting on GC-content has been postulated by several researchers (25,26). Given this evidence it is unlikely that the observed GCcontent can be explained solely by G < − > A mutational biases and, therefore, S 1 is positive and > > 0.5 for Gcontent % 5% and >0 for G-content % 16%. Thus, for Gcontent < 16% the TAG stop codon is expected to be less fit than the TAA stop codon.

Conclusions
The relative fitness of TAG to TAA and TGA stop codons can thus be described as follows. When G-content is >16% TAG has lower fitness than TGA. As long as S 1 > −S 2 for G-content <16% then TAG has lower fitness than TAA in bacterial genomes with G-content < 16%. Because S 1 > −S 2 is expected to hold for G-content < 16% given the mutation parameters observed in nature [25,26] it follows that TAG is a striking example of a global suboptimal codon, such that the substitution of TAG into either TAA or TGA for any bacterial species would lead to an increase of fitness. The use of suboptimal synonymous codons in bacteria is a well-documented phenomena, however, the exact codons that are suboptimal differ substantially between different species (see [27] for review). To our knowledge, the observation that one codon with synonymous function to other codons is always worse in such a large group of organisms, bacteria, is the first example of a global sub-optimality of the genetic table. The sub-optimal organization of the genetic table revealed here provides a striking counterexample to the remarkable optimization of the genetic code with respect to error minimization [28][29][30].

Methods
All available complete bacterial genomes were downloaded from NCBI website and 736 of those that utilize the standard genetic code were used for the analysis (See Additional file 5). Plasmid sequences were excluded. All available pairs of closely related genomes from the ATGC database [31], of which there were 11 pairs with 0.03 < K S < 0.22 were used to measure A < − > G synonymous transition rates (K AG ) and rates of stop codon evolution (K stop ). Orthologues were constructed using twodirectional best BLAST [32] hit approach and aligned using MUSCLE [33]. To obtain K AG we looked at the number of synonymous differences between three pairs of codons: CAA and CAG, AAA and AAG and GAA and GAG. The expected number of substitutions occurred was estimated using Jukes-Cantor model [34]. The same method was applied to estimate the number of substitutions between stop codons with the only difference that the number of synonymous sites for TAA codon is twice as high as the number of synonymous sites for TAG and TGA codons. In order to obtain rates of TAG codon evolution the substitutions have to be polarized and for that the third organism was added to the 11 pairs of the genomes such that the synonymous distance between sister species 0.02 < K S < 0.15 and between sister species and outgroup 0.04 < K S < 0.62. Substitutions were polarized using simple parsimony approach.
To show that a distribution of selection coefficients for the same stop codon across different genes can only increase the differences between average selection coefficients of stop codons we proved the following conjecture. A given frequency of TAG codon in the genome can be explained by an equal strength of selection acting on all TAG codons in the genome (S 0 ) or a distribution of selection coefficients across different codons with an expected value of the distribution (S À ). For any given observed frequency of the TAG codon in the genome S 0 ≤S À , such that the average strength of selection in a distribution is larger when different codons are under different selection pressures. We consider the case where selection on each TAG stop codon is a discrete random variable which assumes the value S i with the probability p i . In this case we use S i as discrete values of a distribution of selection coefficients on TAG stop codons in different genes in the same genome, while S 1 and S 2 were used as fixed values of the selection coefficients for all genes across a single genome. In this case for any selection S i the expected number of the sites under this selection is N i = p i *N stop , the frequency of TAG is f TAG and the value of selection S 0 acting on TAG sites is estimated from the formula f TAG ¼ Taking into account that the second derivative of f, The only condition for this inequality to hold is S ≥ 1nf G , And in any case, what are the reasons to suspect that the selection regime in the amino-acid-encoding codons is the same as in the stops (the former may depend on concentrations of tRNAs and the codon-anticodon interactions; the latter, on interactions with the release factors). What about the A/G choice in the four-fold codon families?
Author response: Indeed, we have created the model based on this assumption because it allowed us to reduce the number of parameters and make the system of equations solvable. However, we can also show that this assumption does not affect our main result that the TAG codon is selectively disadvantageous. Specifically, from system of equations (4) it follows that exp (S 2 ) = f TGA /f TAG . Thus, we can solve for the selective impact of TAG (S 2 ) solely based on the frequencies of TAG and TGA without making the assumption that the selective regime is the same in stop and amino acid codons. Since S2 is positive for almost the entire range of G content it follows that the TAG codon provides a selective disadvantage relative to the TGA codon. Unfortunately, we cannot estimate S 2 by comparing the frequencies of TAG and TAA codons because we cannot independently estimate the μ 2 μ 1 expðS 1 Þ component of f TAA from (4). We now present the new estimate of S 2 in Figure 5 and the main text. The reasoning in page 6 is not clearly presented, and misprints add to the confusion. How is formula S2 = ln ((fG(1-fTAG))/fTAG) used? Do I understand it correctly that the next formula S2 = ln (3.6fG + 0.4) results from a fit to observations (comparison of genome pairs)? -I think, this should be explained more explicitly.
Author response: Yes, this is what we mean, and we rewrote this section to hopefully make this clearer. By the way, the two formulas for S2, theoretical and observed ones, yield a dependence between fG and fTAGdoes it hold?
Author response: Yes, there is a slight dependence as can be seen from Figure 1. Finally, reference to equation (5) in the preceding paragraph should be about equation (4), and the sentence "S2 has a clear G-content dependence is well approximated. . ." probably should be "S2 has a clear G-content dependence that is well approximated. . ." .
Author response: If the referee means this sentence "Thus, selection on G-content,, affects only G-content itself and does not change the form of the relationship between G frequency and stop codon usage as is evident from expressions (4)." then we mean that in the system of equations (4) G-content (f(taa,tga,tag) does not depend on S1. The other typo is corrected. Polarization of substitutions using parsimony may be dangerous if there is selection towards a specific, preferred nucleotide: in some cases two parallel nonpreferred-to-preferred substitutions may occur, and they will be interpreted as a single preferred-to-nonpreferred substitution, hence skewing the substitution statistics.
Author response: This is true, however, these data has been obtained for a number of species with different GC-content and low sequence divergence. Therefore, we believe that it is unlikely that the use of parsimony have produced a systematic error of substantial effect that jeopardizes our conclusions. The manuscript by Povolotskaya et al. puts forward a simple model of nucleotide substitutions in the stop codons in bacteria, and tests it against the genome-wide data. One of the main conclusions is that TAG may be globally suboptimal, with each of the remaining two codons turning out more fit under different values of GC content. One biological explanation of these data may be in the phenomenon of overlapping ORFs in bacterial operons. TAG is the only codon that does not accommodate a minimal overlap, whereas TAA can give one kind of stop-start codon overlap (TAATG) and TGA even two kinds (ATGA and TGATG). Perhaps if the authors restricted their sample to the termination codons in the last (or only) genes in operons, they would see much less difference between fitness of those two and TAG?
Author response: The idea that the observed pattern of stop codon frequency in bacterial genomes is explained by gene overlap has occurred to us as well. However, we observe the same relationship between Gcontent and stop codon frequency in overlapping and non-overlapping genes. We now report these data in a new figure that is Additional file 2 Figure S2 in the new version of the manuscript. We have considered only tail-to-tail overlaps due to a much higher certainty of stop codon annotation compared to the uncertainty in the annotation of many start codons. This manuscript presents an analysis of stop codon usage in bacterial species. The authors report that TAG codon is un-preferred in most bacterial species and that its frequency does not depend on GC content. They suggest presence of weak selection against TAG codon due to unknown mechanism. One potential mechanism may involve dependency of efficiency of one of the release factors on GC content. I find the results of great interest. I only have two minor technical comments. 1) The analysis is based on Bulmer equations, which hold only if evolution is mutation limited. It would be great to briefly discuss applicability of this model to a wide variety of bacterial species.
Author response: Bulmer's model assumes that the fate of a new mutation is decided independently of other mutations, that is to say that generally only one mutation is segregating in the population at the same time. This is certainly true if we consider only mutations in stop codons. In most bacterial genomes there are 2-5 thousand protein coding genes making it rather unlikely that more than one stop codon polymorphism is segregating at the same time.
2) Approximation of selection coefficient against TAG codon as a sum of contributions due to selection against GC content (S1) and selection against this specific codon (S2) ignores the S1*S2 term. It is OK if both selective forces are assumed to be small. It would be great if this assumption would be spelled out.
Author response: The referee is absolutely correct, we assume that both of the selective forces are small. We have added an explicit statement to this effect in the text.