Why eukaryotic cells use introns to enhance gene expression: Splicing reduces transcription-associated mutagenesis by inhibiting topoisomerase I cutting activity
Biology Direct volume 6, Article number: 24 (2011)
The costs and benefits of spliceosomal introns in eukaryotes have not been established. One recognized effect of intron splicing is its known enhancement of gene expression. However, the mechanism regulating such splicing-mediated expression enhancement has not been defined. Previous studies have shown that intron splicing is a time-consuming process, indicating that splicing may not reduce the time required for transcription and processing of spliced pre-mRNA molecules; rather, it might facilitate the later rounds of transcription. Because the densities of active RNA polymerase II on most genes are less than one molecule per gene, direct interactions between the splicing apparatus and transcriptional complexes (from the later rounds of transcription) are infrequent, and thus unlikely to account for splicing-mediated gene expression enhancement.
Presentation of the hypothesis
The serine/arginine-rich protein SF2/ASF can inhibit the DNA topoisomerase I activity that removes negative supercoiling of DNA generated by transcription. Consequently, splicing could make genes more receptive to RNA polymerase II during the later rounds of transcription, and thus affect the frequency of gene transcription. Compared with the transcriptional enhancement mediated by strong promoters, intron-containing genes experience a lower frequency of cut-and-paste processes. The cleavage and religation activity of DNA strands by DNA topoisomerase I was recently shown to account for transcription-associated mutagenesis. Therefore, intron-mediated enhancement of gene expression could reduce transcription-associated genome instability.
Testing the hypothesis
Experimentally test whether transcription-associated mutagenesis is lower in intron-containing genes than in intronless genes. Use bioinformatic analysis to check whether exons flanking lost introns have higher frequencies of short deletions.
Implications of the hypothesis
The mechanism of intron-mediated enhancement proposed here may also explain the positive correlation observed between intron size and gene expression levels in unicellular organisms, and the greater number of intron containing genes in higher organisms.
This article was reviewed by Dr Arcady Mushegian, Dr Igor B Rogozin (nominated by Dr I King Jordan) and Dr Alexey S Kondrashov. For the full reviews, please go to the Reviewer's Reports section.
Splicing could enhance later rounds of transcription
Spliceosomal introns are a landmark feature of eukaryotic nuclear genes. However, their costs and benefits have not been fully interpreted [1–12]. One recognized effect of introns is their enhancement of gene expression. Introns and/or their splicing have been found to enhance almost every step of gene expression, from transcription to translation [13–21]. For example, intron-containing transgenes in mice are transcribed 10- to 100-fold more efficiently than the same genes lacking introns . In humans and the yeast Saccharomyces cerevisiae, intron-containing genes produce more copies of RNA than intronless genes [18, 23], and as was consistently found, removal of the introns from three essential genes in yeast significantly lowered their transcription levels . Similarly, highly expressed genes were found to have higher intron densities (number of introns per kilobase of coding sequence) than weakly expressed genes in the human genome . Comparison of the densities of active RNA polymerase II molecules present on genes between intron-containing genes and intronless genes in S. cerevisiae also showed that introns could enhance transcription (Table 1). The enhancing effects of introns on the posttranscriptional stages of gene expression are commonly attributed to proteins recruited to the mRNA during splicing [13–15, 19]. By contrast, there is still no consensus on how introns and/or their splicing can increase transcription efficiency.
One possibility is that introns contain motifs that stimulate the elongation complex during transcription. It is well known that there is poor conservation of intronic sequences between most organisms. Even splicing signals such as branch sites are only loosely defined. This makes it very hard to imagine the existence of common enhancing signals in the introns present in different genes and different organisms. In the introns of Arabidopsis thaliana, Rose et al.  found loosely defined motifs that may be responsible for a gene expression enhancing effect. However, Akua et al.  showed that splicing is critical for the enhancing effect of the leader intron of the Arabidopsis AtMHX gene. Without splicing, the intron sequence displayed only low-level enhancement . Recently, Morello et al.  constructed rice mutants that had decreased splicing efficiency, but retained the loosely defined motifs identified by Rose et al. . Analysis of the mutant genes showed that the enhancement of gene expression depended heavily on the efficiency of intron splicing . These observations indicate that splicing is the main contributing factor for intron-mediated enhancement of transcription.
Splicing has been shown to have extensive interactions with transcription processes and other pre-mRNA processing events . So splicing may enhance gene expression by any of the aforementioned processes; from stimulating the later rounds of transcription, to facilitating the polyadenylation of the spliced pre-mRNAs.
Besides the enhancing effects of gene expression, there is also evidence indicating that splicing is the rate-limiting step in nascent mRNA production . Intron splicing takes 5 to 10 min [28, 29]; during this time, RNA polymerase II advances 19 to 38 kb towards the 3' end of a gene . This size is far longer than almost all 3'-terminal exons in unicellular organisms. Therefore, RNA polymerase II pauses and waits for splicing to occur before finishing its transcriptional processes [30–32]. Rapidly regulated genes have been consistently found to contain few introns [33, 34]. Therefore, the presence of an intron in a gene is unlikely to reduce the time required to produce an mRNA. Another possible explanation for the splicing-mediated enhancement of gene expression is that splicing mainly enhances the later rounds of transcription.
Indirect interaction between splicing factors and later rounds of transcription
One way for splicing to enhance the later rounds of transcription is for some components of the splicing machinery to directly interact with the RNA polymerases or transcription factors operating during the later rounds of transcription [13, 35]. Apparently, for such interactions and transcriptional enhancement to happen, splicing of a pre-mRNA molecule must be unfinished when the later rounds of transcription initiate. As recently established, RNA polymerases do not transcribe the 3' end of genes before finishing intron splicing [30–32]. That is, for splicing factors to interact directly in the later rounds of transcription, at least two RNA polymerase molecules should be attached to the same gene. For a ribosomal RNA gene, it is very common to have multiple RNA polymerases attached at one time and multiple transcripts synthesized simultaneously . However, in protein-coding genes, it is uncommon to have multiple active RNA polymerase II molecules recruited. In S. cerevisiae, there are only 0.13% of genes (6 among 4,670 analyzed genes) having >2 RNA polymerase molecules/gene, and 0.86% of genes (40 among 4,670 analyzed genes) having >1 RNA polymerase molecules/gene . All the 6 genes with >2 RNA polymerase molecules/gene are intronless and only 6 among the 40 genes that have >1 RNA polymerase molecules/gene contain introns. That is, only 4.5% of the 296 intron-containing genes in S. cerevisiae have >1 RNA polymerase molecules/gene (genome data from , accessed on Nov 24, 2010). Please note that the densities of polymerase II obtained by chromatin immunoprecipitation in some other studies  may be globally higher than the work cited here . As discussed by Pelechano et al. , the RNA polymerase II densities reported probably included a fraction of inactive RNA polymerase II molecules, and therefore may not represent an accurate map of transcriptionally active RNA polymerase II in the yeast genome. Hence, in yeast, the enhancement of the later rounds of transcription by most of the introns is not likely to be mediated by direct interactions between splicing factors and later rounds of transcription.
Although genome-wide data on the densities of RNA polymerase II are not yet available for other species, we can roughly estimate them by steady-state mRNA abundance and mRNA stability. In yeast, the median abundance of mRNAs is 1.38 copies/cell ; the median half-life of mRNAs is about 20 min  and the median RNA polymerase II density on genes is 0.078 molecules/kb . By contrast, mammalian cells have lower copy numbers of stable mRNA. In mice, the median mRNA abundance levels vary from 0.36 to 0.79 copies/cell among different cell types , or about 1/2.4 of the mRNA abundance in yeast. The median half-life of mouse mRNA is at least 274 min [42, 43], i.e. at least 13.7 times the yeast mRNA levels. Therefore, we can estimate that the production rate of nascent transcripts in a mouse cell is 1/32.88 of that in a yeast cell. Assuming that yeasts and mammals do not differ significantly in their transcriptional elongation rates, the median RNA polymerase II density in mouse genes is 2.37 × 10-3 molecules/kb. Referring to the 18.6 kb median size of mouse genes (data from Ensembl Release 60), the 2.37 × 10-3 molecules/kb can be converted into 0.044 molecules/gene, a value which is lower than the 0.096 molecules/gene in yeast. From this we can estimate that genes that have recruited multiple active RNA polymerase II molecules are probably also infrequent in mice.
In both yeast and mice, therefore, direct interactions between splicing factors and the later rounds of transcription are probably infrequent events. The splicing-mediated enhancement of gene expression might be attributed to the indirect interactions occurring between splicing factors and later rounds of transcription. If future studies show that multiple active RNA polymerase II molecules on a single gene are common and direct interactions between splicing factors and later rounds of transcription are not infrequent, indirect enhancement of the later rounds of transcription by splicing would contribute less than we envision here. Nevertheless, indirect enhancement of the later rounds of transcription mediated by splicing should be explored if there is evidence for it.
In the following sections, we first propose a possible mechanism for splicing-mediated indirect enhancement of the later rounds of transcription, and then explore potential answers to the following questions: Is there any difference between the enhancement of gene expression by introns and strong promoters? What are the costs and benefits underlying the enhancement of gene expression by intron splicing?
Presentation of the hypothesis
Splicing makes genes less twisted and thus more accessible
It has been documented that transcription generates positive supercoiling ahead of the transcriptional assembly and negative supercoiling behind the assembly if the DNA is topologically closed [44–52]. In eukaryotes, topoisomerase I removes the negative supercoiling generated during transcription [49–55]. Extremely negatively supercoiled DNA could be observed in the transcriptionally active genes of mutants lacking topoisomerase I [45, 46].
In eukaryotic cells, topoisomerase I has another function; acting as a kinase to phosphorylate the serine/arginine-rich (SR) proteins like SF2/ASF [56, 57]. When topoisomerase I is associated with hypophosphorylated SF2/ASF, its negative supercoiling removal activity is inhibited [58, 59]. In addition, the substrate of SR protein phosphorylation, ATP, can also inhibit topoisomerase I mediated DNA cleavage . Because phosphorylation of SR proteins is required for efficient splice-site recognition and the assembly of spliceosomes [61–63], we propose the following scenario; for intron splicing, SR proteins are phosphorylated by topoisomerase I, which inhibits its negative supercoiling removal activity. Because of intron splicing, the negative supercoiling generated during transcription is removed at a much lower efficiency. Consequently, intron splicing changes the transcribed gene into a less twisted state (Figure 1A). By contrast, in intronless genes, the negative supercoiling generated by transcription is removed efficiently by topoisomerase I, and so the gene reverts back to its original topological status after transcription (Figure 1B). The binding of proteins to less twisted DNA is thermodynamically favored, and thus the separation of two strands is facilitated [49, 50, 64, 65].
In summary, we propose that intron splicing inhibits the topoisomerase I negative supercoiling removal activity, which consequently facilitates later rounds of transcription. Consistent with this hypothesis, the intron-containing genes of S. cerevisiae have more active RNA polymerase II molecules attached to them and higher nascent transcription rates than intronless genes (Table 1).
Both splicing and strong promoters could enhance transcription
In rice, it has been shown that the presence of an efficiently spliced intron could compensate for the reduced transcription level resulting from a weak promoter . Among the 124 cytoplasmic ribosomal protein genes in S. cerevisiae, 94 are intron-containing and 30 are intronless (data from , accessed on Nov 24, 2010). Analysis of their mRNA abundance levels did not reveal any significant differences between intron-containing ribosomal protein genes and intronless ribosomal protein genes . Apparently, intronless ribosomal protein genes have their own strategies to enhance their transcription levels. The most likely strategy in this case is to have stronger promoters. Hence, eukaryotic cells could elevate their gene transcription levels by having introns or by having strong promoters.
Benefits of splicing: avoiding the dark side of topoisomerase I
If eukaryotic cells could elevate their transcription levels simply by having strong promoters, why do they bother to use splicing, which is a complex and energy expensive process ? The most likely answer is that splicing must be beneficial to eukaryotic cells. Although many benefits of introns and intron splicing have been suggested [5–10], here we propose a new one based on the scenario proposed in previous sections of this paper.
To remove negative supercoiling in DNA, topoisomerase I has to generate breaks in one of its strands. This process poses a potential threat to genome integrity. In many unfavorable conditions, this threat is magnified to cause genome instability . Nitiss et al.  over-expressed yeast topoisomerase I and found that the genome became hypersensitive to methyl methanesulfonate and other DNA-damaging agents. For many years, high transcriptional rates have been found to be associated with genetic instability [70, 71]. Recently, two groups consistently found that deletion of topoisomerase I could completely eliminate transcription-associated short DNA deletions [72, 73].
If splicing could inhibit topoisomerase I DNA cleavage and religation activity, eukaryotic cells could avoid the dark side of topoisomerase I, while maintaining a high level of transcriptional activity. However, cells lacking introns and splicing activity do not necessarily exhibit obvious growth rate defects. Indeed, the deletion of most introns has been found to have no significant effects on cell growth, and, in the laboratory setting at least, introns appear to be nonessential . If our hypothesis is correct, intron deletion would increase the risk of genome instability, making it an unlikely evolutionary favored strategy.
Testing the hypothesis
If our hypothesis is correct, splicing inhibits topoisomerase I DNA cleavage activity, thus reducing the frequency of transcription-associated short DNA deletions. That is, transcription-associated mutagenesis would be much lower in intron-containing than intronless genes. Experimental approaches that block intron recognition and splicing would strengthen the validity of transcription-associated mutagenesis.
Because topoisomerase-I-associated damage causes mainly short DNA deletions [72, 73], this would result in the exons flanking lost introns becoming shorter over evolutionary time. Bioinformatic analysis of the frequency of short DNA deletions in the exons flanking lost introns may provide evidence for this hypothesis.
Implications of the hypothesis
Beneficial since early eukaryotes
In lower organisms with small introns, it was believed that introns contained all the information required for accurate splicing, a mechanism called intron definition . By contrast, exon sequences play major roles in the recognition of intron/exon structures in organisms with long introns (termed exon definition). In Drosophila melanogaster, both short and long introns are typically found. Fox-Walsh et al.  demonstrated that intron definition becomes less efficient as intron size increases. The threshold for cessation of recognition across introns is 200 to 250 nt. Indeed, in some unicellular organisms, long introns are not unusual. For example, there are 143 introns >200 nt and 139 introns >250 nt in S. cerevisiae (data from , accessed on Nov 24, 2010) and 290 introns >200 nt and 173 introns >250 nt in Schizosaccharomyces pombe (data from , accessed on Jan 11, 2011). In S. pombe, an SR protein named Srp2p was reported to attach to exonic sequences and promote recognition and splicing of introns that had weak intronic splicing signals . In S. cerevisiae, an SR-like protein called Npl3 is required for efficient splicing of many pre-mRNAs . In addition, there is evidence that SR proteins also participate in intron definition in human cells . Therefore, SR and SR-like proteins are likely to exist in most, if not all eukaryotes, with the purpose of facilitating the splicing of weak introns [62, 81, 82]. If weak splicing signals and SR and SR-like splicing facilitators are ancestral , we could argue that the splicing-mediated enhancement of gene transcription might have been beneficial, since the early evolution of eukaryotic cells.
There is still no evidence, however, that the SR-like protein Npl3 inhibits the negative supercoiling removal activity of topoisomerase I in S. cerevisiae. This is a gap in our hypothesis.
Long introns: weak in splicing thus efficient in transcriptional enhancement
Another insight from the results of Fox-Walsh et al.  is that long introns are weak, and thus require more help from exonic splicing signals. Long intron splicing is more likely to require the recruitment SR or SR-like proteins. According to our hypothesis, long introns should be more efficient in transcriptional enhancement than short introns. Hence we would expect there to be a positive correlation between intron size and gene expression levels. This is, in fact, what has been widely observed in unicellular organisms [18, 33, 84, 85]. Early studies also supported the same trend in plants [33, 86, 87]. However, a later study in plants A. thaliana and Oryza sativa showed that genes with longer introns were weakly expressed , a trend that is consistently observed in animals [2, 87, 89, 90]. A recent more detailed analysis of four multicellular organisms (Homo sapiens, Caenorhabditis elegans, D. melanogaster and A. thaliana) revealed an approximate bell-shaped relationship between intron size and gene expression levels . With increasing expression levels, introns first become longer, but eventually become shorter . Besides the SR protein SF2/ASF, some other splicing-related proteins are also found to interact with human topoisomerase I . For example, PSF/p54nrb activates topoisomerase I to remove negative supercoiling . The splicing apparatuses of multicellular organisms are more complex than those of unicellular organisms like yeast . It is therefore reasonable to assume that unknown interactions between splicing and transcription exist in higher organisms. The evolution of intron size in higher organisms is unlikely to be neatly explained by a single factor such as that proposed in this paper.
The cost of introns
If introns only confer the benefits proposed by ourselves and others [5–10], the loss of introns would be selected against during evolution. In cases where the cost of an intron exceeds its benefit(s), loss of the intron would be positively selected for. And if the cost(s) only just balances the benefit(s), intron loss may be fixed in evolution by random drift. Many cases of intron losse have been documented in evolution [94–105]. So, if our hypothesis is correct, introns and/or their splicing should also confer a considerable cost to an organism. It has been shown that intron splicing is a time-consuming process [27–29], and so introns are selected against in rapidly regulated genes [3, 33]. Crucially, transcription and intron splicing consume energy. Thus, in organisms with very large populations, like S. cerevisiae, the energetic cost of a long intron in a highly expressed gene is a burden visible to natural selection .
Dr Arcady Mushegian, Stowers Institute of Medical Research, USA
The origin of eukaryotic introns is most likely explained from the mechanistic point of view by group II intron invasion from the mitochondrial ancestor, and from population point of view by weak purifying selection in populations with small Ne. What promotes intron persistence in all eukaryotes, aside from small Ne, is an open question. The authors argue that a factor here is the ability of one of the SR proteins to inhibit topo I activity, thus reducing mutation rate. This is an interesting hypothesis compatible with some of the observed data on correlation between intron length, expression strength, polymerase occupancy, etc. I request, however, that the others state more explicitly their opinion on when in the course of evolution this inhibition arose - do I understand it correctly that it had to be an ancient property, and if so, has this been borne out by pinpointing the origin of the SR factor in question, or by showing that this is a general property of many SR factors, not the serendipitous advantage of this particular one?
Authors' response: This is a very important question raised about our hypothesis. We would also like to be able to see the potential benefit(s) that might have driven the origin and evolution of spliceosomal introns. Unfortunately, we are unable to speculate further about this at this time, because very little is known about the origin and early evolution of spliceosomal introns and SR proteins. Further evidence is required to offer a more explicit opinion.
Dr Igor B Rogozin, NCBI/NLM/NIH, USA (nominated by Dr I King Jordan)
The paper discusses various issues related to the positive correlation between intron size and gene expression which is observed in unicellular organisms and some multicellular organisms. This correlation is not particularly strong and have various explanations, for example, longer introns may be more efficiently spliced out or may be splicing is important for an efficient transport of mRNA. The authors suggested their own hypothesis:
"If splicing could inhibit topoisomerase I DNA cleavage and religation activity, eukaryotic cells could avoid the dark side of topoisomerase I, while maintaining a high level of transcriptional activity. However, cells lacking introns and splicing activity do not necessarily exhibit obvious growth rate defects. Indeed, the deletion of most introns has been found to have no significant effects on cell growth, and, in the laboratory setting at least, introns appear to be nonessential . If our hypothesis is correct, intron deletion would increase the risk of genome instability, making it an unlikely evolutionary favored strategy."
I think that by the "the genetic risk" the authors mean the increased rate of spontaneous mutations. In general, I do not think that the transcription-associated mutagenesis is different from other sources of spontaneous mutations. I do not see any connection between the transcription-coupled mutagenesis/repair and introns. Some unicellular eukaryotes have a few introns, prokaryotes without introns (self-splicing introns) are doing just fine. I do not think that they are under any "genetic risk". Thus the transcription-coupled mutagenesis/repair is unlikely to be an important factor in evolution of the exon/intron structure.
Authors' response: We consider that our hypothesis may provide some insight about the correlation between intron size and gene expression levels. The main question we want to address is the correlation observed between the presence of an intron and the effect it might have on the gene expression level; such a correlation has been found in many genome-wide bioinformatic analyses and transgenic analyses [18, 22–24, 107–111].
We also do not know if any connection between transcription-coupled mutagenesis/repair and introns exists. But, in the light of such a hypothesis, we would seek to investigate this further.
"Some unicellular eukaryotes have a few introns, prokaryotes without introns (self-splicing introns) are doing just fine." There are two possible explanations here. The first is the widely held opinion that introns are slightly deleterious, and so the presence/absence of introns depends mainly on the efficiency of natural selection. The second is that introns are abundant in some organisms (like humans) and some genes from intron-rare organisms (like the ribosomal proteins genes of S. cerevisiae) because of the distinctiveness of these organisms and these genes. These organisms may be less able to tolerate genetic risk than others. And these genes (e.g., ribosomal protein coding genes and other evolutionarily conserved genes) may be less tolerant of genetic risks than other genes. In S. cerevisiae, only 3.1% of the nuclear genes contain introns, but the majority (75.8%) of cytoplasmic ribosomal protein genes have introns (genome data from , accessed on Nov 24, 2010). In addition, Dr. Rogozin and colleagues have reported that evolutionarily conserved genes tend to have more introns . Certainly, these observations are consistent with our hypothesis, but not proof per se. However, there is, to the best of our knowledge, no convincing evidence to reject the hypothesis that introns are retained in some organisms and some genes because of their intolerance of genetic risk.
The authors suggested two ways to test the hypothesis:
"If our hypothesis is correct, splicing inhibits topoisomerase I DNA cleavage activity, thus reducing the frequency of transcription-associated short DNA deletions. That is, transcription-associated mutagenesis would be much lower in intron-containing than intronless genes. Experimental approaches that block intron recognition and splicing would strengthen the validity of transcription-associated mutagenesis.
Because topoisomerase-I-associated damage causes mainly short DNA deletions [72, 73], this would result in the exons flanking lost introns becoming shorter over evolutionary time. Bioinformatic analysis of the frequency of short DNA deletions in the exons flanking lost introns may provide evidence for this hypothesis."
However, the authors did not try to find any support for the hypothesis. I think that if the authors did not do the suggested analyses by themselves, nobody is going to do it.
Authors' response: We thank Dr. Rogozin for reminding us of this. We did not want to write a research article with a very long introduction, but preferred to formulate a hypothesis and then (after some studies) write a concise research article on the subject.
I suggest to readers of this paper to consider it as a review paper rather than a hypothesis paper.
Authors' response: We do not completely disagree with this suggestion. In fact, we ourselves have often derived more benefit from reading the background and introduction sections than the hypothesis section of some hypothesis papers.
Dr Alexey S Kondrashov, Department of Ecology and Evolutionary Biology, The University of Michigan, USA
This reviewer provided no comments for publication.
Roy SW, Gilbert W: The evolution of spliceosomal introns: patterns, puzzles and progress. Nat Rev Genet. 2006, 7: 211-221.
Castillo-Davis CI, Mekhedov SL, Hartl DL, Koonin EV, Kondrashov FA: Selection for short introns in highly expressed genes. Nat Genet. 2002, 31: 415-418.
Chen J, Sun M, Hurst LD, Carmichael GG, Rowley JD: Human antisense genes have unusually short introns: evidence for selection for rapid transcription. Trends Genet. 2005, 21: 203-207. 10.1016/j.tig.2005.02.003.
Jeffares DC, Mourier T, Penny D: The biology of intron gain and loss. Trends Genet. 2006, 22: 16-22. 10.1016/j.tig.2005.10.006.
Fedorova L, Fedorov A: Introns in gene evolution. Genetica. 2003, 118: 123-131. 10.1023/A:1024145407467.
Lynch M: Intron evolution as a population-genetic process. Proc Natl Acad Sci USA. 2002, 99: 6118-6123. 10.1073/pnas.092595699.
Fedorova L, Fedorov A: Puzzles of the human genome: Why do we need our introns?. Curr Genomics. 2005, 6: 589-595. 10.2174/138920205775811416.
Duret L: Why do genes have introns? Recombination might add a new piece to the puzzle. Trends Genet. 2001, 17: 172-175. 10.1016/S0168-9525(01)02236-3.
Forsdyke DR: Are introns in-series error-detecting sequences?. J Theor Biol. 1981, 93: 861-866. 10.1016/0022-5193(81)90344-1.
Niu DK: Protecting exons from deleterious R-loops: a potential advantage of having introns. Biol Direct. 2007, 2: 11-10.1186/1745-6150-2-11.
Martin W, Koonin EV: Introns and the origin of nucleus-cytosol compartmentalization. Nature. 2006, 440: 41-45. 10.1038/nature04531.
Koonin EV: The origin of introns and their role in eukaryogenesis: A compromise solution to the introns-early versus introns-late debate?. Biol Direct. 2006, 1: 22-10.1186/1745-6150-1-22.
Le Hir H, Nott A, Moore MJ: How introns influence and enhance eukaryotic gene expression. Trends Biochem Sci. 2003, 28: 215-220. 10.1016/S0968-0004(03)00052-5.
Wang HF, Feng L, Niu DK: Relationship between mRNA stability and intron presence. Biochem Biophys Res Commun. 2007, 354: 203-208. 10.1016/j.bbrc.2006.12.184.
Zhao C, Hamilton T: Introns regulate the rate of unstable mRNA decay. J Biol Chem. 2007, 282: 20230-20237. 10.1074/jbc.M700180200.
Rose AB, Elfersi T, Parra G, Korf I: Promoter-proximal introns in Arabidopsis thaliana are enriched in dispersed signals that elevate gene expression. Plant Cell. 2008, 20: 543-551. 10.1105/tpc.107.057190.
Lynch M, Kewalramani A: Messenger RNA surveillance and the evolutionary proliferation of introns. Mol Biol Evol. 2003, 20: 563-571. 10.1093/molbev/msg068.
Juneau K, Miranda M, Hillenmeyer ME, Nislow C, Davis RW: Introns regulate RNA and protein abundance in yeast. Genetics. 2006, 174: 511-518. 10.1534/genetics.106.058560.
Nott A, Le Hir H, Moore MJ: Splicing enhances translation in mammalian cells: an additional function of the exon junction complex. Genes Dev. 2004, 18: 210-222. 10.1101/gad.1163204.
Skoko N, Baralle M, Tisminetzky S, Buratti E: InTRONs in Biotech. Mol Biotechnol. 2011, 1-8.
Zhu J, He F, Wang D, Liu K, Huang D, Xiao J, Wu J, Hu S, Yu J: A novel role for minimal introns: Routing mRNAs to the cytosol. PLoS ONE. 2010, 5: e10144-10.1371/journal.pone.0010144.
Brinster RL, Allen JM, Behringer RR, Gelinas RE, Palmiter RD: Introns increase transcriptional efficiency in transgenic mice. Proc Natl Acad Sci USA. 1988, 85: 836-840. 10.1073/pnas.85.3.836.
Shabalina SA, Ogurtsov AY, Spiridonov AN, Novichkov PS, Spiridonov NA, Koonin EV: Distinct patterns of expression and evolution of intronless and intron-containing mammalian genes. Mol Biol Evol. 2010, 27: 1745-1749. 10.1093/molbev/msq086.
Comeron JM: Selective and mutational patterns associated with gene expression in humans: Influences on synonymous composition and intron presence. Genetics. 2004, 167: 1293-1304. 10.1534/genetics.104.026351.
Akua T, Berezin I, Shaul O: The leader intron of AtMHX can elicit, in the absence of splicing, low-level intron-mediated enhancement that depends on the internal intron sequence. BMC Plant Biol. 2010, 10: 93-10.1186/1471-2229-10-93.
Morello L, Giani S, Troina F, Breviario D: Testing the IMEter on rice introns and other aspects of intron-mediated enhancement of gene expression. J Exp Bot. 2011, 62: 533-544. 10.1093/jxb/erq273.
Patel AA, McCarthy M, Steitz JA: The splicing of U12-type introns can be a rate-limiting step in gene expression. EMBO J. 2002, 21: 3804-3815. 10.1093/emboj/cdf297.
Singh J, Padgett RA: Rates of in situ transcription and splicing in large human genes. Nat Struct Mol Biol. 2009, 16: 1128-1133. 10.1038/nsmb.1666.
Takashima Y, Ohtsuka T, Gonzalez A, Miyachi H, Kageyama R: Intronic delay is essential for oscillatory expression in the segmentation clock. Proc Natl Acad Sci USA. 2011, 108: 3300-3305. 10.1073/pnas.1014418108.
Alexander RD, Innocente SA, Barrass JD, Beggs JD: Splicing-dependent RNA polymerase pausing in yeast. Mol Cell. 2010, 40: 582-593. 10.1016/j.molcel.2010.11.005.
Andersen PK, Jensen TH: A pause to splice. Mol Cell. 2010, 40: 503-505. 10.1016/j.molcel.2010.11.019.
Carrillo Oesterreich F, Preibisch S, Neugebauer KM: Global analysis of nascent RNA reveals transcriptional pausing in terminal exons. Mol Cell. 2010, 40: 571-581. 10.1016/j.molcel.2010.11.004.
Jeffares DC, Penkett CJ, Bahler J: Rapidly regulated genes are intron poor. Trends Genet. 2008, 24: 375-378. 10.1016/j.tig.2008.05.006.
Riabenko EA, Tonevitsky EA, Tonevitsky AG, Grigoriev AI: Structural pecularities of human genes which expression increases in response to stress. Am J Biomed Sci. 2011, 3: 90-94.
Kwek KY, Murphy S, Furger A, Thomas B, O'Gorman W, Kimura H, Proudfoot NJ, Akoulitchev A: U1 snRNA associates with TFIIH and regulates transcriptional initiation. Nat Struct Biol. 2002, 9: 800-805.
Klumpp S, Hwa T: Traffic patrol in the transcription of ribosomal RNA. RNA Biol. 2009, 6: 392-394. 10.4161/rna.6.4.8952.
Pelechano V, Chávez S, Pérez-Ortín JE: A complete set of nascent transcription rates for yeast genes. PLoS ONE. 2010, 5: e15442-10.1371/journal.pone.0015442.
Saccharomyces Genome Database. [http://downloads.yeastgenome.org/]
Steinmetz EJ, Warren CL, Kuehner JN, Panbehi B, Ansari AZ, Brow DA: Genome-wide distribution of yeast RNA polymerase II and its control by Sen1 helicase. Mol Cell. 2006, 24: 735-746. 10.1016/j.molcel.2006.10.023.
Wang Y, Liu CL, Storey JD, Tibshirani RJ, Herschlag D, Brown PO: Precision and functional specificity in mRNA decay. Proc Natl Acad Sci USA. 2002, 99: 5860-5865. 10.1073/pnas.092538799.
Carter MG, Sharov AA, VanBuren V, Dudekula DB, Carmack CE, Nelson C, Ko MSH: Transcript copy number estimation using a mouse whole-genome oligonucleotide microarray. Genome Biol. 2005, 6: R61-10.1186/gb-2005-6-7-r61.
Friedel CC, Dolken L, Ruzsics Z, Koszinowski UH, Zimmer R: Conserved principles of mammalian transcriptional regulation revealed by RNA half-life. Nucleic Acids Res. 2009, 37: e115-10.1093/nar/gkp542.
Sharova LV, Sharov AA, Nedorezov T, Piao Y, Shaik N, Ko MSH: Database for mRNA half-life of 19 977 genes obtained by DNA microarray analysis of pluripotent and differentiating mouse embryonic stem cells. DNA Res. 2009, 16: 45-58. 10.1093/dnares/dsn030.
Liu LF, Wang JC: Supercoiling of the DNA template during transcription. Proc Natl Acad Sci USA. 1987, 84: 7024-7027. 10.1073/pnas.84.20.7024.
Brill SJ, Sternglanz R: Transcription-dependent DNA supercoiling in yeast DNA topoisomerase mutants. Cell. 1988, 54: 403-411. 10.1016/0092-8674(88)90203-6.
Giaever GN, Wang JC: Supercoiling of intracellular DNA can occur in eukaryotic cells. Cell. 1988, 55: 849-856. 10.1016/0092-8674(88)90140-7.
Wu HY, Shyy S, Wang JC, Liu LF: Transcription generates positively and negatively supercoiled domains in the template. Cell. 1988, 53: 433-440. 10.1016/0092-8674(88)90163-8.
Tsao YP, Wu HY, Liu LF: Transcription-driven supercoiling of DNA: Direct biochemical evidence from in vitro studies. Cell. 1989, 56: 111-118. 10.1016/0092-8674(89)90989-6.
Wang JC: Untangling the Double Helix: DNA Entanglement and the Action of the DNA Topoisomerases. 2009, New York: Cold Spring Harbor Laboratory Press
Bates AD, Maxwell A: DNA Topology. 2005, Oxford: Oxford University Press, 2
Wang JC: Cellular roles of DNA topoisomerases: A molecular perspective. Nat Rev Mol Cell Biol. 2002, 3: 430-440. 10.1038/nrm831.
Champoux JJ: DNA topoisomerases: Structure, function, and mechanism. Annu Rev Biochem. 2001, 70: 369-413. 10.1146/annurev.biochem.70.1.369.
El Hage A, French SL, Beyer AL, Tollervey D: Loss of topoisomerase I leads to R-loop-mediated transcriptional blocks during ribosomal RNA synthesis. Genes Dev. 2010, 24: 1546-1558. 10.1101/gad.573310.
Pommier Y: Topoisomerase I inhibitors: camptothecins and beyond. Nat Rev Cancer. 2006, 6: 789-802. 10.1038/nrc1977.
Mondal N, Zhang Y, Jonsson Z, Dhar SK, Kannapiran M, Parvin JD: Elongation by RNA polymerase II on chromatin templates requires topoisomerase activity. Nucleic Acids Res. 2003, 31: 5016-5024. 10.1093/nar/gkg705.
Rossi F, Labourier E, Forne T, Divita G, Derancourt J, Riou JF, Antoine E, Cathala G, Brunel C, Tazi J: Specific phosphorylation of SR proteins by mammalian DNA topoisomerase I. Nature. 1996, 381: 80-82. 10.1038/381080a0.
Soret J, Gabut M, Dupon C, Kohlhagen G, Stevenin J, Pommier Y, Tazi J: Altered serine/arginine-rich protein phosphorylation and exonic enhancer-dependent splicing in mammalian cells lacking topoisomerase 1. Cancer Res. 2003, 63: 8203-8211.
Kowalska-Loth B, Girstun A, Piekielko A, Staron K: SF2/ASF protein inhibits camptothecin-induced DNA cleavage by human topoisomerase I. Eur J Biochem. 2002, 269: 3504-3510. 10.1046/j.1432-1033.2002.03037.x.
Andersen FF, Tange TO, Reinert LS, Olesen JR, Andersen KE, Westergaard O, Kjems J, Knudsen BR: The RNA splicing factor topoisomerase I mediated SF/SF2 inhibits human DNA relaxation. J Mol Biol. 2002, 322: 677-686. 10.1016/S0022-2836(02)00815-X.
Chen HJ, Hwang J: Binding of ATP to human DNA topoisomerase I resulting in an alteration of the conformation of the enzyme. Eur J Biochem. 1999, 265: 367-375. 10.1046/j.1432-1327.1999.00741.x.
Mermoud JE, Cohen PT, Lamond AI: Regulation of mammalian spliceosome assembly by a protein phosphorylation mechanism. EMBO J. 1994, 13: 5679-5688.
Shepard P, Hertel K: The SR protein family. Genome Biol. 2009, 10: 242-10.1186/gb-2009-10-10-242.
Reddy ASN, Golovkin M, Fluhr R: Regulation of splicing by protein phosphorylation. Nuclear Pre-mRNA Processing in Plants. Edited by: Reddy ASN, Golovkin M. 2008, Berlin Heidelberg: Springer, 326: 119-138. 10.1007/978-3-540-76776-3_7. Current Topics in Microbiology and Immunology]
Kouzine F, Sanford S, Elisha-Feil Z, Levens D: The functional response of upstream DNA to dynamic supercoiling in vivo. Nat Struct Mol Biol. 2008, 15: 146-154. 10.1038/nsmb.1372.
Kouzine F, Levens D: Supercoil-driven DNA structures regulate genetic transactions. Front Biosci. 2007, 12: 4409-4423. 10.2741/2398.
Zhang J, Vingron M, Roepcke S: Characteristic differences between the promoters of intron-containing and intronless ribosomal protein genes in yeast. BMC Res Notes. 2008, 1: 109-10.1186/1756-0500-1-109.
Valadkhan S, Jaladat Y: The spliceosomal proteome: At the heart of the largest cellular ribonucleoprotein machine. Proteomics. 2010, 10: 4128-4141. 10.1002/pmic.201000354.
Froelich-Ammon SJ, Osheroff N: Topoisomerase poisons: Harnessing the dark side of enzyme mechanism. J Biol Chem. 1995, 270: 21429-21432. 10.1074/jbc.270.37.21429.
Nitiss JL, Nitiss KC, Rose A, Waltman JL: Overexpression of type I topoisomerases sensitizes yeast cells to DNA damage. J Biol Chem. 2001, 276: 26708-26714. 10.1074/jbc.M102674200.
Datta A, Jinks-Robertson S: Association of increased spontaneous mutation-rates with high-levels of transcription in yeast. Science. 1995, 268: 1616-1619. 10.1126/science.7777859.
Aguilera A, Gomez-Gonzalez B: Genome instability: a mechanistic view of its causes and consequences. Nat Rev Genet. 2008, 9: 204-217. 10.1038/nrg2268.
Lippert MJ, Kim N, Cho JE, Larson RP, Schoenly NE, O'Shea SH, Jinks-Robertson S: Role for topoisomerase 1 in transcription-associated mutagenesis in yeast. Proc Natl Acad Sci USA. 2011, 108: 698-703. 10.1073/pnas.1012363108.
Takahashi T, Burguiere-Slezak G, Van der Kemp PA, Boiteux S: Topoisomerase 1 provokes the formation of short deletions in repeated sequences upon high transcription in Saccharomyces cerevisiae. Proc Natl Acad Sci USA. 2011, 108: 692-697. 10.1073/pnas.1012582108.
Parenteau J, Durand M, Veronneau S, Lacombe AA, Morin G, Guerin V, Cecez B, Gervais-Bird J, Koh CS, Brunelle D, et al: Deletion of many yeast introns reveals a minority of genes that require splicing for function. Mol Biol Cell. 2008, 19: 1932-1941. 10.1091/mbc.E07-12-1254.
Berget SM: Exon recognition in vertebrate splicing. J Biol Chem. 1995, 270: 2411-2414.
Fox-Walsh KL, Dou YM, Lam BJ, Hung SP, Baldi PF, Hertel KJ: The architecture of pre-mRNAs affects mechanisms of splice-site pairing. Proc Natl Acad Sci USA. 2005, 102: 16176-16181. 10.1073/pnas.0508489102.
Wood V, Gwilliam R, Rajandream MA, Lyne M, Lyne R, Stewart A, Sgouros J, Peat N, Hayles J, Baker S, et al: The genome sequence of Schizosaccharomyces pombe. Nature. 2002, 415: 871-880. 10.1038/nature724.
Webb CJ, Romfo CM, van Heeckeren WJ, Wise JA: Exonic splicing enhancers in fission yeast: functional conservation demonstrates an early evolutionary origin. Genes Dev. 2005, 19: 242-254. 10.1101/gad.1265905.
Kress TL, Krogan NJ, Guthrie C: A single SR-like protein, Npl3, promotes pre-mRNA splicing in budding yeast. Mol Cell. 2008, 32: 727-734. 10.1016/j.molcel.2008.11.013.
Ellis JD, Lleres D, Denegri M, Lamond AI, Caceres JF: Spatial mapping of splicing factor complexes involved in exon and intron definition. J Cell Biol. 2008, 181: 921-934. 10.1083/jcb.200710051.
Ram O, Ast G: SR proteins: a foot on the exon before the transition from intron to exon definition. Trends Genet. 2007, 23: 5-7. 10.1016/j.tig.2006.10.002.
Plass M, Agirre E, Reyes D, Camara F, Eyras E: Co-evolution of the branch site and SR proteins in eukaryotes. Trends Genet. 2008, 24: 590-594. 10.1016/j.tig.2008.10.004.
Roy SW, Irimia M: Splicing in the eukaryotic ancestor: form, function and dysfunction. Trends Ecol Evol. 2009, 24: 447-455. 10.1016/j.tree.2009.04.005.
Lanier W, Moustafa A, Bhattacharya D, Comeron JM: EST analysis of Ostreococcus lucimarinus, the most compact eukaryotic genome, shows an excess of introns in highly expressed genes. PLoS ONE. 2008, 3: e2171-10.1371/journal.pone.0002171.
Vinogradov AE: Intron length and codon usage. J Mol Evol. 2001, 52: 2-5.
Ren XY, Vorst O, Fiers MWEJ, Stiekema WJ, Nap JP: In plants, highly expressed genes are the least compact. Trends Genet. 2006, 22: 528-532. 10.1016/j.tig.2006.08.008.
Li SW, Feng L, Niu DK: Selection for the miniaturization of highly expressed genes. Biochem Biophys Res Commun. 2007, 360: 586-592. 10.1016/j.bbrc.2007.06.085.
Yang H: In plants, expression breadth and expression level distinctly and non-linearly correlate with gene structure. Biol Direct. 2009, 4: 45-10.1186/1745-6150-4-45.
Urrutia AO, Hurst LD: The signature of selection mediated by expression on human genes. Genome Res. 2003, 13: 2260-2264. 10.1101/gr.641103.
Rao YS, Wang ZF, Chai XW, Wu GZ, Zhou M, Nie QH, Zhang XQ: Selection for the compactness of highly expressed genes in Gallus gallus. Biol Direct. 2010, 5: 35-
Carmel L, Koonin EV: A universal nonmonotonic relationship between gene compactness and expression levels in multicellular eukaryotes. Genome Biol Evol. 2009, 382-390.
Czubaty A, Girstun A, Kowalska-Loth B, Trzcinska AA, Purta E, Winczura A, Grajkowski W, Staron K: Proteomic analysis of complexes formed by human topoisomerase I. Biochim Biophys Acta. 2005, 1749: 133-141.
Straub T, Grue P, Uhse A, Lisby M, Knudsen BR, Tange TO, Westergaard O, Boege F: The RNA-splicing factor PSF/p54nrbcontrols DNA-topoisomerase I activity by a direct interaction. J Biol Chem. 1998, 273: 26261-26264. 10.1074/jbc.273.41.26261.
Roy SW, Gilbert W: Rates of intron loss and gain: Implications for early eukaryotic evolution. Proc Natl Acad Sci USA. 2005, 102: 5773-5778. 10.1073/pnas.0500383102.
Roy SW, Penny D: Smoke without fire: most reported cases of intron gain in nematodes instead reflect intron losses. Mol Biol Evol. 2006, 23: 2259-2262. 10.1093/molbev/msl098.
Carmel L, Wolf YI, Rogozin IB, Koonin EV: Three distinct modes of intron dynamics in the evolution of eukaryotes. Genome Res. 2007, 17: 1034-1044. 10.1101/gr.6438607.
Coulombe-Huntington J, Majewski J: Characterization of intron loss events in mammals. Genome Res. 2007, 17: 23-32.
Coulombe-Huntington J, Majewski J: Intron loss and gain in Drosophila. Mol Biol Evol. 2007, 24: 2842-2850.
Roy SW, Penny D: Patterns of intron loss and gain in plants: Intron loss-dominated evolution and genome-wide comparison of O. sativa and A. thaliana. Mol Biol Evol. 2007, 24: 171-181.
Roy SW, Penny D: Widespread intron loss suggests retrotransposon activity in ancient apicomplexans. Mol Biol Evol. 2007, 24: 1926-1933. 10.1093/molbev/msm102.
Stajich JE, Dietrich FS, Roy SW: Comparative genomic analysis of fungal genomes reveals intron-rich ancestors. Genome Biol. 2007, 8: R223-10.1186/gb-2007-8-10-r223.
Csuros M, Rogozin IB, Koonin EV: Extremely intron-rich genes in the alveolate ancestors inferred with a flexible maximum-likelihood approach. Mol Biol Evol. 2008, 25: 903-911. 10.1093/molbev/msn039.
Mitrovich QM, Tuch BB, De La Vega FM, Guthrie C, Johnson AD: Evolution of yeast noncoding RNAs reveals an alternative mechanism for widespread intron loss. Science. 2010, 330: 838-841. 10.1126/science.1194554.
Zhang LY, Yang YF, Niu DK: Evaluation of models of the mechanisms underlying intron loss and gain in Aspergillus fungi. J Mol Evol. 2010, 71: 364-373. 10.1007/s00239-010-9391-6.
Raible F, Tessmar-Raible K, Osoegawa K, Wincker P, Jubin C, Balavoine G, Ferrier D, Benes V, de Jong P, Weissenbach J, et al: Vertebrate-type intron-rich genes in the marine annelid Platynereis dumerilii. Science. 2005, 310: 1325-1326. 10.1126/science.1119089.
Huang YF, Niu DK: Evidence against the energetic cost hypothesis for the short introns in highly expressed genes. BMC Evol Biol. 2008, 8: 154-10.1186/1471-2148-8-154.
Buchman AR, Berg P: Comparison of intron-dependent and intron-independent gene expression. Mol Cell Biol. 1988, 8: 4395-4405.
Duncker BP, Davies PL, Walker VK: Introns boost transgene expression in Drosophila melanogaster. Mol Gen Genet. 1997, 254: 291-296. 10.1007/s004380050418.
Callis J, Fromm M, Walbot V: Introns increase gene expression in cultured maize cells. Genes Dev. 1987, 1: 1183-1200. 10.1101/gad.1.10.1183.
Charron M, Chern LY, Wright WW: The cathepsin L first intron stimulates gene expression in rat Sertoli cells. Biol Reprod. 2007, 76: 813-824. 10.1095/biolreprod.106.057851.
Rose AB: Intron-mediated regulation of gene expression. Curr Top Microbiol Immunol. 2008, 326: 277-290. 10.1007/978-3-540-76776-3_15.
Carmel L, Rogozin IB, Wolf YI, Koonin EV: Evolutionarily conserved genes preferentially accumulate introns. Genome Res. 2007, 17: 1045-1050. 10.1101/gr.5978207.
Acknowledgements and Funding
This paper was supported by the National Natural Science Foundation of China (Grant No. 31071112) and Beijing Normal University. We thank the above reviewers for their comments.
The authors declare that they have no competing interests.
DKN conceived the hypothesis and wrote the original draft; YFY collected the genome and expression data and modified the manuscript; both authors read and approved the final text.
Authors’ original submitted files for images
Below are the links to the authors’ original submitted files for images.
About this article
Cite this article
Niu, DK., Yang, YF. Why eukaryotic cells use introns to enhance gene expression: Splicing reduces transcription-associated mutagenesis by inhibiting topoisomerase I cutting activity. Biol Direct 6, 24 (2011). https://doi.org/10.1186/1745-6150-6-24
- Unicellular Organism
- Ribosomal Protein Gene
- Intron Splice
- Intron Size
- Intronless Gene