- Discovery notes
- Open Access
Proteorhodopsin genes in giant viruses
© Yutin and Koonin; licensee BioMed Central Ltd. 2012
- Received: 30 August 2012
- Accepted: 1 October 2012
- Published: 4 October 2012
Viruses with large genomes encode numerous proteins that do not directly participate in virus biogenesis but rather modify key functional systems of infected cells. We report that a distinct group of giant viruses infecting unicellular eukaryotes that includes Organic Lake Phycodnaviruses and Phaeocystis globosa virus encode predicted proteorhodopsins that have not been previously detected in viruses. Search of metagenomic sequence data shows that putative viral proteorhodopsins are extremely abundant in marine environments. Phylogenetic analysis suggests that giant viruses acquired proteorhodopsins via horizontal gene transfer from proteorhodopsin-encoding protists although the actual donor(s) could not be presently identified. The pattern of conservation of the predicted functionally important amino acid residues suggests that viral proteorhodopsin homologs function as sensory rhodopsins. We hypothesize that viral rhodopsins modulate light-dependent signaling, in particular phototaxis, in infected protists.
This article was reviewed by Igor B. Zhulin and Laksminarayan M. Iyer. For the full reviews, see the Reviewers’ reports section.
- Unicellular Eukaryote
- Environmental Sequence
- Spectral Tuning
- Giant Virus
- Sensory Rhodopsin
Many if not all viruses encode proteins that counter-act host defense or more generally affect the functions of cellular systems, presumably tweaking them in a manner that favors virus reproduction. Even viruses with small genomes, for example picornaviruses, typically encode a ‘security protein’ that modifies the host translation system in favor of viral RNA translation . Viruses with larger genomes encode multiple proteins with dedicated functions in the modulation of virus-host interaction at different levels rather than direct roles in virus reproduction [2–5]. A striking example is the presence in numerous cyanophages of genes encoding multiple proteins involved in photosynthesis including complete photosystems I and II [6–8]. These phage-encoded proteins apparently support photosynthesis in infected cyanobacteria and hence promote phage reproduction [9, 10]. Here we report the presence in genomes of giant viruses infecting marine unicellular eukaryotes of genes that encode another light-dependent energy-transduction system, proteorhodopsin. We investigate the origin of these genes and discuss their possible roles in the cellular functions of infected protists.
Bacteriorhodopsins encoded in the genomes of Organic Lake Phycodnaviruses and Phaeocystsis globosavirus and their abundant homologs in marine environments
In the course of comprehensive comparative genomic analysis of the giant viruses in the families Mimiviridae and Phycodnaviridae, our attention was caught by 5 viral proteins [one from the nearly complete genome of Organic Lake Phycodnavirus (OLPV) 2, three from fragments of other OLPV genomes , and one from the more distantly related Phaeocystis globosa virus (PGV)] that showed significant sequence similarity to proteorhodopsins from marine bacteria, in particular the most abundant bacterium in the ocean, Pelagibacter ubique. The sequences of these proteins were up to 28% identical to proteorhodopsins (expectation value <e-05); all viral proteins with similarity to proteorhodopsins are currently annotated as ‘hypothetical proteins’ in GenBank although for two of the OLPV proteins the similarity to bacteriorhodopsins is pointed out in a note. Proteorhodopsins represent a distinct, comparatively simple phototrophic system that is of crucial importance in marine ecology [12–14]. These proteins belong to the broader family of bacteriorhodopsins (or Type I rhodopsins) that originally were discovered in Halobacteria (Euryarchaeota) and subsequently identified in diverse bacteria as well as protists and fungi . To our knowledge, proteorhodopsins (or any other rhodopsin superfamily members) have not been previously detected in viruses, so we were interested in a detailed analysis of these sequences.
Origin of the viral proteorhodopsins
Notably, Phaeocystis globosa, the protist host of PGV, encodes two closely related rhodopsins. However, these rhodopsins confidently group within the eukaryotic branch of Proteorhodopsin Group II (Additional file 4) and accordingly are not the ancestors of the proteorhodopsins of PGV or other viruses. In the tree shown in Figure 2, the viral rhodopsins join the proteorhodopsin clade at its base which at face value seems to suggest ancient acquisition of the proteorhodopsin gene by ancestral giant viruses. However, we cannot rule out that some of the environmental sequences in the ‘viral’ clade actually come from planctonic protists and represent the (still uncharacterized) source(s) of the rhodopsin genes in giant viruses.
Implications for virus-host interaction in giant viruses
It appears likely that proteorhodopsins of giant viruses modulate phototrophic process in the infected protists. Although proteorhodopsins originally were discovered and characterized in bacteria  and subsequently in mesophilic Archaea , more recently, members of this family have been identified in several dinoflagellates [21–23]. Notably, in the marine dinoflagellate Oxyrrhis marina, proteorhodopsin is the most highly expressed nuclear protein, suggesting a major physiological role(s) . Database searches also indicate the presence of two closely related proteorhodopsins in the prasinophyte P. globosa (Figure 1 and Additional file 4). There are no experimental data on the functions of proteorhodopsins in these unicellular eukaryotes. However, by analogy with the well characterized bacterial proteorhodopsins , it appears likely that those of the eukaryotic proteorhodopsins that possess the proton donor carboxylate function as light-driven proton pumps involved in ATP synthesis, particularly under oligotrophic conditions, whereas those that lack the proton donor perform sensory functions, in particular in phototaxis [21, 25]. By this token, the proteorhodopsins of P. globosa are predicted to possess proton-pumping activity (see Figure 1; the second paralogous sequence from P. globosa is nearly identical and is not shown).
Viral proteorhodopsins that are predicted to function as sensory rhodopsins could affect signaling and in particular phototaxis in the infected protists, perhaps stimulating relocation of the infected protists to areas that are rich in nutrients required for virus reproduction. Complete sequencing of the genome of P. globosa and the still unidentified hosts of OLPV (most likely, also prasinophytes ) will show whether the putative viral sensory rhodopsins complement a pre-existing host function or confer a functionality that is new to the host. Given that P. globosa is a dominant component of marine phytoplankton and that its population dynamics is substantially affected by viruses , viral proteorhodopsin homologs described here, regardless of their exact role(s) that remains to be elucidated experimentally, could be major players in the ocean ecology.
Proteorhodopsin homologs encoded by giant viruses belong to a distinct proteorhodopsin subfamily that additionally includes numerous uncharacterized sequences from marine environments that are likely to be of virus and/or eukaryotic origin. The viruses probably acquired proteorhodopsin genes from unicellular eukaryotic hosts although the identity of the donors remains unknown. These proteins are predicted to perform light-dependent sensory functions, potentially altering the behavior of the infected protist host, e.g. by inducing phototaxis and perhaps stimulating the host relocation to nutrient-rich areas.
Protein sequences were retrieved from the non-redundant database at the National Center for Biotechnology Information (NIH, Bethesda). Reference sequences for halo-, bacterio-, xeno, and sensory rhodopsins were taken from . The non-redundant protein sequence database was searched using the PSI-BLAST program , with default parameters and the predicted viral proteorhodopsin sequences used as queries. The reported results reflect searchers performed on 13-15/08/2012. Marine metagenomics blast hits were clustered before the alignment by blastclust (http://www.ncbi.nlm.nih.gov/Web/Newsltr/Spring04/blastlab.html); a representative (the longest) sequence from each cluster was taken. Protein sequences were aligned using the MUSCLE program with default parameters ; columns containing a large fraction of gaps (greater than 30%) and non-homogenous columns defined as described previously  were removed from the alignment. The resulting 160-column alignment was used to construct a maximum likelihood phylogenetic tree using the FastTree program with default parameters (JTT evolutionary model, discrete gamma model with 20 rate categories) . Transmembrane helices in proteins were predicted using the TMHMM program .
Reviewer 1: Dr. Igor B. Zhulin, Oakridge National Laboratory and the University of Tennessee
The paper by Yutin and Koonin reports a discovery of proteorhodopsin genes in marine viruses. This is a very interesting finding expanding the repertoire of genes that viruses might carry in order to modify host’s metabolism. As different organisms developed various forms of light-harvesting devices, and at least some of them have been already found in viruses (photosystems I and II), it does not come as a total surprise, and supports the notion that improving host’s conditions promotes phage reproduction. When conditions are right, proteorhodopsin can be a very useful plug-and-play device for energy generation.
The paper is brief, clearly written and goes straight to the point. I do not have any particular comments or concerns rather than it would be usefulto indicate the timing of database searches since NR changes so rapidly and parameters for PSI-BLAST and MUSCLE (presumably, default, but still…) – all in the Methods section
Authors’ response: The details proposed to be included were included.
Reviewer 2: Dr. Laksminarayan M. Iyer, National Center for Biotechnology Information, NIH
The study details the presence, and analyzes the origins, of proteorhodopsins in certain marine NCLDV viruses. These proteins add to the small, yet interesting, list of laterally acquired genes in large dsDNA viruses that are predicted to alter the response of the infected host to various environmental inputs. The precise biology of how the viral proteorhodopsins contribute to the fitness of the virus and the host should elicit interest among experimental biologists. The analysis can be easily reproduced and the writing is lucid. I only have a minor comment. On the issue of the provenance of environmental sequences most closely related to the viral ones, the authors could consider performing a gene neighborhood analysis, where possible, to see if any predominant associations emerge that might provide clues to the origins of these sequences.
Authors’ response: Regrettably, the majority of the environmental sequences are too short for this type of analysis. Those few sequences that were long enough failed to yield useful clues.
The authors thank Oded Beja and Valerian Dolja for critical reading of the manuscript and useful suggestions. The authors’ research is supported by the US Department of Health and Human Services intramural funds (to National Library of Medicine).
- Agol VI, Gmyl AP: Viral security proteins: counteracting host defences. Nat Rev Microbiol. 2010, 8 (12): 867-878. 10.1038/nrmicro2452.PubMedView ArticleGoogle Scholar
- Wei H, Zhou MM: Viral-encoded enzymes that target host chromatin functions. Biochim Biophys Acta. 2010, 1799 (3–4): 296-301.PubMedPubMed CentralView ArticleGoogle Scholar
- de Souza RF, Iyer LM, Aravind L: Diversity and evolution of chromatin proteins encoded by DNA viruses. Biochim Biophys Acta. 2010, 1799 (3–4): 302-318.PubMedPubMed CentralView ArticleGoogle Scholar
- Werden SJ, Rahman MM, McFadden G: Poxvirus host range genes. Adv Virus Res. 2008, 71: 135-171.PubMedView ArticleGoogle Scholar
- Bugert JJ, Darai G: Poxvirus homologues of cellular genes. Virus Genes. 2000, 21 (1–2): 111-133.PubMedView ArticleGoogle Scholar
- Alperovitch-Lavy A, Sharon I, Rohwer F, Aro EM, Glaser F, Milo R, Nelson N, Beja O: Reconstructing a puzzle: existence of cyanophages containing both photosystem-I and photosystem-II gene suites inferred from oceanic metagenomic datasets. Environ Microbiol. 2011, 13 (1): 24-32. 10.1111/j.1462-2920.2010.02304.x.PubMedView ArticleGoogle Scholar
- Sharon I, Alperovitch A, Rohwer F, Haynes M, Glaser F, Atamna-Ismaeel N, Pinter RY, Partensky F, Koonin EV, Wolf YI, et al: Photosystem I gene cassettes are present in marine virus genomes. Nature. 2009, 461 (7261): 258-262. 10.1038/nature08284.PubMedPubMed CentralView ArticleGoogle Scholar
- Sullivan MB, Lindell D, Lee JA, Thompson LR, Bielawski JP, Chisholm SW: Prevalence and evolution of core photosystem II genes in marine cyanobacterial viruses and their hosts. PLoS Biol. 2006, 4 (8): e234-10.1371/journal.pbio.0040234.PubMedPubMed CentralView ArticleGoogle Scholar
- Lindell D, Jaffe JD, Johnson ZI, Church GM, Chisholm SW: Photosynthesis genes in marine viruses yield proteins during host infection. Nature. 2005, 438 (7064): 86-89. 10.1038/nature04111.PubMedView ArticleGoogle Scholar
- Bragg JG, Chisholm SW: Modeling the fitness consequences of a cyanophage-encoded photosynthesis gene. PLoS One. 2008, 3 (10): e3550-10.1371/journal.pone.0003550.PubMedPubMed CentralView ArticleGoogle Scholar
- Yau S, Lauro FM, DeMaere MZ, Brown MV, Thomas T, Raftery MJ, Andrews-Pfannkoch C, Lewis M, Hoffman JM, Gibson JA, et al: Virophage control of antarctic algal host-virus dynamics. Proc Natl Acad Sci U S A. 2011, 108 (15): 6163-6168. 10.1073/pnas.1018221108.PubMedPubMed CentralView ArticleGoogle Scholar
- Beja O, Spudich EN, Spudich JL, Leclerc M, DeLong EF: Proteorhodopsin phototrophy in the ocean. Nature. 2001, 411 (6839): 786-789. 10.1038/35081051.PubMedView ArticleGoogle Scholar
- Beja O, Aravind L, Koonin EV, Suzuki MT, Hadd A, Nguyen LP, Jovanovich SB, Gates CM, Feldman RA, Spudich JL, et al: Bacterial rhodopsin: evidence for a new type of phototrophy in the sea. Science. 2000, 289 (5486): 1902-1906. 10.1126/science.289.5486.1902.PubMedView ArticleGoogle Scholar
- Moran MA, Miller WL: Resourceful heterotrophs make the most of light in the coastal ocean. Nat Rev Microbiol. 2007, 5 (10): 792-800. 10.1038/nrmicro1746.PubMedView ArticleGoogle Scholar
- Spudich JL, Yang CS, Jung KH, Spudich EN: Retinylidene proteins: structures and functions from archaea to humans. Annu Rev Cell Dev Biol. 2000, 16: 365-392. 10.1146/annurev.cellbio.16.1.365.PubMedView ArticleGoogle Scholar
- Rusch DB, Halpern AL, Sutton G, Heidelberg KB, Williamson S, Yooseph S, Wu D, Eisen JA, Hoffman JM, Remington K, et al: The Sorcerer II Global Ocean Sampling expedition: northwest Atlantic through eastern tropical Pacific. PLoS Biol. 2007, 5 (3): e77-10.1371/journal.pbio.0050077.PubMedPubMed CentralView ArticleGoogle Scholar
- Man D, Wang W, Sabehi G, Aravind L, Post AF, Massana R, Spudich EN, Spudich JL, Beja O: Diversification and spectral tuning in marine proteorhodopsins. EMBO J. 2003, 22 (8): 1725-1731. 10.1093/emboj/cdg183.PubMedPubMed CentralView ArticleGoogle Scholar
- Atamna-Ismaeel N, Finkel OM, Glaser F, Sharon I, Schneider R, Post AF, Spudich JL, von Mering C, Vorholt JA, Iluz D, et al: Microbial rhodopsins on leaf surfaces of terrestrial plants. Environ Microbiol. 2012, 14 (1): 140-146. 10.1111/j.1462-2920.2011.02554.x.PubMedPubMed CentralView ArticleGoogle Scholar
- Jung KH: The distinct signaling mechanisms of microbial sensory rhodopsins in Archaea, Eubacteria and Eukarya. Photochem Photobiol. 2007, 83 (1): 63-69. 10.1562/2006-03-20-IR-853.PubMedView ArticleGoogle Scholar
- Frigaard NU, Martinez A, Mincer TJ, DeLong EF: Proteorhodopsin lateral gene transfer between marine planktonic Bacteria and Archaea. Nature. 2006, 439 (7078): 847-850. 10.1038/nature04435.PubMedView ArticleGoogle Scholar
- Slamovits CH, Okamoto N, Burri L, James ER, Keeling PJ: A bacterial proteorhodopsin proton pump in marine eukaryotes. Nat Commun. 2011, 2: 183-PubMedView ArticleGoogle Scholar
- Lin S, Zhang H, Zhuang Y, Tran B, Gill J: Spliced leader-based metatranscriptomic analyses lead to recognition of hidden genomic features in dinoflagellates. Proc Natl Acad Sci U S A. 2010, 107 (46): 20033-20038. 10.1073/pnas.1007246107.PubMedPubMed CentralView ArticleGoogle Scholar
- Okamoto OK, Hastings JW: Novel dinoflagellate clock-related genes identified through microarray analysis. J Phycol. 2003, 39: 519-526. 10.1046/j.1529-8817.2003.02170.x.View ArticleGoogle Scholar
- Fuhrman JA, Schwalbach MS, Stingl U: Proteorhodopsins: an array of physiological roles?. Nat Rev Microbiol. 2008, 6 (6): 488-494.PubMedGoogle Scholar
- DeLong EF, Beja O: The light-driven proton pump proteorhodopsin enhances bacterial survival during tough times. PLoS Biol. 2010, 8 (4): e1000359-10.1371/journal.pbio.1000359.PubMedPubMed CentralView ArticleGoogle Scholar
- Brussard CPG, Bratbak G, Baudoux AC, Ruardij P: Phaeocystis and its interaction with viruses. Biogeochemistry. 2007, 83: 201-215. 10.1007/s10533-007-9096-0.View ArticleGoogle Scholar
- Ugalde JA, Podell S, Narasingarao P, Allen EE: Xenorhodopsins, an enigmatic new class of microbial rhodopsins horizontally transferred between archaea and bacteria. Biol Direct. 2011, 6: 52-10.1186/1745-6150-6-52.PubMedPubMed CentralView ArticleGoogle Scholar
- Altschul SF, Madden TL, Schaffer AA, Zhang J, Zhang Z, Miller W, Lipman DJ: Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. Nucleic Acids Res. 1997, 25 (17): 3389-3402. 10.1093/nar/25.17.3389.PubMedPubMed CentralView ArticleGoogle Scholar
- Edgar RC: MUSCLE: multiple sequence alignment with high accuracy and high throughput. Nucleic Acids Res. 2004, 32 (5): 1792-1797. 10.1093/nar/gkh340.PubMedPubMed CentralView ArticleGoogle Scholar
- Yutin N, Makarova KS, Mekhedov SL, Wolf YI, Koonin EV: The deep archaeal roots of eukaryotes. Mol Biol Evol. 2008, 25 (8): 1619-1630. 10.1093/molbev/msn108.PubMedPubMed CentralView ArticleGoogle Scholar
- Price MN, Dehal PS, Arkin AP: FastTree 2–approximately maximum-likelihood trees for large alignments. PLoS One. 2010, 5 (3): e9490-10.1371/journal.pone.0009490.PubMedPubMed CentralView ArticleGoogle Scholar
- Krogh A, Larsson B, von Heijne G, Sonnhammer EL: Predicting transmembrane protein topology with a hidden Markov model: application to complete genomes. J Mol Biol. 2001, 305 (3): 567-580. 10.1006/jmbi.2000.4315.PubMedView ArticleGoogle Scholar
This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.