Mitochondrial pathogenic mutations are population-specific
© Breen and Kondrashov; licensee BioMed Central Ltd. 2010
Received: 13 December 2010
Accepted: 31 December 2010
Published: 31 December 2010
Surveying deleterious variation in human populations is crucial for our understanding, diagnosis and potential treatment of human genetic pathologies. A number of recent genome-wide analyses focused on the prevalence of segregating deleterious alleles in the nuclear genome. However, such studies have not been conducted for the mitochondrial genome.
We present a systematic survey of polymorphisms in the human mitochondrial genome, including those predicted to be deleterious and those that correspond to known pathogenic mutations. Analyzing 4458 completely sequenced mitochondrial genomes we characterize the genetic diversity of different types of single nucleotide polymorphisms (SNPs) in African (L haplotypes) and non-African (M and N haplotypes) populations. We find that the overall level of polymorphism is higher in the mitochondrial compared to the nuclear genome, although the mitochondrial genome appears to be under stronger selection as indicated by proportionally fewer nonsynonymous than synonymous substitutions. The African mitochondrial genomes show higher heterozygosity, a greater number of polymorphic sites and higher frequencies of polymorphisms for synonymous, benign and damaging polymorphism than non-African genomes. However, African genomes carry significantly fewer SNPs that have been previously characterized as pathogenic compared to non-African genomes.
Finding SNPs classified as pathogenic to be the only category of polymorphisms that are more abundant in non-African genomes is best explained by a systematic ascertainment bias that favours the discovery of pathogenic polymorphisms segregating in non-African populations. This further suggests that, contrary to the common disease-common variant hypothesis, pathogenic mutations are largely population-specific and different SNPs may be associated with the same disease in different populations. Therefore, to obtain a comprehensive picture of the deleterious variability in the human population, as well as to improve the diagnostics of individuals carrying African mitochondrial haplotypes, it is necessary to survey different populations independently.
This article was reviewed by Dr Mikhail Gelfand, Dr Vasily Ramensky (nominated by Dr Eugene Koonin) and Dr David Rand (nominated by Dr Laurence Hurst).
The discovery of genetic variants associated with human diseases is widely anticipated to be one of the stepping stones leading to an era of personalized medicine. Hundreds or even thousands of deleterious alleles segregate in the human population [1–6] and contribute to a vast diversity of disease conditions [7, 8]. While most of them are individually only slightly deleterious  taken together an average genome carries several lethal equivalents [6, 9]. In principle, correlating genetic variants to disease phenotypes in a sample of the human population can reveal those variants that are likely to contribute to disease. This is now routinely attempted by genome-wide association studies (GWAS) [10–15] or deleterious alleles predicted computationally on the back of large-scale sequencing efforts [2, 4]. However, the success of GWAS in particular is dependent on pathogenic polymorphisms segregating at relatively high frequency in the population [12, 13, 16, 17].
If common diseases are caused by common variants then the polymorphisms implicated by GWAS are likely to contribute to disease not only in the sample from the study but also in a relatively large fraction of individuals with the disease phenotype in the entire human population. However, if common diseases in the human population are caused by many rare variants then the probability of discovery of these variants is low [12, 13] and different populations are likely to carry different variants associated with one disease. Since a majority of GWAS are performed within specific human populations [11, 18, 19] it is currently unclear if the disease variants identified by a study as major contributors to a specific disease in one population also contribute to the same pathology in a different population.
To study the population-specific distributions of SNPs we performed a comparison of variation encoded in the mitochondrial genome in African and non-African populations. We focused on the mitochondrial genome for three reasons. First, the mitochondrial genome contributes to dozens of genetic pathologies [8, 20], second, it has not been subject to a genome-wide survey of segregating deleterious polymorphism and third, the diversity of available completely sequenced mitochondrial genomes allowed us to consider genomes from different populations independently.
All polymorphism data from African and non-African populations.
Number of variable sites
Number of minor alleles
- Four fold
- Coding regions
π for different types of sites.
0.010 ± 1 × 10-5
0.0054 ± 1 × 10-6
< 1 × 10-5
- Four fold
0.0089 ± 1 × 10-5
0.0044 ± 8 × 10-7
< 1 × 10-5
0.0011 ± 2 × 10-6
0.00087 ± 1 × 10-7
< 1 × 10-5
Average frequency of minor alleles.
African (401 genomes)
Non-African (Monte-Carlo sampling of 401 genomes)
0.027 ± 0.003
0.0040 ± 0.0004
0.011 ± 4 × 10-6
7.9 × 10-6
- Four fold
0.029 ± 0.004
0.0039 ± 0.0005
0.011 ± 4 × 10-6
2.4 × 10-6
0.023 ± 0.005
0.0033 ± 0.0006
0.011 ± 4 × 10-6
1.5 × 10-6
0.026 ± 0.006
0.0042 ± 0.0008
0.012 ± 5 × 10-6
1.0 × 10-6
0.0089 ± 0.009
0.00082 ± 9 × 10-5
0.0038 ± 3 × 10-6
1.7 × 10-3
0.0077 ± 0.003
0.0037 ± 0.001
0.0084 ± 8 × 10-6
1.0 × 10-6
- Coding regions
0.0074 ± 0.003
0.0044 ± 0.001
0.0095 ± 1 × 10-5
1.0 × 10-6
Differences in the level of polymorphism among the African and non-African population have been studied extensively for the nuclear genome [27–31]. Thus, some of the data reported here, and their interpretation, are analogous to those reported for the nuclear genome. In agreement with previously published data [27–31] the African genomes showed higher nucleotide diversity, π, at all classes of sites compared with non-African genomes (Table 2), with this difference being less pronounced for nonsynonymous SNPs (nSNPs). The nucleotide diversity obtained for the mitochondrial genes was higher than that for equivalent sites in the nuclear genome with mitochondrial synonymous diversity (πs) ~2.5 fold and mitochondrial nonsynonymous diversity (πn) ~6.5-8.5 fold higher than the estimates from the nuclear genome (Table 1 from ref. ), which is consistent with a higher rate of mutation in the mitochondrial genome  that allowed nucleotide diversity to accumulate faster after the recent population expansion [27–31]. The larger difference between πs and πn indicates that nonsynonymous sites are under stronger selection in the mitochondrial genome [33, 34]. However, while the difference in mitochondrial and nuclear πs was similar for African and non-African populations the difference for πn was lower for the non-African population (8.5 fold in the African and 6.5 for non-African genomes), indicating that negative selection against mitochondrial nonsynonymous alleles has been relaxed in the non-African population.
Average number of minor alleles per genome.
African (401 genomes)
Non-African (4057 genomes)
21.5 ± 0.5
9.7 ± 0.1
2.2 × 10-6
- Four fold
12.2 ± 0.3
5.0 ± 0.06
2.2 × 10-6
7.3 ± 0.2
3.9 ± 0.04
2.2 × 10-6
6.8 ± 0.2
3.6 ± 0.03
2.2 × 10-6
0.53 ± 0.04
0.25 ± 0.009
2.2 × 10-6
0.21 ± 0.02
0.31 ± 0.008
1.3 × 10-6
- Coding Regions
0.14 ± 0.02
0.24 ± 0.007
1.6 × 10-3
Levels of polymorphism are influenced by mutation, selection and genetic drift. All three of these factors necessarily need to be invoked to explain all of the observations mentioned above. First, the higher πn and πs in the mitochondrial relative to the nuclear genome is consistent with a higher rate of mutation in the mitochondrial genome . Second, the larger difference between πn and πs in the mitochondrial genome relative to the nuclear genome indicates that nonsynonymous sites in the organelle are under stronger negative selection. Finally, a largely similar difference in πn and πs and in the average number of minor alleles per genome between the African and non-African populations indicates that genetic drift has been a stronger factor in shaping the difference between the levels of polymorphism in African and non-African populations than differences in selection pressure. However, a slight difference in the strength of negative selection in the African versus non-African population is also consistent with these and nuclear data .
At first glance the higher number and frequency of pathogenic SNPs in genomes from the non-African population can be explained by a relaxation of selection in the Out-of-Africa population [4, 28, 35]. However, three lines of evidence suggest that this is unlikely. Firstly, the opposite trend of SNPs in the damaging category (Table 3 and 4) suggests that, overall, the difference in strength of selection between the African and non-African populations is relatively minor. Second, data from the nuclear genome confirm our results that there is only a minor difference in selection between the African and non-African populations . Finally, such subtle changes in selection pressure between African and non-African populations are expected to affect slightly deleterious alleles to a much larger extent than strongly deleterious alleles [4, 36]. The pathogenic SNPs almost certainly belong to a more deleterious category of SNPs than all damaging SNPs and, therefore, relaxed selection cannot account for the observed differences in these two categories of SNPs between the African and the non-African populations.
A wave of GWAS followed the suggestion that common diseases are caused by common pathogenic variants [16, 17]. The present data show that knowledge of specific pathogenic variants from one population does not lead to a proportional discovery of pathogenic mutations in another, at least in the mitochondrial genome. Thus, it is likely that to advance the scope of personalized medicine the identification of pathogenic variants, especially in relation to GWAS, must be performed independently across all of human populations. Also, GWAS of specific human populations are likely to have more power for detecting disease-causing variants than studies with a sample of a mixture of humans from the total population.
Our survey of the genome-wide variability in the mitochondrial human genome revealed three distinct patterns. First, selection against nonsynonymous alleles is stronger in the mitochondrial genome than in the nuclear one. Thus, the higher nucleotide diversity in the mitochondrial genome is likely explained by a higher mutation rate and not relaxation of selection. Second, a similar difference in the nucleotide density of all classes of SNPs implies that genetic drift is at present a stronger factor than selection in shaping differences in variability of the mitochondrial genome between African and non-African populations. Finally, the higher density of pathogenic SNPs in the non-African population is likely to be a result of an ascertainment bias in favour of discovering common pathogenic SNPs in the non-African population. Given the non-African focus of many GWAS [11, 18, 19] it is likely that this bias also affects our understanding of human pathologies with a nuclear-based genetic component.
Dr. Mikhail Gelfand, Department of Bioinformatics, Institute of Information Transfer Problems
The authors present interesting, if straightforward, analysis, and the paper may be published more or less "as is", provided misprints and minor inaccuracies are corrected.
The only serious problem is the use of PolyPhen for the identification of damaging mutations. The PolyPhen analysis is based to a large degree on distant comparisons. But as the authors themselves have shown in one of their recent papers, a mutation that is damaging in a protein may well be observed in a distant protein. Hence PolyPhen should underestimate the underestimate the number of damaging mutations.
This is a serious issue in the use of PolyPhen and this is the reason why we used only primate orthologs to call pathogenicity of SNPs in human genes. We believe, and it appears that the referee is in agreement with us, that this represents a much more careful approach that just the default use of PolyPhen.
The other problem is that it is not obvious that it is correct to treat the African population as a homogeneous one. In fact, the non-African variation could be expected to be smaller simply because non-Africans are descendants of one branch of Africans.
Yes, the overall variation in the non-African population is much lower that in the African one. The salt of our analysis is that when all variation is considered the African population is the most variable one, almost independent of the type of variation (synonymous, non-synonymous, etc). However, when variants that correspond to known pathogenic mutations are considered then the situation is reversed, the non-African population contains a larger number of such variants compared to the African population. The most parsimonious explanation for this pattern is that pathogenic mutations are to some extent population-specific and that there is a higher ascertainment of them in the non-African population.
Dr. Vasily Ramensky, UCLA Center for Neurobehavioral Genetics, (nominated by Dr. Eugene Koonin)
I have read the revised manuscript and would like to suggest publishing the current version provided that some minor typographic corrections are made. I appreciate the changes to the manuscript that I believe make the results of your work more straightforward and comprehensible.
We thank the referee for taking the time to go through two rounds of the review process and for the helpful suggestions to improve our manuscript.
Dr. David Rand, Department of Ecology and Evolutionary Biology, Brown University (nominated by Dr. Laurence Hurst)
This reviewer provided no comments for publication.
We thank Ivan Adzhubei and Shamil Sunyaev for extensive assistance with PolyPhen 2 and insightful discussion. We thank the Spanish Ministry of Science and Innovation, Plan Nacional Program grant BFU2009-09271 for funding.
- Sunyaev SR, Lathe WC, Ramensky VE, Bork P: SNP frequencies in human genes an excess of rare alleles and differing modes of selection. Trends Genet. 2000, 16: 335-337. 10.1016/S0168-9525(00)02058-8.PubMedView ArticleGoogle Scholar
- Sunyaev S, Ramensky V, Koch I, Lathe W, Kondrashov AS, Bork P: Prediction of deleterious human alleles. Hum Mol Genet. 2001, 10: 591-597. 10.1093/hmg/10.6.591.PubMedView ArticleGoogle Scholar
- Yampolsky LY, Kondrashov FA, Kondrashov AS: Distribution of the strength of selection against amino acid replacements in human proteins. Hum Mol Genet. 2005, 14: 3191-3201. 10.1093/hmg/ddi350.PubMedView ArticleGoogle Scholar
- Lohmueller KE, Indap AR, Schmidt S, Boyko AR, Hernandez RD, Hubisz MJ, Sninsky JJ, White TJ, Sunyaev SR, Nielsen R, Clark AG, Bustamante CD: Proportionally more deleterious genetic variation in European than in African populations. Nature. 2008, 451: 994-997. 10.1038/nature06611.PubMedPubMed CentralView ArticleGoogle Scholar
- Boyko AR, Williamson SH, Indap AR, Degenhardt JD, Hernandez RD, Lohmueller KE, Adams MD, Schmidt S, Sninsky JJ, Sunyaev SR, White TJ, Nielsen R, Clark AG, Bustamante CD: Assessing the evolutionary impact of amino acid mutations in the human genome. PLoS Genet. 2008, 4: e1000083-10.1371/journal.pgen.1000083.PubMedPubMed CentralView ArticleGoogle Scholar
- Kondrashov AS: Contamination of the genome by very slightly deleterious mutations: why have we not died 100 times over?. J Theor Biol. 1995, 175: 583-594. 10.1006/jtbi.1995.0167.PubMedView ArticleGoogle Scholar
- Elliott HR, Samuels DC, Eden JA, Relton CL, Chinnery PF: Pathogenic mitochondrial DNA mutations are common in the general population. Am J Hum Genet. 2008, 83: 254-260. 10.1016/j.ajhg.2008.07.004.PubMedPubMed CentralView ArticleGoogle Scholar
- Taylor RW, Turnbull DM: Mitochondrial DNA mutations in human disease. Nat Rev Genet. 2005, 6: 389-402. 10.1038/nrg1606.PubMedPubMed CentralView ArticleGoogle Scholar
- Morton NE, Crow JF, Muller HJ: An estimate of the mutational damage in man from data on consanguineous marriages. Proc Natl Acad Sci USA. 1956, 42: 855-863. 10.1073/pnas.42.11.855.PubMedPubMed CentralView ArticleGoogle Scholar
- Donnelly P: Progress and challenges in genome-wide association studies in humans. Nature. 2008, 456: 728-731. 10.1038/nature07631.PubMedView ArticleGoogle Scholar
- Teo YY, Small KS, Kwiatkowski DP: Methodological challenges of genome-wide association analysis in Africa. Nat Rev Genet. 2010, 11: 149-160. 10.1038/nrg2731.PubMedPubMed CentralView ArticleGoogle Scholar
- Iles MM: What can genome-wide association studies tell us about the genetics of common disease?. PLoS Genet. 2008, 4: e33-10.1371/journal.pgen.0040033.PubMedPubMed CentralView ArticleGoogle Scholar
- Pearson TA, Manolio TA: How to interpret a genome-wide association study. JAMA. 2008, 299: 1335-1344. 10.1001/jama.299.11.1335.PubMedView ArticleGoogle Scholar
- Ku CS, Loy EY, Pawitan Y, Chia KS: The pursuit of genome-wide association studies: where are we now?. J Hum Genet. 2010, 55: 195-206. 10.1038/jhg.2010.19.PubMedView ArticleGoogle Scholar
- Raule N, Sevini F, Santoro A, Altilia S, Franceschi C: Association studies on human mitochondrial DNA: methodological aspects and results in the most common age-related diseases. Mitochondrion. 2007, 7: 29-38. 10.1016/j.mito.2006.11.013.PubMedView ArticleGoogle Scholar
- Collins FS, Guyer MS, Charkravarti A: Variations on a theme: cataloging human DNA sequence variation. Science. 1997, 278: 1580-1581. 10.1126/science.278.5343.1580.PubMedView ArticleGoogle Scholar
- Reich DE, Lander ES: On the allelic spectrum of human disease. Trends Genet. 2001, 17: 502-510. 10.1016/S0168-9525(01)02410-6.PubMedView ArticleGoogle Scholar
- Tishkoff SA, Williams SM: Genetic analysis of African populations: human evolution and complex disease. Nat Rev Genet. 2002, 3: 611-621. 10.1038/ni0702-611.PubMedView ArticleGoogle Scholar
- Sirugo G, Hennig BJ, Adeyemo AA, Matimba A, Newport MJ, Ibrahim ME, Ryckman KK, Tacconelli A, Mariani-Costantini R, Novelli G, Soodyall H, Rotimi CN, Ramesar RS, Tishkoff SA, Williams SM: Genetic studies of African populations: an overview on disease susceptibility and response to vaccines and therapeutics. Hum Genet. 2008, 123: 557-598. 10.1007/s00439-008-0511-y.PubMedView ArticleGoogle Scholar
- Di Donato S: Multisystem manifestations of mitochondrial disorders. J Neurol. 2009, 256: 693-710. 10.1007/s00415-009-5028-3.PubMedView ArticleGoogle Scholar
- Benson DA, Karsch-Mizrachi I, Lipman DJ, Ostell J, Sayers EW: GenBank. Nucleic Acids Res. 2009, 37: D26-D31. 10.1093/nar/gkn723.PubMedPubMed CentralView ArticleGoogle Scholar
- Tamura K, Dudley J, Nei M, Kumar S: MEGA4: Molecular Evolutionary Genetics Analysis (MEGA) software version 4.0. Mol Biol Evol. 2007, 24: 1596-1599. 10.1093/molbev/msm092.PubMedView ArticleGoogle Scholar
- Adzhubei IA, Schmidt S, Peshkin L, Ramensky VE, Gerasimova A, Bork P, Kondrashov AS, Sunyaev SR: A method and server for predicting damaging missense mutations. Nat Methods. 2010, 7: 248-249. 10.1038/nmeth0410-248.PubMedPubMed CentralView ArticleGoogle Scholar
- Nei M, Li WH: Mathematical model for studying genetic variation in terms of restriction endonucleases. Proc Natl Acad Sci USA. 1979, 76: 5269-5273. 10.1073/pnas.76.10.5269.PubMedPubMed CentralView ArticleGoogle Scholar
- Ruiz-Pesini E, Lott MT, Procaccio V, Poole JC, Brandon MC, Mishmar D, Yi C, Kreuziger J, Baldi P, Wallace DC: An enhanced MITOMAP with a global mtDNA mutational phylogeny. Nucleic Acids Res. 2007, 35: D823-D828. 10.1093/nar/gkl927.PubMedPubMed CentralView ArticleGoogle Scholar
- Petros JA, Baumann AK, Ruiz-Pesini E, Amin MB, Sun CQ, Hall J, Lim S, Issa MM, Flanders WD, Hosseini SH, Marshall FF, Wallace DC: mtDNA mutations increase tumorigenicity in prostate cancer. Proc Natl Acad Sci USA. 2005, 102: 719-724. 10.1073/pnas.0408894102.PubMedPubMed CentralView ArticleGoogle Scholar
- Cann RL, Stoneking M, Wilson AC: Mitochondrial DNA and human evolution. Nature. 1987, 325: 31-36. 10.1038/325031a0.PubMedView ArticleGoogle Scholar
- Fay JC, Wu CI: A human population bottleneck can account for the discordance between patterns of mitochondrial versus nuclear DNA variation. Mol Biol Evol. 1999, 16: 1003-1005.PubMedView ArticleGoogle Scholar
- Ingman M, Kaessmann H, Pääbo S, Gyllensten U: Mitochondrial genome variation and the origin of modern humans. Nature. 2000, 408: 708-713. 10.1038/35047064.PubMedView ArticleGoogle Scholar
- Marth G, Schuler G, Yeh R, Davenport R, Agarwala R, Church D, Wheelan S, Baker J, Ward M, Kholodov M, Phan L, Czabarka E, Murvai J, Cutler D, Wooding S, Rogers A, Chakravarti A, Harpending HC, Kwok PY, Sherry ST: Sequence variations in the public human genome data reflect a bottlenecked population history. Proc Natl Acad Sci USA. 2003, 100: 376-381. 10.1073/pnas.222673099.PubMedPubMed CentralView ArticleGoogle Scholar
- Gonder MK, Mortensen HM, Reed FA, de Sousa A, Tishkoff SA: Whole-mtDNA genome sequence analysis of ancient African lineages. Mol Biol Evol. 2007, 24: 757-768. 10.1093/molbev/msl209.PubMedView ArticleGoogle Scholar
- Howell N, Smejkal CB, Mackey DA, Chinnery PF, Turnbull DM, Herrnstadt C: The pedigree rate of sequence divergence in the human mitochondrial genome: there is a difference between phylogenetic and pedigree rates. Am J Hum Genet. 2003, 72: 659-670. 10.1086/368264.PubMedPubMed CentralView ArticleGoogle Scholar
- Weinreich DM, Rand DM: Contrasting patterns of nonneutral evolution in proteins encoded in nuclear and mitochondrial genomes. Genetics. 2000, 156: 385-399.PubMedPubMed CentralGoogle Scholar
- Stewart JB, Freyer C, Elson JL, Wredenberg A, Cansu Z, Trifunovic A, Larsson NG: Purifying selection in transmission of mammalian mitochondrial DNA. PLoS Biol. 2008, 6: e10-10.1371/journal.pbio.0060010.PubMedPubMed CentralView ArticleGoogle Scholar
- Campbell MC, Tishkoff SA: African genetic diversity: implications for human demographic history, modern human origins, and complex disease mapping. Annu Rev Genomics Hum Genet. 2008, 9: 403-433. 10.1146/annurev.genom.9.081307.164258.PubMedPubMed CentralView ArticleGoogle Scholar
- Crow JF, Kimura M: An Introduction to Population Genetics Theory. 1970, New York. Harper and RowGoogle Scholar
This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (<url>http://creativecommons.org/licenses/by/2.0</url>), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.