Assessing the evolutionary impact of amino acid mutations in the human genome
- PMID: 18516229
- PMCID: PMC2377339
- DOI: 10.1371/journal.pgen.1000083
Assessing the evolutionary impact of amino acid mutations in the human genome
Abstract
Quantifying the distribution of fitness effects among newly arising mutations in the human genome is key to resolving important debates in medical and evolutionary genetics. Here, we present a method for inferring this distribution using Single Nucleotide Polymorphism (SNP) data from a population with non-stationary demographic history (such as that of modern humans). Application of our method to 47,576 coding SNPs found by direct resequencing of 11,404 protein coding-genes in 35 individuals (20 European Americans and 15 African Americans) allows us to assess the relative contribution of demographic and selective effects to patterning amino acid variation in the human genome. We find evidence of an ancient population expansion in the sample with African ancestry and a relatively recent bottleneck in the sample with European ancestry. After accounting for these demographic effects, we find strong evidence for great variability in the selective effects of new amino acid replacing mutations. In both populations, the patterns of variation are consistent with a leptokurtic distribution of selection coefficients (e.g., gamma or log-normal) peaked near neutrality. Specifically, we predict 27-29% of amino acid changing (nonsynonymous) mutations are neutral or nearly neutral (|s|<0.01%), 30-42% are moderately deleterious (0.01%<|s|<1%), and nearly all the remainder are highly deleterious or lethal (|s|>1%). Our results are consistent with 10-20% of amino acid differences between humans and chimpanzees having been fixed by positive selection with the remainder of differences being neutral or nearly neutral. Our analysis also predicts that many of the alleles identified via whole-genome association mapping may be selectively neutral or (formerly) positively selected, implying that deleterious genetic variation affecting disease phenotype may be missed by this widely used approach for mapping genes underlying complex traits.
Conflict of interest statement
The authors have declared that no competing interests exist.
Figures
Similar articles
-
Characteristics of neutral and deleterious protein-coding variation among individuals and populations.Am J Hum Genet. 2014 Oct 2;95(4):421-36. doi: 10.1016/j.ajhg.2014.09.006. Am J Hum Genet. 2014. PMID: 25279984 Free PMC article.
-
Population history and natural selection shape patterns of genetic variation in 132 genes.PLoS Biol. 2004 Oct;2(10):e286. doi: 10.1371/journal.pbio.0020286. Epub 2004 Sep 7. PLoS Biol. 2004. PMID: 15361935 Free PMC article.
-
Molecular population genetics of PCSK9: a signature of recent positive selection.Pharmacogenet Genomics. 2008 Mar;18(3):169-79. doi: 10.1097/FPC.0b013e3282f44d99. Pharmacogenet Genomics. 2008. PMID: 18300938 Free PMC article.
-
Alleles associated with physical activity levels are estimated to be older than anatomically modern humans.PLoS One. 2019 Apr 29;14(4):e0216155. doi: 10.1371/journal.pone.0216155. eCollection 2019. PLoS One. 2019. PMID: 31034533 Free PMC article.
-
Natural selection affects multiple aspects of genetic variation at putatively neutral sites across the human genome.PLoS Genet. 2011 Oct;7(10):e1002326. doi: 10.1371/journal.pgen.1002326. Epub 2011 Oct 13. PLoS Genet. 2011. PMID: 22022285 Free PMC article.
Cited by
-
Pervasive relaxed selection in termite genomes.Proc Biol Sci. 2024 May;291(2023):20232439. doi: 10.1098/rspb.2023.2439. Epub 2024 May 22. Proc Biol Sci. 2024. PMID: 38772424 Free PMC article.
-
Computationally Efficient Demographic History Inference from Allele Frequencies with Supervised Machine Learning.Mol Biol Evol. 2024 May 3;41(5):msae077. doi: 10.1093/molbev/msae077. Mol Biol Evol. 2024. PMID: 38636507 Free PMC article.
-
A quantitative genetic model of background selection in humans.PLoS Genet. 2024 Mar 20;20(3):e1011144. doi: 10.1371/journal.pgen.1011144. eCollection 2024 Mar. PLoS Genet. 2024. PMID: 38507461 Free PMC article.
-
Computationally efficient demographic history inference from allele frequencies with supervised machine learning.bioRxiv [Preprint]. 2024 Feb 15:2023.05.24.542158. doi: 10.1101/2023.05.24.542158. bioRxiv. 2024. Update in: Mol Biol Evol. 2024 May 3;41(5):msae077. doi: 10.1093/molbev/msae077. PMID: 38405827 Free PMC article. Updated. Preprint.
-
An efficient and robust ABC approach to infer the rate and strength of adaptation.G3 (Bethesda). 2024 Apr 3;14(4):jkae031. doi: 10.1093/g3journal/jkae031. G3 (Bethesda). 2024. PMID: 38365205 Free PMC article.
References
-
- Eyre-Walker A, Keightley PD. The distribution of fitness effects of new mutations. Nat Rev Genet. 2007;8:610–618. (doi: 10.1038/nrg2146). - PubMed
-
- Barton NH, Charlesworth B. Why sex and recombination? Science. 1998;281:1986–1990. - PubMed
-
- Ohta T. Slightly deleterious mutant substitutions in evolution. Nature. 1973;246:96–98. - PubMed
-
- Di Rienzo A. Population genetics models of common diseases. Curr Opin Genet Dev. 2006;16:630–636. (doi: 10.1016/j.gde.2006.10.002). - PubMed
Publication types
MeSH terms
Grants and funding
LinkOut - more resources
Full Text Sources
Research Materials