A composite-likelihood approach for detecting directional selection from DNA sequence data
- PMID: 15879513
- PMCID: PMC1451173
- DOI: 10.1534/genetics.104.035097
A composite-likelihood approach for detecting directional selection from DNA sequence data
Abstract
We present a novel composite-likelihood-ratio test (CLRT) for detecting genes and genomic regions that are subject to recurrent natural selection (either positive or negative). The method uses the likelihood functions of Hartl et al. (1994) for inference in a Wright-Fisher genic selection model and corrects for nonindependence among sites by application of coalescent simulations with recombination. Here, we (1) characterize the distribution of the CLRT statistic (Lambda) as a function of the population recombination rate (R=4Ner); (2) explore the effects of bias in estimation of R on the size (type I error) of the CLRT; (3) explore the robustness of the model to population growth, bottlenecks, and migration; (4) explore the power of the CLRT under varying levels of mutation, selection, and recombination; (5) explore the discriminatory power of the test in distinguishing negative selection from population growth; and (6) evaluate the performance of maximum composite-likelihood estimation (MCLE) of the selection coefficient. We find that the test has excellent power to detect weak negative selection and moderate power to detect positive selection. Moreover, the test is quite robust to bias in the estimate of local recombination rate, but not to certain demographic scenarios such as population growth or a recent bottleneck. Last, we demonstrate that the MCLE of the selection parameter has little bias for weak negative selection and has downward bias for positively selected mutations.
Figures
Similar articles
-
The effects of demography and linkage on the estimation of selection and mutation parameters.Genetics. 2010 Dec;186(4):1411-24. doi: 10.1534/genetics.110.122150. Epub 2010 Oct 5. Genetics. 2010. PMID: 20923980 Free PMC article.
-
Effect of recombination on the accuracy of the likelihood method for detecting positive selection at amino acid sites.Genetics. 2003 Jul;164(3):1229-36. doi: 10.1093/genetics/164.3.1229. Genetics. 2003. PMID: 12871927 Free PMC article.
-
Population genetics of polymorphism and divergence for diploid selection models with arbitrary dominance.Genetics. 2004 Sep;168(1):463-75. doi: 10.1534/genetics.103.024745. Genetics. 2004. PMID: 15454557 Free PMC article.
-
Local effects of limited recombination: historical perspective and consequences for population estimates of adaptive evolution.J Hered. 2010 Mar-Apr;101 Suppl 1:S127-34. doi: 10.1093/jhered/esq012. J Hered. 2010. PMID: 20421321 Review.
-
From Summary Statistics to Gene Trees: Methods for Inferring Positive Selection.Trends Genet. 2020 Apr;36(4):243-258. doi: 10.1016/j.tig.2019.12.008. Epub 2020 Jan 15. Trends Genet. 2020. PMID: 31954511 Free PMC article. Review.
Cited by
-
Nucleocapsid mutations R203K/G204R increase the infectivity, fitness, and virulence of SARS-CoV-2.Cell Host Microbe. 2021 Dec 8;29(12):1788-1801.e6. doi: 10.1016/j.chom.2021.11.005. Epub 2021 Nov 13. Cell Host Microbe. 2021. PMID: 34822776 Free PMC article.
-
Rapid Spread of Mutant Alleles in Worldwide SARS-CoV-2 Strains Revealed by Genome-Wide Single Nucleotide Polymorphism and Variation Analysis.Genome Biol Evol. 2021 Feb 3;13(2):evab015. doi: 10.1093/gbe/evab015. Genome Biol Evol. 2021. PMID: 33512495 Free PMC article.
-
Genomic recombination events may reveal the evolution of coronavirus and the origin of SARS-CoV-2.Sci Rep. 2020 Dec 10;10(1):21617. doi: 10.1038/s41598-020-78703-6. Sci Rep. 2020. PMID: 33303849 Free PMC article.
-
Long-read bitter gourd (Momordica charantia) genome and the genomic architecture of nonclassic domestication.Proc Natl Acad Sci U S A. 2020 Jun 23;117(25):14543-14551. doi: 10.1073/pnas.1921016117. Epub 2020 May 27. Proc Natl Acad Sci U S A. 2020. PMID: 32461376 Free PMC article.
-
ASFVdb: an integrative resource for genomic and proteomic analyses of African swine fever virus.Database (Oxford). 2020 Jan 1;2020:baaa023. doi: 10.1093/database/baaa023. Database (Oxford). 2020. PMID: 32294195 Free PMC article.
References
-
- Ashburner, M., 1989 Drosophila: A Laboratory Manual. Cold Spring Harbor Laboratory Press, Cold Spring Harbor, NY.
-
- Bouffard, G. G., J. R. Idol, V. V. Braden, L. M. Iyer, A. F. Cunningham et al., 1997. A physical map of human chromosome 7: an integrated YAC contig map with average STS spacing of 79kb. Genome Res. 7 673–692. - PubMed
-
- Bustamante, C. D., R. Nielsen and D. L. Hartl, 2003. Maximum likelihood and Bayesian methods for estimating the distribution of selective effects among classes of mutations using DNA polymorphism data. Theor. Popul. Biol. 63(2): 91–103. - PubMed
Publication types
MeSH terms
LinkOut - more resources
Full Text Sources