Ch15 Population Genetics / Summary + Internet Resources + References
Summary
在人类群体遗传学(human population genetics)领域,过去几十年取得了显著发展。大多数理论与方法上的进步,主要由两个因素推动:计算能力的大幅提升,以及大量高分辨率遗传数据的积累,如今尤以全基因组序列(whole genome sequences)为代表。由于这些变化,目前已经能够推断群体历史的许多细节,包括群体瓶颈与扩张、迁移事件,以及对孟德尔性状(Mendelian traits)和多基因性状(polygenic traits)的自然选择。由于人类人口史极其复杂,要解析人类遗传演化中某些更为微妙的方面仍然具有挑战,例如软选择扫荡(soft selective sweeps)以及反复发生的杂交/混合(repeated admixture)事件。尽管如此,这些难题正在被持续推进与解决。随着数据、方法与发现的不断积累,群体遗传学无疑将在我们理解人类演化、健康与疾病方面发挥越来越重要的作用。
Internet Resources
Ancestry and admixture estimation and dating
www.internationalgenome.org
reich.hms.harvard.edu/software
software.genetics.ucla.edu/admixture
samtools.github.io/bcftools
med.stanford.edu/tanglab/software/frappe.html
paintmychromosomes.com
www.stats.ox.ac.uk/~myers/software.html
web.stanford.edu/group/pritchardlab/software.html
vcftools.github.io/index.html
Detection of natural selection
www.broadinstitute.org/cms/cms-composite-multiple-signals
www.megasoftware.net
- Singleton density score(SDS,单例密度分数)
web.stanford.edu/group/pritchardlab/software.html
- Selscan(integrated haplotype score [iHS] 和 cross-population extended haplotype homozygosity [XP-EHH])
hernandezlab.ucsf.edu/software
Population history inference
bitbucket.org/gutenkunstlab/dadi
- Multiple sequentially Markovian coalescent(MSMC,多重序贯马尔可夫共祖模型)
github.com/stschiff/msmc github.com/stschiff/msmc2/releases
github.com/stschiff/msmc-tools
- Pairwise sequentially Markovian coalescent(PSMC,成对序贯马尔可夫共祖模型)
github.com/lh3/psmc
www.htslib.org
mathgen.stats.ox.ac.uk/genetics_software/shapeit/shapeit.html
Population structure analysis
www.hsph.harvard.edu/alkes-price/software
www.hsph.harvard.edu/alkes-price/software
paintmychromosomes.com
zzz.bwh.harvard.edu/plink
zzz.bwh.harvard.edu/plink/plink2.shtml
web.stanford.edu/group/pritchardlab/structure.html
References
Abdulla, M.A., Ahmed, I., Assawamakin, A. et al. (2009). Mapping human genetic diversity in Asia. Science 326: 1541–1545.
Albrechtsen, A., Nielsen, F.C., and Nielsen, R. (2010a). Ascertainment biases in SNP chips affect measures of population divergence. Mol. Biol. Evol 27 (11): 2534–2547.
Albrechtsen, A., Moltke, I., and Nielsen, R. (2010b). Natural selection and the distribution of identity-by-descent in the human genome. Genetics 186: 295–308.
Alexander, D.H., Novembre, J., and Lange, K. (2009). Fast model-based estimation of ancestry in unrelated individuals. Genome Res. 19: 1655–1664.
Allison, A.C. (1954). Protection afforded by sickle-cell trait against subtertian malareal infection. Br. Med. J. 1: 290–294.
Auton, A., Brooks, L.D., Durbin, R.M. et al. (2015). A global reference for human genetic variation. Nature 526: 68–74.
Axelsson, E., Ratnakumar, A., Arendt, M.-L. et al. (2013). The genomic signature of dog domestication reveals adaptation to a starch-rich diet. Nature 495: 360–364.
Azevedo, L., Serrano, C., Amorim, A., and Cooper, D.N. (2015). Trans-species polymorphism in humans and the great apes is generally maintained by balancing selection that modulates the host immune response. Hum. Genomics 9: 21.
Beall, C.M. (2007). Two routes to functional adaptation: Tibetan and Andean high-altitude natives. Proc. Natl. Acad. Sci. U.S.A. 104 (Suppl 1): 8655–8660.
Beall, C.M., Cavalleri, G.L., Deng, L. et al. (2010). Natural selection on EPAS1 (HIF2alpha) associated with low hemoglobin concentration in Tibetan highlanders. Proc. Natl. Acad. Sci. U.S.A. 107: 11459–11464.
Beltrame, M.H., Rubel, M.A., and Tishkoff, S.A. (2016). Inferences of African evolutionary history from genomic data. Curr. Opin. Genet. Dev. 41: 159–166.
Bigham, A.W. (2016). Genetics of human origin and evolution: high-altitude adaptations. Curr. Opin. Genet. Dev. 41: 8–13.
Bunn, H.F. (2013). The triumph of good over evil: protection by the sickle gene against malaria. Blood 121: 20–25.
Campbell, C.D. and Eichler, E.E. (2013). Properties and rates of germline mutations in humans. Trends Genet. 29 (10): 575–584.
Campbell, M.C., Hirbo, J.B., Townsend, J.P., and Tishkoff, S.A. (2014). The peopling of the African continent and the diaspora into the new world. Curr. Opin. Genet. Dev. 29: 120–132.
Conrad, D.F., Keebler, J.E., Depristo, M.A. et al. (2011). Variation in genome-wide mutation rates within and between human families. Nat. Genet. 43: 712–714.
Cooper, G.M. and Shendure, J. (2011). Needles in stacks of needles: finding disease-causal variants in a wealth of genomic data. Nat. Rev. Genet. 12: 628–640.
Dannemann, M. and Kelso, J. (2017). The contribution of Neanderthals to phenotypic variation in modern humans. Am. J. Hum. Genet. 101: 578–589.
Donnelly, M.P., Paschou, P., Grigorenko, E. et al. (2012). A global view of the OCA2-HERC2 region and pigmentation. Hum. Genet. 131: 683–696.
Duggan, A.T. and Stoneking, M. (2014). Recent developments in the genetic history of East Asia and Oceania. Curr. Opin. Genet. Dev. 29: 9–14.
ENCODE Project Consortium (2012). An integrated encyclopedia of DNA elements in the human genome. Nature 489: 57–74.
Fan, S., Hansen, M.E., Lo, Y., and Tishkoff, S.A. (2016). Going global by adapting local: a review of recent human adaptation. Science 354: 54–59.
Field, Y., Boyle, E.A., Telis, N. et al. (2016). Detection of human adaptation during the past 2000 years. Science 354: 760–764.
Flint, J., Hill, A.V., Bowden, D.K. et al. (1986). High frequencies of alpha-thalassaemia are the result of natural selection by malaria. Nature 321: 744–750.
Fu, W. and Akey, J.M. (2013). Selection and adaptation in the human genome. Annu. Rev. Genomics Hum. Genet. 14: 467–489.
Fu, W., O’Connor, T.D., Jun, G. et al. (2013). Analysis of 6,515 exomes reveals the recent origin of most human protein-coding variants. Nature 493: 216–220.
Fu, Q., Posth, C., Hajdinjak, M. et al. (2016). The genetic history of Ice Age Europe. Nature 534: 200–205.
Fumagalli, M., Moltke, I., Grarup, N. et al. (2015). Greenlandic inuit show genetic signatures of diet and climate adaptation. Science 349: 1343–1347.
Genovese, G., Friedman, D.J., Ross, M.D. et al. (2010). Association of trypanolytic ApoL1 variants with kidney disease in African Americans. Science 329: 841–845.
Grossman, S.R., Shylakhter, I., Karlsson, E.K. et al. (2010). A composite of multiple signals distinguishes causal variants in regions of positive selection. Science 327: 883–886.
Grossman, S.R., Andersen, K.G., Shlyakhter, I. et al. (2013). Identifying recent adaptations in large-scale genomic data. Cell 152: 703–713.
Gunther, T. and Jakobsson, M. (2016). Genes mirror migrations and cultures in prehistoric Europe – a population genomic perspective. Curr. Opin. Genet. Dev. 41: 115–123.
Gutenkunst, R.N., Hernandez, R.D., Williamson, S.H., and Bustamante, C.D. (2009). Inferring the joint demographic history of multiple populations from multidimensional SNP frequency data. PLoS Genet. 5: e1000695.
Haasl, R.J. and Payseur, B.A. (2016). Fifteen years of genomewide scans for selection: trends, lessons and unaddressed genetic sources of complication. Mol. Ecol. 25: 5–23.
Hamblin, M.T. and Di Rienzo, A. (2000). Detection of the signature of natural selection in humans: evidence from the Duffy blood group locus. Am. J. Hum. Genet. 66: 1669–1679.
Hamblin, M.T., Thompson, E.E., and Di Rienzo, A. (2002). Complex signatures of natural selection at the Duffy blood group locus. Am. J. Hum. Genet. 70: 369–383.
Hellenthal, G., Busby, G.B., Band, G. et al. (2014). A genetic atlas of human admixture history. Science 343: 747–751.
Holsinger, K.E. and Weir, B.S. (2009). Genetics in geographically structured populations: defining, estimating and interpreting F(ST). Nat. Rev. Genet. 10: 639–650.
Hu, H., Petousi, N., Glusman, G. et al. (2017). Evolutionary history of Tibetans inferred from whole-genome sequencing. PLoS Genet. 13: e1006675.
Huerta-Sanchez, E., Jin, X., Asan Bianba, Z. et al. (2014). Altitude adaptation in Tibetans caused by introgression of Denisovan-like DNA. Nature 512: 194–197.
Hughes, A.L. and Yeager, M. (1998). Natural selection at major histocompatibility complex loci of vertebrates. Annu. Rev. Genet. 32: 415–435.
Jablonski, N.G. and Chaplin, G. (2010). Colloquium paper: human skin pigmentation as an adaptation to UV radiation. Proc. Natl. Acad. Sci. U.S.A. 107 (Suppl 2): 8962–8968.
Jeong, C. and Di Rienzo, A. (2014). Adaptations to local environments in modern human populations. Curr. Opin. Genet. Dev. 29: 1–8.
Jorde, L.B. and Wooding, S.P. (2004). Genetic variation, classification, and "race". Nat. Genet. 36 (11 Suppl): S28–S33.
Kayser, M. and De Knijff, P. (2011). Improving human forensics through advances in genetics, genomics and molecular biology. Nat. Rev. Genet. 12: 179–192.
Key, F.M., Teixeira, J.C., De Filippo, C., and Andres, A.M. (2014). Advantageous diversity maintained by balancing selection in humans. Curr. Opin. Genet. Dev. 29: 45–51.
Kwiatkowski, D.P. (2005). How malaria has affected the human genome and what human genetics can teach us about malaria. Am. J. Hum. Genet. 77: 171–192.
Lachance, J. and Tishkoff, S.A. (2013a). SNP ascertainment bias in population genetic analyses: why it is important, and how to correct it. BioEssays 35: 780–786.
Lachance, J. and Tishkoff, S.A. (2013b). Population genomics of human adaptation. Annu. Rev. Ecol. Evol. Syst. 44: 123–143.
Lamason, R.L., Mohideen, M.A., Mest, J.R. et al. (2005). SLC24A5, a putative cation exchanger, affects pigmentation in zebrafish and humans. Science 310: 1782–1786.
Lawson, D.J., Hellenthal, G., Myers, S., and Falush, D. (2012). Inference of population structure using dense haplotype data. PLoS Genet. 8: e1002453.
Lek, M., Karczewski, K.J., Minikel, E.V. et al. (2016). Analysis of protein-coding genetic variation in 60,706 humans. Nature 536: 285–291.
Leslie, S., Winney, B., Hellenthal, G. et al. (2015). The fine-scale genetic structure of the British population. Nature 519: 309–314.Wellcome Trust Case Control Consortium
Li, H. and Durbin, R. (2011). Inference of human population history from individual whole-genome sequences. Nature 475: 493–496.
Liu, Y., Nyunoya, T., Leng, S. et al. (2013). Softwares and methods for estimating genetic ancestry in human populations. Hum. Genomics 7: 1.
Liu, X., Lu, D., Saw, W.Y. et al. (2017). Characterising private and shared signatures of positive selection in 37 Asian populations. Eur. J. Hum. Genet. 25: 499–508.
Lorenzo, F.R., Huff, C., Myllymaki, M. et al. (2014). A genetic mechanism for Tibetan high-altitude adaptation. Nat. Genet. 46: 951–956.
Macinnis, M.J., Koehle, M.S., and Rupert, J.L. (2010). Evidence for a genetic basis for altitude illness: 2010 update. High Alt. Med. Biol. 11: 349–368.
Malaspinas, A.S., Westaway, M.C., Muller, C. et al. (2016). A genomic history of Aboriginal Australia. Nature 538: 207–214.
Mallick, S., Li, H., Lipson, M. et al. (2016). The Simons Genome Diversity Project: 300 genomes from 142 diverse populations. Nature 538: 201–206.
Manolio, T.A. (2013). Bringing genome-wide association findings into clinical use. Nat. Rev. Genet. 14: 549–558.
Marigorta, U.M. and Navarro, A. (2013). High trans-ethnic replicability of GWAS results implies common causal variants. PLoS Genet. 9: e1003566.
Mathieson, I., Lazaridis, I., Rohland, N. et al. (2015). Genome-wide patterns of selection in 230 ancient Eurasians. Nature 528: 499–503.
Moorjani, P., Patterson, N., Hirschhorn, J.N. et al. (2011). The history of African gene flow into Southern Europeans, Levantines, and Jews. PLoS Genet. 7: e1001373.
Moorjani, P., Gao, Z., and Przeworski, M. (2016). Human germline mutation and the erratic evolutionary clock. PLoS Biol. 14: e2000744.
Nachman, M.W. and Crowell, S.L. (2000). Estimate of the mutation rate per nucleotide in humans. Genetics 156: 297–304.
Nielsen, R., Akey, J.M., Jakobsson, M. et al. (2017). Tracing the peopling of the world through genomics. Nature 541: 302–310.
Norton, H.L., Kittles, R.A., Parra, E. et al. (2007). Genetic evidence for the convergent evolution of light skin in Europeans and East Asians. Mol. Biol. Evol. 24: 710–722.
Novembre, J. and Peter, B.M. (2016). Recent advances in the study of fine-scale population structure in humans. Curr. Opin. Genet. Dev. 41: 98–105.
Novembre, J. and Ramachandran, S. (2011). Perspectives on human population structure at the cusp of the sequencing era. Annu. Rev. Genomics Hum. Genet. 12: 245–274.
Novembre, J. and Stephens, M. (2008). Interpreting principal component analyses of spatial population genetic variation. Nat. Genet. 40: 646–649.
Ostrer, H. and Skorecki, K. (2013). The population genetics of the Jewish people. Hum. Genet. 132: 119–127.
Patterson, N., Moorjani, P., Luo, Y. et al. (2012). Ancient admixture in human history. Genetics 192: 1065–1093.
Perry, G.H., Dominy, N.J., Claw, K.G. et al. (2007). Diet and the evolution of human amylase gene copy number variation. Nat. Genet. 39: 1256–1260.
Pickrell, J.K. and Pritchard, J.K. (2012). Inference of population splits and mixtures from genome-wide allele frequency data. PLoS Genet. 8: e1002967.
Piel, F.B., Patil, A.P., Howes, R.E. et al. (2010). Global distribution of the sickle cell gene and geographical confirmation of the malaria hypothesis. Nat. Commun. 1: 104.
Price, A.L., Patterson, N.J., Plenge, R.M. et al. (2006). Principal components analysis corrects for stratification in genome-wide association studies. Nat. Genet. 38: 904–909.
Pritchard, J.K., Stephens, M., and Donnelly, P. (2000). Inference of population structure using multilocus genotype data. Genetics 155: 945–959.
Pritchard, J.K., Pickrell, J.K., and Coop, G. (2010). The genetics of human adaptation: hard sweeps, soft sweeps, and polygenic adaptation. Curr. Biol. 20: R208–R215.
Purcell, S., Neale, B., Todd-Brown, K. et al. (2007). PLINK: a tool set for whole-genome association and population-based linkage analyses. Am. J. Hum. Genet. 81: 559–575.
Quintana-Murci, L. (2016). Understanding rare and common diseases in the context of human evolution. Genome Biol. 17: 225.
Racimo, F., Sankararaman, S., Nielsen, R., and Huerta-Sanchez, E. (2015). Evidence for archaic adaptive introgression in humans. Nat. Rev. Genet. 16: 359–371.
Raj, A., Stephens, M., and Pritchard, J.K. (2014). fastSTRUCTURE: variational inference of population structure in large SNP data sets. Genetics 197: 573–589.
Ramachandran, S., Deshpande, O., Roseman, C.C. et al. (2005). Support from the relationship of genetic and geographic distance in human populations for a serial founder effect originating in Africa. Proc. Natl. Acad. Sci. U.S.A. 102: 15942–15947.
Rees, D.C., Williams, T.N., and Gladwin, M.T. (2010). Sickle-cell disease. Lancet 376: 2018–2031.
Reich, D., Thangaraj, K., Patterson, N. et al. (2009). Reconstructing Indian population history. Nature 461: 489–494.
Reich, D., Green, R.E., Kircher, M. et al. (2010). Genetic history of an archaic hominin group from Denisova Cave in Siberia. Nature 468: 1053–1060.
Reiff, S.B. and Striepen, B. (2009). Malaria: the gatekeeper revealed. Nature 459: 918–919.
Roach, J.C., Glusman, G., Smit, A.F. et al. (2010). Analysis of genetic inheritance in a family quartet by whole-genome sequencing. Science 328: 636–639.
Rosenberg, N.A. and Nordborg, M. (2002). Genealogical trees, coalescent theory and the analysis of genetic polymorphisms. Nat. Rev. Genet. 3: 380–390.
Rosenberg, N.A., Mahajan, S., Ramachandran, S. et al. (2005). Clines, clusters, and the effect of study design on the inference of human population structure. PLoS Genet. 1: e70.
Rosenberg, N.A., Huang, L., Jewett, E.M. et al. (2010). Genome-wide association studies in diverse populations. Nat. Rev. Genet. 11: 356–366.
Royal, C.D., Novembre, J., Fullerton, S.M. et al. (2010). Inferring genetic ancestry: opportunities, challenges, and implications. Am. J. Hum. Genet. 86: 661–673.
Sabeti, P.C., Reich, D.E., Higgins, J.M. et al. (2002). Detecting recent positive selection in the human genome from haplotype structure. Nature 419: 832–837.
Sabeti, P.C., Varilly, P., Fry, B. et al. (2007). Genome-wide detection and characterization of positive selection in human populations. Nature 449: 913–918.
Sankararaman, S., Mallick, S., Dannemann, M. et al. (2014). The genomic landscape of Neanderthal ancestry in present-day humans. Nature 507 (7492): 354–357.
Savage, S.A., Gerstenblith, M.R., Goldstein, A.M. et al. (2008). Nucleotide diversity and population differentiation of the melanocortin 1 receptor gene, MC1R. BMC Genet. 9: 31.
Scheinfeldt, L.B. and Tishkoff, S.A. (2013). Recent human adaptation: genomic approaches, interpretation and insights. Nat. Rev. Genet. 14: 692–702.
Schiffels, S. and Durbin, R. (2014). Inferring human population size and separation history from multiple genome sequences. Nat. Genet. 46: 919–925.
Schraiber, J.G. and Akey, J.M. (2015). Methods and models for unravelling human evolutionary history. Nat. Rev. Genet. 16: 727–740.
Segurel, L., Wyman, M.J., and Przeworski, M. (2014). Determinants of mutation rate variation in the human germline. Annu. Rev. Genomics Hum. Genet. 15: 47–70.
Shendure, J. and Akey, J.M. (2015). The origins, determinants, and consequences of human mutations. Science 349: 1478–1483.
Sherry, S.T. and Batzer, M.A. (1997). Modeling human evolution – to tree or not to tree? Genome Res. 7: 947–949.
Simonson, T.S., Yang, Y., Huff, C.D. et al. (2010). Genetic evidence for high-altitude adaptation in Tibet. Science 329: 72–75.
Simonson, T.S., Huff, C.D., Witherspoon, D.J. et al. (2015). Adaptive genetic changes related to haemoglobin concentration in native high-altitude Tibetans. Exp. Physiol. 100: 1263–1268.
Skoglund, P. and Reich, D. (2016). A genomic view of the peopling of the Americas. Curr. Opin. Genet. Dev. 41: 27–35.
Strauss, K.A. and Puffenberger, E.G. (2009). Genetics, medicine, and the plain people. Annu. Rev. Genomics Hum. Genet. 10: 513–536.
Sturm, R.A. and Duffy, D.L. (2012). Human pigmentation genes under environmental selection. Genome Biol. 13: 248.
Sudmant, P.H., Rausch, T., Gardner, E.J. et al. (2015a). An integrated map of structural variation in 2,504 human genomes. Nature 526: 75–81.
Sudmant, P.H., Mallick, S., Nelson, B.J. et al. (2015b). Global diversity, population stratification, and selection of human copy number variation. Science https://doi.org/10.1126/science.aab3761.
Tang, H., Peng, J., Wang, P., and Risch, N.J. (2005). Estimation of individual admixture: analytical and study design considerations. Genet. Epidemiol. 28: 289–301.
Tennessen, J.A., Bigham, A.W., O’Connor, T.D. et al. (2012). Evolution and functional impact of rare coding variation from deep sequencing of human exomes. Science 337 (6090): 64–69.
Teshima, K.M., Coop, G., and Przeworski, M. (2006). How reliable are empirical genomic scans for selective sweeps? Genome Res. 16: 702–712.
Tishkoff, S.A., Varkonyi, R., Cahinhinan, N. et al. (2001). Haplotype diversity and linkage disequilibrium at human G6PD: recent origin of alleles that confer malarial resistance. Science 293: 455–462.
Tishkoff, S.A., Reed, F.A., Ranciaro, A. et al. (2007). Convergent adaptation of human lactase persistence in Africa and Europe. Nat. Genet. 39: 31–40.
UK10K Consortium, Walter, K., Min, J.L. et al. (2015). The UK10K Project identifies rare variants in health and disease. Nature 526: 82–90.
Veeramah, K.R. and Hammer, M.F. (2014). The impact of whole-genome sequencing on the reconstruction of human population history. Nat. Rev. Genet. 15: 149–162.
Visscher, P.M., Wray, N.R., Zhang, Q. et al. (2017). 10 years of GWAS discovery: biology, function, and translation. Am. J. Hum. Genet. 101: 5–22.
Vitti, J.J., Grossman, S.R., and Sabeti, P.C. (2013). Detecting natural selection in genomic data. Annu. Rev. Genet. 47: 97–120.
Voight, B.F., Kudaravalli, S., Wen, X., and Pritchard, J.K. (2006). A map of recent positive selection in the human genome. PLoS Biol. 4: e72.
Volkman, S.K., Barry, A.E., Lyons, E.J. et al. (2001). Recent origin of Plasmodium falciparum from a single progenitor. Science 293: 482–484.
Wang, C., Zöllner, S., and Rosenberg, N.A. (2012). A quantitative comparison of the similarity between genes and geography in worldwide human populations. PLoS Genet. 8: e1002886.
Witherspoon, D.J., Wooding, S., Rogers, A.R. et al. (2007). Genetic similarities within and between human populations. Genetics 176: 351–359.
Xing, J., Watkins, W.S., Shlien, A. et al. (2010). Toward a more uniform sampling of human genetic diversity: a survey of worldwide populations by high-density genotyping. Genomics 96: 199–210.
Yi, X., Liang, Y., Huerta-Sanchez, E. et al. (2010). Sequencing of 50 human exomes reveals adaptation to high altitude. Science 329: 75–78.