Ch17 Translational Bioinformatics / Summary + Internet Resources + References
Summary
转化生物信息学(translational bioinformatics)的未来十分光明,并且仍将持续发展。随着我们通过高通量实验不断加深对疾病成因的认识,我们开始发展用于早期检测和诊断的新方法、新的干预手段,以及将结果反馈给患者的新工具。我们已经非常擅长在临床中进行基因组测序和解读。这推动了许多工具的发展,用于预测和优先排序致病性或功能性变异以及疾病相关基因。通过将我们丰富的遗传学知识与基于 EHRs(electronic health records,电子健康记录)、行为数据、患者报告数据等构建的新型复杂患者表型模型相连接,我们已经开始更好地建立基因型与表型之间的联系。这项工作推动了围绕患者数据计算的大量活动。一些新兴项目包括药物再利用(drug repurposing)工作、社区挑战赛与开放科学,以及有用的临床标志物发现。
此外,借助 EHR 系统中的新技术,例如标准 APIs,我们能够将这些发现实现为数据分析方法和推荐方法,并通过健康记录或依附于健康记录开发的应用程序,把相关信息呈现给患者或医疗服务提供者,从而直接影响临床照护。
Internet Resources
References
Adams, S.A. and Petersen, C. (2016). Precision medicine: opportunities, possibilities, and
challenges for patients and providers. J. Am. Med. Inf. Assoc. 23 (4): 787–790.
Adzhubei, I., Jordan, D.M., and Sunyaev, S.R. (2013). Predicting functional effect of human
missense mutations using PolyPhen-2. Curr. Protoc. Hum. Genet. 76: 7.20.1–7.20.41.
Anonymous (1996). Revisions of the International classification of diseases (ICD-9 and ICD-10):
impact on health statistics. Epidemiol. Bull. 17 (2): 1–5.
Ashley, E.A. (2015). The precision medicine initiative: a new national effort. JAMA 313 (21):
2119–2120.
Aubert, G. and Lansdorp, P.M. (2008). Telomeres and aging. Physiol. Rev. 88 (2): 557–579.
Ball, M.P., Bobe, J.R., Chou, M.F. et al. (2014). Harvard personal genome project: lessons from
participatory public research. Genome Med. 6 (2): 10.
Bloomfield, R.A. Jr., Polo-Wood, F., Mandel, J.C., and Mandl, K.D. (2017). Opening the Duke
electronic health record to apps: implementing SMART on FHIR. Int. J. Med. Inf. 99: 1–10.
Bollati, V., Schwartz, J., Wright, R. et al. (2009). Decline in genomic DNA methylation through
aging in a cohort of elderly subjects. Mech. Ageing Dev. 130 (4): 234–239.
Boycott, K.M., Vanstone, M.R., Bulman, D.E., and MacKenzie, A.E. (2013). Rare-disease genetics
in the era of next-generation sequencing: discovery to translation. Nat. Rev. Genet. 14 (10):
681–691.
Bromberg, Y., Yachdav, G., and Rost, B. (2008). SNAP predicts effect of mutations on protein
function. Bioinformatics 24 (20): 2397–2398.
Cai, B., Li, B., Kiga, N. et al. (2017). Matching phenotypes to whole genomes: lessons learned from
four iterations of the personal genome project community challenges. Hum. Mutat. 38 (9):
1266–1276.
Callier, S.L., Abudu, R., Mehlman, M.J. et al. (2016). Ethical, legal, and social implications of
personalized genomic medicine research: current literature and suggestions for the future.
Bioethics 30 (9): 698–705.
Carey, D.J., Fetterolf, S.N., Davis, F.D. et al. (2016). The Geisinger MyCode community health
initiative: an electronic health record-linked biobank for precision medicine research. Genet.
Med. 18 (9): 906–913.
Cariaso, M. and Lennon, G. (2012). SNPedia: a wiki supporting personal genome annotation,
interpretation and analysis. Nucleic Acids Res. 40 (Database issue): D1308–D1312.
Chiaramello, E., Pinciroli, F., Bonalumi, A. et al. (2016). Use of “off-the-shelf” information
extraction algorithms in clinical informatics: a feasibility study of MetaMap annotation of
Italian medical notes. J. Biomed. Inf. 63: 22–32.
Collins, F.S. (1999). Shattuck Lecture – medical and societal consequences of the Human Genome
Project. N. Engl. J. Med. 341 (1): 28–37.
Collins, A., Lonjou, C., and Morton, N.E. (1999). Genetic epidemiology of single-nucleotide
polymorphisms. Proc. Natl. Acad. Sci. U.S.A. 96 (26): 15173–15177.
Cooper, G.M. and Shendure, J. (2011). Needles in stacks of needles: finding disease-causal variants
in a wealth of genomic data. Nat. Rev. Genet. 12 (9): 628–640.
Corsello, S.M., Bittker, J.A., Liu, Z. et al. (2017). The drug repurposing hub: a next-generation drug
library and information resource. Nat. Med. 23 (4): 405–408.
Cronin, R.M., Field, J.R., Bradford, Y. et al. (2014). Phenome-wide association studies
demonstrating pleiotropy of genetic variants within FTO with and without adjustment for body
mass index. Front. Genet. 5: 250.
Denny, J.C., Ritchie, M.D., Basford, M.A. et al. (2010). PheWAS: demonstrating the feasibility of a
phenome-wide scan to discover gene-disease associations. Bioinformatics 26 (9): 1205–1210.
Devarakonda, S., Sevusu, P., Liu, H. et al. (2013). Real-time air quality monitoring through mobile
sensing in metropolitan areas. In: Proceedings of the 2nd ACM SIGKDD International Workshop
on Urban Computing, Chicago, IL (11 August 2013). New York, NY: ACM.
Dudley, J.T., Deshpande, T., and Butte, A.J. (2011). Exploiting drug-disease relationships for
computational drug repositioning. Briefings Bioinf. 12 (4): 303–311.
Edwards, N.J., Oberti, M., Thangudu, R.R. et al. (2015). The CPTAC data portal: a resource for
cancer proteomics research. J. Proteome Res. 14 (6): 2707–2713.
Eysenbach, G. (2009). Infodemiology and infoveillance: framework for an emerging set of public
health informatics methods to analyze search, communication and publication behavior on the
Internet. J. Med. Internet Res. 11 (1): e11.
Fleurence, R.L., Curtis, L.H., Califf, R.M. et al. (2014). Launching PCORnet, a national
patient-centered clinical research network. J. Am. Med. Inf. Assoc. 21 (4): 578–582.
Flores, M., Glusman, G., Brogaard, K. et al. (2013). P4 medicine: how systems medicine will
transform the healthcare sector and society. Per. Med. 10 (6): 565–576.
Forbes, S.A., Beare, D., Boutselakis, H. et al. (2017). COSMIC: somatic cancer genetics at
high-resolution. Nucleic Acids Res. 45 (Database issue): D777–D783.
Gini, R., Schuemie, M., Brown, J. et al. (2016). Data extraction and management in networks of
observational health care databases for scientific research: a comparison of EU-ADR, OMOP,
mini-sentinel and MATRICE strategies. EGEMS (Wash DC) 4 (1): 1189.
Hamburg, M.A. and Collins, F.S. (2010). The path to personalized medicine. N. Engl. J. Med. 363
(4): 301–304.
Hipp, J.A., Adlakha, D., Eyler, A.A. et al. (2013). Emerging technologies: webcams and
crowd-sourcing to identify active transportation. Am. J. Prev. Med. 44 (1): 96.
Ioannidis, N.M., Rothstein, J.H., Pejaver, V. et al. (2016). REVEL: an ensemble method for
predicting the pathogenicity of rare missense variants. Am. J. Hum. Genet. 99 (4): 877–885.
Jarchum, I. and Jones, S. (2015). DREAMing of benchmarks. Nat. Biotechnol. 33 (1): 49–50.
Jensen, P.B., Jensen, L.J., and Brunak, S. (2012). Mining electronic health records: towards better
research applications and clinical care. Nat. Rev. Genet. 13 (6): 395–405.
Jha, A.K., DesRoches, C.M., Campbell, E.G. et al. (2009). Use of electronic health records in U.S.
hospitals. N. Engl. J. Med. 360 (16): 1628–1638.
Jorde, L.B. and Wooding, S.P. (2004). Genetic variation, classification and “race”. Nat. Genet. 36 (11
Suppl): S28–S33.
Kohler, S., Vasilevsky, N.A., Engelstad, M. et al. (2017). The human phenotype ontology in 2017.
Nucleic Acids Res. 45 (D1): D865–D876.
Laffitte, E. and Revuz, J. (2004). Thalidomide: an old drug with new clinical applications. Expert
Opin. Drug Saf. 3 (1): 47–56.
Landrum, M.J., Lee, J.M., Benson, M. et al. (2016). ClinVar: public archive of interpretations of
clinically relevant variants. Nucleic Acids Res. 44 (D1): D862–D868.
Lazer, D., Kennedy, R., King, G., and Vespignani, A. (2014). The parable of Google Flu: traps in big
data analysis. Science 343 (6176): 1203–1205.
Li, B., Krishnan, V.G., Mort, M.E. et al. (2009). Automated inference of molecular mechanisms of
disease from amino acid substitutions. Bioinformatics 25 (21): 2744–2750.
Liu, X., Wu, C., Li, C., and Boerwinkle, E. (2016). dbNSFP v3.0: a one-stop database of functional
predictions and annotations for human nonsynonymous and splice-site SNVs. Hum. Mutat. 37
(3): 235–241.
Lovasi, G.S., O’Neil-Dunne, J.P., Lu, J.W. et al. (2013). Urban tree canopy and asthma, wheeze,
rhinitis, and allergic sensitization to tree pollen in a New York City birth cohort. Environ.
Health Perspect. 121 (4): 494.
Lugo-Martinez, J., Pejaver, V., Pagel, K.A. et al. (2016). The loss and gain of functional amino acid
residues is a common mechanism causing human inherited disease. PLoS Comput. Biol. 12 (8):
e1005091.
Mandel, J.C., Kreda, D.A., Mandl, K.D. et al. (2016). SMART on FHIR: a standards-based,
interoperable apps platform for electronic health records. J. Am. Med. Inf. Assoc. 23 (5): 899–908.
Mooney, S.D. and Klein, T.E. (2002). The functional importance of disease-associated mutation.
BMC Bioinf. 3: 24.
Mooney, S.J. and Pejaver, V. (2017). Big data in public health: terminology, machine learning, and
privacy. Annu. Rev. Public Health 39: 95–112.
Mooney, S.J., Grady, S.T., Sotoodehnia, N. et al. (2016a). In the wrong place with the wrong SNP:
the association between stressful neighborhoods and cardiac arrest within Beta-2-adrenergic
receptor variants. Epidemiology 27 (5): 656–662.
Mooney, S.J., DiMaggio, C.J., Lovasi, G.S. et al. (2016b). Use of Google Street View to assess
environmental contributions to pedestrian injury. Am. J. Public Health 106 (3): 462–469.
Mort, M., Sterne-Weiler, T., Li, B. et al. (2014). MutPred splice: machine learning-based prediction
of exonic variants that disrupt splicing. Genome Biol. 15 (1): R19.
Moult, J., Fidelis, K., Kryshtafovych, A., and Tramontano, A. (2011). Critical assessment of
methods of protein structure prediction (CASP)--round IX. Proteins 79 (Suppl 10): 1–5.
Mungall, C.J., McMurry, J.A., Köhler, S. et al. (2017). The Monarch initiative: an integrative data
and analytic platform connecting phenotypes to genotypes across species. Nucleic Acids Res. 45
(D1): D712–D722.
Murdoch, T.B. and Detsky, A.S. (2013). The inevitable application of big data to health care. JAMA
309 (13): 1351–1352.
Musen, M.A., Noy, N.F., Shah, N.H. et al. (2012). The National Center for Biomedical Ontology. J.
Am. Med. Inf. Assoc. 19 (2): 190–195.
Ng, P.C. and Henikoff, S. (2003). SIFT: predicting amino acid changes that affect protein function.
Nucleic Acids Res. 31 (13): 3812–3814.
Nikfarjam, A., Sarker, A., O’Connor, K. et al. (2015). Pharmacovigilance from social media:
mining adverse drug reaction mentions using sequence labeling with word embedding cluster
features. J. Am. Med. Inf. Assoc. 22 (3): 671–681.
Nishimura, A.A., Shirts, B.H., Dorschner, M.O. et al. (2015). Development of clinical decision
support alerts for pharmacogenomic incidental findings from exome sequencing. Genet. Med.
17 (11): 939–942.
Norris, K. and Nissenson, A.R. (2008). Race, gender, and socioeconomic disparities in CKD in the
United States. J. Am. Soc. Nephrol. 19 (7): 1261–1270.
Oliver, J.M. and McGuire, A.L. (2011). Exploring the ELSI universe: critical issues in the evolution
of human genomic research. Genome Med. 3 (6): 38.
Overby, C.L., Devine, E.B., Abernethy, N. et al. (2015). Making pharmacogenomic-based
prescribing alerts more effective: a scenario-based pilot study with physicians. J. Biomed. Inf. 55:
249–259.
Pathak, J., Kho, A.N., and Denny, J.C. (2013). Electronic health records-driven phenotyping:
challenges, recent advances, and perspectives. J. Am. Med. Inf. Assoc. 20 (e2): e206–e211.
Prociow, P., Wac, K., and Crowe, J. (2012). Mobile psychiatry: towards improving the care for
bipolar disorder. Int. J. Ment. Health Syst. 6 (1): 5.
Radivojac, P., Peng, K., Clark, W.T. et al. (2008). An integrated approach to inferring gene-disease
associations in humans. Proteins 72 (3): 1030–1037.
Radivojac, P., Clark, W.T., Oron, T.R. et al. (2013). A large-scale evaluation of computational
protein function prediction. Nat. Methods 10 (3): 221–227.
Relling, M.V. and Klein, T.E. (2011). CPIC: clinical pharmacogenetics implementation consortium
of the pharmacogenomics research network. Clin. Pharmacol. Ther. 89 (3): 464–467.
Richards, S., Aziz, N., Bale, S. et al. (2015). Standards and guidelines for the interpretation of
sequence variants: a joint consensus recommendation of the American College of Medical
Genetics and Genomics and the Association for Molecular Pathology. Genet. Med. 17 (5):
405–424.
Ritchie, G.R.S., Dunham, I., Zeggini, E., and Flicek, P. (2014). Functional annotation of noncoding
sequence variants. Nat. Methods 11 (3): 294–296.
Rudisill, T.M., Zhu, M., Davidov, D. et al. (2016). Medication use and the risk of motor vehicle
collision in West Virginia drivers 65 years of age and older: a case-crossover study. BMC Res.
Notes 9: 166.
Saez-Rodriguez, J., Costello, J.C., Friend, S.H. et al. (2016). Crowdsourcing biomedical research:
leveraging communities as innovation engines. Nat. Rev. Genet. 17 (8): 470–486.
Santillana, M., Zhang, D.W., Althouse, B.M., and Ayers, J.W. (2014). What can digital disease
detection learn from (an external revision to) Google Flu Trends? Am. J. Prev. Med. 47 (3):
341–347.
Smith, B., Ashburner, M., Rosse, C. et al. (2007). The OBO foundry: coordinated evolution of
ontologies to support biomedical data integration. Nat. Biotechnol. 25 (11): 1251–1255.
Stenson, P.D., Mort, M., Ball, E.V. et al. (2017). The Human Gene Mutation Database: towards a
comprehensive repository of inherited mutation data for medical research, genetic diagnosis
and next-generation sequencing studies. Hum. Genet. 136 (6): 665–677.
Thorn, C.F., Klein, T.E., and Altman, R.B. (2010). Pharmacogenomics and bioinformatics:
PharmGKB. Pharmacogenomics 11 (4): 501–505.
Tranchevent, L.C., Ardeshirdavani, A., ElShal, S. et al. (2016). Candidate gene prioritization with
Endeavour. Nucleic Acids Res. 44 (W1): W117–W121.
Vuokko, R., Mäkelä-Bengs, P., Hyppönen, H., and Doupi, P. (2015). Secondary use of structured
patient data: interim results of a systematic review. Stud. Health Technol. Inf. 210: 291–295.
Wang, Z. and Moult, J. (2003). Three-dimensional structural location and molecular functional
effects of missense SNPs in the T cell receptor Vbeta domain. Proteins 53 (3): 748–757.
Wang, X., Wang, G., Shen, C. et al. (2008). Using RNase sequence specificity to refine the
identification of RNA-protein binding regions. BMC Genomics 9 (Suppl 1): S17.
Wang, K., Li, M., and Hakonarson, H. (2010). ANNOVAR: functional annotation of genetic
variants from high-throughput sequencing data. Nucleic Acids Res. 38 (16): e164.
Warde-Farley, D., Donaldson, S.L., Comes, O. et al. (2010). The GeneMANIA prediction server:
biological network integration for gene prioritization and predicting gene function. Nucleic
Acids Res. 38 (Web Server issue): W214–W220.
Wilson, V.L. and Jones, P.A. (1983). DNA methylation decreases in aging but not in immortal cells.
Science 220 (4601): 1055–1057.
Wishart, D.S., Feunang, Y.D., Guo, A.C. et al. (2018). DrugBank 5.0: a major update to the
DrugBank database for 2018. Nucleic Acids Res. 46 (D1): D1074–D1082.
Xin, J., Mark, A., Afrasiabi, C. et al. (2016). High-performance web services for querying gene and
variant annotation. Genome Biol. 17 (1): 91.