Ch14 Metabolomics / Summary + Further Reading + References
小结
代谢组学领域融合了基础生物学与分析化学的独特组合,并辅以大量的生物信息学、化学信息学和统计学方法。代谢组学方法已促成多项重要的生物医学发现(Wang et al. 2011a, b),并为更多发现打开了大门(Wishart 2016),这一事实使得这些方法在生命科学研究者中日益普及。事实上,过去十年间,代谢组学在规模、范围和技术水平上都有了显著发展。因此,对已开发的众多生物信息学/化学信息学工具、资源和技术进行详细描述,轻而易举就能写满好几本书。本章仅旨在作为一个易于入门的窗口,让那些有志从事代谢组学研究并希望使用生物信息学或化学信息学工具的人能够更好地了解现有的资源、可行的方向以及仍待解决的问题。
因特网资源
延伸阅读
Dunn, W.B., Bailey, N.J., and Johnson, H.E. (2005). Measuring the metabolome: current analytical technologies. Analyst 130: 606–625. 一篇关于代谢组学中不同分析技术的优秀综述。虽然论文发表时间较早,但其解释深入浅出、易于理解。此类文献永不过时。
Kind, T. and Fiehn, O. (2010). Advances in structure elucidation of small molecules using mass spectrometry. Bioanal. Rev. 2: 23–60. 一篇关于质谱如何以及应当如何用于代谢物表征的非常全面的综述。涵盖的许多主题都阐述得极为详尽。作者是备受推崇的质谱学家,开创了现代代谢组学中使用的许多技术和思想。
Wishart, D.S. (2016). Emerging applications of metabolomics in drug discovery and precision medicine. Nat. Rev. Drug Discov. 15: 473–484. 介绍代谢组学在医学应用中如何被(以及正在被)使用。重点介绍了过去10年间代谢组学领域产生的一些更重要、更有趣的生物医学发现,并展望了代谢组学的未来发展方向。
Xia, J. and Wishart, D.S. (2016). Using MetaboAnalyst 3.0 for comprehensive metabolomics data analysis. Curr. Protoc. Bioinf. 55: 14.10.1–14.10.93. 一份非常详细的分步说明(配有大量截图),描述了 MetaboAnalyst 中所有工具、技巧和窍门。任何希望从事代谢组学研究和使用 MetaboAnalyst 的人的必读文献。
参考文献
Allen, F., Pon, A., Wilson, M. et al. (2014). CFM-ID: a web server for annotation, spectrum prediction and metabolite identification from tandem mass spectra. Nucleic Acids Res. 42 (Web Server issue): W94–W99.
Allen, F., Greiner, R., and Wishart, D.S. (2015). Competitive fragmentation modeling of ESI-MS/MS spectra for putative metabolite identification. Metabolomics 11: 98–110.
Allen, F., Pon, A., Greiner, R., and Wishart, D. (2016). Computational prediction of electron ionization mass spectra to assist in GC/MS compound identification. Anal. Chem. 88: 7689–7697.
Barupal, D.K., Haldiya, P.K., Wohlgemuth, G. et al. (2012). MetaMapp: mapping and visualizing metabolomic data by integrating information from biochemical pathways and chemical and mass spectral similarity. BMC Bioinf. 13: 99.
Bassini, A. and Cameron, L.C. (2014). Sportomics: building a new concept in metabolic studies and exercise science. Biochem. Biophys. Res. Commun. 445: 708–716.
Bienfait, B. and Ertl, P. (2013). JSME: a free molecule editor in JavaScript. J. Cheminform. 5: 24.
Böcker, S., Letzel, M.C., Lipták, Z., and Pervukhin, A. (2009). SIRIUS: decomposing isotope patterns for metabolite identification. Bioinformatics 25: 218–224.
Brown, S.A. (2016). Circadian metabolism: from mechanisms to metabolomics and medicine. Trends Endocrinol. Metab. 27: 415–426.
Croft, D., O'Kelly, G., Wu, G. et al. (2011). Reactome: a database of reactions, pathways and biological processes. Nucleic Acids Res. 39 (Database issue): D691–D697.
da Silva, R.R., Dorrestein, P.C., and Quinn, R.A. (2015). Illuminating the dark matter in metabolomics. Proc. Natl. Acad. Sci. U.S.A. 112: 12549–12550.
Dalby, A., Nourse, J.G., Hounshell, W.D. et al. (1992). Description of several chemical structure file formats used by computer programs developed at Molecular Design Limited. J. Chem. Inf. Comput. Sci. 32: 244–255.
Davidson, R.L., Weber, R.J., Liu, H. et al. (2016). Galaxy-M: a Galaxy workflow for processing and analyzing direct infusion and liquid chromatography mass spectrometry-based metabolomics data. GigaScience 5: 10.
Demir, E., Cary, M.P., Paley, S. et al. (2010). The BioPAX community standard for pathway data sharing. Nat. Biotechnol. 28: 935–942.
Deutsch, E.W. (2008). mzML: a single, unifying data format for mass spectrometer output. Proteomics 14: 2776–2777.
Dunn, W.B., Bailey, N.J., and Johnson, H.E. (2005). Measuring the metabolome: current analytical technologies. Analyst 130: 606–625.
Durant, J.L., Leland, B.A., Henry, D.R., and Nourse, J.G. (2002). Reoptimization of MDL keys for use in drug discovery. J. Chem. Inf. Comput. Sci. 42: 1273–1280.
Ertl, P. (2010). Molecular structure input on the web. J. Cheminform. 2: 1.
Fahy, E., Sud, M., Cotter, D., and Subramaniam, S. (2007). LIPID MAPS online tools for lipid research. Nucleic Acids Res. 35 (Web Server issue): W606–W612.
Fiehn, O. (2002). Metabolomics – the link between genotypes and phenotypes. Plant Mol. Biol. 48: 155–171.
Grapov, D., Wanichthanarak, K., and Fiehn, O. (2015). MetaMapR: pathway independent metabolomic network analysis incorporating unknowns. Bioinformatics 31: 2757–2760.
Guo, A.C., Jewison, T., Wilson, M. et al. (2013). ECMDB: the E. coli Metabolome Database. Nucleic Acids Res. 41 (Database issue): D625–D630.
Hanson, R.M., Prilusky, J., Renjian, Z. et al. (2013). JSmol and the next-generation web-based representation of 3D molecular structure as applied to Proteopedia. Isr. J. Chem. 53: 207–216.
Hanwell, M.D., Curtis, D.E., Lonie, D.C. et al. (2012). Avogadro: an advanced semantic chemical editor, visualization, and analysis platform. J. Cheminform. 4: 17.
Hao, J., Liebeke, M., Astle, W. et al. (2014). Bayesian deconvolution and quantification of metabolites in complex 1D NMR spectra using BATMAN. Nat. Protoc. 9: 1416–1427.
Hastings, J., de Matos, P., Dekker, A. et al. (2013). The ChEBI reference database and ontology for biologically relevant chemistry: enhancements for 2013. Nucleic Acids Res. 41 (Database issue): D456–D463.
Haug, K., Salek, R.M., Conesa, P. et al. (2013). MetaboLights – an open-access general-purpose repository for metabolomics studies and associated meta-data. Nucleic Acids Res. 41 (Database issue): D781–D786.
Heller, S.R., McNaught, A., Pletnev, I. et al. (2015). InChI, the IUPAC international chemical identifier. J. Cheminform. 7: 23.
Holmes, E., Wilson, I.D., and Nicholson, J.K. (2008). Metabolic phenotyping in health and disease. Cell 134: 714–717.
Hucka, M., Finney, A., Sauro, H.M. et al. (2003). The systems biology markup language (SBML): a medium for representation and exchange of biochemical network models. Bioinformatics 19: 524–531.
Jewison, T., Knox, C., Neveu, V. et al. (2012). YMDB: the yeast metabolome database. Nucleic Acids Res. 40 (Database issue): D815–D820.
Jewison, T., Su, Y., Disfany, F.M. et al. (2014). SMPDB 2.0: big improvements to the small molecule pathway database. Nucleic Acids Res. 42 (Database issue): D478–D484.
Kanehisa, M., Goto, S., Sato, Y. et al. (2014). Data, information, knowledge and principle: back to metabolism in KEGG. Nucleic Acids Res. 42 (Database issue): D199–D205.
Karnovsky, A., Weymouth, T., Hull, T. et al. (2012). Metscape 2 bioinformatics tool for the analysis and visualization of metabolomics and gene expression data. Bioinformatics 28: 373–380.
Karp, P.D., Riley, M., Saier, M. et al. (2000). The EcoCyc and MetaCyc databases. Nucleic Acids Res. 28: 56–59.
Kastenmüller, G., Römisch-Margl, W., Wägele, B. et al. (2011). metaP-server: a web-based metabolomics data analysis tool. J. Biomed. Biotechnol. 2011 https://doi.org/10.1155/2011/839862.
Katajamaa, M., Miettinen, J., and Oresic, M. (2006). MZmine: toolbox for processing and visualization of mass spectrometry based molecular profile data. Bioinformatics 22: 634–636.
Kelder, T., van Iersel, M.P., Hanspers, K. et al. (2012). WikiPathways: building research communities on biological pathways. Nucleic Acids Res. 40 (Database issue): D1301–D1307.
Kessler, N., Neuweger, H., Bonte, A. et al. (2013). MeltDB 2.0-advances of the metabolomics software system. Bioinformatics 29: 2452–2459.
Kim, S., Kim, J., Yun, E.J., and Kim, K.H. (2016). Food metabolomics: from farm to human. Curr. Opin. Biotechnol. 37: 16–23.
Kind, T. and Fiehn, O. (2007). Seven Golden Rules for heuristic filtering of molecular formulas obtained by accurate mass spectrometry. BMC Bioinf. 8: 105.
Kind, T. and Fiehn, O. (2010). Advances in structure elucidation of small molecules using mass spectrometry. Bioanal. Rev. 2: 23–60.
Kind, T., Tsugawa, H., Cajka, T. et al. (2017). Identification of small molecules using accurate mass MS/MS search. Mass Spectrom. Rev. https://doi.org/10.1002/mas.21535.
Kopka, J., Schauer, N., Krueger, S. et al. (2005). GMD@CSB.DB: the Golm metabolome database. Bioinformatics 21: 1635–1638.
Krause, S., Willighagen, E., and Steinbeck, C. (2000). JChemPaint – using the collaborative forces of the internet to develop a free editor for 2D chemical structures. Molecules 5: 93–98.
Kuhn, S., Helmus, T., Lancashire, R.J. et al. (2007). Chemical markup, XML, and the World Wide Web. 7. CMLSpect, an XML vocabulary for spectral data. J. Chem. Inf. Model. 47: 2015–2034.
Lancashire, R.J. (2007). The JSpecView Project: an Open Source Java viewer and converter for JCAMP-DX, and XML spectral data files. Chem. Cent. J. 1: 31.
Levy, P.A. (2010). An overview of newborn screening. J. Dev. Behav. Pediatr. 31: 622–631.
Lewis, N.E., Nagarajan, H., and Palsson, B.O. (2012). Constraining the metabolic genotype-phenotype relationship using a phylogeny of in silico methods. Nat. Rev. Microbiol. 10: 291–305.
Lu, H., Liang, Y., Dunn, W.B. et al. (2008). Comparative evaluation of software for deconvolution of metabolomics data based on GC-TOF-MS. Trends Anal. Chem. 27: 215–227.
Markley, J.L., Ulrich, E.L., Berman, H.M. et al. (2008). BioMagResBank (BMRB) as a partner in the Worldwide Protein Data Bank (wwPDB): new policies affecting biomolecular NMR depositions. J. Biomol. NMR. 40: 153–155.
McDonald, R.S. and Wilks, P.A. (1988). JCAMP-DX: a standard form for exchange of infrared spectra in computer-readable form. Appl. Spectrosc. 42: 151–162.
Melamud, E., Vastag, L., and Rabinowitz, J.D. (2010). Metabolomic analysis and visualization engine for LC-MS data. Anal. Chem. 82: 9818–9826.
Nakamura, K., Shimura, N., Otabe, Y. et al. (2013). KNApSAcK-3D: a three-dimensional structure database of plant metabolites. Plant Cell Physiol. 54: e4.
Naz, S., Moreira dos Santos, D.C., García, A., and Barbas, C. (2014). Analytical protocols based on LC-MS, GC-MS and CE-MS for nontargeted metabolomics of biological tissues. Bioanalysis 6: 1657–1677.
Niedermeyer, T.H. (2016). Annotating and interpreting linear and cyclic peptide tandem mass spectra. Methods Mol. Biol. 1401: 199–207.
O'Boyle, N.M., Banck, M., James, C.A. et al. (2011). Open Babel: an open chemical toolbox. J. Cheminform. 3: 33.
Psychogios, N., Hau, D.D., Peng, J. et al. (2011). The human serum metabolome. PLoS One 6 (2): e16957.
Ravanbakhsh, S., Liu, P., Bjorndahl, T.C. et al. (2015). Accurate, fully-automated NMR spectral profiling for metabolomics. PLoS One 10: e0124219.
Ruppin, E., Papin, J.A., de Figueiredo, L.F., and Schuster, S. (2010). Metabolic reconstruction, constraint-based analysis and game theory to probe genome-scale metabolic networks. Curr. Opin. Biotechnol. 21: 502–510.
Schober, D., Jacob, D., Wilson, M. et al. (2018). nmrML: an open standard for the description, storage and exchange on NMR data. Anal. Chem. 90: 649–656.
Smith, C.A., Want, E.J., O'Maille, G. et al. (2006). XCMS: processing mass spectrometry data for metabolite profiling using nonlinear peak alignment, matching, and identification. Anal. Chem. 78: 779–787.
Stein, S.E. (1999). An integrated method for spectrum extraction and compound identification from gas chromatography/mass spectrometry data. J. Am. Soc. Mass Spectrom. 10: 770–781.
Steinbeck, C. and Kuhn, S. (2004). NMRShiftDB – compound identification and structure elucidation support through a free community-built web database. Phytochemistry 65: 2711–2717.
Steinbeck, C., Hoppe, C., Kuhn, S. et al. (2006). Recent developments of the chemistry development kit (CDK) an open-source java library for chemo- and bioinformatics. Curr. Pharm. Des. 12: 2111–2120.
Sud, M., Fahy, E., Cotter, D. et al. (2016). Metabolomics Workbench: an international repository for metabolomics data and metadata, metabolite standards, protocols, tutorials and training, and analysis tools. Nucleic Acids Res. 44 (D1): D463–D470.
Sumner, L.W., Amberg, A., Barrett, D. et al. (2007). Proposed minimum reporting standards for chemical analysis. Metabolomics 3: 211–221.
Swainston, N., Smallbone, K., Hefzi, H. et al. (2016). Recon 2.2: from reconstruction to model of human metabolism. Metabolomics 12: 109.
Tautenhahn, R., Cho, K., Uritboonthai, W. et al. (2012). An accelerated workflow for untargeted metabolomics using the METLIN database. Nat. Biotechnol. 30: 826–828.
Tsugawa, H., Cajka, T., Kind, T. et al. (2015). MS-DIAL: data-independent MS/MS deconvolution for comprehensive metabolome analysis. Nat. Methods 12: 523–526.
van Iersel, M.P., Villéger, A.C., Czauderna, T. et al. (2012). Software support for SBGN maps: SBGN-ML and LibSBGN. Bioinformatics 28: 2016–2021.
Viant, M.R. (2008). Recent developments in environmental metabolomics. Mol. Biosyst. 4: 980–986.
Wang, T.J., Larson, M.G., Vasan, R.S. et al. (2011a). Metabolite profiles and the risk of developing diabetes. Nat. Med. 17: 448–453.
Wang, Z., Klipfell, E., Bennett, B.J. et al. (2011b). Gut flora metabolism of phosphatidylcholine promotes cardiovascular disease. Nature 472: 57–63.
Weininger, D. (1988). SMILES 1. Introduction and encoding rules. J. Chem. Inf. Comput. Sci. 28: 31–38.
Westbrook, J.D. and Fitzgerald, P.M. (2003). The PDB format, mmCIF, and other data formats. Methods Biochem. Anal. 44: 161–179.
Wheeler, D.L., Barrett, T., Benson, D.A. et al. (2006). Database resources of the National Center for Biotechnology Information. Nucleic Acids Res. 34 (Database issue): D173–D180.
Wild, C.P. (2005). Complementing the genome with an "exposome": the outstanding challenge of environmental exposure measurement in molecular epidemiology. Cancer Epidemiol. Biomarkers Prev. 14: 1847–1850.
Williams, A.J. (2008). Public chemical compound databases. Curr. Opin. Drug Discov. Devel. 11: 393–404.
Wishart, D.S. (2005). Metabolomics: the principles and potential applications to transplantation. Am. J. Transplant. 5: 2814–2820.
Wishart, D.S. (2008). Quantitative metabolomics using NMR. Trends Anal. Chem. 27: 228–237.
Wishart, D.S. (2011). Advances in metabolite identification. Bioanalysis 3: 1769–1782.
Wishart, D.S. (2016). Emerging applications of metabolomics in drug discovery and precision medicine. Nat. Rev. Drug Discov. 15: 473–484.
Wishart, D.S., Tzur, D., Knox, C. et al. (2007). HMDB: the human metabolome database. Nucleic Acids Res. 35 (Database issue): D521–D526.
Wishart, D.S., Arndt, D., Pon, A. et al. (2015). T3DB: the toxic exposome database. Nucleic Acids Res. 43 (Database issue): D928–D934.
Worley, B. and Powers, R. (2014). MVAPACK: a complete data handling package for NMR metabolomics. ACS Chem. Biol. 9: 1138–1144.
Xia, J. and Wishart, D.S