Statistical analysis in metabolic phenotyping - PubMed (original) (raw)

. 2021 Sep;16(9):4299-4326.

doi: 10.1038/s41596-021-00579-1. Epub 2021 Jul 28.

Affiliations

Statistical analysis in metabolic phenotyping

Benjamin J Blaise et al. Nat Protoc. 2021 Sep.

Abstract

Metabolic phenotyping is an important tool in translational biomedical research. The advanced analytical technologies commonly used for phenotyping, including mass spectrometry (MS) and nuclear magnetic resonance (NMR) spectroscopy, generate complex data requiring tailored statistical analysis methods. Detailed protocols have been published for data acquisition by liquid NMR, solid-state NMR, ultra-performance liquid chromatography (LC-)MS and gas chromatography (GC-)MS on biofluids or tissues and their preprocessing. Here we propose an efficient protocol (guidelines and software) for statistical analysis of metabolic data generated by these methods. Code for all steps is provided, and no prior coding skill is necessary. We offer efficient solutions for the different steps required within the complete phenotyping data analytics workflow: scaling, normalization, outlier detection, multivariate analysis to explore and model study-related effects, selection of candidate biomarkers, validation, multiple testing correction and performance evaluation of statistical models. We also provide a statistical power calculation algorithm and safeguards to ensure robust and meaningful experimental designs that deliver reliable results. We exemplify the protocol with a two-group classification study and data from an epidemiological cohort; however, the protocol can be easily modified to cover a wider range of experimental designs or incorporate different modeling approaches. This protocol describes a minimal set of analyses needed to rigorously investigate typical datasets encountered in metabolic phenotyping.

© 2021. The Author(s), under exclusive licence to Springer Nature Limited.

PubMed Disclaimer

References

    1. Nicholson, J. K., Lindon, J. C. & Holmes, E. ‘Metabonomics’: understanding the metabolic responses of living systems to pathophysiological stimuli via multivariate statistical analysis of biological NMR spectroscopic data. Xenobiotica 29, 1181–1189 (2008). - DOI
    1. Holmes, E., Wilson, I. D. & Nicholson, J. K. Metabolic phenotyping in health and disease. Cell 134, 714–717 (2008). - PubMed - DOI
    1. Nicholson, J. K. et al. Metabolic phenotyping in clinical and surgical environments. Nature 491, 384–392 (2012). - PubMed - DOI
    1. Surowiec, I. et al. Quantification of run order effect on chromatography - mass spectrometry profiling data. J. Chromatogr. A 1568, 229–234 (2018). - PubMed - DOI
    1. Leek, J. T. et al. Tackling the widespread and critical impact of batch effects in high-throughput data. Nat. Rev. Genet. 11, 733–739 (2010). - PubMed - DOI

Publication types

MeSH terms

Grants and funding

LinkOut - more resources