Quantification and deconvolution of asymmetric LC-MS peaks using the bi-Gaussian mixture model and statistical model selection - PubMed (original) (raw)
Quantification and deconvolution of asymmetric LC-MS peaks using the bi-Gaussian mixture model and statistical model selection
Tianwei Yu et al. BMC Bioinformatics. 2010.
Abstract
Background: Liquid chromatography-mass spectrometry (LC-MS) is one of the major techniques for the quantification of metabolites in complex biological samples. Peak modeling is one of the key components in LC-MS data pre-processing.
Results: To quantify asymmetric peaks with high noise level, we developed an estimation procedure using the bi-Gaussian function. In addition, to accurately quantify partially overlapping peaks, we developed a deconvolution method using the bi-Gaussian mixture model combined with statistical model selection.
Conclusions: Using extensive simulations and real data, we demonstrated the advantage of the bi-Gaussian mixture model over the Gaussian mixture model and the method of kernel smoothing combined with signal summation in peak quantification and deconvolution. The method is implemented in the R package apLCMS: http://www.sph.emory.edu/apLCMS/.
Figures
Figure 1
The characteristics of the bi-Gaussian function. (a) the four parameters that define the bi-Guassian function; (b) The function A(τ)-B(τ) used in our estimation. Different σ1/σ2 ratios are plotted.
Figure 2
Comparison of the rate of successfully selecting the correct number of components between the bi-Gaussian mixture model and the Gaussian mixture model. Each sub-plot corresponds to a different degree of asymmetry, as shown in the titles of the sub-plots (ratios between the right- and left- standard deviations). Each dot represents a simulated situation. The values were obtained by averaging the results from 100 simulations. The color represents the level of overlaps between the simulated peaks. The size of the dot represents the amount of noise added to the data. The fill of the dot represents the percentage of values missing in the ion trace.
Figure 3
Comparison of the accuracy in peak size quantification between the bi-Gaussian mixture model and the Gaussian mixture model. Each sub-plot corresponds to a different degree of asymmetry, as shown in the titles of the sub-plots (ratios between the right- and left- standard deviations). Each dot represents a simulated situation. The values were obtained by averaging the results from 100 simulations. The color represents the level of overlaps between the simulated peaks. The size of the dot represents the amount of noise added to the data. The fill of the dot represents the percentage of values missing in the ion trace.
Figure 4
Comparison of the accuracy in peak size quantification between the bi-Gaussian mixture model and the method of kernel smoother combined with signal summation. Each sub-plot corresponds to a different degree of asymmetry, as shown in the titles of the sub-plots (ratios between the right- and left- standard deviations). Each dot represents a simulated situation. The values were obtained by averaging the results from 100 simulations. The color represents the level of overlaps between the simulated peaks. The size of the dot represents the amount of noise added to the data. The fill of the dot represents the percentage of values missing in the ion trace.
Figure 5
Comparison of the fit of the bi-Gaussian mixture model and the Gaussian mixture model to real asymmetric peaks. (a) The ion trace at m/z = 446.8913. (b) The ion trace at m/z = 301.1409. Colored curves: fitted components; black curve: summation of the signal from all the components. Upper-panel: the bi-Gaussian mixture fit; lower-panel: the Gaussian mixture fit.
Similar articles
- apLCMS--adaptive processing of high-resolution LC/MS data.
Yu T, Park Y, Johnson JM, Jones DP. Yu T, et al. Bioinformatics. 2009 Aug 1;25(15):1930-6. doi: 10.1093/bioinformatics/btp291. Epub 2009 May 4. Bioinformatics. 2009. PMID: 19414529 Free PMC article. - Improving peak detection in high-resolution LC/MS metabolomics data using preexisting knowledge and machine learning approach.
Yu T, Jones DP. Yu T, et al. Bioinformatics. 2014 Oct 15;30(20):2941-8. doi: 10.1093/bioinformatics/btu430. Epub 2014 Jul 7. Bioinformatics. 2014. PMID: 25005748 Free PMC article. - Hybrid feature detection and information accumulation using high-resolution LC-MS metabolomics data.
Yu T, Park Y, Li S, Jones DP. Yu T, et al. J Proteome Res. 2013 Mar 1;12(3):1419-27. doi: 10.1021/pr301053d. Epub 2013 Feb 12. J Proteome Res. 2013. PMID: 23362826 Free PMC article. - LC-MS Spectra Processing.
Matthiesen R. Matthiesen R. Methods Mol Biol. 2020;2051:59-77. doi: 10.1007/978-1-4939-9744-2_2. Methods Mol Biol. 2020. PMID: 31552624 Review. - Translational Metabolomics of Head Injury: Exploring Dysfunctional Cerebral Metabolism with Ex Vivo NMR Spectroscopy-Based Metabolite Quantification.
Wolahan SM, Hirt D, Glenn TC. Wolahan SM, et al. In: Kobeissy FH, editor. Brain Neurotrauma: Molecular, Neuropsychological, and Rehabilitation Aspects. Boca Raton (FL): CRC Press/Taylor & Francis; 2015. Chapter 25. In: Kobeissy FH, editor. Brain Neurotrauma: Molecular, Neuropsychological, and Rehabilitation Aspects. Boca Raton (FL): CRC Press/Taylor & Francis; 2015. Chapter 25. PMID: 26269925 Free Books & Documents. Review.
Cited by
- Inequivalent Solvation Effects on the N 1s Levels of Self-Associated Melamine Molecules in Aqueous Solution.
Ponzi A, Rosa M, Kladnik G, Unger I, Ciavardini A, Di Nardi L, Viola E, Nicolas C, Došlić N, Goldoni A, Lanzilotto V. Ponzi A, et al. J Phys Chem B. 2023 Apr 6;127(13):3016-3025. doi: 10.1021/acs.jpcb.3c00327. Epub 2023 Mar 27. J Phys Chem B. 2023. PMID: 36972466 Free PMC article. - Cardiovascular Risk and Resilience Among Black Adults: Rationale and Design of the MECA Study.
Islam SJ, Kim JH, Topel M, Liu C, Ko YA, Mujahid MS, Sims M, Mubasher M, Ejaz K, Morgan-Billingslea J, Jones K, Waller EK, Jones D, Uppal K, Dunbar SB, Pemu P, Vaccarino V, Searles CD, Baltrus P, Lewis TT, Quyyumi AA, Taylor H. Islam SJ, et al. J Am Heart Assoc. 2020 May 5;9(9):e015247. doi: 10.1161/JAHA.119.015247. Epub 2020 Apr 28. J Am Heart Assoc. 2020. PMID: 32340530 Free PMC article. - HappyTools: A software for high-throughput HPLC data processing and quantitation.
Jansen BC, Hafkenscheid L, Bondt A, Gardner RA, Hendel JL, Wuhrer M, Spencer DIR. Jansen BC, et al. PLoS One. 2018 Jul 6;13(7):e0200280. doi: 10.1371/journal.pone.0200280. eCollection 2018. PLoS One. 2018. PMID: 29979768 Free PMC article. - Application of network smoothing to glycan LC-MS profiling.
Klein J, Carvalho L, Zaia J. Klein J, et al. Bioinformatics. 2018 Oct 15;34(20):3511-3518. doi: 10.1093/bioinformatics/bty397. Bioinformatics. 2018. PMID: 29790907 Free PMC article. - A new peak detection algorithm for MALDI mass spectrometry data based on a modified Asymmetric Pseudo-Voigt model.
Wijetunge CD, Saeed I, Boughton BA, Roessner U, Halgamuge SK. Wijetunge CD, et al. BMC Genomics. 2015;16 Suppl 12(Suppl 12):S12. doi: 10.1186/1471-2164-16-S12-S12. Epub 2015 Dec 9. BMC Genomics. 2015. PMID: 26680279 Free PMC article.
References
Publication types
MeSH terms
LinkOut - more resources
Full Text Sources
Other Literature Sources