Progressive peak clustering in GC-MS Metabolomic experiments applied to Leishmania parasites (original) (raw)
Abstract
Motivation: A common problem in the emerging field of metabolomics is the consolidation of signal lists derived from metabolic profiling of different cell/tissue/fluid states where a number of replicate experiments was collected on each state. We describe an approach for the consolidation of peak lists based on hierarchical clustering, first within each set of replicate experiments, and then between the sets of replicate experiments. The problems of finding the dendrogram tree cutoff which gives the optimal number of peak clusters and the effect of different clustering methods were addressed. When applied to GC-MS metabolic profiling data acquired on Leishmania mexicana this approach resulted in robust data matrices which completely separated the wild-type and two mutant parasite lines based on their metabolic profile. * To whom correspondence should be addressed. (metabolites) and provide a quantitative measure of the concentration of individual analytes. A common problem in metabolic profiling is the correlation of signal peaks from two or more cell/tissue states (i.e. wild-type vs mutant, healthy vs diseased etc). For example, if two cell/tissue states A and B are examined, with observed signal peaks A 1 , A 2 , ... A N and B 1 , B 2 , ... B M , it is important to establish the correspondence between these signals, i.e. which signals in A and B refer to the same analyte or metabolite. Furthermore, some signals in A may not have the corresponding signal in B due to metabolites produced in one state but not the other. The correspondence between A 1 , A 2 , ... A N and B 1 , B 2 , ... B M leads directly to the data matrix, whose rows represent individual experiments and columns represent unique analytes observed in the two sets of experiments. The generalization to more than two experiments is straightforward. Metabolic profiling experiments are relatively rapid and inexpensive, with typically multiple replicates recorded for each cell/sample state . Multiple replicate experiments facilitate robust statistical analysis, and have also been used to explore the inherent biological variability in metabolomic studies . If replicate experiments are to be explicitly included in the data matrix, the correspondence between signals observed in different replicates must be established. By taking the previous example of two cell states A and B, and assuming that P replicate experiments of A and Q replicate experiments of B were performed, the two sets of experiments may be represented by a series of signals A 11 , A 12 , .
Loading Preview
Sorry, preview is currently unavailable. You can download the paper by clicking the button above.
References (20)
- Allen, J., Davey, H.M., Broadhurst, D., Heald, J.K., Rowland, J.J., Oliver, S.G. and Kell, D.B. (2003) High-throughput classification of yeast mu- tants for functional genomics using metabolic footprinting, Nat Bio- technol, 21, 692-696.
- Burchmore, R.J., Rodriguez-Contreras, D., McBride, K., Merkel, P., Bar- rett, M.P., Modi, G., Sacks, D. and Landfear, S.M. (2003) Genetic char- acterization of glucose transporter function in Leishmania mexicana, Proc Natl Acad Sci U S A, 100, 3901-3906.
- Castrillo, J.I., Hayes, A., Mohammed, S., Gaskell, S.J. and Oliver, S.G. (2003) An optimized protocol for metabolome analysis in yeast using direct infusion electrospray mass spectrometry, Phytochemistry, 62, 929-937.
- Choi, Y.H., Kim, H.K., Hazekamp, A., Erkelens, C., Lefeber, A.W. and Verpoorte, R. (2004) Metabolomic differentiation of Cannabis sativa cultivars using 1H NMR spectroscopy and principal component analy- sis, J Nat Prod, 67, 953-957.
- Davis, A.J., Murray, H.W. and Handman, E. (2004) Drugs against leishma- niasis: a synergy of technology and partnerships, Trends Parasitol, 20, 73-76.
- Duran, A.L., Yang, J., Wang, L. and Sumner, L.W. (2003) Metabolomics spectral formatting, alignment and conversion tools (MSFACTs), Bioin- formatics, 19, 2283-2293.
- Fernie, A.R., Trethewey, R.N., Krotzky, A.J. and Willmitzer, L. (2004) Metabolite profiling: from diagnostics to systems biology, Nature Re- views: Molecular Cell Biology, 5, 1-7.
- Fiehn, O. (2002) Metabolomics--the link between genotypes and pheno- types, Plant Mol Biol, 48, 155-171.
- Fiehn, O., Kopka, J., Dormann, P., Altmann, T., Trethewey, R.N. and Willmitzer, L. (2000) Metabolite profiling for plant functional genom- ics, Nat Biotechnol, 18, 1157-1161.
- Garami, A. and Ilg, T. (2001) The role of phosphomannose isomerase in Leishmania mexicana glycoconjugate synthesis and virulence, J Biol Chem, 276, 6566-6575.
- Goodacre, R., Vaidyanathan, S., Bianchi, G. and Kell, D.B. (2002) Meta- bolic profiling using direct infusion electrospray ionisation mass spec- trometry for the characterisation of olive oils, Analyst, 127, 1457-1462.
- Goodacre, R., Vaidyanathan, S., Dunn, W.B., Harrigan, G.G. and Kell, D.B. (2004) Metabolomics by numbers: acquiring and understanding global metabolite data, Trends Biotechnol, 22, 245-252.
- Guillo, C., Barlow, D., Perrett, D. and Hanna-Brown, M. (2004) Micellar electrokinetic capillary chromatography and data alignment analysis: a new tool in urine profiling, J Chromatogr A, 1027, 203-212.
- Raamsdonk, L.M., Teusink, B., Broadhurst, D., Zhang, N., Hayes, A., Walsh, M.C., Berden, J.A., Brindle, K.M., Kell, D.B., Rowland, J.J., Westerhoff, H.V., van Dam, K. and Oliver, S.G. (2001) A functional genomics strategy that uses metabolome data to reveal the phenotype of silent mutations, Nat Biotechnol, 19, 45-50.
- Roessner, U., Luedemann, A., Brust, D., Fiehn, O., Linke, T., Willmitzer, L. and Fernie, A. (2001) Metabolic profiling allows comprehensive phenotyping of genetically or environmentally modified plant systems, Plant Cell, 13, 11-29.
- Roessner, U., Wagner, C., Kopka, J., Trethewey, R.N. and Willmitzer, L. (2000) Technical advance: simultaneous analysis of metabolites in po- tato tuber by gas chromatography-mass spectrometry, Plant J, 23, 131- 142.
- Sato, S., Soga, T., Nishioka, T. and Tomita, M. (2004) Simultaneous de- termination of the main metabolites in rice leaves using capillary elec- trophoresis mass spectrometry and capillary electrophoresis diode array detection, Plant J, 40, 151-163.
- Sumner, L.W., Mendes, P. and Dixon, R.A. (2003) Plant metabolomics: large-scale phytochemistry in the functional genomics era, Phytochem- istry, 62, 817-836.
- Tolstikov, V.V., Lommen, A., Nakanishi, K., Tanaka, N. and Fiehn, O. (2003) Monolithic silica-based capillary reversed-phase liquid chroma- tography/electrospray mass spectrometry for plant metabolomics, Anal Chem, 75, 6737-6740.
- Urbanczyk-Wochniak, E., Luedemann, A., Kopka, J., Selbig, J., Roessner- Tunali, U., Willmitzer, L. and Fernie, A.R. (2003) Parallel analysis of transcript and metabolic profiles: a new approach in systems biology, EMBO Rep, 4, 989-993.