Michael Wagner - Academia.edu (original) (raw)
Uploads
Papers by Michael Wagner
PROTEOMICS, 2003
We report our results in classifying protein matrix‐assisted laser desorption/ionization‐time of ... more We report our results in classifying protein matrix‐assisted laser desorption/ionization‐time of flight mass spectra obtained from serum samples into diseased and healthy groups. We discuss in detail five of the steps in preprocessing the mass spectral data for biomarker discovery, as well as our criterion for choosing a small set of peaks for classifying the samples. Cross‐validation studies with four selected proteins yielded misclassification rates in the 10–15% range for all the classification methods. Three of these proteins or protein fragments are down‐regulated and one up‐regulated in lung cancer, the disease under consideration in this data set. When cross‐validation studies are performed, care must be taken to ensure that the test set does not influence the choice of the peaks used in the classification. Misclassification rates are lower when both the training and test sets are used to select the peaks used in classification versus when only the training set is used. This ...
PROTEOMICS, 2003
We report our results in classifying protein matrix‐assisted laser desorption/ionization‐time of ... more We report our results in classifying protein matrix‐assisted laser desorption/ionization‐time of flight mass spectra obtained from serum samples into diseased and healthy groups. We discuss in detail five of the steps in preprocessing the mass spectral data for biomarker discovery, as well as our criterion for choosing a small set of peaks for classifying the samples. Cross‐validation studies with four selected proteins yielded misclassification rates in the 10–15% range for all the classification methods. Three of these proteins or protein fragments are down‐regulated and one up‐regulated in lung cancer, the disease under consideration in this data set. When cross‐validation studies are performed, care must be taken to ensure that the test set does not influence the choice of the peaks used in the classification. Misclassification rates are lower when both the training and test sets are used to select the peaks used in classification versus when only the training set is used. This ...