Self-taught learning for classification of mass spectrometry data: a case study of colorectal cancer

Alexandrov, Theodore

Konferenzbeitrag

Self-taught learning for classification of mass spectrometry data: a case study of colorectal cancer

Dokumententyp

Text/Conference Paper

Dateien

45.pdf (384.2 KB)

Datum

2009

Autor:innen

Alexandrov, Theodore

Quelle

German conference on bioinformatics 2009

Regular Research Papers

Verlag

Gesellschaft für Informatik e.V.

Zusammenfassung

Mass spectrometry is an important technique for chemical profiling and is a major tool in proteomics, a discipline interested in large-scale studies of proteins expressed by an organism. In this paper we propose using a sparse coding algorithm for classification of mass spectrometry serum protein profiles of colorectal cancer patients and healthy individuals following the so-called self-taught learning approach. Being applied to the dataset of 112 spectra of length 4731 bins, the sparse coding algorithm represents each of them by means of less then ten prototype spectra. The classification of spectra is done as in our previous study on the same dataset [ADM+09], using Support Vector Machines evaluated by means of the double cross-validation. However, the classifiers take as input not discrete wavelet coefficients but the sparse coding coefficients. Comparing the classification results with reference results, we show that providing the same total recognition rate, the sparse coding-based procedure leads to higher generalization performance. Moreover, we propose using the sparse coding coefficients for clustering of mass spectra and demonstrate that this approach allows one to highlight differences between the cancer spectra.

Alexandrov, Theodore (2009): Self-taught learning for classification of mass spectrometry data: a case study of colorectal cancer. German conference on bioinformatics 2009. Bonn: Gesellschaft für Informatik e.V.. PISSN: 1617-5468. ISBN: 978-3-88579-251-2. pp. 45-54. Regular Research Papers. Halle-Wittenberg. 28th to 30th September 2009

Sammlungen

P157 - GCB 2009 - German Conference on Bioinformatics 2009

Komplettanzeige

Self-taught learning for classification of mass spectrometry data: a case study of colorectal cancer

Volltext URI

Dokumententyp

Dateien

Zusatzinformation

Datum

Autor:innen

Zeitschriftentitel

ISSN der Zeitschrift

Bandtitel

Quelle

Verlag

Zusammenfassung

Beschreibung

Schlagwörter

Zitierform

DOI

Tags

Sammlungen