Machine learning-based classification of diffuse large B-cell lymphoma patients by their protein expression profiles

Sally J. Deeb, Stefka Tyanova, Michael Hummel, Marc Schmidt-Supprian, Juergen Cox, Matthias Mann

Research output: Contribution to journalArticlepeer-review

69 Scopus citations


Characterization of tumors at the molecular level has improved our knowledge of cancer causation and progression. Proteomic analysis of their signaling pathways promises to enhance our understanding of cancer aberrations at the functional level, but this requires accurate and robust tools. Here, we develop a state of the art quantitative mass spectrometric pipeline to characterize formalin-fixed paraffin-embedded tissues of patients with closely related subtypes of diffuse large B-cell lymphoma. We combined a super-SILAC approach with label-free quantification (hybrid LFQ) to address situations where the protein is absent in the super-SILAC standard but present in the patient samples. Shotgun proteomic analysis on a quadrupole Orbitrap quantified almost 9,000 tumor proteins in 20 patients. The quantitative accuracy of our approach allowed the segregation of diffuse large B-cell lymphoma patients according to their cell of origin using both their global protein expression patterns and the 55-protein signature obtained previously from patientderived cell lines (Deeb, S. J., D'Souza, R. C., Cox, J., Schmidt-Supprian, M., and Mann, M. (2012) Mol. Cell. Proteomics 11, 77-89). Expression levels of individual segregationdriving proteins as well as categories such as extracellular matrix proteins behaved consistently with known trends between the subtypes. We used machine learning (support vector machines) to extract candidate proteins with the highest segregating power. A panel of four proteins (PALD1, MME, TNFAIP8, and TBC1D4) is predicted to classify patients with low error rates. Highly ranked proteins from the support vector analysis revealed differential expression of core signaling molecules between the subtypes, elucidating aspects of their pathobiology.

Original languageEnglish
Pages (from-to)2947-2960
Number of pages14
JournalMolecular and Cellular Proteomics
Issue number11
StatePublished - Nov 2015


Dive into the research topics of 'Machine learning-based classification of diffuse large B-cell lymphoma patients by their protein expression profiles'. Together they form a unique fingerprint.

Cite this