TY - JOUR
T1 - Rescoring Peptide Spectrum Matches
T2 - Boosting Proteomics Performance by Integrating Peptide Property Predictors Into Peptide Identification
AU - Kalhor, Mostafa
AU - Lapin, Joel
AU - Picciani, Mario
AU - Wilhelm, Mathias
N1 - Publisher Copyright:
Copyright © 2024 The Authors. Published by Elsevier Inc. All rights reserved.
PY - 2024/7/1
Y1 - 2024/7/1
N2 - Rescoring of peptide spectrum matches originating from database search engines enabled by peptide property predictors is exceeding the performance of peptide identification from traditional database search engines. In contrast to the peptide spectrum match scores calculated by traditional database search engines, rescoring peptide spectrum matches generates scores based on comparing observed and predicted peptide properties, such as fragment ion intensities and retention times. These newly generated scores enable a more efficient discrimination between correct and incorrect peptide spectrum matches. This approach was shown to lead to substantial improvements in the number of confidently identified peptides, facilitating the analysis of challenging datasets in various fields such as immunopeptidomics, metaproteomics, proteogenomics, and single-cell proteomics. In this review, we summarize the key elements leading up to the recent introduction of multiple data-driven rescoring pipelines. We provide an overview of relevant post-processing rescoring tools, introduce prominent data-driven rescoring pipelines for various applications, and highlight limitations, opportunities, and future perspectives of this approach and its impact on mass spectrometry-based proteomics.
AB - Rescoring of peptide spectrum matches originating from database search engines enabled by peptide property predictors is exceeding the performance of peptide identification from traditional database search engines. In contrast to the peptide spectrum match scores calculated by traditional database search engines, rescoring peptide spectrum matches generates scores based on comparing observed and predicted peptide properties, such as fragment ion intensities and retention times. These newly generated scores enable a more efficient discrimination between correct and incorrect peptide spectrum matches. This approach was shown to lead to substantial improvements in the number of confidently identified peptides, facilitating the analysis of challenging datasets in various fields such as immunopeptidomics, metaproteomics, proteogenomics, and single-cell proteomics. In this review, we summarize the key elements leading up to the recent introduction of multiple data-driven rescoring pipelines. We provide an overview of relevant post-processing rescoring tools, introduce prominent data-driven rescoring pipelines for various applications, and highlight limitations, opportunities, and future perspectives of this approach and its impact on mass spectrometry-based proteomics.
KW - artificial intelligence
KW - computational proteomics
KW - data-driven rescoring
KW - machine learning
KW - peptide identification
KW - peptide property prediction
KW - rescoring
UR - http://www.scopus.com/inward/record.url?scp=85199933573&partnerID=8YFLogxK
U2 - 10.1016/j.mcpro.2024.100798
DO - 10.1016/j.mcpro.2024.100798
M3 - Review article
C2 - 38871251
AN - SCOPUS:85199933573
SN - 1535-9476
VL - 23
SP - 100798
JO - Molecular and Cellular Proteomics
JF - Molecular and Cellular Proteomics
IS - 7
ER -