TY - JOUR
T1 - Peptide Property Prediction for Mass Spectrometry Using AI
T2 - An Introduction to State of the Art Models
AU - Angelis, Jesse
AU - Schröder, Eva Ayla
AU - Xiao, Zixuan
AU - Gabriel, Wassim
AU - Wilhelm, Mathias
N1 - Publisher Copyright:
© 2025 The Author(s). Proteomics published by Wiley-VCH GmbH.
PY - 2025
Y1 - 2025
N2 - This review explores state of the art machine learning and deep learning models for peptide property prediction in mass spectrometry-based proteomics, including, but not limited to, models for predicting digestibility, retention time, charge state distribution, collisional cross section, fragmentation ion intensities, and detectability. The combination of these models enables not only the in silico generation of spectral libraries but also finds many additional use cases in the design of targeted assays or data-driven rescoring. This review serves as both an introduction for newcomers and an update for experienced researchers aiming to develop accessible and reproducible models for peptide property predictions. Key limitations of the current models, including difficulties in handling diverse post-translational modifications and instrument variability, highlight the need for large-scale, harmonized datasets, and standardized evaluation metrics for benchmarking.
AB - This review explores state of the art machine learning and deep learning models for peptide property prediction in mass spectrometry-based proteomics, including, but not limited to, models for predicting digestibility, retention time, charge state distribution, collisional cross section, fragmentation ion intensities, and detectability. The combination of these models enables not only the in silico generation of spectral libraries but also finds many additional use cases in the design of targeted assays or data-driven rescoring. This review serves as both an introduction for newcomers and an update for experienced researchers aiming to develop accessible and reproducible models for peptide property predictions. Key limitations of the current models, including difficulties in handling diverse post-translational modifications and instrument variability, highlight the need for large-scale, harmonized datasets, and standardized evaluation metrics for benchmarking.
KW - deep learning
KW - machine learning
KW - mass spectrometry
KW - peptide property prediction
KW - proteomics
UR - http://www.scopus.com/inward/record.url?scp=105002241745&partnerID=8YFLogxK
U2 - 10.1002/pmic.202400398
DO - 10.1002/pmic.202400398
M3 - Review article
AN - SCOPUS:105002241745
SN - 1615-9853
JO - Proteomics
JF - Proteomics
ER -