TY - JOUR
T1 - AI-Assisted Processing Pipeline to Boost Protein Isoform Detection
AU - The, Matthew
AU - Picciani, Mario
AU - Jensen, Cecilia
AU - Gabriel, Wassim
AU - Kuster, Bernhard
AU - Wilhelm, Mathias
N1 - Publisher Copyright:
© 2024. The Author(s), under exclusive license to Springer Science+Business Media, LLC, part of Springer Nature.
PY - 2024
Y1 - 2024
N2 - Proteomics, the study of proteins within biological systems, has seen remarkable advancements in recent years, with protein isoform detection emerging as one of the next major frontiers. One of the primary challenges is achieving the necessary peptide and protein coverage to confidently differentiate isoforms as a result of the protein inference problem and protein false discovery rate estimation challenge in large data. In this chapter, we describe the application of artificial intelligence-assisted peptide property prediction for database search engine rescoring by Oktoberfest, an approach that has proven effective, particularly for complex samples and extensive search spaces, which can greatly increase peptide coverage. Further, it illustrates a method for increasing isoform coverage by the PickedGroupFDR approach that is designed to excel when applied on large data. Real-world examples are provided to illustrate the utility of the tools in the context of rescoring, protein grouping, and false discovery rate estimation. By implementing these cutting-edge techniques, researchers can achieve a substantial increase in both peptide and isoform coverage, thus unlocking the potential of protein isoform detection in their studies and shedding light on their roles and functions in biological processes.
AB - Proteomics, the study of proteins within biological systems, has seen remarkable advancements in recent years, with protein isoform detection emerging as one of the next major frontiers. One of the primary challenges is achieving the necessary peptide and protein coverage to confidently differentiate isoforms as a result of the protein inference problem and protein false discovery rate estimation challenge in large data. In this chapter, we describe the application of artificial intelligence-assisted peptide property prediction for database search engine rescoring by Oktoberfest, an approach that has proven effective, particularly for complex samples and extensive search spaces, which can greatly increase peptide coverage. Further, it illustrates a method for increasing isoform coverage by the PickedGroupFDR approach that is designed to excel when applied on large data. Real-world examples are provided to illustrate the utility of the tools in the context of rescoring, protein grouping, and false discovery rate estimation. By implementing these cutting-edge techniques, researchers can achieve a substantial increase in both peptide and isoform coverage, thus unlocking the potential of protein isoform detection in their studies and shedding light on their roles and functions in biological processes.
KW - Deep learning
KW - Isoforms
KW - Mass spectrometry
KW - Peptide identification
KW - Prosit
KW - Rescoring
UR - http://www.scopus.com/inward/record.url?scp=85198568293&partnerID=8YFLogxK
U2 - 10.1007/978-1-0716-4007-4_10
DO - 10.1007/978-1-0716-4007-4_10
M3 - Article
C2 - 38995541
AN - SCOPUS:85198568293
SN - 1064-3745
VL - 2836
SP - 157
EP - 181
JO - Methods in molecular biology (Clifton, N.J.)
JF - Methods in molecular biology (Clifton, N.J.)
ER -