TY - JOUR
T1 - Multicenter evaluation of label-free quantification in human plasma on a high dynamic range benchmark set
AU - Distler, Ute
AU - Yoo, Han Byul
AU - Kardell, Oliver
AU - Hein, Dana
AU - Sielaff, Malte
AU - Scherer, Marian
AU - Jozefowicz, Anna M.
AU - Leps, Christian
AU - Gomez-Zepeda, David
AU - von Toerne, Christine
AU - Merl-Pham, Juliane
AU - Barth, Teresa K.
AU - Tüshaus, Johanna
AU - Giesbertz, Pieter
AU - Müller, Torsten
AU - Kliewer, Georg
AU - Aljakouch, Karim
AU - Helm, Barbara
AU - Unger, Henry
AU - Frey, Dario L.
AU - Helm, Dominic
AU - Schwarzmüller, Luisa
AU - Popp, Oliver
AU - Qin, Di
AU - Wudy, Susanne I.
AU - Sinn, Ludwig Roman
AU - Mergner, Julia
AU - Ludwig, Christina
AU - Imhof, Axel
AU - Kuster, Bernhard
AU - Lichtenthaler, Stefan F.
AU - Krijgsveld, Jeroen
AU - Klingmüller, Ursula
AU - Mertins, Philipp
AU - Coscia, Fabian
AU - Ralser, Markus
AU - Mülleder, Michael
AU - Hauck, Stefanie M.
AU - Tenzer, Stefan
N1 - Publisher Copyright:
© The Author(s) 2025.
PY - 2025/12
Y1 - 2025/12
N2 - Human plasma is routinely collected during clinical care and constitutes a rich source of biomarkers for diagnostics and patient stratification. Liquid chromatography-mass spectrometry (LC-MS)-based proteomics is a key method for plasma biomarker discovery, but the high dynamic range of plasma proteins poses significant challenges for MS analysis and data processing. To benchmark the quantitative performance of neat plasma analysis, we introduce a multispecies sample set based on a human tryptic plasma digest containing varying low level spike-ins of yeast and E. coli tryptic proteome digests, termed PYE. By analysing the sample set on state-of-the-art LC-MS platforms across twelve different sites in data-dependent (DDA) and data-independent acquisition (DIA) modes, we provide a data resource comprising a total of 1116 individual LC-MS runs. Centralized data analysis shows that DIA methods outperform DDA-based approaches regarding identifications, data completeness, accuracy, and precision. DIA achieves excellent technical reproducibility, as demonstrated by coefficients of variation (CVs) between 3.3% and 9.8% at protein level. Comparative analysis of different setups clearly shows a high overlap in identified proteins and proves that accurate and precise quantitative measurements are feasible across multiple sites, even in a complex matrix such as plasma, using state-of-the-art instrumentation. The collected dataset, including the PYE sample set and strategy presented, serves as a valuable resource for optimizing the accuracy and reproducibility of LC-MS and bioinformatic workflows for clinical plasma proteome analysis.
AB - Human plasma is routinely collected during clinical care and constitutes a rich source of biomarkers for diagnostics and patient stratification. Liquid chromatography-mass spectrometry (LC-MS)-based proteomics is a key method for plasma biomarker discovery, but the high dynamic range of plasma proteins poses significant challenges for MS analysis and data processing. To benchmark the quantitative performance of neat plasma analysis, we introduce a multispecies sample set based on a human tryptic plasma digest containing varying low level spike-ins of yeast and E. coli tryptic proteome digests, termed PYE. By analysing the sample set on state-of-the-art LC-MS platforms across twelve different sites in data-dependent (DDA) and data-independent acquisition (DIA) modes, we provide a data resource comprising a total of 1116 individual LC-MS runs. Centralized data analysis shows that DIA methods outperform DDA-based approaches regarding identifications, data completeness, accuracy, and precision. DIA achieves excellent technical reproducibility, as demonstrated by coefficients of variation (CVs) between 3.3% and 9.8% at protein level. Comparative analysis of different setups clearly shows a high overlap in identified proteins and proves that accurate and precise quantitative measurements are feasible across multiple sites, even in a complex matrix such as plasma, using state-of-the-art instrumentation. The collected dataset, including the PYE sample set and strategy presented, serves as a valuable resource for optimizing the accuracy and reproducibility of LC-MS and bioinformatic workflows for clinical plasma proteome analysis.
UR - https://www.scopus.com/pages/publications/105017681619
U2 - 10.1038/s41467-025-64501-z
DO - 10.1038/s41467-025-64501-z
M3 - Article
C2 - 41038884
AN - SCOPUS:105017681619
SN - 2041-1723
VL - 16
JO - Nature Communications
JF - Nature Communications
IS - 1
M1 - 8774
ER -