TY - JOUR
T1 - Development and validation of deep learning classifiers to detect Epstein-Barr virus and microsatellite instability status in gastric cancer
T2 - a retrospective multicentre cohort study
AU - Muti, Hannah Sophie
AU - Heij, Lara Rosaline
AU - Keller, Gisela
AU - Kohlruss, Meike
AU - Langer, Rupert
AU - Dislich, Bastian
AU - Cheong, Jae Ho
AU - Kim, Young Woo
AU - Kim, Hyunki
AU - Kook, Myeong Cherl
AU - Cunningham, David
AU - Allum, William H.
AU - Langley, Ruth E.
AU - Nankivell, Matthew G.
AU - Quirke, Philip
AU - Hayden, Jeremy D.
AU - West, Nicholas P.
AU - Irvine, Andrew J.
AU - Yoshikawa, Takaki
AU - Oshima, Takashi
AU - Huss, Ralf
AU - Grosser, Bianca
AU - Roviello, Franco
AU - d'Ignazio, Alessia
AU - Quaas, Alexander
AU - Alakus, Hakan
AU - Tan, Xiuxiang
AU - Pearson, Alexander T.
AU - Luedde, Tom
AU - Ebert, Matthias P.
AU - Jäger, Dirk
AU - Trautwein, Christian
AU - Gaisa, Nadine Therese
AU - Grabsch, Heike I.
AU - Kather, Jakob Nikolas
N1 - Publisher Copyright:
© 2021 The Author(s). Published by Elsevier Ltd. This is an Open Access article under the CC BY 4.0 license
PY - 2021/10
Y1 - 2021/10
N2 - Background: Response to immunotherapy in gastric cancer is associated with microsatellite instability (or mismatch repair deficiency) and Epstein-Barr virus (EBV) positivity. We therefore aimed to develop and validate deep learning-based classifiers to detect microsatellite instability and EBV status from routine histology slides. Methods: In this retrospective, multicentre study, we collected tissue samples from ten cohorts of patients with gastric cancer from seven countries (South Korea, Switzerland, Japan, Italy, Germany, the UK and the USA). We trained a deep learning-based classifier to detect microsatellite instability and EBV positivity from digitised, haematoxylin and eosin stained resection slides without annotating tumour containing regions. The performance of the classifier was assessed by within-cohort cross-validation in all ten cohorts and by external validation, for which we split the cohorts into a five-cohort training dataset and a five-cohort test dataset. We measured the area under the receiver operating curve (AUROC) for detection of microsatellite instability and EBV status. Microsatellite instability and EBV status were determined to be detectable if the lower bound of the 95% CI for the AUROC was above 0·5. Findings: Across the ten cohorts, our analysis included 2823 patients with known microsatellite instability status and 2685 patients with known EBV status. In the within-cohort cross-validation, the deep learning-based classifier could detect microsatellite instability status in nine of ten cohorts, with AUROCs ranging from 0·597 (95% CI 0·522–0·737) to 0·836 (0·795–0·880) and EBV status in five of eight cohorts, with AUROCs ranging from 0·819 (0·752–0·841) to 0·897 (0·513–0·966). Training a classifier on the pooled training dataset and testing it on the five remaining cohorts resulted in high classification performance with AUROCs ranging from 0·723 (95% CI 0·676–0·794) to 0·863 (0·747–0·969) for detection of microsatellite instability and from 0·672 (0·403–0·989) to 0·859 (0·823–0·919) for detection of EBV status. Interpretation: Classifiers became increasingly robust when trained on pooled cohorts. After prospective validation, this deep learning-based tissue classification system could be used as an inexpensive predictive biomarker for immunotherapy in gastric cancer. Funding: German Cancer Aid and German Federal Ministry of Health.
AB - Background: Response to immunotherapy in gastric cancer is associated with microsatellite instability (or mismatch repair deficiency) and Epstein-Barr virus (EBV) positivity. We therefore aimed to develop and validate deep learning-based classifiers to detect microsatellite instability and EBV status from routine histology slides. Methods: In this retrospective, multicentre study, we collected tissue samples from ten cohorts of patients with gastric cancer from seven countries (South Korea, Switzerland, Japan, Italy, Germany, the UK and the USA). We trained a deep learning-based classifier to detect microsatellite instability and EBV positivity from digitised, haematoxylin and eosin stained resection slides without annotating tumour containing regions. The performance of the classifier was assessed by within-cohort cross-validation in all ten cohorts and by external validation, for which we split the cohorts into a five-cohort training dataset and a five-cohort test dataset. We measured the area under the receiver operating curve (AUROC) for detection of microsatellite instability and EBV status. Microsatellite instability and EBV status were determined to be detectable if the lower bound of the 95% CI for the AUROC was above 0·5. Findings: Across the ten cohorts, our analysis included 2823 patients with known microsatellite instability status and 2685 patients with known EBV status. In the within-cohort cross-validation, the deep learning-based classifier could detect microsatellite instability status in nine of ten cohorts, with AUROCs ranging from 0·597 (95% CI 0·522–0·737) to 0·836 (0·795–0·880) and EBV status in five of eight cohorts, with AUROCs ranging from 0·819 (0·752–0·841) to 0·897 (0·513–0·966). Training a classifier on the pooled training dataset and testing it on the five remaining cohorts resulted in high classification performance with AUROCs ranging from 0·723 (95% CI 0·676–0·794) to 0·863 (0·747–0·969) for detection of microsatellite instability and from 0·672 (0·403–0·989) to 0·859 (0·823–0·919) for detection of EBV status. Interpretation: Classifiers became increasingly robust when trained on pooled cohorts. After prospective validation, this deep learning-based tissue classification system could be used as an inexpensive predictive biomarker for immunotherapy in gastric cancer. Funding: German Cancer Aid and German Federal Ministry of Health.
UR - http://www.scopus.com/inward/record.url?scp=85116012388&partnerID=8YFLogxK
U2 - 10.1016/S2589-7500(21)00133-3
DO - 10.1016/S2589-7500(21)00133-3
M3 - Article
C2 - 34417147
AN - SCOPUS:85116012388
SN - 2589-7500
VL - 3
SP - e654-e664
JO - The Lancet Digital Health
JF - The Lancet Digital Health
IS - 10
ER -