TY - JOUR
T1 - Bridging Data Silos in Oncology with Modular Software for Federated Analysis on Fast Healthcare Interoperability Resources
T2 - Multisite Implementation Study
AU - Ziegler, Jasmin
AU - Erpenbeck, Marcel Pascal
AU - Fuchs, Timo
AU - Saibold, Anna
AU - Volkmer, Paul Christian
AU - Schmidt, Guenter
AU - Eicher, Johanna
AU - Pallaoro, Peter
AU - De Souza Falguera, Renata
AU - Aubele, Fabio
AU - Hagedorn, Marlien
AU - Vansovich, Ekaterina
AU - Raffler, Johannes
AU - Ringshandl, Stephan
AU - Kerscher, Alexander
AU - Maurer, Julia Karolin
AU - Kühnel, Brigitte
AU - Schenkirsch, Gerhard
AU - Kampf, Marvin
AU - Kapsner, Lorenz A.
AU - Ghanbarian, Hadieh
AU - Spengler, Helmut
AU - Soto-Rey, Iñaki
AU - Albashiti, Fady
AU - Hellwig, Dirk
AU - Ertl, Maximilian
AU - Fette, Georg
AU - Kraska, Detlef
AU - Boeker, Martin
AU - Prokosch, Hans Ulrich
AU - Gulden, Christian
N1 - Publisher Copyright:
© Jasmin Ziegler, Marcel Pascal Erpenbeck, Timo Fuchs, Anna Saibold, Paul-Christian Volkmer, Guenter Schmidt, Johanna Eicher, Peter Pallaoro, Renata De Souza Falguera, Fabio Aubele, Marlien Hagedorn, Ekaterina Vansovich, Johannes Raffler, Stephan Ringshandl, Alexander Kerscher, Julia Karolin Maurer, Brigitte Kühnel, Gerhard Schenkirsch, Marvin Kampf, Lorenz A Kapsner, Hadieh Ghanbarian, Helmut Spengler, Iñaki Soto-Rey, Fady Albashiti, Dirk Hellwig, Maximilian Ertl, Georg Fette, Detlef Kraska, Martin Boeker, Hans-Ulrich Prokosch, Christian Gulden.
PY - 2025
Y1 - 2025
N2 - Background: Real-world data (RWD) from sources like administrative claims, electronic health records, and cancer registries offer insights into patient populations beyond the tightly regulated environment of randomized controlled trials. To leverage this and to advance cancer research, 6 university hospitals in Bavaria have established a joint research IT infrastructure. Objective: This study aimed to outline the design, implementation, and deployment of a modular data transformation pipeline that transforms oncological RWD into a Health Level 7 (HL7) Fast Healthcare Interoperability Resources (FHIR) format and then into a tabular format in preparation for a federated analysis (FA) across the 6 Bavarian Cancer Research Center university hospitals. Methods: To harness RWD effectively, we designed a pipeline to convert the oncological basic dataset (oBDS) into HL7 FHIR format and prepare it for FA. The pipeline handles diverse IT infrastructures and systems while maintaining privacy by keeping data decentralized for analysis. To assess the functionality and validity of our implementation, we defined a cohort to address two specific medical research questions. We evaluated our findings by comparing the results of the FA with reports from the Bavarian Cancer Registry and the original data from local tumor documentation systems. Results: We conducted an FA of 17,885 cancer cases from 2021/2022. Breast cancer was the most common diagnosis at 3 sites, prostate cancer ranked in the top 2 at 4 sites, and malignant melanoma was notably prevalent. Gender-specific trends showed larynx and esophagus cancers were more common in males, while breast and thyroid cancers were more frequent in females. Discrepancies between the Bavarian Cancer Registry and our data, such as higher rates of malignant melanoma (3400/63,771, 5.3% vs 1921/17,885, 10.7%) and lower representation of colorectal cancers (8100/63,771, 12.7% vs 1187/17,885, 6.6%) likely result from differences in the time periods analyzed (2019 vs 2021/2022) and the scope of data sources used. The Bavarian Cancer Registry reports approximately 3 times more cancer cases than the 6 university hospitals alone. Conclusions: The modular pipeline successfully transformed oncological RWD across 6 hospitals, and the federated approach preserved privacy while enabling comprehensive analysis. Future work will add support for recent oBDS versions, automate data quality checks, and integrate additional clinical data. Our findings highlight the potential of federated health data networks and lay the groundwork for future research that can leverage high-quality RWD, aiming to contribute valuable knowledge to the field of cancer research.
AB - Background: Real-world data (RWD) from sources like administrative claims, electronic health records, and cancer registries offer insights into patient populations beyond the tightly regulated environment of randomized controlled trials. To leverage this and to advance cancer research, 6 university hospitals in Bavaria have established a joint research IT infrastructure. Objective: This study aimed to outline the design, implementation, and deployment of a modular data transformation pipeline that transforms oncological RWD into a Health Level 7 (HL7) Fast Healthcare Interoperability Resources (FHIR) format and then into a tabular format in preparation for a federated analysis (FA) across the 6 Bavarian Cancer Research Center university hospitals. Methods: To harness RWD effectively, we designed a pipeline to convert the oncological basic dataset (oBDS) into HL7 FHIR format and prepare it for FA. The pipeline handles diverse IT infrastructures and systems while maintaining privacy by keeping data decentralized for analysis. To assess the functionality and validity of our implementation, we defined a cohort to address two specific medical research questions. We evaluated our findings by comparing the results of the FA with reports from the Bavarian Cancer Registry and the original data from local tumor documentation systems. Results: We conducted an FA of 17,885 cancer cases from 2021/2022. Breast cancer was the most common diagnosis at 3 sites, prostate cancer ranked in the top 2 at 4 sites, and malignant melanoma was notably prevalent. Gender-specific trends showed larynx and esophagus cancers were more common in males, while breast and thyroid cancers were more frequent in females. Discrepancies between the Bavarian Cancer Registry and our data, such as higher rates of malignant melanoma (3400/63,771, 5.3% vs 1921/17,885, 10.7%) and lower representation of colorectal cancers (8100/63,771, 12.7% vs 1187/17,885, 6.6%) likely result from differences in the time periods analyzed (2019 vs 2021/2022) and the scope of data sources used. The Bavarian Cancer Registry reports approximately 3 times more cancer cases than the 6 university hospitals alone. Conclusions: The modular pipeline successfully transformed oncological RWD across 6 hospitals, and the federated approach preserved privacy while enabling comprehensive analysis. Future work will add support for recent oBDS versions, automate data quality checks, and integrate additional clinical data. Our findings highlight the potential of federated health data networks and lay the groundwork for future research that can leverage high-quality RWD, aiming to contribute valuable knowledge to the field of cancer research.
KW - cancer registries
KW - electronic health records
KW - federated analysis
KW - HL7 FHIR
KW - interoperability
KW - observational research network
KW - oncology
KW - real-world data
KW - real-world evidence
UR - http://www.scopus.com/inward/record.url?scp=105002812247&partnerID=8YFLogxK
U2 - 10.2196/65681
DO - 10.2196/65681
M3 - Article
AN - SCOPUS:105002812247
SN - 1438-8871
VL - 27
JO - Journal of Medical Internet Research
JF - Journal of Medical Internet Research
M1 - e65681 | p. 1
ER -