copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Federated Learning on Transcriptomic Data: Model Quality and Performance Trade-Offs

A. Hannemann, J. Ewald, L. Seeger, and E. Buchmann. Computational Science – ICCS 2024: 24th International Conference, Malaga, Spain, July 2–4, 2024, Proceedings, Part IV, page 279–293. Berlin, Heidelberg, Springer-Verlag, (2024)
DOI: 10.1007/978-3-031-63772-8_26

Abstract

Machine learning on large-scale genomic or transcriptomic data is important for many novel health applications. For example, precision medicine tailors medical treatments to patients on the basis of individual biomarkers, cellular and molecular states, etc. However, the data required is sensitive, voluminous, heterogeneous, and typically distributed across locations where dedicated machine learning hardware is not available. Due to privacy and regulatory reasons, it is also problematic to aggregate all data at a trusted third party. Federated learning is a promising solution to this dilemma, because it enables decentralized, collaborative machine learning without exchanging raw data.In this paper, we perform comparative experiments with the federated learning frameworks TensorFlow Federated and Flower. Our test case is the training of disease prognosis and cell type classification models. We train the models with distributed transcriptomic data, considering both data heterogeneity and architectural heterogeneity. We measure model quality, robustness against privacy-enhancing noise, computational performance and resource overhead. Each of the federated learning frameworks has different strengths. However, our experiments confirm that both frameworks can readily build models on transcriptomic data, without transferring personal raw data to a third party with abundant computational resources.

Links and resources

BibTeX key: hannemann2024federated
entry type: inproceedings
address: Berlin, Heidelberg
booktitle: Computational Science – ICCS 2024: 24th International Conference, Malaga, Spain, July 2–4, 2024, Proceedings, Part IV
year: 2024
pages: 279–293
publisher: Springer-Verlag
isbn: 978-3-031-63771-1
numpages: 15
location: Malaga, Spain
DOI: 10.1007/978-3-031-63772-8_26
url: https://doi.org/10.1007/978-3-031-63772-8_26

Cite this publication

%0 Conference Paper %1 hannemann2024federated %A Hannemann, Anika %A Ewald, Jan %A Seeger, Leo %A Buchmann, Erik %B Computational Science – ICCS 2024: 24th International Conference, Malaga, Spain, July 2–4, 2024, Proceedings, Part IV %C Berlin, Heidelberg %D 2024 %I Springer-Verlag %K area_responsibleai topic_lifescience Cell Classification, Disease Federated Learning, Prognosis Type %P 279–293 %R 10.1007/978-3-031-63772-8_26 %T Federated Learning on Transcriptomic Data: Model Quality and Performance Trade-Offs %U https://doi.org/10.1007/978-3-031-63772-8_26 %X Machine learning on large-scale genomic or transcriptomic data is important for many novel health applications. For example, precision medicine tailors medical treatments to patients on the basis of individual biomarkers, cellular and molecular states, etc. However, the data required is sensitive, voluminous, heterogeneous, and typically distributed across locations where dedicated machine learning hardware is not available. Due to privacy and regulatory reasons, it is also problematic to aggregate all data at a trusted third party. Federated learning is a promising solution to this dilemma, because it enables decentralized, collaborative machine learning without exchanging raw data.In this paper, we perform comparative experiments with the federated learning frameworks TensorFlow Federated and Flower. Our test case is the training of disease prognosis and cell type classification models. We train the models with distributed transcriptomic data, considering both data heterogeneity and architectural heterogeneity. We measure model quality, robustness against privacy-enhancing noise, computational performance and resource overhead. Each of the federated learning frameworks has different strengths. However, our experiments confirm that both frameworks can readily build models on transcriptomic data, without transferring personal raw data to a third party with abundant computational resources. %@ 978-3-031-63771-1

@inproceedings{hannemann2024federated, abstract = {Machine learning on large-scale genomic or transcriptomic data is important for many novel health applications. For example, precision medicine tailors medical treatments to patients on the basis of individual biomarkers, cellular and molecular states, etc. However, the data required is sensitive, voluminous, heterogeneous, and typically distributed across locations where dedicated machine learning hardware is not available. Due to privacy and regulatory reasons, it is also problematic to aggregate all data at a trusted third party. Federated learning is a promising solution to this dilemma, because it enables decentralized, collaborative machine learning without exchanging raw data.In this paper, we perform comparative experiments with the federated learning frameworks TensorFlow Federated and Flower. Our test case is the training of disease prognosis and cell type classification models. We train the models with distributed transcriptomic data, considering both data heterogeneity and architectural heterogeneity. We measure model quality, robustness against privacy-enhancing noise, computational performance and resource overhead. Each of the federated learning frameworks has different strengths. However, our experiments confirm that both frameworks can readily build models on transcriptomic data, without transferring personal raw data to a third party with abundant computational resources.}, added-at = {2024-11-12T13:56:18.000+0100}, address = {Berlin, Heidelberg}, author = {Hannemann, Anika and Ewald, Jan and Seeger, Leo and Buchmann, Erik}, biburl = {https://puma.scadsai.uni-leipzig.de/bibtex/2684c365faa7e050e1ff6a7d0a46147d9/scadsfct}, booktitle = {Computational Science – ICCS 2024: 24th International Conference, Malaga, Spain, July 2–4, 2024, Proceedings, Part IV}, doi = {10.1007/978-3-031-63772-8_26}, interhash = {81a47c7fff4876b0ea260bb5c590c587}, intrahash = {684c365faa7e050e1ff6a7d0a46147d9}, isbn = {978-3-031-63771-1}, keywords = {area_responsibleai topic_lifescience Cell Classification, Disease Federated Learning, Prognosis Type}, location = {Malaga, Spain}, numpages = {15}, pages = {279–293}, publisher = {Springer-Verlag}, timestamp = {2024-11-22T15:56:46.000+0100}, title = {Federated Learning on Transcriptomic Data: Model Quality and Performance Trade-Offs}, url = {https://doi.org/10.1007/978-3-031-63772-8_26}, year = 2024 }

PUMA

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Federated Learning on Transcriptomic Data: Model Quality and Performance Trade-Offs

Abstract

Links and resources

Tags

Cite this publication

More citation styles

search on

Meta data

Comments and Reviews
(0)

PUMA

copydeleteadd this publication to your clipboardcommunity posthistory of this postURLDOIBibTeXEndNoteAPAChicagoDIN 1505HarvardMSOffice XML Federated Learning on&nbsp;Transcriptomic Data: Model Quality and&nbsp;Performance Trade-Offs

Abstract

Links and resources

Tags

Cite this publication

More citation styles

search on

Meta data

Comments and Reviews (0)

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Federated Learning on Transcriptomic Data: Model Quality and Performance Trade-Offs

Comments and Reviews
(0)