Kopieren Löschen Diese Publikation zur Ablage hinzufügen
Community-Eintrag
Versionsverlauf dieses Eintrags
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Benchmarking weakly-supervised deep learning pipelines for whole slide classification in computational pathology

N. Ghaffari Laleh, H. Muti, C. Loeffler, A. Echle, O. Saldanha, F. Mahmood, M. Lu, C. Trautwein, R. Langer, B. Dislich, R. Buelow, H. Grabsch, H. Brenner, J. Chang-Claude, E. Alwers, T. Brinker, F. Khader, D. Truhn, N. Gaisa, P. Boor, M. Hoffmeister, V. Schulz, und J. Kather. Med. Image Anal., 79 (102474): 102474 (Juli 2022)

Zusammenfassung

Artificial intelligence (AI) can extract visual information from histopathological slides and yield biological insight and clinical biomarkers. Whole slide images are cut into thousands of tiles and classification problems are often weakly-supervised: the ground truth is only known for the slide, not for every single tile. In classical weakly-supervised analysis pipelines, all tiles inherit the slide label while in multiple-instance learning (MIL), only bags of tiles inherit the label. However, it is still unclear how these widely used but markedly different approaches perform relative to each other. We implemented and systematically compared six methods in six clinically relevant end-to-end prediction tasks using data from N=2980 patients for training with rigorous external validation. We tested three classical weakly-supervised approaches with convolutional neural networks and vision transformers (ViT) and three MIL-based approaches with and without an additional attention module. Our results empirically demonstrate that histological tumor subtyping of renal cell carcinoma is an easy task in which all approaches achieve an area under the receiver operating curve (AUROC) of above 0.9. In contrast, we report significant performance differences for clinically relevant tasks of mutation prediction in colorectal, gastric, and bladder cancer. In these mutation prediction tasks, classical weakly-supervised workflows outperformed MIL-based weakly-supervised methods for mutation prediction, which is surprising given their simplicity. This shows that new end-to-end image analysis pipelines in computational pathology should be compared to classical weakly-supervised methods. Also, these findings motivate the development of new methods which combine the elegant assumptions of MIL with the empirically observed higher performance of classical weakly-supervised approaches. We make all source codes publicly available at https://github.com/KatherLab/HIA, allowing easy application of all methods to any similar task.

Zitieren Sie diese Publikation

%0 Journal Article %1 Ghaffari_Laleh2022-zd %A Ghaffari Laleh, Narmin %A Muti, Hannah Sophie %A Loeffler, Chiara Maria Lavinia %A Echle, Amelie %A Saldanha, Oliver Lester %A Mahmood, Faisal %A Lu, Ming Y %A Trautwein, Christian %A Langer, Rupert %A Dislich, Bastian %A Buelow, Roman D %A Grabsch, Heike Irmgard %A Brenner, Hermann %A Chang-Claude, Jenny %A Alwers, Elizabeth %A Brinker, Titus J %A Khader, Firas %A Truhn, Daniel %A Gaisa, Nadine T %A Boor, Peter %A Hoffmeister, Michael %A Schulz, Volkmar %A Kather, Jakob Nikolas %D 2022 %I Elsevier BV %J Med. Image Anal. %K Computational Convolutional Learning; Multiple-Instance Vision Weakly-supervised deep learning neural topic_lifescience transformers; zno artificial intelligence pathology networks %N 102474 %P 102474 %T Benchmarking weakly-supervised deep learning pipelines for whole slide classification in computational pathology %V 79 %X Artificial intelligence (AI) can extract visual information from histopathological slides and yield biological insight and clinical biomarkers. Whole slide images are cut into thousands of tiles and classification problems are often weakly-supervised: the ground truth is only known for the slide, not for every single tile. In classical weakly-supervised analysis pipelines, all tiles inherit the slide label while in multiple-instance learning (MIL), only bags of tiles inherit the label. However, it is still unclear how these widely used but markedly different approaches perform relative to each other. We implemented and systematically compared six methods in six clinically relevant end-to-end prediction tasks using data from N=2980 patients for training with rigorous external validation. We tested three classical weakly-supervised approaches with convolutional neural networks and vision transformers (ViT) and three MIL-based approaches with and without an additional attention module. Our results empirically demonstrate that histological tumor subtyping of renal cell carcinoma is an easy task in which all approaches achieve an area under the receiver operating curve (AUROC) of above 0.9. In contrast, we report significant performance differences for clinically relevant tasks of mutation prediction in colorectal, gastric, and bladder cancer. In these mutation prediction tasks, classical weakly-supervised workflows outperformed MIL-based weakly-supervised methods for mutation prediction, which is surprising given their simplicity. This shows that new end-to-end image analysis pipelines in computational pathology should be compared to classical weakly-supervised methods. Also, these findings motivate the development of new methods which combine the elegant assumptions of MIL with the empirically observed higher performance of classical weakly-supervised approaches. We make all source codes publicly available at https://github.com/KatherLab/HIA, allowing easy application of all methods to any similar task.

@article{Ghaffari_Laleh2022-zd, abstract = {Artificial intelligence (AI) can extract visual information from histopathological slides and yield biological insight and clinical biomarkers. Whole slide images are cut into thousands of tiles and classification problems are often weakly-supervised: the ground truth is only known for the slide, not for every single tile. In classical weakly-supervised analysis pipelines, all tiles inherit the slide label while in multiple-instance learning (MIL), only bags of tiles inherit the label. However, it is still unclear how these widely used but markedly different approaches perform relative to each other. We implemented and systematically compared six methods in six clinically relevant end-to-end prediction tasks using data from N=2980 patients for training with rigorous external validation. We tested three classical weakly-supervised approaches with convolutional neural networks and vision transformers (ViT) and three MIL-based approaches with and without an additional attention module. Our results empirically demonstrate that histological tumor subtyping of renal cell carcinoma is an easy task in which all approaches achieve an area under the receiver operating curve (AUROC) of above 0.9. In contrast, we report significant performance differences for clinically relevant tasks of mutation prediction in colorectal, gastric, and bladder cancer. In these mutation prediction tasks, classical weakly-supervised workflows outperformed MIL-based weakly-supervised methods for mutation prediction, which is surprising given their simplicity. This shows that new end-to-end image analysis pipelines in computational pathology should be compared to classical weakly-supervised methods. Also, these findings motivate the development of new methods which combine the elegant assumptions of MIL with the empirically observed higher performance of classical weakly-supervised approaches. We make all source codes publicly available at https://github.com/KatherLab/HIA, allowing easy application of all methods to any similar task.}, added-at = {2024-09-10T11:54:51.000+0200}, author = {Ghaffari Laleh, Narmin and Muti, Hannah Sophie and Loeffler, Chiara Maria Lavinia and Echle, Amelie and Saldanha, Oliver Lester and Mahmood, Faisal and Lu, Ming Y and Trautwein, Christian and Langer, Rupert and Dislich, Bastian and Buelow, Roman D and Grabsch, Heike Irmgard and Brenner, Hermann and Chang-Claude, Jenny and Alwers, Elizabeth and Brinker, Titus J and Khader, Firas and Truhn, Daniel and Gaisa, Nadine T and Boor, Peter and Hoffmeister, Michael and Schulz, Volkmar and Kather, Jakob Nikolas}, biburl = {https://puma.scadsai.uni-leipzig.de/bibtex/229c62bb89ae3442fc97e9e69d17b9bdf/scadsfct}, interhash = {61a83fed8c79c11853767f2ca90c54d7}, intrahash = {29c62bb89ae3442fc97e9e69d17b9bdf}, journal = {Med. Image Anal.}, keywords = {Computational Convolutional Learning; Multiple-Instance Vision Weakly-supervised deep learning neural topic_lifescience transformers; zno artificial intelligence pathology networks}, language = {en}, month = jul, number = 102474, pages = 102474, publisher = {Elsevier BV}, timestamp = {2025-07-29T12:31:09.000+0200}, title = {Benchmarking weakly-supervised deep learning pipelines for whole slide classification in computational pathology}, volume = 79, year = 2022 }

PUMA

Kopieren Löschen Diese Publikation zur Ablage hinzufügen
Community-Eintrag
Versionsverlauf dieses Eintrags
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Benchmarking weakly-supervised deep learning pipelines for whole slide classification in computational pathology

Zusammenfassung

Links und Ressourcen

Tags

Zitieren Sie diese Publikation

Mehr Zitationsstile

Suchen auf

Metadaten

Kommentare und Rezensionen
(0)

PUMA

KopierenLöschenDiese Publikation zur Ablage hinzufügenCommunity-EintragVersionsverlauf dieses EintragsURLDOIBibTeXEndNoteAPAChicagoDIN 1505HarvardMSOffice XML Benchmarking weakly-supervised deep learning pipelines for whole slide classification in computational pathology

Zusammenfassung

Links und Ressourcen

Tags

Zitieren Sie diese Publikation

Mehr Zitationsstile

Suchen auf

Metadaten

Kommentare und Rezensionen (0)

Kopieren Löschen Diese Publikation zur Ablage hinzufügen
Community-Eintrag
Versionsverlauf dieses Eintrags
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Benchmarking weakly-supervised deep learning pipelines for whole slide classification in computational pathology

Kommentare und Rezensionen
(0)