Reproducibility in deep learning algorithms for digital pathology applications: a case study using the CAMELYON16 datasets

dc.contributor.authorLi, Weizhe
dc.contributor.authorChen, Weijie
dc.date.accessioned2025-10-22T19:57:48Z
dc.date.issued2021-02-15
dc.descriptionSPIE MEDICAL IMAGING February 15-20, 2021, Online, California, United States
dc.description.abstractReproducibility as a cornerstone of science has in recent years attracted significant attention in a variety of applications that involve big data and complex algorithms. Artificial Intelligence (AI) applications, in particular, have been referred by some authors as having a “reproducibility crisis”. The application of deep learning algorithms in digital pathology, which has been demonstrated in some studies to achieve pathologist-level performance, involves both big data and complex algorithms. It is important to identify technical factors influencing AI performance for such studies to be reproducible. In this work, we conducted a reproducibility study using the public datasets shared by the CAMELYON16 challenge, which aimed to develop and assess algorithms for the detection of breast cancer metastasis using whole-slide images (WSIs) of lymph node sections. We used the two-stage classification framework of the CAMELYON16 challenge’s top-performing algorithm, i.e., a convolutional neural network (Inception V1) to generate heatmaps followed by feature extraction from the heatmaps and a random forest classifier for classification. We investigated the effects of variations in training/testing procedures on the performance of AI/ML algorithms including color augmentation methods, color normalization methods, and the resolution of heatmaps. Our results showed that, despite the differences in color augmentation and color normalization methods, the utilization of these techniques improved the classification performance by an AUC value of 0.07 compared to without using them, which is consistent with the CAMELYON16 findings. We concluded that sufficient details of technical description should be provided for a study to be fully reproducible.
dc.description.sponsorshipThe authors thank Dr. Mike Mikailov for his help in using the FDA’s HPC facility in this work.
dc.description.urihttps://www.spiedigitallibrary.org/conference-proceedings-of-spie/11603/1160318/Reproducibility-in-deep-learning-algorithms-for-digital-pathology-applications/10.1117/12.2581996.full
dc.format.extent10 pages
dc.genreconference papers and proceedings
dc.genreposters
dc.genrepresentations (communicative events)
dc.identifierdoi:10.13016/m2h1gr-1ji9
dc.identifier.citationLi, Weizhe, and Weijie Chen. “Reproducibility in Deep Learning Algorithms for Digital Pathology Applications: A Case Study Using the CAMELYON16 Datasets.” Medical Imaging 2021: Digital Pathology 11603 (February 2021): 323–32. https://doi.org/10.1117/12.2581996.
dc.identifier.urihttps://doi.org/10.1117/12.2581996
dc.identifier.urihttp://hdl.handle.net/11603/40497
dc.language.isoen
dc.publisherSPIE
dc.relation.isAvailableAtThe University of Maryland, Baltimore County (UMBC)
dc.relation.ispartofUMBC Computer Science and Electrical Engineering Department
dc.rightsThis work was written as part of one of the author's official duties as an Employee of the United States Government and is therefore a work of the United States Government. In accordance with 17 U.S.C. 105, no copyright protection is available for such works under U.S. Law.
dc.rightsPublic Domain
dc.rights.urihttps://creativecommons.org/publicdomain/mark/1.0/
dc.subjectUMBC High Performance Computing Facility (HPCF)
dc.titleReproducibility in deep learning algorithms for digital pathology applications: a case study using the CAMELYON16 datasets
dc.typeText

Files

Original bundle

Now showing 1 - 3 of 3
Loading...
Thumbnail Image
Name:
1160318(1).pdf
Size:
697.16 KB
Format:
Adobe Portable Document Format
Loading...
Thumbnail Image
Name:
Reproducibilityindeeplearningalgorithmsfordigitalpathologyapplicationsacase.pdf
Size:
50.43 KB
Format:
Adobe Portable Document Format
Loading...
Thumbnail Image
Name:
1160341poster.pdf
Size:
1.17 MB
Format:
Adobe Portable Document Format