Active Semi-Supervised Learning via Bayesian Experimental Design for Lung Cancer Classification Using Low Dose Computed Tomography Scans

dc.contributor.authorNguyen, Phuong
dc.contributor.authorRathod, Ankita
dc.contributor.authorChapman, David
dc.contributor.authorPrathapan, Smriti
dc.contributor.authorMenon, Sumeet
dc.contributor.authorMorris, Michael
dc.contributor.authorYesha, Yelena
dc.date.accessioned2023-04-12T17:33:04Z
dc.date.available2023-04-12T17:33:04Z
dc.date.issued2023-03-15
dc.description.abstractWe introduce an active, semisupervised algorithm that utilizes Bayesian experimental design to address the shortage of annotated images required to train and validate Artificial Intelligence (AI) models for lung cancer screening with computed tomography (CT) scans. Our approach incorporates active learning with semisupervised expectation maximization to emulate the human in the loop for additional ground truth labels to train, evaluate, and update the neural network models. Bayesian experimental design is used to intelligently identify which unlabeled samples need ground truth labels to enhance the model’s performance. We evaluate the proposed Active Semi-supervised Expectation Maximization for Computer aided diagnosis (CAD) tasks (ASEM-CAD) using three public CT scans datasets: the National Lung Screening Trial (NLST), the Lung Image Database Consortium (LIDC), and Kaggle Data Science Bowl 2017 for lung cancer classification using CT scans. ASEM-CAD can accurately classify suspicious lung nodules and lung cancer cases with an area under the curve (AUC) of 0.94 (Kaggle), 0.95 (NLST), and 0.88 (LIDC) with significantly fewer labeled images compared to a fully supervised model. This study addresses one of the significant challenges in early lung cancer screenings using low-dose computed tomography (LDCT) scans and is a valuable contribution towards the development and validation of deep learning algorithms for lung cancer screening and other diagnostic radiology examinations.en_US
dc.description.sponsorshipThis work was sponsored by the NSF IUCRC Center for Accelerated Real Time Analytics (CARTA), (https://carta.umbc.edu/, https://carta.miami.edu/, accessed on 9 March 2023) (NSF grant award #1747724).en_US
dc.description.urihttps://www.mdpi.com/2076-3417/13/6/3752en_US
dc.format.extent19 pagesen_US
dc.genrejournal articlesen_US
dc.identifierdoi:10.13016/m2rt8o-07bo
dc.identifier.citationNguyen, Phuong, Ankita Rathod, David Chapman, Smriti Prathapan, Sumeet Menon, Michael Morris, and Yelena Yesha. 2023. "Active Semi-Supervised Learning via Bayesian Experimental Design for Lung Cancer Classification Using Low Dose Computed Tomography Scans" Applied Sciences 13, no. 6: 3752. https://doi.org/10.3390/app13063752en_US
dc.identifier.urihttps://doi.org/10.3390/app13063752
dc.identifier.urihttp://hdl.handle.net/11603/27600
dc.language.isoen_USen_US
dc.publisherMDPIen_US
dc.relation.isAvailableAtThe University of Maryland, Baltimore County (UMBC)
dc.relation.ispartofUMBC Computer Science and Electrical Engineering Department Collection
dc.relation.ispartofUMBC Student Collection
dc.rightsThis item is likely protected under Title 17 of the U.S. Copyright Law. Unless on a Creative Commons license, for uses protected by Copyright Law, contact the copyright holder or the author.en_US
dc.rightsAttribution 4.0 International (CC BY 4.0)*
dc.rights.urihttps://creativecommons.org/licenses/by/4.0/*
dc.titleActive Semi-Supervised Learning via Bayesian Experimental Design for Lung Cancer Classification Using Low Dose Computed Tomography Scansen_US
dc.typeTexten_US

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
applsci-13-03752.pdf
Size:
5.51 MB
Format:
Adobe Portable Document Format
Description:

License bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
license.txt
Size:
2.56 KB
Format:
Item-specific license agreed upon to submission
Description: