Research on fine-tuning CNN for cancer diagnosis with gene expression data

dc.contributor.authorLiu, Zhen
dc.contributor.authorWang, Ruoyu
dc.contributor.authorYang, Jin
dc.contributor.authorZhang, Wenbin
dc.date.accessioned2023-03-06T18:09:41Z
dc.date.available2023-03-06T18:09:41Z
dc.date.issued2022-06-21
dc.descriptionICMLC 2022: 2022 14th International Conference on Machine Learning and Computing (ICMLC)February 2022
dc.description.abstractConvolutional neural networks have been used for cancer type prediction with gene expression data. However, its success is impeded by the lack of large labeled datasets in gene expression data. The class imbalance problem leads to that the model ignores the performance of the minority class. To handle the small sample size problem, fine-tuning CNN is used to transfer the knowledge of pre-trained model for cancer type predicting. The dataset with one cancer is used for training a model. The pre-model is finetuned with the training set of a new cancer type, and the fine-tuned model could be used for identifying the new cancer type. And the SMOTE resampling method is used for handling the class imbalance problem. We carried out experiments on The TCGA datasets with 1D-CNN and 2D-CNN models. The fine-tuned 1D-CNN obtains 97.5% accuracy, 98.6% Fscore of cancer type and 78.1% Fscore of normal type on average, and fine-tuned 2D-CNN obtains 97.4% accuracy, 98.5% Fscore of cancer type and 77.4% of normal type on average. Using fine-tuned CNN with SMOTE, the accuracy, Fscore of cancer type and the one of normal type are respectively increased about 1.5%, 0.5% and 21.5% on average.en_US
dc.description.sponsorshipWe thank the anonymous reviewers for their constructive comments. This work is supported by the Key research platforms and projects of colleges and universities in Guangdong Province [Grant No. 2020ZDZX3060, 2019KZDZX1020], National Natural Science Foundation of China [Grant No. 61501128, 61976239], financial support from China Scholarship Council, Natural Science Foundation of Guangdong Province [Grant Nos. 2017A030313345, 2020A1515010783.en_US
dc.description.urihttps://dl.acm.org/doi/abs/10.1145/3529836.3529844en_US
dc.format.extent6 pagesen_US
dc.genreconference papers and proceedingsen_US
dc.genrepostprintsen_US
dc.identifierdoi:10.13016/m2o2as-bluy
dc.identifier.citationLiu, Zhen, et al. "Research on fine-tuning CNN for cancer diagnosis with gene expression data" ICMLC 2022: 2022 14th International Conference on Machine Learning and Computing (ICMLC) (ACM), 2022 pp. 140-145. https://doi.org/10.1145/3529836.3529844.en_US
dc.identifier.urihttps://doi.org/10.1145/3529836.3529844
dc.identifier.urihttp://hdl.handle.net/11603/26951
dc.language.isoen_USen_US
dc.publisherACMen_US
dc.relation.isAvailableAtThe University of Maryland, Baltimore County (UMBC)
dc.relation.ispartofUMBC Information Systems Department Collection
dc.relation.ispartofUMBC Student Collection
dc.rightsPermission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from permissions@acm.org.en_US
dc.titleResearch on fine-tuning CNN for cancer diagnosis with gene expression dataen_US
dc.typeTexten_US

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
ICMLC21.pdf
Size:
488.42 KB
Format:
Adobe Portable Document Format
Description:

License bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
license.txt
Size:
2.56 KB
Format:
Item-specific license agreed upon to submission
Description: