Cross-cancer Prediction: A Novel Machine Learning Approach to Discover Molecular Targets for Development of Treatments for Multiple Cancers

Author/Creator ORCID

Date

2018-10-22

Department

Program

Citation of Original Publication

Katie Gao, Dayong Wang, Yi Huang, Cross-cancer Prediction: A Novel Machine Learning Approach to Discover Molecular Targets for Development of Treatments for Multiple Cancers, Cancer Informatics Volume 17: 1–8, 2018, https://doi.org/10.1177/1176935118805398

Rights

This item is likely protected under Title 17 of the U.S. Copyright Law. Unless on a Creative Commons license, for uses protected by Copyright Law, contact the copyright holder or the author.
Attribution-NonCommercial 4.0 International (CC BY-NC 4.0)

Abstract

Conventional cancer drug development has long been limited to organ- or tissue-specific cancer types. However, it has become increasingly known that specific genetic abnormalities are responsible for the carcinogenesis of multiple cancers. The recent US Food and Drug Administration (FDA) approval of the first multi-cancer drug, Keytruda, has demonstrated the feasibility of developing new drugs that target multiple cancers. Despite a promising future, methodological development for identifying multi-cancer molecular targets remains encumbered. This study developed a novel machine learning approach to identify such genes responsible for multiple cancers by synthesizing salient genomic information from cancer-specific classification models. This approach centered on the cross-cancer prediction method for identifying groups of cancers with high cross-cancer predictability. Furthermore, a robust hybrid classifier, comprising Prediction Analysis for Microarrays and Random Forest, was developed to integrate predictive models for gene inference. This approach has successfully identified key genes shared by endometrial cancer, mammary gland ductal carcinoma, and small cell lung cancer. The results are supported by published experimental evidence. This framework holds potential to transform the current methods of discovering multi-cancer molecular targets for clinical oncology.