Independent Component Analysis for Trustworthy Cyberspace during High Impact Events: An Application to Covid-19

Author/Creator ORCID

Date

2020-06-30

Department

Program

Citation of Original Publication

Boukouvalas, Zois; Mallinson, Christine; Crothers, Evan; Japkowicz, Nathalie; Piplai, Aritran; Mittal, Sudip; Joshi, Anupam; Adalı, Tülay; Independent Component Analysis for Trustworthy Cyberspace during High Impact Events: An Application to Covid-19; Machine Learning (2020); https://arxiv.org/abs/2006.01284

Rights

This item is likely protected under Title 17 of the U.S. Copyright Law. Unless on a Creative Commons license, for uses protected by Copyright Law, contact the copyright holder or the author.

Abstract

Social media has become an important communication channel during high impact events, such as the COVID-19 pandemic. As misinformation in social media can rapidly spread, creating social unrest, curtailing the spread of misinformation during such events is a significant data challenge. While recent solutions that are based on machine learning have shown promise for the detection of misinformation, most widely used methods include approaches that rely on either handcrafted features that cannot be optimal for all scenarios, or those that are based on deep learning where the interpretation of the prediction results is not directly accessible. In this work, we propose a data-driven solution that is based on the ICA model, such that knowledge discovery and detection of misinformation are achieved jointly. To demonstrate the effectiveness of our method and compare its performance with deep learning methods, we developed a labeled COVID-19 Twitter dataset based on socio-linguistic criteria.