Predicting Malware Attributes from Cybersecurity Texts
Loading...
Links to Files
Author/Creator
Author/Creator ORCID
Date
2019-06
Type of Work
Department
Program
Citation of Original Publication
Arpita Roy, Youngja Park, Shimei Pan, Predicting Malware Attributes from Cybersecurity Texts, Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 1 (Long and Short Papers), DOI: 10.18653/v1/N19-1293
Rights
This item is likely protected under Title 17 of the U.S. Copyright Law. Unless on a Creative Commons license, for uses protected by Copyright Law, contact the copyright holder or the author.
Abstract
Text analytics is a useful tool for studying malware behavior and tracking emerging threats.
The task of automated malware attribute identification based on cybersecurity texts is very
challenging due to a large number of malware
attribute labels and a small number of training
instances. In this paper, we propose a novel
feature learning method to leverage diverse
knowledge sources such as small amount of
human annotations, unlabeled text and specifications about malware attribute labels. Our
evaluation has demonstrated the effectiveness
of our method over the state-of-the-art malware attribute prediction systems.