Catch'em all: Classification of Rare, Prominent, and Novel Malware Families

dc.contributor.authorEren, Maksim E.
dc.contributor.authorBarron, Ryan
dc.contributor.authorBhattarai, Manish
dc.contributor.authorWanna, Selma
dc.contributor.authorSolovyev, Nicholas
dc.contributor.authorRasmussen, Kim
dc.contributor.authorAlexandrov, Boian S.
dc.contributor.authorNicholas, Charles
dc.date.accessioned2024-03-27T13:26:10Z
dc.date.available2024-03-27T13:26:10Z
dc.date.issued2024-03-04
dc.description.abstractNational security is threatened by malware, which remains one of the most dangerous and costly cyber threats. As of last year, researchers reported 1.3 billion known malware specimens, motivating the use of data-driven machine learning (ML) methods for analysis. However, shortcomings in existing ML approaches hinder their mass adoption. These challenges include detection of novel malware and the ability to perform malware classification in the face of class imbalance: a situation where malware families are not equally represented in the data. Our work addresses these shortcomings with MalwareDNA: an advanced dimensionality reduction and feature extraction framework. We demonstrate stable task performance under class imbalance for the following tasks: malware family classification and novel malware detection with a trade-off in increased abstention or reject-option rate.
dc.description.sponsorshipThis manuscript has been assigned LA-UR-24-21917. This research was funded by the LANL LDRD grant 20230067SR and the LANL Institutional Computing Program, supported by the U.S. Department of Energy National Nuclear Security Administration under Contract No. 89233218CNA000001.
dc.description.urihttp://arxiv.org/abs/2403.02546
dc.format.extent6 pages
dc.genrejournal articles
dc.genrepreprints
dc.identifierdoi:10.13016/m2bj7o-bfwv
dc.identifier.urihttps://doi.org/10.48550/arXiv.2403.02546
dc.identifier.urihttp://hdl.handle.net/11603/32675
dc.language.isoen_US
dc.relation.isAvailableAtThe University of Maryland, Baltimore County (UMBC)
dc.relation.ispartofUMBC Faculty Collection
dc.relation.ispartofUMBC Computer Science and Electrical Engineering Department
dc.rightsThis work was written as part of one of the author's official duties as an Employee of the United States Government and is therefore a work of the United States Government. In accordance with 17 U.S.C. 105, no copyright protection is available for such works under U.S. Law.
dc.rightsPublic Domain Mark 1.0
dc.rights.urihttps://creativecommons.org/publicdomain/mark/1.0/deed.en
dc.subjectComputer Science - Cryptography and Security
dc.titleCatch'em all: Classification of Rare, Prominent, and Novel Malware Families
dc.typeText
dcterms.creatorhttps://orcid.org/0000-0001-9494-7139

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
2403.02546.pdf
Size:
1.58 MB
Format:
Adobe Portable Document Format