Random Forest of Tensors

dc.contributor.advisorNicholas, Charles K
dc.contributor.authorEren, Maksim Ekin
dc.contributor.departmentComputer Science and Electrical Engineering
dc.contributor.programComputer Science
dc.date.accessioned2024-03-21T19:37:43Z
dc.date.available2024-03-21T19:37:43Z
dc.date.issued2022-01-01
dc.description.abstractTensor decomposition is a powerful unsupervised Machine Learning method that enables the modeling of multi-dimensional data, including malware data. This thesis introduces a novel ensemble semi-supervised classification algorithm, named Random Forest of Tensors (RFoT), that utilizes tensor decomposition to extract the complex and multi-faceted latent patterns from data. Our hybrid model leverages the strength of multi-dimensional analysis combined with clustering to capture the sample groupings in the latent components, whose combinations distinguish malware and benign-ware. The patterns extracted from a malware data with tensor decomposition depend upon the configuration of the tensor such as dimension, entry, and rank selection. To capture the unique perspectives of different tensor configurations, we employ the "wisdom of crowds" philosophy and make use of decisions made by the majority of a randomly generated ensemble of tensors with varying dimensions, entries, and ranks. We show the capabilities of RFoT when classifying Windows Portable Executable (PE) malware and benign-ware.
dc.formatapplication:pdf
dc.genrethesis
dc.identifierdoi:10.13016/m2h4cb-vbvz
dc.identifier.other12529
dc.identifier.urihttp://hdl.handle.net/11603/32397
dc.languageen
dc.relation.isAvailableAtThe University of Maryland, Baltimore County (UMBC)
dc.relation.ispartofUMBC Computer Science and Electrical Engineering Department Collection
dc.relation.ispartofUMBC Theses and Dissertations Collection
dc.relation.ispartofUMBC Graduate School Collection
dc.relation.ispartofUMBC Student Collection
dc.rightsThis item may be protected under Title 17 of the U.S. Copyright Law. It is made available by UMBC for non-commercial research and education. For permission to publish or reproduce, please see http://aok.lib.umbc.edu/specoll/repro.php or contact Special Collections at speccoll(at)umbc.edu
dc.sourceOriginal File Name: Eren_umbc_0434M_12529.pdf
dc.subjectclustering
dc.subjectensemble learning
dc.subjectmalware classification
dc.subjectrandom algorithms
dc.subjectsemi-supervised learning
dc.subjecttensor decomposition
dc.titleRandom Forest of Tensors
dc.typeText
dcterms.accessRightsAccess limited to the UMBC community. Item may possibly be obtained via Interlibrary Loan thorugh a local library, pending author/copyright holder's permission.

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Eren_umbc_0434M_12529.pdf
Size:
1.04 MB
Format:
Adobe Portable Document Format