Nicholas, CharlesS, Raguvir2024-08-092024-08-092024-01-0112849http://hdl.handle.net/11603/35318Exploring and analysing a dataset provides insights into what kind of data is present and how it can be used. This is especially useful for malware datasets. As an area that is growing bigger due to the implementation of machine learning techniques, having knowledge about a dataset may assist in any future machine learning task that can be done on the dataset. This work aims to gain statistical insights about a dataset of malware and to explore patterns of different families of malware. This will provide a gateway to enable categorizing malicious files based on their properties. One of the outcomes of this work is the discovery of patterns and insights as to how different attributes of a malware specimen can act as an indicator of its maliciousness.application:pdfThis item may be protected under Title 17 of the U.S. Copyright Law. It is made available by UMBC for non-commercial research and education. For permission to publish or reproduce, please see http://aok.lib.umbc.edu/specoll/repro.php or contact Special Collections at speccoll(at)umbc.eduData AnalysisDescriptive StatisticsFeature ImportanceMalware analysisStatic AnalysisVirus TotalDescriptive Statistics of Malware DataText