Nicholas, CharlesPatel, Tirth Jitendra2023-04-052023-04-052022-01-0112661http://hdl.handle.net/11603/27351Due to the number of daily malware attacks, we have been relying on machinelearning to detect them. Lots of people sell systems that claim to do this, which we refer to as malware classifiers. Evaluating malware classifiers can be tricky. There are many types of malware classifiers, each of which has its purpose. The purpose may be to classify whether a given specimen was malicious or benign, or it may be to classify the malware by its family name, or it may be something else. Nevertheless, for any of these purposes, it has been noted that the malware classifier evaluated similar data on which it was trained. By similar data here, we mean that the training and testing data of the malware classifier included malware samples from similar families. After some false starts, we built a benchmark that can be used to evaluate malware classifiers, even when confronted with malware that they had not seen before.application:pdfBenchmarkingMachine LearningMalware ClassifiersMalware DatasetsMalware FamiliesUnknown FamiliesEvaluating Malware Classifiers on Unknown Malware FamiliesText