A Quantum Algorithm To Locate Unknown Hashes For Known N-Grams Within A Large Malware Corpus

dc.contributor.authorAllgood, Nicholas R.
dc.contributor.authorNicholas, Charles K.
dc.date.accessioned2020-06-08T18:13:18Z
dc.date.available2020-06-08T18:13:18Z
dc.date.issued2020-05-07
dc.description.abstractQuantum computing has evolved quickly in recent years and is showing significant benefits in a variety of fields. Malware analysis is one of those fields that could also take advantage of quantum computing. The combination of software used to locate the most frequent hashes and n-grams between benign and malicious software (KiloGram) and a quantum search algorithm could be beneficial, by loading the table of hashes and n-grams into a quantum computer, and thereby speeding up the process of mapping n-grams to their hashes. The first phase will be to use KiloGram to find the top-k hashes and n-grams for a large malware corpus. From here, the resulting hash table is then loaded into a quantum machine. A quantum search algorithm is then used search among every permutation of the entangled key and value pairs to find the desired hash value. This prevents one from having to re-compute hashes for a set of n-grams, which can take on average O(MN) time, whereas the quantum algorithm could take O(√N) in the number of table lookups to find the desired hash values.en_US
dc.description.sponsorshipWe extend our thanks to our colleagues Sam Lomonaco and Edward Raff for their comments on an earlier version of this paperen_US
dc.description.urihttps://arxiv.org/abs/2005.02911en_US
dc.format.extent6 pagesen_US
dc.genrejournal articles preprintsen_US
dc.identifierdoi:10.13016/m2ol6y-96a8
dc.identifier.citationNicholas R. Allgood and Charles K. Nicholas, A Quantum Algorithm To Locate Unknown Hashes For Known N-Grams Within A Large Malware Corpus, https://arxiv.org/abs/2005.02911en_US
dc.identifier.urihttp://hdl.handle.net/11603/18841
dc.language.isoen_USen_US
dc.relation.isAvailableAtThe University of Maryland, Baltimore County (UMBC)
dc.relation.ispartofUMBC Computer Science and Electrical Engineering Department Collection
dc.relation.ispartofUMBC Student Collection
dc.relation.ispartofUMBC Faculty Collection
dc.rightsThis item is likely protected under Title 17 of the U.S. Copyright Law. Unless on a Creative Commons license, for uses protected by Copyright Law, contact the copyright holder or the author.
dc.titleA Quantum Algorithm To Locate Unknown Hashes For Known N-Grams Within A Large Malware Corpusen_US
dc.typeTexten_US

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
2005.02911.pdf
Size:
174.79 KB
Format:
Adobe Portable Document Format
Description:

License bundle

Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
2.56 KB
Format:
Item-specific license agreed upon to submission
Description: