A Quantum Algorithm To Locate Unknown Hashes For Known N-Grams Within A Large Malware Corpus

dc.contributor.authorAllgood, Nicholas R.
dc.contributor.authorNicholas, Charles K.
dc.date.accessioned2020-06-08T18:13:18Z
dc.date.available2020-06-08T18:13:18Z
dc.date.issued2020-05-07
dc.description.abstractQuantum computing has evolved quickly in recent years and is showing significant benefits in a variety of fields. Malware analysis is one of those fields that could also take advantage of quantum computing. The combination of software used to locate the most frequent hashes and n-grams between benign and malicious software (KiloGram) and a quantum search algorithm could be beneficial, by loading the table of hashes and n-grams into a quantum computer, and thereby speeding up the process of mapping n-grams to their hashes. The first phase will be to use KiloGram to find the top-k hashes and n-grams for a large malware corpus. From here, the resulting hash table is then loaded into a quantum machine. A quantum search algorithm is then used search among every permutation of the entangled key and value pairs to find the desired hash value. This prevents one from having to re-compute hashes for a set of n-grams, which can take on average O(MN) time, whereas the quantum algorithm could take O(√N) in the number of table lookups to find the desired hash values.en
dc.description.sponsorshipWe extend our thanks to our colleagues Sam Lomonaco and Edward Raff for their comments on an earlier version of this paperen
dc.description.urihttps://arxiv.org/abs/2005.02911en
dc.format.extent6 pagesen
dc.genrejournal articles preprintsen
dc.identifierdoi:10.13016/m2ol6y-96a8
dc.identifier.citationNicholas R. Allgood and Charles K. Nicholas, A Quantum Algorithm To Locate Unknown Hashes For Known N-Grams Within A Large Malware Corpus, https://arxiv.org/abs/2005.02911en
dc.identifier.urihttp://hdl.handle.net/11603/18841
dc.language.isoenen
dc.relation.isAvailableAtThe University of Maryland, Baltimore County (UMBC)
dc.relation.ispartofUMBC Computer Science and Electrical Engineering Department Collection
dc.relation.ispartofUMBC Student Collection
dc.relation.ispartofUMBC Faculty Collection
dc.rightsThis item is likely protected under Title 17 of the U.S. Copyright Law. Unless on a Creative Commons license, for uses protected by Copyright Law, contact the copyright holder or the author.
dc.titleA Quantum Algorithm To Locate Unknown Hashes For Known N-Grams Within A Large Malware Corpusen
dc.typeTexten

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
2005.02911.pdf
Size:
174.79 KB
Format:
Adobe Portable Document Format
Description:

License bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
license.txt
Size:
2.56 KB
Format:
Item-specific license agreed upon to submission
Description: