Using Deep Neural Networks to Translate Multi-lingual Threat Intelligence

dc.contributor.authorRanade, Priyanka
dc.contributor.authorMittal, Sudip
dc.contributor.authorJoshi, Anupam
dc.contributor.authorJoshi, Karuna
dc.date.accessioned2018-09-05T19:44:41Z
dc.date.available2018-09-05T19:44:41Z
dc.description.abstractThe multilingual nature of the Internet increases complications in the cybersecurity community's ongoing efforts to strategically mine threat intelligence from OSINT data on the web. OSINT sources such as social media, blogs, and dark web vulnerability markets exist in diverse languages and hinder security analysts, who are unable to draw conclusions from intelligence in languages they don't understand. Although third party translation engines are growing stronger, they are unsuited for private security environments. First, sensitive intelligence is not a permitted input to third party engines due to privacy and confidentiality policies. In addition, third party engines produce generalized translations that tend to lack exclusive cybersecurity terminology. In this paper, we address these issues and describe our system that enables threat intelligence understanding across unfamiliar languages. We create a neural network based system that takes in cybersecurity data in a different language and outputs the respective English translation. The English translation can then be understood by an analyst, and can also serve as input to an AI based cyber-defense system that can take mitigative action. As a proof of concept, we have created a pipeline which takes Russian threats and generates its corresponding English, RDF, and vectorized representations. Our network optimizes translations on specifically, cybersecurity data.en_US
dc.description.sponsorshipThe work was partially supported by a gift from IBM Research, USA and the Undergraduate Research Award from the University of Maryland, Baltimore County.en_US
dc.description.urihttps://arxiv.org/abs/1807.07517en_US
dc.format.extent6 PAGESen_US
dc.genrejournal articles preprintsen_US
dc.identifierdoi:10.13016/M2J38KM8N
dc.identifier.uri2018arXiv180707517R
dc.identifier.urihttp://hdl.handle.net/11603/11236
dc.language.isoen_USen_US
dc.relation.isAvailableAtThe University of Maryland, Baltimore County (UMBC)
dc.relation.ispartofUMBC Computer Science and Electrical Engineering Department Collection
dc.relation.ispartofUMBC Faculty Collection
dc.relation.ispartofUMBC Student Collection
dc.rightsThis item may be protected under Title 17 of the U.S. Copyright Law. It is made available by UMBC for non-commercial research and education. For permission to publish or reproduce, please contact the author.
dc.subjectComputer Science - Computation and Languageen_US
dc.subjectComputer Science - Cryptography and Securityen_US
dc.subjectUMBC Ebiquity Research Group
dc.titleUsing Deep Neural Networks to Translate Multi-lingual Threat Intelligenceen_US
dc.typeTexten_US

Files

Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
1807.07517.pdf
Size:
941.04 KB
Format:
Adobe Portable Document Format
Description:
License bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1.68 KB
Format:
Item-specific license agreed upon to submission
Description: