A Comparative Study of Deep Learning based Named Entity Recognition Algorithms for Cybersecurity

Dasgupta, Soham; Piplai, Aritran; Kotal, Anantaa; Joshi, Anupam

A Comparative Study of Deep Learning based Named Entity Recognition Algorithms for Cybersecurity

dc.contributor.author	Dasgupta, Soham
dc.contributor.author	Piplai, Aritran
dc.contributor.author	Kotal, Anantaa
dc.contributor.author	Joshi, Anupam
dc.date.accessioned	2020-12-14T16:50:09Z
dc.date.available	2020-12-14T16:50:09Z
dc.date.issued	2020-12-10
dc.description	4th International Workshop on Big Data Analytics for Cyber Intelligence and Defense, IEEE International Conference on Big Data	en
dc.description.abstract	Named Entity Recognition (NER) is important in the cybersecurity domain. It helps researchers extract cyber threat information from unstructured text sources. The extracted cyber entities or key expressions can be used to model a cyber-attack described in an open-source text. A large number of general-purpose NER algorithms have been published that work well in text analysis. These algorithms do not perform well when applied to the cybersecurity domain. In the field of cybersecurity, the open-source text available varies greatly in complexity and underlying structure of the sentences. General-purpose NER algorithms can misrepresent domain-specific words, such as “malicious” and “javascript”. In this paper, we compare the recent deep learning-based NER algorithms on a cybersecurity dataset. We created a cybersecurity dataset collected from various sources, including “Microsoft Security Bulletin” and “Adobe Security Updates”. Some of these approaches proposed in the literature were not used for cybersecurity. Others are innovations proposed by us. This comparative study helps us identify the NER algorithms that are robust and can work well in sentences taken from a large number of cybersecurity sources. We tabulate their performance on the test set and identify the best NER algorithm for a cybersecurity corpus. We also discuss the different embedding strategies that aid in the process of NER for the chosen deep learning algorithms.	en
dc.description.sponsorship	This work supported in part by an award from DoD to Joshi	en
dc.description.uri	https://ieeexplore.ieee.org/document/9378482	en
dc.format.extent	9 pages	en
dc.genre	conference papers and proceedings preprints	en
dc.identifier	doi:10.13016/m2wds7-b7n8
dc.identifier.citation	S. Dasgupta, A. Piplai, A. Kotal and A. Joshi, "A Comparative Study of Deep Learning based Named Entity Recognition Algorithms for Cybersecurity," 2020 IEEE International Conference on Big Data (Big Data), 2020, pp. 2596-2604, doi: 10.1109/BigData50022.2020.9378482.	en
dc.identifier.uri	http://hdl.handle.net/11603/20255
dc.identifier.uri	https://doi.org/10.1109/BigData50022.2020.9378482
dc.language.iso	en	en
dc.publisher	IEEE	en
dc.relation.isAvailableAt	The University of Maryland, Baltimore County (UMBC)
dc.relation.ispartof	UMBC Computer Science and Electrical Engineering Department Collection
dc.relation.ispartof	UMBC Faculty Collection
dc.relation.ispartof	UMBC Student Collection
dc.rights	This item is likely protected under Title 17 of the U.S. Copyright Law. Unless on a Creative Commons license, for uses protected by Copyright Law, contact the copyright holder or the author.
dc.rights	© 2020 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works.
dc.subject	UMBC Ebiquity Research Group
dc.title	A Comparative Study of Deep Learning based Named Entity Recognition Algorithms for Cybersecurity	en
dc.type	Text	en

Files

Original bundle

Now showing 1 - 1 of 1

Name:: 1058.pdf
Size:: 1.02 MB
Format:: Adobe Portable Document Format
Description:

Download

License bundle

Now showing 1 - 1 of 1

Name:: license.txt
Size:: 2.56 KB
Format:: Item-specific license agreed upon to submission
Description:

Download

Collections

UMBC Computer Science and Electrical Engineering Department
UMBC Faculty Collection
UMBC Student Collection