Extracting Rich Semantic Information about Cybersecurity Events

dc.contributor.authorSatyapanich, Taneeya
dc.contributor.authorFinin, Tim
dc.contributor.authorFerraro, Francis
dc.date.accessioned2020-07-22T16:46:20Z
dc.date.available2020-07-22T16:46:20Z
dc.date.issued2020-02-24
dc.description2019 IEEE International Conference on Big Data (Big Data), 9-12 Dec. 2019, Los Angeles, CA, USA.en_US
dc.description.abstractArticles about cybersecurity events like data breaches and ransomware attacks are common, both in general news and technical sources. Automatically extracting structured information from these can provide valuable information to inform both human analysts and computer systems. In this paper we describe how cybersecurity events can be described via semantic schemas, examined through an initial set of five event types. Using a collection of 1,000 news articles annotated with these event types, including their semantic roles, arguments, realis, and coreference, we detail a modular, deep-learning based information extraction (IE) pipeline, which extracts useful event information with high accuracy. We argue that the event argument set considered here can support many other cybersecurity events, facilitating the extension to new cybersecurity event types, such as distributed denial of service and SQL injection attacks.en_US
dc.description.sponsorship. Partial support for this research was provided by a gift from the IBM AI Horizons Network.en_US
dc.description.urihttps://ieeexplore.ieee.org/document/9006444en_US
dc.format.extent9 pagesen_US
dc.genreconference papers and proceedings preprintsen_US
dc.identifierdoi:10.13016/m2l6hq-wd0p
dc.identifier.citationT. Satyapanich, T. Finin and F. Ferraro, "Extracting Rich Semantic Information about Cybersecurity Events," 2019 IEEE International Conference on Big Data (Big Data), Los Angeles, CA, USA, 2019, pp. 5034-5042, doi: 10.1109/BigData47090.2019.9006444.en_US
dc.identifier.uri10.1109/BigData47090.2019.9006444
dc.identifier.urihttp://hdl.handle.net/11603/19217
dc.language.isoen_USen_US
dc.publisherIEEEen_US
dc.relation.isAvailableAtThe University of Maryland, Baltimore County (UMBC)
dc.relation.ispartofUMBC Computer Science and Electrical Engineering Department Collection
dc.relation.ispartofUMBC Faculty Collection
dc.rightsThis item is likely protected under Title 17 of the U.S. Copyright Law. Unless on a Creative Commons license, for uses protected by Copyright Law, contact the copyright holder or the author.
dc.rights© 2019 IEEE
dc.subjectUMBC Ebiquity Research Group
dc.titleExtracting Rich Semantic Information about Cybersecurity Eventsen_US
dc.typeTexten_US

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
944.pdf
Size:
838.01 KB
Format:
Adobe Portable Document Format
Description:

License bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
license.txt
Size:
2.56 KB
Format:
Item-specific license agreed upon to submission
Description: