Extracting cybersecurity related linked data from text

Arnav Joshi, Ravendar Lal, Tim Finin, and Anupam Joshi, Extracting cybersecurity related linked data from text, 2013 IEEE Seventh International Conference on Semantic Computing , 2013, DOI: 10.1109/ICSC.2013.50

Rights

This item is likely protected under Title 17 of the U.S. Copyright Law. Unless on a Creative Commons license, for uses protected by Copyright Law, contact the copyright holder or the author.
© 2013 IEEE

Subjects

cybersecurity
linked data
information extraction
ontology
Computer crime
data mining
CRF-based system
National Vulnerability Database
RDF linked data representation
UMBC Ebiquity Research Group
security of data

Abstract

The Web is typically our first source of information about new software vulnerabilities, exploits and cyber-attacks. Information is found in semi-structured vulnerability databases as well as in text from security bulletins, news reports, cybersecurity blogs and Internet chat rooms. It can be useful to cybersecurity systems if there is a way to recognize and extract relevant information and represent it as easily shared and integrated semantic data. We describe such an automatic framework that generates and publishes a RDF linked data representation of cybersecurity concepts and vulnerability descriptions extracted from the National Vulnerability Database and from text sources. A CRF-based system is used to identify cybersecurity-related entities, concepts and relations in text, which are then represented using custom ontologies for the cybersecurity domain and also mapped to objects in the DBpedia knowledge base. The resulting cybersecurity linked data collection can be used for many purposes, including automating early vulnerability identification, mitigation and prevention efforts.

Extracting cybersecurity related linked data from text

Files

Links to Files

Permanent Link

Collections

Author/Creator

Author/Creator ORCID

Date

Type of Work

Department

Program

Citation of Original Publication

Rights

Subjects

Abstract