Modeling and Extracting Information about Cybersecurity Events from Text

dc.contributor.advisorFinin, Tim
dc.contributor.authorSatyapanich, Taneeya
dc.contributor.departmentComputer Science and Electrical Engineering
dc.contributor.programComputer Science
dc.date.accessioned2021-09-01T13:55:55Z
dc.date.available2021-09-01T13:55:55Z
dc.date.issued2020-01-20
dc.description.abstractPeople now rely on the Internet to carry out much of their daily activities such as banking, ordering food, and socializing with their family and friends. The technology facilitates our lives, but also comes with many problems, including cybercrimes, stolen data, and identity theft. With the large and increasing number of transactions done every day, the frequency of cybercrime events is also growing. Since the number of security-related events is too high for manual review and monitoring, we need to train machines to be able to detect and gather data about potential cyber threats. To support machines that can identify and understand threats, we need standard models to store the cybersecurity information and information extraction systems that can collect information to populate the models with data from text. This dissertations makes two significant contributions. First, we defined rich cybersecurity event schema and annotated a news corpus following the schema. Our schema consists of event type definitions, semantic roles, and event arguments. Second, we present CASIE, a cybersecurity event extraction system. CASIE can detect cybersecurity events, identify event participants and their roles, including specifying realis values. It also groups the events, which are coreference. CASIE produces output in an easy to use format, as a JSON object. We believe that this work will be useful for cybersecurity management in the future. It will quickly grasp cybersecurity event information out of the unstructured text and fill in the event frame. So we can keep up with many cybersecurity events that happen every day.
dc.formatapplication:pdf
dc.genredissertations
dc.identifierdoi:10.13016/m2dqe2-8sc5
dc.identifier.other12123
dc.identifier.urihttp://hdl.handle.net/11603/22920
dc.languageen
dc.relation.isAvailableAtThe University of Maryland, Baltimore County (UMBC)
dc.relation.ispartofUMBC Computer Science and Electrical Engineering Department Collection
dc.relation.ispartofUMBC Theses and Dissertations Collection
dc.relation.ispartofUMBC Graduate School Collection
dc.relation.ispartofUMBC Student Collection
dc.sourceOriginal File Name: Satyapanich_umbc_0434D_12123.pdf
dc.subjectCybersecurity
dc.subjectCybersecurity Event Schema
dc.subjectDeep learning
dc.subjectEvent detection
dc.subjectInformation Extraction
dc.subjectNatural Language Processing
dc.titleModeling and Extracting Information about Cybersecurity Events from Text
dc.typeText
dcterms.accessRightsAccess limited to the UMBC community. Item may possibly be obtained via Interlibrary Loan thorugh a local library, pending author/copyright holder's permission.
dcterms.accessRightsThis item may be protected under Title 17 of the U.S. Copyright Law. It is made available by UMBC for non-commercial research and education. For permission to publish or reproduce, please see http://aok.lib.umbc.edu/specoll/repro.php or contact Special Collections at speccoll(at)umbc.edu

Files

Original bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
Satyapanich_umbc_0434D_12123.pdf
Size:
1.56 MB
Format:
Adobe Portable Document Format