UMBC at SemEval-2018 Task 8: Understanding Text about Malware

dc.contributor.authorPadia, Ankur
dc.contributor.authorRoy, Arpita
dc.contributor.authorSatyapanich, Taneeya W.
dc.contributor.authorFerraro, Francis
dc.contributor.authorPan, Shimei
dc.contributor.authorPark, Youngja
dc.contributor.authorJoshi, Anupam
dc.contributor.authorFinin, Tim
dc.date.accessioned2018-10-23T13:26:38Z
dc.date.available2018-10-23T13:26:38Z
dc.date.issued2018-06-05
dc.descriptionProceedings of International Workshop on Semantic Evaluation (SemEval-2018)en_US
dc.description.abstractWe describe the systems developed by the UMBC team for 2018 SemEval Task 8, SecureNLP (Semantic Extraction from CybersecUrity REports using Natural Language Processing). We participated in three of the sub-tasks: (1) classifying sentences as being relevant or irrelevant to malware, (2) predicting token labels for sentences, and (4) predicting attribute labels from the Malware Attribute Enumeration and Characterization vocabulary for defining malware characteristics. We achieved F1 scores of 50.34/18.0 (dev/test), 22.23 (test-data), and 31.98 (test-data) for Task1, Task2 and Task2 respectively. We also make our cybersecurity embeddings publicly available at https://bit.ly/cybr2vec.en_US
dc.description.sponsorshipThe research described in this paper was partially supported by gifts from IBM and Northrop Grumman. We thank Agniva Banerjee, Sudip Mittal, Sandeep Narayanan, Maithilee Prabodh, Vishal Rathod, and Arya Renjan for helping with annotations.en_US
dc.description.urihttps://www.aclweb.org/anthology/S18-1142/en_US
dc.format.extent7 pagesen_US
dc.genreconference papers and proceedingsen_US
dc.identifierdoi:10.13016/M2WS8HQ5Q
dc.identifier.urihttp://hdl.handle.net/11603/11641
dc.identifier.uri10.18653/v1/S18-1142
dc.language.isoen_USen_US
dc.relation.isAvailableAtThe University of Maryland, Baltimore County (UMBC)
dc.relation.ispartofUMBC Information Systems Department Collection
dc.relation.ispartofUMBC Faculty Collection
dc.relation.ispartofUMBC Student Collection
dc.relation.ispartofUMBC Computer Science and Electrical Engineering Department
dc.rightsThis item is likely protected under Title 17 of the U.S. Copyright Law. Unless on a Creative Commons license, for uses protected by Copyright Law, contact the copyright holder or the author.
dc.rightsAttribution 4.0 International
dc.rights.urihttps://creativecommons.org/licenses/by/4.0/
dc.subjectcybersecurityen_US
dc.subjectinformation extractionen_US
dc.subjectnatural language processingen_US
dc.subjectUMBC Ebiquity Research Groupen_US
dc.titleUMBC at SemEval-2018 Task 8: Understanding Text about Malwareen_US
dc.typeTexten_US

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
S18-1142.pdf
Size:
272.73 KB
Format:
Adobe Portable Document Format
Description:

License bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
license.txt
Size:
1.68 KB
Format:
Item-specific license agreed upon to submission
Description: