UMBC at SemEval-2018 Task 8: Understanding Text about Malware

dc.contributor.authorPadia, Ankur
dc.contributor.authorRoy, Arpita
dc.contributor.authorSatyapanich, Taneeya W.
dc.contributor.authorFerraro, Francis
dc.contributor.authorPan, Shimei
dc.contributor.authorPark, Youngja
dc.contributor.authorJoshi, Anupam
dc.contributor.authorFinin, Tim
dc.date.accessioned2018-10-23T13:26:38Z
dc.date.available2018-10-23T13:26:38Z
dc.date.issued2018-06-05
dc.descriptionProceedings of International Workshop on Semantic Evaluation (SemEval-2018)en
dc.description.abstractWe describe the systems developed by the UMBC team for 2018 SemEval Task 8, SecureNLP (Semantic Extraction from CybersecUrity REports using Natural Language Processing). We participated in three of the sub-tasks: (1) classifying sentences as being relevant or irrelevant to malware, (2) predicting token labels for sentences, and (4) predicting attribute labels from the Malware Attribute Enumeration and Characterization vocabulary for defining malware characteristics. We achieved F1 scores of 50.34/18.0 (dev/test), 22.23 (test-data), and 31.98 (test-data) for Task1, Task2 and Task2 respectively. We also make our cybersecurity embeddings publicly available at https://bit.ly/cybr2vec.en
dc.description.sponsorshipThe research described in this paper was partially supported by gifts from IBM and Northrop Grumman. We thank Agniva Banerjee, Sudip Mittal, Sandeep Narayanan, Maithilee Prabodh, Vishal Rathod, and Arya Renjan for helping with annotations.en
dc.description.urihttps://www.aclweb.org/anthology/S18-1142/en
dc.format.extent7 pagesen
dc.genreconference papers and proceedingsen
dc.identifierdoi:10.13016/M2WS8HQ5Q
dc.identifier.urihttp://hdl.handle.net/11603/11641
dc.identifier.uri10.18653/v1/S18-1142
dc.language.isoenen
dc.relation.isAvailableAtThe University of Maryland, Baltimore County (UMBC)
dc.relation.ispartofUMBC Information Systems Department Collection
dc.relation.ispartofUMBC Faculty Collection
dc.relation.ispartofUMBC Student Collection
dc.relation.ispartofUMBC Computer Science and Electrical Engineering Department
dc.rightsThis item is likely protected under Title 17 of the U.S. Copyright Law. Unless on a Creative Commons license, for uses protected by Copyright Law, contact the copyright holder or the author.
dc.rightsAttribution 4.0 International
dc.rights.urihttps://creativecommons.org/licenses/by/4.0/
dc.subjectcybersecurityen
dc.subjectinformation extractionen
dc.subjectnatural language processingen
dc.subjectUMBC Ebiquity Research Groupen
dc.titleUMBC at SemEval-2018 Task 8: Understanding Text about Malwareen
dc.typeTexten

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
S18-1142.pdf
Size:
272.73 KB
Format:
Adobe Portable Document Format
Description:

License bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
license.txt
Size:
1.68 KB
Format:
Item-specific license agreed upon to submission
Description: