Improving Neural Named Entity Recognition with Gazetteers

dc.contributor.authorSong, Chan Hee
dc.contributor.authorLawrie, Dawn
dc.contributor.authorFinin, Tim
dc.contributor.authorMayfield, James
dc.date.accessioned2020-04-10T15:53:20Z
dc.date.available2020-04-10T15:53:20Z
dc.date.issued2020-03-06
dc.description.abstractThe goal of this work is to improve the performance of a neural named entity recognition system by adding input features that indicate a word is part of a name included in a gazetteer. This article describes how to generate gazetteers from the Wikidata knowledge graph as well as how to integrate the information into a neural NER system. Experiments reveal that the approach yields performance gains in two distinct languages: a high-resource, word-based language, English and a high-resource, character-based language, Chinese. Experiments were also performed in a low-resource language, Russian on a newly annotated Russian NER corpus from Reddit tagged with four core types and twelve extended types. This article reports a baseline score. It is a longer version of a paper in the 33rd FLAIRS conference (Song et al. 2020).en
dc.description.urihttps://arxiv.org/abs/2003.03072en
dc.format.extent8 pagesen
dc.genrejournal articles preprintsen
dc.identifierdoi:10.13016/m2ifgq-yed1
dc.identifier.citationSong, Chan Hee; Lawrie, Dawn; Finin, Tim; Mayfield, James; Improving Neural Named Entity Recognition with Gazetteers; Computation and Language (2020); https://arxiv.org/abs/2003.03072en
dc.identifier.urihttp://hdl.handle.net/11603/17981
dc.language.isoenen
dc.relation.isAvailableAtThe University of Maryland, Baltimore County (UMBC)
dc.relation.ispartofUMBC Computer Science and Electrical Engineering Department Collection
dc.relation.ispartofUMBC Faculty Collection
dc.rightsAttribution 4.0 International (CC BY 4.0)*
dc.rightsThis item is likely protected under Title 17 of the U.S. Copyright Law. Unless on a Creative Commons license, for uses protected by Copyright Law, contact the copyright holder or the author.
dc.rights.urihttps://creativecommons.org/licenses/by/4.0/*
dc.titleImproving Neural Named Entity Recognition with Gazetteersen
dc.typeTexten

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
2003.03072.pdf
Size:
741.7 KB
Format:
Adobe Portable Document Format
Description:

License bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
license.txt
Size:
2.56 KB
Format:
Item-specific license agreed upon to submission
Description: