Entity Disambiguation for Knowledge Base Population

Dredze, MarkMcNamee, PaulRao, DelipGerber, AdamFinin, TimEntity Disambiguation for Knowledge Base PopulationMy University2010information extractionknowledge basenatural language processingnatural language processingUMBC Ebiquity Research GroupMy UniversityMy University2018-11-142018-11-142010-08-23enTexthttp://hdl.handle.net/11603/119869 pagesThis item is likely protected under Title 17 of the U.S. Copyright Law. Unless on a Creative Commons license, for uses protected by Copyright Law, contact the copyright holder or the author.Proceedings of the 23rd International Conference on Computational LinguisticsThe integration of facts derived from information extraction systems into existing knowledge bases requires a system to disambiguate entity mentions in the text. This is challenging due to issues such as non-uniform variations in entity names, mention ambiguity, and entities absent from a knowledge base. We present a state of the art system for entity disambiguation that not only addresses these challenges but also scales to knowledge bases with several million entries using very little resources. Further, our approach achieves performance of up to 95% on entities mentioned from newswire and 80% on a public test set that was designed to include challenging queries.