KELVIN: Extracting Knowledge from Large Text Collections
Loading...
Files
Permanent Link
Author/Creator
Author/Creator ORCID
Date
2014-11-13
Type of Work
Department
Program
Citation of Original Publication
Mayfield, James, Paul McNamee, Craig Harman, Tim Finin, and Dawn Lawrie. “KELVIN: Extracting Knowledge from Large Text Collections.” Natural Language Access to Big Data: Papers from the 2014 AAAI Fall Symposium. AAAI Press (2022). https://aaai.org/papers/09140-kelvin-extracting-knowledge-from-large-text-collections/.
Rights
This item is likely protected under Title 17 of the U.S. Copyright Law. Unless on a Creative Commons license, for uses protected by Copyright Law, contact the copyright holder or the author.
Abstract
We describe the KELVIN system for extracting entities and relations from large text collections and its use in the TAC Knowledge Base Population Cold Start task run by the U.S. National Institute of Standards and Technology. The Cold Start task starts with an empty knowledge base defined by an ontology or entity types, properties and relations. Evaluations in 2012 and 2013 were done using a collection of text from local Web and news to de-emphasize the linking entities to a background knowledge bases such as Wikipedia. Interesting features of KELVIN include a cross-document entity coreference module based on entity mentions, removal of suspect intra-document conference chains, a slot value consolidator for entities, the application of inference rules to expand the number of asserted facts and a set of analysis and browsing tools supporting development.