Comparing and Evaluating Semantic Data Automatically Extracted from Text

dc.contributor.authorLawrie, Dawn
dc.contributor.authorFinin, Tim
dc.contributor.authorMayfield, James
dc.contributor.authorMcNamee, Paul
dc.date.accessioned2018-11-02T15:28:24Z
dc.date.available2018-11-02T15:28:24Z
dc.date.issued2013-11-15
dc.descriptionAAAI 2013 Fall Symposium on Semantics for Big Dataen_US
dc.description.abstractOne way to obtain large amounts of semantic data is to extract facts from the vast quantities of text that is now available on-line. The relatively low accuracy of current information extraction techniques introduces a need for evaluating the quality of the knowledge bases (KBs) they generate. We frame the problem as comparing KBs generated by different systems from the same documents and show that exploiting provenance leads to more efficient techniques for aligning them and identifying their differences. We describe two types of tools: entity-match focuses on differences in entities found and linked; kbdiff focuses on differences in relations among those entities. Together, these tools support assessment of relative KB accuracy by sampling the parts of two KBs that disagree. We explore the usefulness of the tools through the construction of tens of different KBs built from the same 26,000 Washington Post articles and identifying the differences.en_US
dc.description.sponsorshipPartial support for this work was provided by NSF grant CCF 0916081 and AFOSR grant FA9550-08-1-0265.en_US
dc.description.urihttps://www.aaai.org/ocs/index.php/FSS/FSS13/paper/viewFile/7623/7549en_US
dc.format.extent8 pagesen_US
dc.genreconference papers and proceedings pre-printen_US
dc.identifierdoi:10.13016/M2TT4FX5S
dc.identifier.citationDawn Lawrie, Tim Finin, James Mayfield, and Paul McNamee, Comparing and Evaluating Semantic Data Automatically Extracted from Text, AAAI 2013 Fall Symposium on Semantics for Big Data, https://www.aaai.org/ocs/index.php/FSS/FSS13/paper/viewFile/7623/7549en_US
dc.identifier.urihttp://hdl.handle.net/11603/11844
dc.language.isoen_USen_US
dc.publisherAAAI Pressen_US
dc.relation.isAvailableAtThe University of Maryland, Baltimore County (UMBC)
dc.relation.ispartofUMBC Computer Science and Electrical Engineering Department Collection
dc.relation.ispartofUMBC Faculty Collection
dc.rightsThis item is likely protected under Title 17 of the U.S. Copyright Law. Unless on a Creative Commons license, for uses protected by Copyright Law, contact the copyright holder or the author.
dc.subjectbig dataen_US
dc.subjectInformation Extractionen_US
dc.subjectnatural language processingen_US
dc.subjectSemanticsen_US
dc.subjectUMBC Ebiquity Research Groupen_US
dc.titleComparing and Evaluating Semantic Data Automatically Extracted from Texten_US
dc.typeTexten_US

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
688.pd.pdf
Size:
546.68 KB
Format:
Adobe Portable Document Format
Description:

License bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
license.txt
Size:
1.68 KB
Format:
Item-specific license agreed upon to submission
Description: