Show simple item record

dc.contributor.authorMulwad, Varish
dc.contributor.authorFinin, Tim
dc.contributor.authorJoshi, Anupam
dc.date.accessioned2018-11-12T17:59:52Z
dc.date.available2018-11-12T17:59:52Z
dc.date.issued2011-09-03
dc.descriptionProceedings of the First International Workshop on Searching and Integrating New Web Data Sourcesen_US
dc.description.abstractVast amounts of information is encoded in structured tables found in documents, on the Web, and in spreadsheets or databases. Integrating or searching over this information benefits from understanding its intended meaning. Evidence for a table's meaning can be found in its column headers, cell values, implicit relations between columns, caption and surrounding text but also requires general and domain-specific background knowledge. We represent a table's meaning by mapping columns to classes in an appropriate ontology, linking cell values to literal constants, implied measurements, or entities in the linked data cloud (existing or new) and discovering or and identifying relations between columns. We describe techniques grounded in graphical models and probabilistic reasoning to infer meaning (semantics) associated with a table. Using background knowledge from the Linked Open Data cloud, we jointly infer the semantics of column headers, table cell values (e.g.,strings and numbers) and relations between columns and represent the inferred meaning as graph of RDF triples. We motivate the value of this approach using tables from the medical domain, discussing some of the challenges presented by these tables and describing techniques to tackle them.en_US
dc.description.sponsorshipThis research was supported in part by NSF awards 0326460 and 0910838,MURI award FA9550-08-1-0265 from AFOSR, and a gift from Microsoft Research.en_US
dc.description.urihttp://ceur-ws.org/Vol-880/VLDS-p17-Mulwad.pdfen_US
dc.format.extent6 pagesen_US
dc.genreconference papers and proceedingsen_US
dc.identifierdoi:10.13016/M2KK94G7H
dc.identifier.urihttp://hdl.handle.net/11603/11962
dc.language.isoen_USen_US
dc.relation.isAvailableAtThe University of Maryland, Baltimore County (UMBC)
dc.relation.ispartofUMBC Computer Science and Electrical Engineering Department Collection
dc.relation.ispartofUMBC Faculty Collection
dc.rightsThis item is likely protected under Title 17 of the U.S. Copyright Law. Unless on a Creative Commons license, for uses protected by Copyright Law, contact the copyright holder or the author.
dc.subjectSemantic Weben_US
dc.subjectlinked dataen_US
dc.subjecthuman language technologyen_US
dc.subjectentity linkingen_US
dc.subjectinformation retrievalen_US
dc.subjectUMBC Ebiquity Research Groupen_US
dc.titleGenerating Linked Data by Inferring the Semantics of Tablesen_US
dc.typeTexten_US


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record