Wikipedia as an Ontology for Describing Documents

dc.contributor.authorSyed, Zareen
dc.contributor.authorFinin, Tim
dc.contributor.authorJoshi, Anupam
dc.date.accessioned2018-11-21T19:00:56Z
dc.date.available2018-11-21T19:00:56Z
dc.date.issued2008-03-31
dc.descriptionProceedings of the Second International Conference on Weblogs and Social Mediaen_US
dc.description.abstractIdentifying topics and concepts associated with a set of documents is a task common to many applications. It can help in the annotation and categorization of documents and be used to model a person's current interests for improving search results, business intelligence or selecting appropriate advertisements. One approach is to associate a document with a set of topics selected from a fixed ontology or vocabulary of terms. We have investigated using Wikipedia's articles and associated pages as a topic ontology for this purpose. The benefits are that the ontology terms are developed through a social process, maintained and kept current by the Wikipedia community, represent a consensus view, and have meaning that can be understood simply by reading the associated Wikipedia page. We use Wikipedia articles and the category and article link graphs to predict concepts common to a set of documents. We describe several algorithms to aggregate and refine results, including the use of spreading activation to select the most appropriate terms. While the Wikipedia category graph can be used to predict generalized concepts, the article links graph helps by predicting more specific concepts and concepts not in the category hierarchy. Our experiments demonstrate the feasibility of extending the category system with new concepts identified as a union of pages from the page link graph.en_US
dc.description.urihttps://www.aaai.org/Papers/ICWSM/2008/ICWSM08-024.pdfen_US
dc.format.extent9 pagesen_US
dc.genreconference papers and proceedings preprintsen_US
dc.identifierdoi:10.13016/M2JS9HC3J
dc.identifier.citationZareen Syed, Tim Finin, and Anupam Joshi, Wikipedia as an Ontology for Describing Documents, Proceedings of the Second International Conference on Weblogs and Social Media, 2008, https://www.aaai.org/Papers/ICWSM/2008/ICWSM08-024.pdfen_US
dc.identifier.urihttp://hdl.handle.net/11603/12073
dc.language.isoen_USen_US
dc.publisherAAAIen_US
dc.relation.isAvailableAtThe University of Maryland, Baltimore County (UMBC)
dc.relation.ispartofUMBC Computer Science and Electrical Engineering Department Collection
dc.relation.ispartofUMBC Faculty Collection
dc.relation.ispartofUMBC Student Collection
dc.rightsThis item is likely protected under Title 17 of the U.S. Copyright Law. Unless on a Creative Commons license, for uses protected by Copyright Law, contact the copyright holder or the author.
dc.subjectWikipediaen_US
dc.subjectOntologyen_US
dc.subjectDescribing Documentsen_US
dc.subjectUMBC Ebiquity Research Groupen_US
dc.titleWikipedia as an Ontology for Describing Documentsen_US
dc.typeTexten_US

Files

Original bundle
Now showing 1 - 2 of 2
Loading...
Thumbnail Image
Name:
385.pdf
Size:
386.23 KB
Format:
Adobe Portable Document Format
Description:
No Thumbnail Available
Name:
394.ppt
Size:
860 KB
Format:
Microsoft Powerpoint
Description:
License bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
2.56 KB
Format:
Item-specific license agreed upon to submission
Description: