Wikipedia as an Ontology for Describing Documents

dc.contributor.authorSyed, Zareen
dc.contributor.authorFinin, Tim
dc.contributor.authorJoshi, Anupam
dc.date.accessioned2018-11-21T19:00:56Z
dc.date.available2018-11-21T19:00:56Z
dc.date.issued2008-03-31
dc.descriptionProceedings of the Second International Conference on Weblogs and Social Mediaen
dc.description.abstractIdentifying topics and concepts associated with a set of documents is a task common to many applications. It can help in the annotation and categorization of documents and be used to model a person's current interests for improving search results, business intelligence or selecting appropriate advertisements. One approach is to associate a document with a set of topics selected from a fixed ontology or vocabulary of terms. We have investigated using Wikipedia's articles and associated pages as a topic ontology for this purpose. The benefits are that the ontology terms are developed through a social process, maintained and kept current by the Wikipedia community, represent a consensus view, and have meaning that can be understood simply by reading the associated Wikipedia page. We use Wikipedia articles and the category and article link graphs to predict concepts common to a set of documents. We describe several algorithms to aggregate and refine results, including the use of spreading activation to select the most appropriate terms. While the Wikipedia category graph can be used to predict generalized concepts, the article links graph helps by predicting more specific concepts and concepts not in the category hierarchy. Our experiments demonstrate the feasibility of extending the category system with new concepts identified as a union of pages from the page link graph.en
dc.description.urihttps://www.aaai.org/Papers/ICWSM/2008/ICWSM08-024.pdfen
dc.format.extent9 pagesen
dc.genreconference papers and proceedings preprintsen
dc.identifierdoi:10.13016/M2JS9HC3J
dc.identifier.citationZareen Syed, Tim Finin, and Anupam Joshi, Wikipedia as an Ontology for Describing Documents, Proceedings of the Second International Conference on Weblogs and Social Media, 2008, https://www.aaai.org/Papers/ICWSM/2008/ICWSM08-024.pdfen
dc.identifier.urihttp://hdl.handle.net/11603/12073
dc.language.isoenen
dc.publisherAAAIen
dc.relation.isAvailableAtThe University of Maryland, Baltimore County (UMBC)
dc.relation.ispartofUMBC Computer Science and Electrical Engineering Department Collection
dc.relation.ispartofUMBC Faculty Collection
dc.relation.ispartofUMBC Student Collection
dc.rightsThis item is likely protected under Title 17 of the U.S. Copyright Law. Unless on a Creative Commons license, for uses protected by Copyright Law, contact the copyright holder or the author.
dc.subjectWikipediaen
dc.subjectOntologyen
dc.subjectDescribing Documentsen
dc.subjectUMBC Ebiquity Research Groupen
dc.titleWikipedia as an Ontology for Describing Documentsen
dc.typeTexten

Files

Original bundle

Now showing 1 - 2 of 2
Loading...
Thumbnail Image
Name:
385.pdf
Size:
386.23 KB
Format:
Adobe Portable Document Format
Description:
Loading...
Thumbnail Image
Name:
394.ppt
Size:
860 KB
Format:
Microsoft Powerpoint
Description:

License bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
license.txt
Size:
2.56 KB
Format:
Item-specific license agreed upon to submission
Description: