Text Based Similarity Metrics and Delta for Semantic Web Graphs

Author/Creator ORCID

Date

2010-11-07

Type of Work

Department

Program

Citation of Original Publication

Rights

This item is likely protected under Title 17 of the U.S. Copyright Law. Unless on a Creative Commons license, for uses protected by Copyright Law, contact the copyright holder or the author.

Abstract

Recognizing that two Semantic Web documents or graphs are similar and characterizing their differences is useful in many tasks, including retrieval, updating, version control and knowledge base editing. We describe several text-based similarity metrics that characterize the relation between Semantic Web graphs and evaluate these metrics for three specialc cases of similarity: similarity in classes and properties, similarity disregarding differences in base-URIs, and versioning relation- ship. We apply these techniques for a special use case - generating a delta between versions of a Semantic Web graph. We have evaluated our system on several tasks using a collection of graphs from the archive of the Swoogle Semantic Web search engine.