A Layered Approach to Semantic Similarity Analysis of XML Schemas

Author/Creator ORCID

Date

2008-08-05

Department

Program

Citation of Original Publication

Jaewook Kim, Yun Peng, Serm Kulvatunyou, Nenad Ivezic, and Albert Jones, A Layered Approach to Semantic Similarity Analysis of XML Schemas, 2008 IEEE International Conference on Information Reuse and Integration , DOI: 10.1109/IRI.2008.4583042

Rights

Public Domain Mark 1.0
http://creativecommons.org/publicdomain/mark/1.0/
This work was written as part of one of the author's official duties as an Employee of the United States Government and is therefore a work of the United States Government. In accordance with 17 U.S.C. 105, no copyright protection is available for such works under U.S. Law.

Abstract

One of the most critical steps to integrating heterogeneous e-Business applications using different XML schemas is schema mapping, which is known to be costly and error-prone. Past research on schema mapping has not fully utilized semantic information in the XML schemas. In this paper, we propose a semantic similarity analysis approach to facilitate XML schema mapping, merging and reuse. Several key innovations are introduced to better utilize available semantic information. These innovations, including: 1) a layered semantic structure of XML schema, 2) layered specific similarity measures using an information content based approach, and 3) a scheme for integrating similarities at all layers. Experimental results using two different schemas from an real world application demonstrate that the proposed approach is valuable for addressing difficulties in XML schema mapping.