A Layered Approach to Semantic Similarity Analysis of XML Schemas
Loading...
Links to Files
Permanent Link
Author/Creator
Author/Creator ORCID
Date
2008-08-05
Type of Work
Department
Program
Citation of Original Publication
Jaewook Kim, Yun Peng, Serm Kulvatunyou, Nenad Ivezic, and Albert Jones, A Layered Approach to Semantic Similarity Analysis of XML Schemas, 2008 IEEE International Conference on Information Reuse and Integration , DOI: 10.1109/IRI.2008.4583042
Rights
Public Domain Mark 1.0
http://creativecommons.org/publicdomain/mark/1.0/
This work was written as part of one of the author's official duties as an Employee of the United States Government and is therefore a work of the United States Government. In accordance with 17 U.S.C. 105, no copyright protection is available for such works under U.S. Law.
http://creativecommons.org/publicdomain/mark/1.0/
This work was written as part of one of the author's official duties as an Employee of the United States Government and is therefore a work of the United States Government. In accordance with 17 U.S.C. 105, no copyright protection is available for such works under U.S. Law.
Abstract
One of the most critical steps to integrating heterogeneous e-Business applications using different XML schemas is schema mapping, which is known to be costly and error-prone. Past research on schema mapping has not fully utilized semantic information in the XML schemas. In this paper, we propose a semantic similarity analysis approach to facilitate XML schema mapping, merging and reuse. Several key innovations are introduced to better utilize available semantic information. These innovations, including: 1) a layered semantic structure of XML schema, 2) layered specific similarity measures using an information content based approach, and 3) a scheme for integrating similarities at all layers. Experimental results using two different schemas from an real world application demonstrate that the proposed approach is valuable for addressing difficulties in XML schema mapping.