A Machine Learning Approach to Linking FOAF Instances
Links to Fileshttps://www.google.com/url?q=https://www.aaai.org/ocs/index.php/SSS/SSS10/paper/download/1156/1464&sa=U&ved=0ahUKEwjeq8ytucDeAhVNzVMKHai-B2IQFggEMAA&client=internal-uds-cse&cx=016314354884912110518:gwmynp16xuu&usg=AOvVaw0zdE8NUHKEbKIkHQsOlweS
MetadataShow full item record
Type of Work7 pages
conference papers and proceedings
Citation of Original PublicationJennifer Sleeman and Tim Finin, A Machine Learning Approach to Linking FOAF Instances, Proceedings of the AAAI Spring Symposium on Linked Data Meets Artificial Intelligence, 2010, https://www.google.com/url?q=https://www.aaai.org/ocs/index.php/SSS/SSS10/paper/download/1156/1464&sa=U&ved=0ahUKEwjeq8ytucDeAhVNzVMKHai-B2IQFggEMAA&client=internal-uds-cse&cx=016314354884912110518:gwmynp16xuu&usg=AOvVaw0zdE8NUHKEbKIkHQsOlweS
RightsThis item is likely protected under Title 17 of the U.S. Copyright Law. Unless on a Creative Commons license, for uses protected by Copyright Law, contact the copyright holder or the author.
The friend of a friend (FOAF) vocabulary is widely used on the Web to describe individual people and their properties. Since FOAF does not require a unique ID for a person, it is not clear when two FOAF agents should be linked as coreferent, i.e., denote the same person in the world. One approach is to use the presence of inverse functional properties (e.g., foaf:mbox) as evidence that two individuals are the same. Another applies heuristics based on the string similarity of values of FOAF properties such as name and school as evidence for or against co-reference. Performance is limited, however, by many factors: non-semantic string matching, noise, changes in the world, and the lack of more sophisticated graph analytics. We describe a supervised machine learning approach that uses features defined over pairs of FOAF individuals to produce a classifier for identifying co-referent FOAF instances. We present initial results using data collected from Swoogle and other sources and describe plans for additional analysis.