Robust Semantic Text Similarity Using LSA, Machine Learning and Linguistic Resources

Kashyap, Abhay L.; Han, Lushan; Yus, Roberto; Sleeman, Jennifer; Satyapanich, Taneeya W.; Gandhi, Sunil R; Finin, Tim

Robust Semantic Text Similarity Using LSA, Machine Learning and Linguistic Resources

dc.contributor.author	Kashyap, Abhay L.
dc.contributor.author	Han, Lushan
dc.contributor.author	Yus, Roberto
dc.contributor.author	Sleeman, Jennifer
dc.contributor.author	Satyapanich, Taneeya W.
dc.contributor.author	Gandhi, Sunil R
dc.contributor.author	Finin, Tim
dc.date.accessioned	2018-10-31T17:01:42Z
dc.date.available	2018-10-31T17:01:42Z
dc.date.issued	2016-03-01
dc.description.abstract	Semantic textual similarity is a measure of the degree of semantic equivalence between two pieces of text. We describe the SemSim system and its performance in the SEM 2013 and SemEval-2014 tasks on semantic textual similarity. At the core of our system lies a robust distributional word similarity component that combines latent semantic analysis and machine learning augmented with data from several linguistic resources. We used a simple term alignment algorithm to handle longer pieces of text. Additional wrappers and resources were used to handle task specific challenges that include processing Spanish text, comparing text sequences of different lengths, handling informal words and phrases, and matching words with sense definitions. In the SEM 2013 task on Semantic Textual Similarity, our best performing system ranked first among the 89 submitted runs. In the SemEval-2014 task on Multilingual Semantic Textual Similarity, we ranked a close second in both the English and Spanish subtasks. In the SemEval-2014 task on Cross-Level Semantic Similarity, we ranked first in Sentence–Phrase, Phrase–Word, and Word–Sense subtasks and second in the Paragraph–Sentence subtask.	en
dc.description.sponsorship	This research was supported by awards 1228198, 1250627 and 0910838 from the U.S. National Science Foundation. We would like to thank the anonymous reviewers for their valuable comments on an earlier version of this paper.	en
dc.description.uri	https://link.springer.com/article/10.1007%2Fs10579-015-9319-2	en
dc.format.extent	33 pages	en
dc.genre	journal articles post-print	en
dc.identifier	doi:10.13016/M2HQ3S29F
dc.identifier.citation	Abhay L. Kashyap, Lushan Han, Roberto Yus, Jennifer Sleeman, Taneeya W. Satyapanich, Sunil R Gandhi, and Tim Finin, Robust Semantic Text Similarity Using LSA, Machine Learning and Linguistic Resources, Language Resources and Evaluation March 2016, Volume 50, Issue 1, pp 125–161, http://dx.doi.org/10.1007/s10579-015-9319-2	en
dc.identifier.uri	http://dx.doi.org/10.1007/s10579-015-9319-2
dc.identifier.uri	http://hdl.handle.net/11603/11802
dc.language.iso	en	en
dc.publisher	Springer	en
dc.relation.isAvailableAt	The University of Maryland, Baltimore County (UMBC)
dc.relation.ispartof	UMBC Computer Science and Electrical Engineering Department Collection
dc.relation.ispartof	UMBC Faculty Collection
dc.relation.ispartof	UMBC Student Collection
dc.rights	This item is likely protected under Title 17 of the U.S. Copyright Law. Unless on a Creative Commons license, for uses protected by Copyright Law, contact the copyright holder or the author.
dc.rights	This is a post-peer-review, pre-copyedit version of an article published in Language Resources and Evaluation. The final authenticated version is available online at: http://dx.doi.org/10.1007/s10579-015-9319-2
dc.subject	Latent Semantic Analysis	en
dc.subject	WordNet	en
dc.subject	term alignment	en
dc.subject	semantic similarity	en
dc.subject	UMBC Ebiquity Research Group	en
dc.title	Robust Semantic Text Similarity Using LSA, Machine Learning and Linguistic Resources	en
dc.type	Text	en

Files

Original bundle

Now showing 1 - 1 of 1

Name:: 777.pdf
Size:: 1.49 MB
Format:: Adobe Portable Document Format
Description:

Download

License bundle

Now showing 1 - 1 of 1

Name:: license.txt
Size:: 1.68 KB
Format:: Item-specific license agreed upon to submission
Description:

Download

Collections

UMBC Computer Science and Electrical Engineering Department
UMBC Faculty Collection
UMBC Student Collection