Annotating named entities in Twitter data with crowdsourcing

Finin, Tim; Murnane, Will; Karandikar, Anand; Keller, Nicholas; Martineau, Justin; Dredze, Mark

Annotating named entities in Twitter data with crowdsourcing

dc.contributor.author	Finin, Tim
dc.contributor.author	Murnane, Will
dc.contributor.author	Karandikar, Anand
dc.contributor.author	Keller, Nicholas
dc.contributor.author	Martineau, Justin
dc.contributor.author	Dredze, Mark
dc.date.accessioned	2018-11-13T19:53:55Z
dc.date.available	2018-11-13T19:53:55Z
dc.date.issued	2010-06-06
dc.description	Proceedings of the NAACL Workshop on Creating Speech and Text Language Data With Amazon's Mechanical Turk	en
dc.description.abstract	We describe our experience using both Amazon Mechanical Turk (MTurk) and Crowd Flower to collect simple named entity annotations for Twitter status updates. Unlike most genres that have traditionally been the focus of named entity experiments, Twitter is far more informal and abbreviated. The collected annotations and annotation techniques will provide a first step towards the full study of named entity recognition in domains like Facebook and Twitter. We also briefly describe how to use MTurk to collect judgements on the quality of “word clouds.”	en
dc.description.sponsorship	This work was done with partial support from the Office of Naval Research and the Johns Hopkins University Human Language Technology Center of Excellence. We thank both Amazon and Dolores Labs for grants that allowed us to use their systems for the experiments.	en
dc.description.uri	https://ebiquity.umbc.edu/paper/html/id/476/Annotating-named-entities-in-Twitter-data-with-crowdsourcing	en
dc.format.extent	9 pages	en
dc.genre	conference papers and proceedings preprints	en
dc.identifier	doi:10.13016/M21J97C2P
dc.identifier.citation	Tim Finin, Will Murnane, Anand Karandikar, Nicholas Keller, Justin Martineau, and Mark Dredze, Annotating named entities in Twitter data with crowdsourcing, Proceedings of the NAACL Workshop on Creating Speech and Text Language Data With Amazon's Mechanical Turk, 2010. https://aclanthology.org/W10-0713/	en
dc.identifier.uri	http://hdl.handle.net/11603/11974
dc.language.iso	en	en
dc.publisher	Association for Computational Linguistics	en
dc.relation.isAvailableAt	The University of Maryland, Baltimore County (UMBC)
dc.relation.ispartof	UMBC Computer Science and Electrical Engineering Department Collection
dc.relation.ispartof	UMBC Faculty Collection
dc.relation.ispartof	UMBC Student Collection
dc.rights	This item is likely protected under Title 17 of the U.S. Copyright Law. Unless on a Creative Commons license, for uses protected by Copyright Law, contact the copyright holder or the author.
dc.subject	amazon mechanical turk	en
dc.subject	crowdsourcing	en
dc.subject	information extraction	en
dc.subject	mturk	en
dc.subject	named entities	en
dc.subject	named entity recognition	en
dc.subject	natural language processing	en
dc.subject	social media	en
dc.subject	social	en
dc.subject	twitter	en
dc.subject	word clouds	en
dc.subject	UMBC Ebiquity Research Group	en
dc.title	Annotating named entities in Twitter data with crowdsourcing	en
dc.type	Text	en

Files

License bundle

Now showing 1 - 1 of 1

Name:: license.txt
Size:: 1.68 KB
Format:: Item-specific license agreed upon to submission
Description:

Download

Collections

UMBC Computer Science and Electrical Engineering Department
UMBC Faculty Collection
UMBC Student Collection