Two-stream indexing for spoken web search

Jitendra, Ajmera; Joshi, Anupam; Mukherjea, Sougata; Rajput, Nitendra; Sahay, Shrey; Shrivastava, Mayank; Srivastava, Kundan

Two-stream indexing for spoken web search

dc.contributor.author	Jitendra, Ajmera
dc.contributor.author	Joshi, Anupam
dc.contributor.author	Mukherjea, Sougata
dc.contributor.author	Rajput, Nitendra
dc.contributor.author	Sahay, Shrey
dc.contributor.author	Shrivastava, Mayank
dc.contributor.author	Srivastava, Kundan
dc.date.accessioned	2018-11-20T17:19:54Z
dc.date.available	2018-11-20T17:19:54Z
dc.date.issued	2011-04-01
dc.description	Proceedings of the 20th international conference companion on world wide web	en_US
dc.description.abstract	This paper presents two-stream processing of audio to index the audio content for Spoken Web search. The first stream indexes the meta-data associated with a particular audio document. The meta-data is usually very sparse, but accurate. This therefore results in a high-precision, low-recall index. The second stream uses a novel language-independent speech recognition to generate text to be indexed. Owing to the multiple languages and the noise in user generated content on the Spoken Web, the speech recognition accuracy of such systems is not high, thus they result in a low-precision, high-recall index. The paper attempts to use these two complementary streams to generate a combined index to increase the precision-recall performance in audio content search. The problem of audio content search is motivated by the real world implication of the Web in developing regions, where due to literacy and affordability issues, people use Spoken Web which consists of interconnected VoiceSites, which have content in audio. The experiments are based on more than 20,000 audio documents spanning over seven live VoiceSites and four different languages. The results suggest significant improvement over a meta-data-only or a speech-recognitiononly system, thus justifying the two-stream processing approach. Audio content search is a growing problem area and this paper wishes to be a first step to solving this at a large scale, across languages, in a Web context.	en_US
dc.description.uri	https://ebiquity.umbc.edu/paper/html/id/527/Two-stream-indexing-for-spoken-web-search	en_US
dc.format.extent	10 pages	en_US
dc.genre	conference papers and proceedings preprints	en_US
dc.identifier	doi:10.13016/M21Z41X4K
dc.identifier.uri	http://hdl.handle.net/11603/12067
dc.language.iso	en_US	en_US
dc.relation.isAvailableAt	The University of Maryland, Baltimore County (UMBC)
dc.relation.ispartof	UMBC Computer Science and Electrical Engineering Department Collection
dc.relation.ispartof	UMBC Faculty Collection
dc.rights	This item is likely protected under Title 17 of the U.S. Copyright Law. Unless on a Creative Commons license, for uses protected by Copyright Law, contact the copyright holder or the author.
dc.subject	World Wide Telecom Web	en_US
dc.subject	Spoken Web	en_US
dc.subject	developing regions	en_US
dc.subject	mobile phone	en_US
dc.subject	literacy	en_US
dc.subject	audio search	en_US
dc.subject	Algorithms	en_US
dc.subject	Experimentation	en_US
dc.subject	Human Factors	en_US
dc.subject	UMBC Ebiquity Research Group	en_US
dc.title	Two-stream indexing for spoken web search	en_US
dc.type	Text	en_US

Files

License bundle

Now showing 1 - 1 of 1

Name:: license.txt
Size:: 2.56 KB
Format:: Item-specific license agreed upon to submission
Description:

Download

Collections

UMBC Computer Science and Electrical Engineering Department
UMBC Faculty Collection