Two-stream indexing for spoken web search

dc.contributor.authorJitendra, Ajmera
dc.contributor.authorJoshi, Anupam
dc.contributor.authorMukherjea, Sougata
dc.contributor.authorRajput, Nitendra
dc.contributor.authorSahay, Shrey
dc.contributor.authorShrivastava, Mayank
dc.contributor.authorSrivastava, Kundan
dc.date.accessioned2018-11-20T17:19:54Z
dc.date.available2018-11-20T17:19:54Z
dc.date.issued2011-04-01
dc.descriptionProceedings of the 20th international conference companion on world wide weben_US
dc.description.abstractThis paper presents two-stream processing of audio to index the audio content for Spoken Web search. The first stream indexes the meta-data associated with a particular audio document. The meta-data is usually very sparse, but accurate. This therefore results in a high-precision, low-recall index. The second stream uses a novel language-independent speech recognition to generate text to be indexed. Owing to the multiple languages and the noise in user generated content on the Spoken Web, the speech recognition accuracy of such systems is not high, thus they result in a low-precision, high-recall index. The paper attempts to use these two complementary streams to generate a combined index to increase the precision-recall performance in audio content search. The problem of audio content search is motivated by the real world implication of the Web in developing regions, where due to literacy and affordability issues, people use Spoken Web which consists of interconnected VoiceSites, which have content in audio. The experiments are based on more than 20,000 audio documents spanning over seven live VoiceSites and four different languages. The results suggest significant improvement over a meta-data-only or a speech-recognitiononly system, thus justifying the two-stream processing approach. Audio content search is a growing problem area and this paper wishes to be a first step to solving this at a large scale, across languages, in a Web context.en_US
dc.description.urihttps://ebiquity.umbc.edu/paper/html/id/527/Two-stream-indexing-for-spoken-web-searchen_US
dc.format.extent10 pagesen_US
dc.genreconference papers and proceedings preprintsen_US
dc.identifierdoi:10.13016/M21Z41X4K
dc.identifier.urihttp://hdl.handle.net/11603/12067
dc.language.isoen_USen_US
dc.relation.isAvailableAtThe University of Maryland, Baltimore County (UMBC)
dc.relation.ispartofUMBC Computer Science and Electrical Engineering Department Collection
dc.relation.ispartofUMBC Faculty Collection
dc.rightsThis item is likely protected under Title 17 of the U.S. Copyright Law. Unless on a Creative Commons license, for uses protected by Copyright Law, contact the copyright holder or the author.
dc.subjectWorld Wide Telecom Weben_US
dc.subjectSpoken Weben_US
dc.subjectdeveloping regionsen_US
dc.subjectmobile phoneen_US
dc.subjectliteracyen_US
dc.subjectaudio searchen_US
dc.subjectAlgorithmsen_US
dc.subjectExperimentationen_US
dc.subjectHuman Factorsen_US
dc.subjectUMBC Ebiquity Research Groupen_US
dc.titleTwo-stream indexing for spoken web searchen_US
dc.typeTexten_US

Files

License bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
2.56 KB
Format:
Item-specific license agreed upon to submission
Description: