Context-based multimodal input understanding in conversational systems

Chai, J.; Pan, Shimei; Zhou, M. X.; Houck, K.

Context-based multimodal input understanding in conversational systems

dc.contributor.author	Chai, J.
dc.contributor.author	Pan, Shimei
dc.contributor.author	Zhou, M. X.
dc.contributor.author	Houck, K.
dc.date.accessioned	2025-06-05T14:03:45Z
dc.date.available	2025-06-05T14:03:45Z
dc.date.issued	2002-10
dc.description	Fourth IEEE International Conference on Multimodal Interfaces
dc.description.abstract	In a multimodal human-machine conversation, user inputs are often abbreviated or imprecise. Sometimes, merely fusing multimodal inputs together cannot derive a complete understanding. To address these inadequacies, we are building a semantics-based multimodal interpretation framework called MIND (Multimodal Interpretation for Natural Dialog). The unique feature of MIND is the use of a variety of contexts (e.g., domain context and conversation context) to enhance multimodal fusion. In this paper we present a semantically rich modeling scheme and a context-based approach that enable MIND to gain a full understanding of user inputs, including ambiguous and incomplete ones.
dc.description.uri	https://ieeexplore.ieee.org/document/1166974/
dc.format.extent	6 pages
dc.genre	conference papers and proceedings
dc.genre	postprints
dc.identifier	doi:10.13016/m2udti-olhe
dc.identifier.citation	Chai, J., Shimei Pan, M.X. Zhou, and K. Houck. “Context-Based Multimodal Input Understanding in Conversational Systems.” In Proceedings. Fourth IEEE International Conference on Multimodal Interfaces, 87–92, 2002. https://doi.org/10.1109/ICMI.2002.1166974.
dc.identifier.uri	https://doi.org/10.1109/ICMI.2002.1166974
dc.identifier.uri	http://hdl.handle.net/11603/38752
dc.language.iso	en_US
dc.publisher	IEEE
dc.relation.isAvailableAt	The University of Maryland, Baltimore County (UMBC)
dc.relation.ispartof	UMBC Information Systems Department
dc.rights	© 2002 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works.
dc.subject	Speech processing
dc.subject	USA Councils
dc.subject	Context modeling
dc.subject	Switches
dc.subject	Displays
dc.subject	Speech recognition
dc.subject	Text recognition
dc.subject	Natural languages
dc.subject	Cities and towns
dc.subject	History
dc.title	Context-based multimodal input understanding in conversational systems
dc.type	Text
dcterms.creator	https://orcid.org/0000-0002-5989-8543

Files

Original bundle

Now showing 1 - 1 of 1

Name:: contextbased.pdf
Size:: 242.08 KB
Format:: Adobe Portable Document Format

Download

Collections

UMBC Information Systems Department