Mind: A Context-Based Multimodal Interpretation Framework in Conversational Systems

dc.contributor.authorChai, Joyce Y.
dc.contributor.authorPan, Shimei
dc.contributor.authorZhou, Michelle X.
dc.date.accessioned2025-06-05T14:02:42Z
dc.date.available2025-06-05T14:02:42Z
dc.date.issued2005
dc.description.abstractIn a multimodal human-machine conversation, user inputs are often abbreviated or imprecise. Simply fusing multimodal inputs together may not be sufficient to derive a complete understanding of the inputs. Aiming to handle a wide variety of multimodal inputs, we are building a context-based multimodal interpretation framework called MIND (Multimodal Interpreter for Natural Dialog). MIND is unique in its use of a variety of contexts, such as domain context and conversation context, to enhance multimodal interpretation. In this chapter, we first describe a fine-grained semantic representation that captures salient information from user inputs and the overall conversation, and then present a context-based interpretation approach that enables MIND to reach a full understanding of user inputs, including those abbreviated or imprecise ones.
dc.description.urihttps://link.springer.com/chapter/10.1007/1-4020-3933-6_12
dc.format.extent21 pages
dc.genrebook chapters
dc.genrepostprints
dc.identifierdoi:10.13016/m2rh9s-ab1a
dc.identifier.citationChai, Joyce Y., Shimei Pan, and Michelle X. Zhou. “Mind: A Context-Based Multimodal Interpretation Framework in Conversational Systems.” In Advances in Natural Multimodal Dialogue Systems, edited by Jan C. J. van Kuppevelt, Laila Dybkjær, and Niels Ole Bernsen, 265–85. Dordrecht: Springer Netherlands, 2005. https://doi.org/10.1007/1-4020-3933-6_12.
dc.identifier.urihttps://doi.org/10.1007/1-4020-3933-6_12
dc.identifier.urihttp://hdl.handle.net/11603/38577
dc.language.isoen_US
dc.publisherSpringer Nature
dc.relation.isAvailableAtThe University of Maryland, Baltimore County (UMBC)
dc.relation.ispartofUMBC Information Systems Department
dc.rightsThis item is likely protected under Title 17 of the U.S. Copyright Law. Unless on a Creative Commons license, for uses protected by Copyright Law, contact the copyright holder or the author.
dc.titleMind: A Context-Based Multimodal Interpretation Framework in Conversational Systems
dc.typeText
dcterms.creatorhttps://orcid.org/0000-0002-5989-8543

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
MIND.pdf
Size:
663.48 KB
Format:
Adobe Portable Document Format