Towards Robust Visual Understanding: from Recognition to Reasoning

Gokhale, Tejas

Towards Robust Visual Understanding: from Recognition to Reasoning

dc.contributor.author	Gokhale, Tejas
dc.date.accessioned	2024-05-06T15:05:54Z
dc.date.available	2024-05-06T15:05:54Z
dc.date.issued	2024-03-24
dc.description	Proceedings of the 38th AAAI Conference on Artificial Intelligence
dc.description.abstract	Models that learn from data are widely and rapidly being deployed today for real-world use, but they suffer from unforeseen failures due to distribution shift, adversarial attacks, noise and corruption, and data scarcity. But many failures also occur because many modern AI tasks require reasoning beyond pattern matching -- and such reasoning abilities are difficult to formulate as data-based input-output function fitting. The reliability problem has become increasingly important under the new paradigm of semantic ``multimodal'' learning. My research provides avenues to develop robust and reliable computer vision systems, particularly by leveraging the interactions between vision and language. In this AAAI New Faculty highlights talk, I will cover three thematic areas of my research, ranging from robustness in computer vision, open-domain reliability in visual reasoning, and challenges and opportunities in evaluation of generative models. Readers are encouraged to refer to my website (www.tejasgokhale.com) for more details and updates from my lab's activities towards the goal of robust visual understanding.
dc.description.uri	https://ojs.aaai.org/index.php/AAAI/article/view/30281
dc.format.extent	1 page
dc.genre	conference papers and proceedings
dc.genre	postprints
dc.identifier	doi:10.13016/m2yjam-jsv6
dc.identifier.citation	Gokhale, Tejas. “Towards Robust Visual Understanding: From Recognition to Reasoning.” Proceedings of the AAAI Conference on Artificial Intelligence 38, no. 20 (March 24, 2024): 22665–22665. https://doi.org/10.1609/aaai.v38i20.30281.
dc.identifier.uri	https://doi.org/10.1609/aaai.v38i20.30281
dc.identifier.uri	http://hdl.handle.net/11603/33610
dc.language.iso	en
dc.publisher	AAAI
dc.relation.isAvailableAt	The University of Maryland, Baltimore County (UMBC)
dc.relation.ispartof	UMBC Faculty Collection
dc.relation.ispartof	UMBC Computer Science and Electrical Engineering Department
dc.subject	Multimodal Learning
dc.title	Towards Robust Visual Understanding: from Recognition to Reasoning
dc.type	Text
dcterms.creator	https://orcid.org/0000-0002-5593-2804

Files

Original bundle

Now showing 1 - 1 of 1

Name:: aaai_nfh_cameraready.pdf
Size:: 151.75 KB
Format:: Adobe Portable Document Format

Download

Collections

UMBC Faculty Collection
UMBC Computer Science and Electrical Engineering Department