Grounding Stylistic Domain Generalization with Quantitative Domain Shift Measures and Synthetic Scene Images

Luo, Yiran; Feinglass, Joshua; Gokhale, Tejas; Lee, Kuan-Cheng; Baral, Chitta; Yang, Yezhou

Grounding Stylistic Domain Generalization with Quantitative Domain Shift Measures and Synthetic Scene Images

dc.contributor.author	Luo, Yiran
dc.contributor.author	Feinglass, Joshua
dc.contributor.author	Gokhale, Tejas
dc.contributor.author	Lee, Kuan-Cheng
dc.contributor.author	Baral, Chitta
dc.contributor.author	Yang, Yezhou
dc.date.accessioned	2024-07-12T14:57:25Z
dc.date.available	2024-07-12T14:57:25Z
dc.date.issued	2024-09-27
dc.description	2024 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW), 17-18 June 2024, Seattle, WA, USA
dc.description.abstract	Domain Generalization (DG) is a challenging task in machine learning that requires a coherent ability to comprehend shifts across various domains through extraction of domain-invariant features. DG performance is typically evaluated by performing image classification in domains of various image styles. However, current methodology lacks quantitative understanding about shifts in stylistic domain, and relies on a vast amount of pre-training data, such as ImageNet1K, which are predominantly in photorealistic style with weakly supervised class labels. Such a data-driven practice could potentially result in spurious correlation and inflated performance on DG benchmarks. In this paper, we introduce a new 3-part DG paradigm to address these risks. We first introduce two new quantitative measures ICV and IDD to describe domain shifts in terms of consistency of classes within one domain and similarity between two stylistic domains. We then present SuperMarioDomains (SMD), a novel synthetic multi-domain dataset sampled from video game scenes with more consistent classes and sufficient dissimilarity compared to ImageNet1K. We demonstrate our DG method SMOS. SMOS uses SMD to first train a precursor model, which is then used to ground the training on a DG benchmark. We observe that SMOS+SMD altogether contributes to state-ofthe-art performance across five DG benchmarks, gaining large improvements to performances on abstract domains along with on-par or slight improvements to those on photorealistic domains. Our qualitative analysis suggests that these improvements can be attributed to reduced distributional divergence between originally distant domains.
dc.description.sponsorship	The authors acknowledge Research Computing at Arizona State University for providing HPC resources and support for this work. This work was supported by NSF RI grants #1750082 and #2132724. The views and opinions of the authors expressed herein do not necessarily state or reflect those of the funding agencies and employers.
dc.description.uri	https://ieeexplore.ieee.org/document/10678092
dc.format.extent	11 pages
dc.genre	conference papers and proceedings
dc.genre	postprints
dc.identifier	doi:10.13016/m2exdj-rr5k
dc.identifier.citation	Luo, Yiran, Joshua Feinglass, Tejas Gokhale, Kuan-Cheng Lee, Chitta Baral, and Yezhou Yang. “Grounding Stylistic Domain Generalization with Quantitative Domain Shift Measures and Synthetic Scene Images.” 2024 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW), June 2024, 7303–13. https://doi.org/10.1109/CVPRW63382.2024.00726.
dc.identifier.uri	http://hdl.handle.net/11603/34886
dc.language.iso	en_US
dc.publisher	IEEE
dc.relation.isAvailableAt	The University of Maryland, Baltimore County (UMBC)
dc.relation.ispartof	UMBC Faculty Collection
dc.relation.ispartof	UMBC Computer Science and Electrical Engineering Department
dc.rights	© 2024 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works.
dc.subject	Computer Science - Computer Vision and Pattern Recognition
dc.title	Grounding Stylistic Domain Generalization with Quantitative Domain Shift Measures and Synthetic Scene Images
dc.type	Text
dcterms.creator	https://orcid.org/0000-0002-5593-2804

Files

Original bundle

Now showing 1 - 1 of 1

Name:: 2405.15961v1.pdf
Size:: 2.69 MB
Format:: Adobe Portable Document Format

Download

Collections

UMBC Faculty Collection
UMBC Computer Science and Electrical Engineering Department