Unsupervised Domain Adaptation for Action Recognition via Self-Ensembling and Conditional Embedding Alignment

Ghosh, Indrajeet; Chugh, Garvit; Faridee, Abu Zaher Md; Roy, Nirmalya

Unsupervised Domain Adaptation for Action Recognition via Self-Ensembling and Conditional Embedding Alignment

dc.contributor.author	Ghosh, Indrajeet
dc.contributor.author	Chugh, Garvit
dc.contributor.author	Faridee, Abu Zaher Md
dc.contributor.author	Roy, Nirmalya
dc.date.accessioned	2024-12-11T17:02:40Z
dc.date.available	2024-12-11T17:02:40Z
dc.date.issued	2024-10-23
dc.description.abstract	Recent advancements in deep learning-based wearable human action recognition (wHAR) have improved the capture and classification of complex motions, but adoption remains limited due to the lack of expert annotations and domain discrepancies from user variations. Limited annotations hinder the model's ability to generalize to out-of-distribution samples. While data augmentation can improve generalizability, unsupervised augmentation techniques must be applied carefully to avoid introducing noise. Unsupervised domain adaptation (UDA) addresses domain discrepancies by aligning conditional distributions with labeled target samples, but vanilla pseudo-labeling can lead to error propagation. To address these challenges, we propose μDAR, a novel joint optimization architecture comprised of three functions: (i) consistency regularizer between augmented samples to improve model classification generalizability, (ii) temporal ensemble for robust pseudo-label generation and (iii) conditional distribution alignment to improve domain generalizability. The temporal ensemble works by aggregating predictions from past epochs to smooth out noisy pseudo-label predictions, which are then used in the conditional distribution alignment module to minimize kernel-based class-wise conditional maximum mean discrepancy (kCMMD) between the source and target feature space to learn a domain invariant embedding. The consistency-regularized augmentations ensure that multiple augmentations of the same sample share the same labels; this results in (a) strong generalization with limited source domain samples and (b) consistent pseudo-label generation in target samples. The novel integration of these three modules in μDAR results in a range of ≈4-12% average macro-F1 score improvement over six state-of-the-art UDA methods in four benchmark wHAR datasets
dc.description.sponsorship	This work has been partially supported by NSF CAREER Award #1750936, ONR Grant #N00014-23-1-2119, U.S. Army Grant #W911NF2120076 and NSF CNS EAGER Grant #2233879.
dc.description.uri	http://arxiv.org/abs/2410.17489
dc.format.extent	6 pages
dc.genre	journal articles
dc.genre	preprints
dc.identifier	doi:10.13016/m2yqmf-ceqo
dc.identifier.uri	https://doi.org/10.48550/arXiv.2410.17489
dc.identifier.uri	http://hdl.handle.net/11603/37092
dc.language.iso	en_US
dc.relation.isAvailableAt	The University of Maryland, Baltimore County (UMBC)
dc.relation.ispartof	UMBC Student Collection
dc.relation.ispartof	UMBC Information Systems Department
dc.relation.ispartof	UMBC Center for Real-time Distributed Sensing and Autonomy
dc.relation.ispartof	UMBC Faculty Collection
dc.rights	Attribution 4.0 International CC BY 4.0
dc.rights.uri	https://creativecommons.org/licenses/by/4.0/
dc.subject	Computer Science - Artificial Intelligence
dc.subject	Computer Science - Computer Vision and Pattern Recognition
dc.subject	UMBC Mobile, Pervasive and Sensor Computing Lab (MPSC Lab)
dc.title	Unsupervised Domain Adaptation for Action Recognition via Self-Ensembling and Conditional Embedding Alignment
dc.type	Text
dcterms.creator	https://orcid.org/0000-0003-2868-3766

Files

Original bundle

Now showing 1 - 1 of 1

Name:: 2410.17489v1.pdf
Size:: 3.26 MB
Format:: Adobe Portable Document Format

Download

Collections

UMBC Student Collection
UMBC Center for Real-time Distributed Sensing and Autonomy
UMBC Faculty Collection
UMBC Information Systems Department