SKGHOI: Spatial-Semantic Knowledge Graph for Human-Object Interaction Detection

Zhu, Lijing; Lan, Qizhen; Velasquez, Alvaro; Song,  Houbing; Kamal, Acharya; Tian,  Qing; Niu, Shuteng

SKGHOI: Spatial-Semantic Knowledge Graph for Human-Object Interaction Detection

dc.contributor.author	Zhu, Lijing
dc.contributor.author	Lan, Qizhen
dc.contributor.author	Velasquez, Alvaro
dc.contributor.author	Song, Houbing
dc.contributor.author	Kamal, Acharya
dc.contributor.author	Tian, Qing
dc.contributor.author	Niu, Shuteng
dc.date.accessioned	2023-04-17T19:05:22Z
dc.date.available	2023-04-17T19:05:22Z
dc.date.issued	2023-03-15
dc.description.abstract	—Detecting human-object interactions (HOIs) is a challenging problem in computer vision. Existing techniques for HOI detection heavily rely on appearance-based features, which may not capture other essential characteristics for accurate detection. Furthermore, the use of transformer-based models for sentiment representation of human-object pairs can be computationally expensive. To address these challenges, we propose a novel graph-based approach, SKGHOI (Spatial-Semantic Knowledge Graph for Human-Object Interaction Detection), that effectively captures the sentiment representation of HOIs by integrating both spatial and semantic knowledge. In a graph, SKGHOI takes the components of interaction as nodes, and the spatial relationships between them as edges. Our approach employs a spatial encoder and a semantic encoder to extract spatial and semantic information, respectively, and then combines these encodings to create a knowledge graph that captures the sentiment representation of HOIs. Compared to existing techniques, SKGHOI is computationally efficient and allows for the incorporation of prior knowledge, making it practical for use in real-world applications. We demonstrate the effectiveness of our proposed method on the widely-used HICO-DET datasets, where it outperforms existing state-of-the-art graph-based methods by a significant margin. Our results indicate that the SKGHOI approach has the potential to significantly improve the accuracy and efficiency of HOI detection, and we anticipate that it will be of great interest to researchers and practitioners working on this challenging task.	en_US
dc.description.sponsorship	This work was supported by the College of Arts and Sciences and the Department of Computer Science at Bowling Green State University. The authors express their sincere gratitude to the colleges who contributed to this work. Specially, the authors thank Dr. Qing Tian, the Assistant Professor of Computer Science at Bowling Green State University, who generously provided computational resource and valuable advice.	en_US
dc.description.uri	https://arxiv.org/abs/2303.04253	en_US
dc.format.extent	10 pages	en_US
dc.genre	journal articles	en_US
dc.genre	preprints	en_US
dc.identifier	doi:10.13016/m2nxyf-kpll
dc.identifier.uri	https://doi.org/10.48550/arXiv.2303.04253 Focus to learn more
dc.identifier.uri	http://hdl.handle.net/11603/27614
dc.language.iso	en_US	en_US
dc.relation.isAvailableAt	The University of Maryland, Baltimore County (UMBC)
dc.relation.ispartof	UMBC Information Systems Department Collection
dc.relation.ispartof	UMBC Faculty Collection
dc.rights	This item is likely protected under Title 17 of the U.S. Copyright Law. Unless on a Creative Commons license, for uses protected by Copyright Law, contact the copyright holder or the author.	en_US
dc.title	SKGHOI: Spatial-Semantic Knowledge Graph for Human-Object Interaction Detection	en_US
dc.type	Text	en_US
dcterms.creator	https://orcid.org/0000-0003-2631-9223	en_US

Files

Original bundle

Now showing 1 - 1 of 1

Name:: 2303.04253.pdf
Size:: 19.85 MB
Format:: Adobe Portable Document Format
Description:

Download

License bundle

Now showing 1 - 1 of 1

Name:: license.txt
Size:: 2.56 KB
Format:: Item-specific license agreed upon to submission
Description:

Download

Collections

UMBC Information Systems Department
UMBC Faculty Collection