SKGHOI: Spatial-Semantic Knowledge Graph for Human-Object Interaction Detection
dc.contributor.author | Zhu, Lijing | |
dc.contributor.author | Lan, Qizhen | |
dc.contributor.author | Velasquez, Alvaro | |
dc.contributor.author | Song, Houbing | |
dc.contributor.author | Kamal, Acharya | |
dc.contributor.author | Tian, Qing | |
dc.contributor.author | Niu, Shuteng | |
dc.date.accessioned | 2023-04-17T19:05:22Z | |
dc.date.available | 2023-04-17T19:05:22Z | |
dc.date.issued | 2023-03-15 | |
dc.description.abstract | —Detecting human-object interactions (HOIs) is a challenging problem in computer vision. Existing techniques for HOI detection heavily rely on appearance-based features, which may not capture other essential characteristics for accurate detection. Furthermore, the use of transformer-based models for sentiment representation of human-object pairs can be computationally expensive. To address these challenges, we propose a novel graph-based approach, SKGHOI (Spatial-Semantic Knowledge Graph for Human-Object Interaction Detection), that effectively captures the sentiment representation of HOIs by integrating both spatial and semantic knowledge. In a graph, SKGHOI takes the components of interaction as nodes, and the spatial relationships between them as edges. Our approach employs a spatial encoder and a semantic encoder to extract spatial and semantic information, respectively, and then combines these encodings to create a knowledge graph that captures the sentiment representation of HOIs. Compared to existing techniques, SKGHOI is computationally efficient and allows for the incorporation of prior knowledge, making it practical for use in real-world applications. We demonstrate the effectiveness of our proposed method on the widely-used HICO-DET datasets, where it outperforms existing state-of-the-art graph-based methods by a significant margin. Our results indicate that the SKGHOI approach has the potential to significantly improve the accuracy and efficiency of HOI detection, and we anticipate that it will be of great interest to researchers and practitioners working on this challenging task. | en_US |
dc.description.sponsorship | This work was supported by the College of Arts and Sciences and the Department of Computer Science at Bowling Green State University. The authors express their sincere gratitude to the colleges who contributed to this work. Specially, the authors thank Dr. Qing Tian, the Assistant Professor of Computer Science at Bowling Green State University, who generously provided computational resource and valuable advice. | en_US |
dc.description.uri | https://arxiv.org/abs/2303.04253 | en_US |
dc.format.extent | 10 pages | en_US |
dc.genre | journal articles | en_US |
dc.genre | preprints | en_US |
dc.identifier | doi:10.13016/m2nxyf-kpll | |
dc.identifier.uri | https://doi.org/10.48550/arXiv.2303.04253 Focus to learn more | |
dc.identifier.uri | http://hdl.handle.net/11603/27614 | |
dc.language.iso | en_US | en_US |
dc.relation.isAvailableAt | The University of Maryland, Baltimore County (UMBC) | |
dc.relation.ispartof | UMBC Information Systems Department Collection | |
dc.relation.ispartof | UMBC Faculty Collection | |
dc.rights | This item is likely protected under Title 17 of the U.S. Copyright Law. Unless on a Creative Commons license, for uses protected by Copyright Law, contact the copyright holder or the author. | en_US |
dc.title | SKGHOI: Spatial-Semantic Knowledge Graph for Human-Object Interaction Detection | en_US |
dc.type | Text | en_US |
dcterms.creator | https://orcid.org/0000-0003-2631-9223 | en_US |