SKGHOI: Spatial-Semantic Knowledge Graph for Human-Object Interaction Detection

dc.contributor.authorZhu, Lijing
dc.contributor.authorLan, Qizhen
dc.contributor.authorVelasquez, Alvaro
dc.contributor.authorSong,  Houbing
dc.contributor.authorKamal, Acharya
dc.contributor.authorTian,  Qing
dc.contributor.authorNiu, Shuteng
dc.date.accessioned2023-04-17T19:05:22Z
dc.date.available2023-04-17T19:05:22Z
dc.date.issued2023-03-15
dc.description.abstract—Detecting human-object interactions (HOIs) is a challenging problem in computer vision. Existing techniques for HOI detection heavily rely on appearance-based features, which may not capture other essential characteristics for accurate detection. Furthermore, the use of transformer-based models for sentiment representation of human-object pairs can be computationally expensive. To address these challenges, we propose a novel graph-based approach, SKGHOI (Spatial-Semantic Knowledge Graph for Human-Object Interaction Detection), that effectively captures the sentiment representation of HOIs by integrating both spatial and semantic knowledge. In a graph, SKGHOI takes the components of interaction as nodes, and the spatial relationships between them as edges. Our approach employs a spatial encoder and a semantic encoder to extract spatial and semantic information, respectively, and then combines these encodings to create a knowledge graph that captures the sentiment representation of HOIs. Compared to existing techniques, SKGHOI is computationally efficient and allows for the incorporation of prior knowledge, making it practical for use in real-world applications. We demonstrate the effectiveness of our proposed method on the widely-used HICO-DET datasets, where it outperforms existing state-of-the-art graph-based methods by a significant margin. Our results indicate that the SKGHOI approach has the potential to significantly improve the accuracy and efficiency of HOI detection, and we anticipate that it will be of great interest to researchers and practitioners working on this challenging task.en_US
dc.description.sponsorshipThis work was supported by the College of Arts and Sciences and the Department of Computer Science at Bowling Green State University. The authors express their sincere gratitude to the colleges who contributed to this work. Specially, the authors thank Dr. Qing Tian, the Assistant Professor of Computer Science at Bowling Green State University, who generously provided computational resource and valuable advice.en_US
dc.description.urihttps://arxiv.org/abs/2303.04253en_US
dc.format.extent10 pagesen_US
dc.genrejournal articlesen_US
dc.genrepreprintsen_US
dc.identifierdoi:10.13016/m2nxyf-kpll
dc.identifier.urihttps://doi.org/10.48550/arXiv.2303.04253 Focus to learn more
dc.identifier.urihttp://hdl.handle.net/11603/27614
dc.language.isoen_USen_US
dc.relation.isAvailableAtThe University of Maryland, Baltimore County (UMBC)
dc.relation.ispartofUMBC Information Systems Department Collection
dc.relation.ispartofUMBC Faculty Collection
dc.rightsThis item is likely protected under Title 17 of the U.S. Copyright Law. Unless on a Creative Commons license, for uses protected by Copyright Law, contact the copyright holder or the author.en_US
dc.titleSKGHOI: Spatial-Semantic Knowledge Graph for Human-Object Interaction Detectionen_US
dc.typeTexten_US
dcterms.creatorhttps://orcid.org/0000-0003-2631-9223en_US

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
2303.04253.pdf
Size:
19.85 MB
Format:
Adobe Portable Document Format
Description:

License bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
license.txt
Size:
2.56 KB
Format:
Item-specific license agreed upon to submission
Description: