Room-Scale Hand Gesture Recognition Using Smart Speakers

dc.contributor.authorLi, Dong
dc.contributor.authorLiu, Jialin
dc.contributor.authorLee, Sunghoon Ivan
dc.contributor.authorXiong, Jie
dc.date.accessioned2026-03-05T19:36:04Z
dc.date.issued2023-01-24
dc.descriptionSenSys '22: The 20th ACM Conference on Embedded Networked Sensor Systems, Boston Massachusetts, November 6 - 9, 2022
dc.description.abstractAcoustic signal has been recently adopted for contact-free hand gesture recognition due to its fine-grained sensing granularity and wide availability of microphone and speaker in consumer-grade electronic devices such as smartphones. However, a very limited sensing range constrains acoustic sensing to application scenarios where users interact with devices in close proximity. In this paper, we improve the range of acoustic sensing and demonstrate the feasibility of enabling room-scale hand gesture recognition using commodity smart speakers. We develop a series of novel signal processing techniques and implement our system on two commodity smart speaker prototypes with different numbers of microphones. Extensive evaluations are performed in three different environments with 1440 gestures collected from 16 participants. Experiment results show that our system can significantly increase the sensing range from 1 m to 4--5 m. In the challenging scenario where the user is 4 m away from the smart speaker and there is strong interference, the achieved gesture recognition accuracy is still higher than 90%.
dc.description.urihttps://dl.acm.org/doi/10.1145/3560905.3568528
dc.format.extent15 pages
dc.genreconference papers and proceedings
dc.identifierdoi:10.13016/m2xuhj-4zzw
dc.identifier.citationLi, Dong, Jialin Liu, Sunghoon Ivan Lee, and Jie Xiong. “Room-Scale Hand Gesture Recognition Using Smart Speakers.” Proceedings of the 20th ACM Conference on Embedded Networked Sensor Systems (New York, NY, USA), SenSys ’22, January 24, 2023, 462–75. https://doi.org/10.1145/3560905.3568528.
dc.identifier.urihttps://doi.org/10.1145/3560905.3568528
dc.identifier.urihttp://hdl.handle.net/11603/42081
dc.language.isoen
dc.publisherACM
dc.relation.isAvailableAtThe University of Maryland, Baltimore County (UMBC)
dc.relation.ispartofUMBC Computer Science and Electrical Engineering Department
dc.rightsThis item is likely protected under Title 17 of the U.S. Copyright Law. Unless on a Creative Commons license, for uses protected by Copyright Law, contact the copyright holder or the author.
dc.titleRoom-Scale Hand Gesture Recognition Using Smart Speakers
dc.typeText
dcterms.creatorhttps://orcid.org/0000-0002-3144-5104

Files