Multi-Modal Intention Recognition Combining Head Motion and Throat Vibration for Underwater Superlimbs

By AncoraSIR 2024-09-01

Rongzheng Zhang, Wanghongjie Qiu, Jianuo Qiu, Yuqin Guo, Chengxiao Dong, Tuo Zhang, Juan Yi, Chaoyang Song, Harry Asada, Fang Wan: Multimodal Intention Recognition Combining Head Motion and Throat Vibration for Underwater Superlimbs. In: IEEE Transactions on Automation Science and Engineering, vol. 0, no. 0, pp. 0, 2025.

Abstract

This paper presents a novel solution for underwater intention recognition that simultaneously detects head motion and throat vibration, enhancing multimodal human-robot interactions for underwater diving. The system pairs with an underwater supernumerary robotic limb (SuperLimb), providing propulsion assistance to reduce the diver's physical load and mental fatigue. An inertial measurement unit monitors head motion, while a throat microphone captures vocal vibrations. Learning algorithms process these signals to accurately interpret the diver's intentions and map them to the SuperLimb for posture management. The system features a compact design optimized for diving scenarios and includes a multimodal, real-time classification algorithm to distinguish various head motions and vocal signals. By collecting and analyzing underwater throat vibration data, the study demonstrates the feasibility of this approach, enabling continuous motion commands for enhanced diving assistance. The results show that the head motion recognition component of the system achieved a high classification accuracy of 95%, and throat vibration classification reached 86% accuracy on land and 89% underwater for various purposes.

Links

doi:10.1109/TASE.2025.3554036

BibTeX (Download)

@article{Zhang2024MultiModal,
title = {Multimodal Intention Recognition Combining Head Motion and Throat Vibration for Underwater Superlimbs},
author = {Rongzheng Zhang and Wanghongjie Qiu and Jianuo Qiu and Yuqin Guo and Chengxiao Dong and Tuo Zhang and Juan Yi and Chaoyang Song and Harry Asada and Fang Wan},
doi = {10.1109/TASE.2025.3554036},
year  = {2025},
date = {2025-03-20},
urldate = {2025-03-20},
journal = {IEEE Transactions on Automation Science and Engineering},
volume = {0},
number = {0},
pages = {0},
abstract = {This paper presents a novel solution for underwater intention recognition that simultaneously detects head motion and throat vibration, enhancing multimodal human-robot interactions for underwater diving. The system pairs with an underwater supernumerary robotic limb (SuperLimb), providing propulsion assistance to reduce the diver's physical load and mental fatigue. An inertial measurement unit monitors head motion, while a throat microphone captures vocal vibrations. Learning algorithms process these signals to accurately interpret the diver's intentions and map them to the SuperLimb for posture management. The system features a compact design optimized for diving scenarios and includes a multimodal, real-time classification algorithm to distinguish various head motions and vocal signals. By collecting and analyzing underwater throat vibration data, the study demonstrates the feasibility of this approach, enabling continuous motion commands for enhanced diving assistance. The results show that the head motion recognition component of the system achieved a high classification accuracy of 95%, and throat vibration classification reached 86% accuracy on land and 89% underwater for various purposes.},
keywords = {Authorship - Co-Author, JCR Q1, Jour - IEEE Trans. Autom. Sci. Eng. (T-ASE)},
pubstate = {published},
tppubtype = {article}
}

Co-Author IEEE Trans. Autom. Sci. Eng. (T-ASE)JCR Q1

Last updated on 2024-09-20

AncoraSIR

View All Posts