Publication
Improving Human-Robot Communication in Noisy Environments with Visual Voice Activity Detection
Arunima Gopi Krishnan; Adrian Auer; Lisa Gutzeit
In: Josef F. Krems; Hugo Plácido da Silva; Pietro Cipresso (Hrsg.). Computer-Human Interaction Research and Applications - Proceedings, Part II. International Conference on Computer-Human Interaction Research and Applications (CHIRA-2025), 9th, October 20-21, Marbella, Spain, Pages 71-91, Communications in Computer and Information Science (CCIS), Vol. 2835, ISBN 978-3-032-16451-3, Springer Nature Switzerland, 2/2026.
Abstract
This paper investigates the integration of Visual Voice Activity Detection (VVAD) into human robot dialogue systems to enhance communication in noisy environments. Usually, speech recognition systems often falter under acoustic interference, limiting their effectiveness in real-world human-robot interactions. By leveraging visual cues, especially lip movements, VVAD supports more accurate speech detection and turn-taking.
