Home /Research /Building for speech: designing the next-generation of social robots for audio interaction
HRI

Building for speech: designing the next-generation of social robots for audio interaction

Angus Addlesee, Ioannis Papaioannou

Year
2025
Citations
4
Access
Open access

Abstract

There have been significant advances in robotics, conversational AI, and spoken dialogue systems (SDSs) over the past few years, but we still do not find social robots in public spaces such as train stations, shopping malls, or hospital waiting rooms. In this paper, we argue that early-stage collaboration between robot designers and SDS researchers is crucial for creating social robots that can legitimately be used in real-world environments. We draw from our experiences running experiments with social robots, and the surrounding literature, to highlight recurring issues. Robots need better speakers, a greater number of high-quality microphones, quieter motors, and quieter fans to enable human-robot spoken interaction in the wild. If a robot was designed to meet these requirements, researchers could create SDSs that are more accessible, and able to handle multi-party conversations in populated environments. Robust robot joints are also needed to limit potential harm to older adults and other more vulnerable groups. We suggest practical steps towards future real-world deployments of conversational AI systems for human-robot interaction.

Keywords

RobotComputer scienceRoboticsHuman–computer interactionHarmSocial robotHuman–robot interactionArtificial intelligenceMobile robotRobot control

Related papers

Browse all HRI papers