首页 /研究 /Human-robot interaction through real-time auditory and visual multiple-talker tracking
HRI

Human-robot interaction through real-time auditory and visual multiple-talker tracking

Hiroshi G. Okuno, Kazuhiro Nakadai, K. Hidai, Hiroshi Mizoguchi, Hiroaki Kitano

发表年份
2002
引用次数
70

摘要

Nakadai et al. (2001) have developed a real-time auditory and visual multiple-talker tracking technique. In this paper, this technique is applied to human-robot interaction including a receptionist robot and a companion robot at a party. The system includes face identification, speech recognition, focus-of-attention control, and sensorimotor task in tracking multiple talkers. The system is implemented on a upper-torso humanoid and the talker tracking is attained by distributed processing on three nodes connected by 100Base-TX network. The delay of tracking is 200 msec. Focus-of-attention is controlled by associating auditory and visual streams by using the sound source direction and talker position as a clue. Once an association is established, the humanoid keeps its face to the direction of the associated talker.

关键词

Computer scienceTorsoHumanoid robotFocus (optics)Computer visionArtificial intelligenceRobotTracking (education)Task (project management)Human–robot interaction

相关论文

查看 HRI 分类全部论文