Talking about 3D scenes: integration of image and speech understanding in a hybrid distributed system
Gudrun Socher, Gerhard Sagerer, Franz Kümmert, T. Fuhr
- 发表年份
- 2002
- 引用次数
- 8
摘要
We present a hybrid system that integrates speech and image understanding. Given spoken references, it is able to identify objects of a 3D scene perceived via a stereo camera. Central to our approach is the extraction of qualitative object features and spatial scene properties from acoustic and visual data. The interaction of the understanding processes is performed using a procedural semantic network that interfaces with signal recognition and reconstruction modules, thus integrating semantic, neural and Bayesian networks and Hidden Markov Models. 1. INTRODUCTION Man-Machine-Interaction in real environments is one of the greatest challenges for a number of scientific fields related to Computer Vision, Speech Understanding, and Robotics. At the University of Bielefeld the joint project "Situated Artificial Communicators" has been established with the goal to develop an integrated system where visual, linguistic, senso-motoric, and cognitive abilities interact. The system plays the rol...
关键词
相关论文
Statistical Learning Theory
Yuhai Wu, Vladimir Vapnik
1999
Artificial intelligence: a modern approach
1995
Applied Nonlinear Control
Jean-Jacques Slotine, Weiping Li
1991
A new optimizer using particle swarm theory
R.C. Eberhart, James Kennedy
2002