首页 /研究 /Language, Camera, Autonomy! Prompt-engineered Robot Control for Rapidly Evolving Deployment
LOCOMOTION

Language, Camera, Autonomy! Prompt-engineered Robot Control for Rapidly Evolving Deployment

Jacob P. Macdonald, Rohit Mallick, Allan Wollaber, Jaime Peña, Nathan J. McNeese, Ho Chit Siu

发表年份
2024
引用次数
13
访问权限
开放获取

摘要

The Context-observant LLM-Enabled Autonomous Robots (CLEAR) platform offers a general solution for large language model (LLM)-enabled robot autonomy. CLEAR-controlled robots use natural language to perceive and interact with their environment: contextual description deriving from computer vision and optional human commands prompt intelligent LLM responses that map to robotic actions. By emphasizing prompting, system behavior is programmed without manipulating code, and unlike other LLM-based robot control methods, we do not perform any model fine-tuning. CLEAR employs off-the-shelf pre-trained machine learning models for controlling robots ranging from simulated quadcopters to terrestrial quadrupeds. We provide the open-source CLEAR platform, along with sample implementations for a Unity-based quadcopter and Boston Dynamics Spot® robot. Each LLM used, GPT-3.5, GPT-4, and LLaMA2, exhibited behavioral differences when embodied by CLEAR, contrasting in actuation preference, ability to apply new knowledge, and receptivity to human instruction. GPT-4 demonstrates best performance compared to GPT-3.5 and LLaMA2, showing successful task execution 97% of the time. The CLEAR platform contributes to HRI by increasing the usability of robotics for natural human interaction.

关键词

Software deploymentAutonomyRobotComputer scienceControl (management)Human–computer interactionArtificial intelligenceSoftware engineeringPolitical scienceLaw

相关论文

查看 LOCOMOTION 分类全部论文