首页 /研究 /Language, Camera, Autonomy! Prompt-engineered Robot Control for Rapidly Evolving Deployment

LOCOMOTION

Language, Camera, Autonomy! Prompt-engineered Robot Control for Rapidly Evolving Deployment

Jacob P. Macdonald, Rohit Mallick, Allan Wollaber, Jaime Peña, Nathan J. McNeese, Ho Chit Siu

发表年份: 2024
引用次数: 13
访问权限: 开放获取

摘要

The Context-observant LLM-Enabled Autonomous Robots (CLEAR) platform offers a general solution for large language model (LLM)-enabled robot autonomy. CLEAR-controlled robots use natural language to perceive and interact with their environment: contextual description deriving from computer vision and optional human commands prompt intelligent LLM responses that map to robotic actions. By emphasizing prompting, system behavior is programmed without manipulating code, and unlike other LLM-based robot control methods, we do not perform any model fine-tuning. CLEAR employs off-the-shelf pre-trained machine learning models for controlling robots ranging from simulated quadcopters to terrestrial quadrupeds. We provide the open-source CLEAR platform, along with sample implementations for a Unity-based quadcopter and Boston Dynamics Spot® robot. Each LLM used, GPT-3.5, GPT-4, and LLaMA2, exhibited behavioral differences when embodied by CLEAR, contrasting in actuation preference, ability to apply new knowledge, and receptivity to human instruction. GPT-4 demonstrates best performance compared to GPT-3.5 and LLaMA2, showing successful task execution 97% of the time. The CLEAR platform contributes to HRI by increasing the usability of robotics for natural human interaction.

关键词

Software deploymentAutonomyRobotComputer scienceControl (management)Human–computer interactionArtificial intelligenceSoftware engineeringPolitical scienceLaw

Language, Camera, Autonomy! Prompt-engineered Robot Control for Rapidly Evolving Deployment

摘要

关键词

相关论文

Statistical Learning Theory

Artificial intelligence: a modern approach

Applied Nonlinear Control

A new optimizer using particle swarm theory