首页 /研究 /Dynamic Task Planning for Multi-Arm Apple-Harvesting Robots Using LSTM-PPO Reinforcement Learning Algorithm

LEARNING

Dynamic Task Planning for Multi-Arm Apple-Harvesting Robots Using LSTM-PPO Reinforcement Learning Algorithm

Zhengwei Guo, Heng Fu, Jiahao Wu, Wenkai Han, Wengang Zheng, Tao Li

发表年份: 2025
引用次数: 16
访问权限: 开放获取

摘要

This paper presents a dynamic task planning approach for multi-arm apple-picking robots based on a deep reinforcement learning (DRL) framework incorporating Long Short-Term Memory (LSTM) networks and Proximal Policy Optimization (PPO). In the context of rising labor costs and labor shortages in agriculture, automated apple harvesting is becoming increasingly important. The proposed algorithm addresses key challenges such as efficient task coordination, optimal picking sequences, and real-time decision-making in complex, dynamic orchard environments. The system’s performance is validated through simulations in both static and dynamic environments, with the algorithm demonstrating significant improvements in task completion time and robot efficiency compared to existing strategies. The results show that the LSTM-PPO approach outperforms other methods, offering enhanced adaptability, fault tolerance, and task execution efficiency, particularly under changing and unpredictable conditions. This research lays the foundation for the development of more efficient, adaptable robotic systems in agricultural applications.

关键词

Reinforcement learningTask (project management)Computer scienceRobotArtificial intelligenceAlgorithmMachine learningEngineering

Dynamic Task Planning for Multi-Arm Apple-Harvesting Robots Using LSTM-PPO Reinforcement Learning Algorithm

摘要

关键词

相关论文

Statistical Learning Theory

Artificial intelligence: a modern approach

Applied Nonlinear Control

A new optimizer using particle swarm theory