Home /Research /Deep Q-Learning-Based Gain Scheduling for Nonlinear Quadcopter Dynamics

LEARNING

Deep Q-Learning-Based Gain Scheduling for Nonlinear Quadcopter Dynamics

Hossein Rastgoftar, Muhammad J. H. Zahed

Year: 2026
Access: Open access

Abstract

This paper presents a deep Q-network (DQN)-based gain-scheduling framework for safety-critical quadcopter trajectory tracking. Instead of directly learning control inputs, the proposed approach selects from a finite set of pre-certified stabilizing gain vectors, enabling reinforcement learning to operate within a structured and stability-preserving control architecture. By exploiting the isotropic structure of the translational dynamics, feedback gains are shared across spatial axes to reduce dimensionality while preserving performance. The learned policy adapts feedback aggressiveness in real time, applying high authority during large transients and reducing gains near convergence to limit control effort. Simulation results using a high-fidelity nonlinear quadcopter model demonstrate accurate trajectory tracking, bounded attitude excursions, smooth transition to hover after the final time, and consistent reward improvement, validating the effectiveness and robustness of the proposed learning-based gain scheduling strategy.

Keywords

eess.SYmath.DS

Deep Q-Learning-Based Gain Scheduling for Nonlinear Quadcopter Dynamics

Abstract

Keywords

Related papers

The Organization of Behavior

Fractional Brownian Motions, Fractional Noises and Applications

Review of deep learning: concepts, CNN architectures, challenges, applications, future directions

A guide to deep learning in healthcare