首页 /研究 /Average Reward Reinforcement Learning: Foundations, Algorithms, and Empirical Results
LEARNING

Average Reward Reinforcement Learning: Foundations, Algorithms, and Empirical Results

Sridhar Mahadevan

发表年份
1996
引用次数
20
访问权限
开放获取

关键词

Reinforcement learningComputer scienceConvergence (economics)Asynchronous communicationMetric (unit)Temporal difference learningLearning automataQ-learningArtificial intelligencePerformance metric

相关论文

查看 LEARNING 分类全部论文