首页 /研究 /Computing the Exact Pareto Front in Average-Cost Multi-Objective Markov Decision Processes

OTHER

Computing the Exact Pareto Front in Average-Cost Multi-Objective Markov Decision Processes

Jiping Luo, Nikolaos Pappas

发表年份: 2026
访问权限: 开放获取

摘要

Many communication and control problems are cast as multi-objective Markov decision processes (MOMDPs). The complete solution to an MOMDP is the Pareto front. Much of the literature approximates this front via scalarization into single-objective MDPs. Recent work has begun to characterize the full front in discounted or simple bi-objective settings by exploiting its geometry. In this work, we characterize the exact front in average-cost MOMDPs. We show that the front is a continuous, piecewise-linear surface lying on the boundary of a convex polytope. Each vertex corresponds to a deterministic policy, and adjacent vertices differ in exactly one state. Each edge is realized as a convex combination of the policies at its endpoints, with the mixing coefficient given in closed form. We apply these results to a remote state estimation problem, where each vertex on the front corresponds to a threshold policy. The exact Pareto front and solutions to certain non-convex MDPs can be obtained without explicitly solving any MDP.

关键词

eess.SYcs.ITcs.LGcs.NI

Computing the Exact Pareto Front in Average-Cost Multi-Objective Markov Decision Processes

摘要

关键词

相关论文

Statistical Learning Theory

Fractional Differential Equations

Applied Nonlinear Control

Genetic Programming: On the Programming of Computers by Means of Natural Selection