Efficient Methods for Multi-Objective Decision-Theoretic Planning. - Citegraph

Paper Info

Title
Efficient Methods for Multi-Objective Decision-Theoretic Planning.

Abstract
In decision-theoretic planning problems, such as (partially observable) Markov decision problems [Wiering and Van Otterlo, 2012] or coordination graphs [Guestrin et al., 2002], agents typically aim to optimize a scalar value function. However, in many real-world problems agents are faced with multiple possibly conflicting objectives, e.g., maximizing the economic benefits of timber harvesting while minimizing ecological damage in a forest management scenario [Bone and Dragicevic, 2009]. In such multi-objective problems, the value is a vector rather than a scalar [Roijers et al., 2013a]. Even when there are multiple objectives, it might not be necessary to have specialized multi-objective methods. When the problem can be scalarized, i.e., converted to a singleobjective problem before planning, existing single-objective methods may apply. Unfortunately, such a priori scalarization is not possible when the scalarization weights, i.e., the parameters of the scalarization, are not known in advance. For example, consider a company that mines different metals whose market prices vary. If there is not enough time to re-solve the decision problem for each price change, we need specialized multi-objective methods that compute a coverage set, i.e., a set of solutions optimal for all scalarizations. What constitutes a coverage set depends on the type scalarization. Much existing research assumes the Pareto coverage set (PCS), or Pareto front, as the optimal solution set. However, we argue that this is not always the best choice. In the highly prevalent case when the objectives will be linearly weighted, the convex coverage set (CCS) suffices. Because CCSs are typically much smaller, and have exploitable mathematical properties, CCSs are often much cheaper to compute than PCSs. Futhermore, when policies can be stochastic, all optimal value-vectors can be attained by mixing policies from the CCS [Vamplew et al., 2009]. Thefore, this project focuses on finding planning methods that compute the CCS.

Year	Venue	Field
2015	IJCAI	Mathematical optimization,Decision problem,Observable,Computer science,Markov chain,Scalar (physics),Multi-objective optimization,Regular polygon,Solution set,Mathematical properties
DocType	Citations	PageRank
Conference	0	0.34
References	Authors
7	1

Authors (1 rows)

Cited by (0 rows)

References (7 rows)

Name	Order	Citations	PageRank
Diederik Marijn Roijers	1	22	1.79

1