Title
Deep reinforcement learning and parameter transfer based approach for the multi-objective agile earth observation satellite scheduling problem
Abstract
The agile earth observation satellite scheduling problem (AEOSSP) consists of selecting and scheduling a number of tasks from a set of user requests in order to optimize one or multiple criteria. In this paper, we consider a multi-objective version of AEOSSP (called MO-AEOSSP) where the failure rate and the timeliness of scheduled requests are optimized simultaneously. Due to its NP-hardness, traditional iterative problem-tailored heuristic methods are sensitive to problem instances and require massive computational overhead. We thus propose a deep reinforcement learning and parameter transfer based approach (RLPT) to tackle the MO-AEOSSP in a non-iterative manner. RLPT first decomposes the MO-AEOSSP into a number of scalarized sub-problems by a weight sum approach where each subproblem can be formulated as a Markov Decision Process (MDP). RLPT then applies an encoder-decoder structure neural network (NN) trained by a deep reinforcement learning procedure to producing a highquality schedule for each sub-problem. The resulting schedules of all scalarized sub-problems form an approximate pareto front for the MO-AEOSSP. Once a NN of a subproblem is trained, RLPT applies a parameter transfer strategy to reducing the training expenses for its neighboring sub-problems. Experimental results on a large set of randomly generated instances show that RLPT outperforms three classical multi-objective evolutionary algorithms (MOEAs) in terms of solution quality, solution distribution and computational efficiency. Results on various-size instances also show that RLPT is highly general and scalable. To the best of our knowledge, this study is the first attempt that applies deep reinforcement learning to a satellite scheduling problem considering multiple objectives. (C) 2021 Elsevier B.V. All rights reserved.
Year
DOI
Venue
2021
10.1016/j.asoc.2021.107607
APPLIED SOFT COMPUTING
Keywords
DocType
Volume
Agile satellite scheduling, Multi-objective, Deep reinforcement learning, MOEA
Journal
110
ISSN
Citations 
PageRank 
1568-4946
0
0.34
References 
Authors
0
4
Name
Order
Citations
PageRank
Luona Wei100.34
Yuning Chen223.41
Chen Ming31211.07
Ying-Wu Chen420519.89