Deep reinforcement learning and parameter transfer based approach for the multi-objective agile earth observation satellite scheduling problem - Citegraph

Paper Info

Title
Deep reinforcement learning and parameter transfer based approach for the multi-objective agile earth observation satellite scheduling problem

Abstract
The agile earth observation satellite scheduling problem (AEOSSP) consists of selecting and scheduling a number of tasks from a set of user requests in order to optimize one or multiple criteria. In this paper, we consider a multi-objective version of AEOSSP (called MO-AEOSSP) where the failure rate and the timeliness of scheduled requests are optimized simultaneously. Due to its NP-hardness, traditional iterative problem-tailored heuristic methods are sensitive to problem instances and require massive computational overhead. We thus propose a deep reinforcement learning and parameter transfer based approach (RLPT) to tackle the MO-AEOSSP in a non-iterative manner. RLPT first decomposes the MO-AEOSSP into a number of scalarized sub-problems by a weight sum approach where each subproblem can be formulated as a Markov Decision Process (MDP). RLPT then applies an encoder-decoder structure neural network (NN) trained by a deep reinforcement learning procedure to producing a highquality schedule for each sub-problem. The resulting schedules of all scalarized sub-problems form an approximate pareto front for the MO-AEOSSP. Once a NN of a subproblem is trained, RLPT applies a parameter transfer strategy to reducing the training expenses for its neighboring sub-problems. Experimental results on a large set of randomly generated instances show that RLPT outperforms three classical multi-objective evolutionary algorithms (MOEAs) in terms of solution quality, solution distribution and computational efficiency. Results on various-size instances also show that RLPT is highly general and scalable. To the best of our knowledge, this study is the first attempt that applies deep reinforcement learning to a satellite scheduling problem considering multiple objectives. (C) 2021 Elsevier B.V. All rights reserved.

Year	DOI	Venue
2021	10.1016/j.asoc.2021.107607	APPLIED SOFT COMPUTING
Keywords	DocType	Volume
Agile satellite scheduling, Multi-objective, Deep reinforcement learning, MOEA	Journal	110
ISSN	Citations	PageRank
1568-4946	0	0.34
References	Authors
0	4

Authors (4 rows)

Cited by (0 rows)

References (0 rows)

Name	Order	Citations	PageRank
Luona Wei	1	0	0.34
Yuning Chen	2	2	3.41
Chen Ming	3	12	11.07
Ying-Wu Chen	4	205	19.89

1