Policy Gradient for Observer Trajectory Planning with Application in Multi-target Tracking Problems (original) (raw)
2018 52nd Asilomar Conference on Signals, Systems, and Computers, 2018
Abstract
Tracking multiple moving targets with bearing-only measurement is a challenging task, due to the inherent difficulties in determining the correct trajectory of the observer that will meet observability conditions. The work presented here formulates Observer Trajectory Planning (OTP) as a continuous control problem, and proposes reinforcement learning as a solution. The proposed architecture in this work constitutes a model-independent framework that allows for the estimation of the states of targets, and that allows multiple targets to be tracked in a realistic scenario, where the agent has no prior information about the initial locations and velocities of the targets.
Aliakbar Gorji hasn't uploaded this paper.
Create a free Academia account to let Aliakbar Gorji know you want this paper to be uploaded.
Ask for this paper to be uploaded.