Policy Gradient for Observer Trajectory Planning with Application in Multi-target Tracking Problems (original) (raw)

2018 52nd Asilomar Conference on Signals, Systems, and Computers, 2018

Abstract

Tracking multiple moving targets with bearing-only measurement is a challenging task, due to the inherent difficulties in determining the correct trajectory of the observer that will meet observability conditions. The work presented here formulates Observer Trajectory Planning (OTP) as a continuous control problem, and proposes reinforcement learning as a solution. The proposed architecture in this work constitutes a model-independent framework that allows for the estimation of the states of targets, and that allows multiple targets to be tracked in a realistic scenario, where the agent has no prior information about the initial locations and velocities of the targets.

Aliakbar Gorji hasn't uploaded this paper.

Create a free Academia account to let Aliakbar Gorji know you want this paper to be uploaded.

Ask for this paper to be uploaded.