Fabian Hart, and Ostap Okhrin. Enhanced method for reinforcement learning based dynamic obstacle avoidance by assessment of collision risk. Neurocomputing, (568):127097, 2024. [PUMA: Collision Dynamic Reinforcement Training avoidance environment learning metric obstacle risk] URL