A platform-agnostic deep reinforcement learning framework for effective Sim2Real transfer towards autonomous driving. Commun Eng, (3)1:147, Springer Science and Business Media LLC, October 2024. [PUMA: Sim2Real Xack autonomous deep driving framework learning platform-agnostic reinforcement]
Two-step dynamic obstacle avoidance. Knowledge-based systems, (302)Elsevier Science B.V., Oct 25, 2024. [PUMA: topic_engineering Deep Dynamic FIS_scads Local Supervised avoidance, learning learning, obstacle path planning, reinforcement]
Spatial–temporal recurrent reinforcement learning for autonomous ships. Neural Networks, (2023)165:634--653, Elsevier Science B.V., Jun 15, 2023. [PUMA: topic_engineering Algorithms, Autonomous COLREG, Computer, Deep FIS_scads Networks, Neural Psychology, Recurrency, Reinforcement, Reward Ships, learning, reinforcement surface vehicle,]
Addressing maximization bias in reinforcement learning with two-sample testing. Artificial intelligence, (336)Elsevier Science B.V., November 2024. [PUMA: topic_engineering Estimation FIS_scads Maximum Q-learning, Reinforcement Two-sample bias, expected learning, testing value,]
Robust path following on rivers using bootstrapped reinforcement learning. Ocean engineering, (298)Elsevier Science B.V., Apr 15, 2024. [PUMA: topic_engineering Autonomous Deep FIS_scads Path Restricted following, learning, reinforcement surface vessel, waterways]
Deep reinforcement learning with artificial microswimmers. Emerging Topics in Artificial Intelligence (ETAI) 2022, (12204):104--110, 2022. [PUMA: topic_physchemistry Deep artificial learning microswimmers reinforcement]
Steuerung von Compliant-Mechanismen durch Reinforcement Learning. GETRIEBETAGUNG 2022, 121, 2022. [PUMA: topic_engineering Compliant-Mechanismen Learning Reinforcement Steuerung]
Self-organized free-flight arrival for urban air mobility. Transportation Research Part C: Emerging Technologies, (167):104806, 2024. [PUMA: topic_engineering Deep Urban air eVTOL learning mobility reinforcement] URL
Enhanced method for reinforcement learning based dynamic obstacle avoidance by assessment of collision risk. Neurocomputing, (568):127097, 2024. [PUMA: topic_engineering Collision Dynamic Reinforcement Training avoidance environment learning metric obstacle risk] URL
Chapter 5 - Artificial intelligence (AI) enhanced nanomotors and active matter. In Yuebing Zheng, and Zilong Wu (Eds.), Intelligent Nanotechnology, 113--144, Elsevier, 2023. [PUMA: topic_physchemistry Active Feedback Machine Multi Optical Reinforcement agent control control, learning, particles, reinforcement] URL