Publications

Fabian Hart, Martin Waltz, and Ostap Okhrin. Two-step dynamic obstacle avoidance. Knowledge-based systems, (302)Elsevier Science B.V., Oct 25, 2024. [PUMA: topic_engineering Deep Dynamic FIS_scads Local Supervised avoidance, learning learning, obstacle path planning, reinforcement]

Martin Waltz, and Ostap Okhrin. Spatial–temporal recurrent reinforcement learning for autonomous ships. Neural Networks, (2023)165:634--653, Elsevier Science B.V., Jun 15, 2023. [PUMA: topic_engineering Algorithms, Autonomous COLREG, Computer, Deep FIS_scads Networks, Neural Psychology, Recurrency, Reinforcement, Reward Ships, learning, reinforcement surface vehicle,]

Martin Waltz, and Ostap Okhrin. Addressing maximization bias in reinforcement learning with two-sample testing. Artificial intelligence, (336)Elsevier Science B.V., November 2024. [PUMA: topic_engineering Estimation FIS_scads Maximum Q-learning, Reinforcement Two-sample bias, expected learning, testing value,]

Niklas Paulig, and Ostap Okhrin. Robust path following on rivers using bootstrapped reinforcement learning. Ocean engineering, (298)Elsevier Science B.V., Apr 15, 2024. [PUMA: topic_engineering Autonomous Deep FIS_scads Path Restricted following, learning, reinforcement surface vessel, waterways]

Frank Cichos, Santiago Mui�os Landin, and Ravi Pradip. Chapter 5 - Artificial intelligence (AI) enhanced nanomotors and active matter. In Yuebing Zheng, and Zilong Wu (Eds.), Intelligent Nanotechnology, 113--144, Elsevier, 2023. [PUMA: topic_physchemistry Active Feedback Machine Multi Optical Reinforcement agent control control, learning, particles, reinforcement] URL