Publications

Dianzhao Li, und Ostap Okhrin. A platform-agnostic deep reinforcement learning framework for effective Sim2Real transfer towards autonomous driving. Commun Eng, (3)1:147, Springer Science and Business Media LLC, Oktober 2024. [PUMA: Sim2Real Xack autonomous deep driving framework learning platform-agnostic reinforcement]

Fabian Hart, Martin Waltz, und Ostap Okhrin. Two-step dynamic obstacle avoidance. Knowledge-based systems, (302)Elsevier Science B.V., 25.10.2024. [PUMA: topic_engineering Deep Dynamic FIS_scads Local Supervised avoidance, learning learning, obstacle path planning, reinforcement]

Martin Waltz, und Ostap Okhrin. Spatial–temporal recurrent reinforcement learning for autonomous ships. Neural Networks, (2023)165:634--653, Elsevier Science B.V., 15.06.2023. [PUMA: topic_engineering Algorithms, Autonomous COLREG, Computer, Deep FIS_scads Networks, Neural Psychology, Recurrency, Reinforcement, Reward Ships, learning, reinforcement surface vehicle,]

Martin Waltz, und Ostap Okhrin. Addressing maximization bias in reinforcement learning with two-sample testing. Artificial intelligence, (336)Elsevier Science B.V., November 2024. [PUMA: topic_engineering Estimation FIS_scads Maximum Q-learning, Reinforcement Two-sample bias, expected learning, testing value,]

Niklas Paulig, und Ostap Okhrin. Robust path following on rivers using bootstrapped reinforcement learning. Ocean engineering, (298)Elsevier Science B.V., 15.04.2024. [PUMA: topic_engineering Autonomous Deep FIS_scads Path Restricted following, learning, reinforcement surface vessel, waterways]

Ravi Pradip, und Frank Cichos. Deep reinforcement learning with artificial microswimmers. Emerging Topics in Artificial Intelligence (ETAI) 2022, (12204):104--110, 2022. [PUMA: topic_physchemistry Deep artificial learning microswimmers reinforcement]

Lars Muschalski, Joanna Wollmann, Andreas Hornig, und Niels Modler. Steuerung von Compliant-Mechanismen durch Reinforcement Learning. GETRIEBETAGUNG 2022, 121, 2022. [PUMA: topic_engineering Compliant-Mechanismen Learning Reinforcement Steuerung]

Martin Waltz, Ostap Okhrin, und Michael Schultz. Self-organized free-flight arrival for urban air mobility. Transportation Research Part C: Emerging Technologies, (167):104806, 2024. [PUMA: topic_engineering Deep Urban air eVTOL learning mobility reinforcement] URL

Fabian Hart, und Ostap Okhrin. Enhanced method for reinforcement learning based dynamic obstacle avoidance by assessment of collision risk. Neurocomputing, (568):127097, 2024. [PUMA: topic_engineering Collision Dynamic Reinforcement Training avoidance environment learning metric obstacle risk] URL

Frank Cichos, Santiago Mui�os Landin, und Ravi Pradip. Chapter 5 - Artificial intelligence (AI) enhanced nanomotors and active matter. In Yuebing Zheng, und Zilong Wu (Hrsg.), Intelligent Nanotechnology, 113--144, Elsevier, 2023. [PUMA: topic_physchemistry Active Feedback Machine Multi Optical Reinforcement agent control control, learning, particles, reinforcement] URL