Hardware Efficient Direct Policy Imitation Learning for Robotic Navigation in Resource-Constrained Settings
Sumanasena, Vidura; Fernando, Heshan; De Silva, Daswin; Thileepan, Beniel; Pasan, Amila; Samarawickrama, Jayathu; Osipov, Evgeny; Alahakoon, Damminda (2023-12-28)
Sumanasena, Vidura
Fernando, Heshan
De Silva, Daswin
Thileepan, Beniel
Pasan, Amila
Samarawickrama, Jayathu
Osipov, Evgeny
Alahakoon, Damminda
MDPI
28.12.2023
Sumanasena V, Fernando H, De Silva D, Thileepan B, Pasan A, Samarawickrama J, Osipov E, Alahakoon D. Hardware Efficient Direct Policy Imitation Learning for Robotic Navigation in Resource-Constrained Settings. Sensors. 2024; 24(1):185. https://doi.org/10.3390/s24010185
https://creativecommons.org/licenses/by/4.0/
© 2023 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
https://creativecommons.org/licenses/by/4.0/
© 2023 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
https://creativecommons.org/licenses/by/4.0/
Julkaisun pysyvä osoite on
https://urn.fi/URN:NBN:fi:oulu-202401221399
https://urn.fi/URN:NBN:fi:oulu-202401221399
Tiivistelmä
Abstract
Direct policy learning (DPL) is a widely used approach in imitation learning for time-efficient and effective convergence when training mobile robots. However, using DPL in real-world applications is not sufficiently explored due to the inherent challenges of mobilizing direct human expertise and the difficulty of measuring comparative performance. Furthermore, autonomous systems are often resource-constrained, thereby limiting the potential application and implementation of highly effective deep learning models. In this work, we present a lightweight DPL-based approach to train mobile robots in navigational tasks. We integrated a safety policy alongside the navigational policy to safeguard the robot and the environment. The approach was evaluated in simulations and real-world settings and compared with recent work in this space. The results of these experiments and the efficient transfer from simulations to real-world settings demonstrate that our approach has improved performance compared to its hardware-intensive counterparts. We show that using the proposed methodology, the training agent achieves closer performance to the expert within the first 15 training iterations in simulation and real-world settings.
Direct policy learning (DPL) is a widely used approach in imitation learning for time-efficient and effective convergence when training mobile robots. However, using DPL in real-world applications is not sufficiently explored due to the inherent challenges of mobilizing direct human expertise and the difficulty of measuring comparative performance. Furthermore, autonomous systems are often resource-constrained, thereby limiting the potential application and implementation of highly effective deep learning models. In this work, we present a lightweight DPL-based approach to train mobile robots in navigational tasks. We integrated a safety policy alongside the navigational policy to safeguard the robot and the environment. The approach was evaluated in simulations and real-world settings and compared with recent work in this space. The results of these experiments and the efficient transfer from simulations to real-world settings demonstrate that our approach has improved performance compared to its hardware-intensive counterparts. We show that using the proposed methodology, the training agent achieves closer performance to the expert within the first 15 training iterations in simulation and real-world settings.
Kokoelmat
- Avoin saatavuus [42879]

