Global Path Planning for Unmanned Surface Vehicles based on Adaptive Parameter Q-Learning Algorithm

Haoran Wang; Shaoyi Guo

doi:10.6919/ICJE.202602_12(2).0011

Authors

Haoran Wang
Shaoyi Guo

DOI:

https://doi.org/10.6919/ICJE.202602_12(2).0011

Keywords:

Unmanned Surface Vehicle (USV); Global Path Planning; Q-Learning Algorithm; Adaptive Parameters; Reinforcement Learning.

Abstract

Global path planning is one of the core issues in the field of Unmanned Surface Vehicle (USV) navigation. As a classic method in reinforcement learning, the Q-Learning algorithm is widely applied to USV global path planning. However, the traditional Q-Learning algorithm typically employs a fixed learning rate, greedy rate, and discount factor, which leads to issues such as redundant exploration in the late training stage and slow convergence speed. To address these limitations, this paper proposes an adaptive parameter Q-Learning path planning algorithm. The proposed algorithm dynamically adjusts the greedy rate to balance exploration and exploitation, adaptively optimizes the learning rate based on the temporal difference (TD) error, and dynamically adjusts the discount factor by combining the distance between the current state and the destination. These improvements enhance both the convergence efficiency and path planning performance of the algorithm. Comparative experiments between the improved algorithm and the traditional Q-Learning algorithm were conducted in a simulated water environment. Experimental results demonstrate that the improved algorithm increases the training convergence speed by over 30% and shortens the optimal path length by 11%, indicating a significant improvement in algorithm performance. This study provides a more efficient algorithmic scheme for USV global path planning and holds great significance for enhancing the real-time performance, reliability, and environmental adaptability of USV navigation.

Downloads

Download data is not yet available.

References

[1] Meng Xiangdu. Research on Path Planning Algorithms for Unmanned Vessels [D]. Tianjin: Tianjin University, 2017

[2] Zhang Daheng. Research on Autonomous Path Planning for Intelligent Ships Based on Deep Learning [D]. Dalian Maritime University 2022.DOI:10.26989/d.cnki.gdlhu.2022.000001.

[3] DIJKSTRA E. W. A note on two problems in connexion with graphs[J]. Numerische mathematik, 1959, 1(1): 269-271.

[4] Hart P E, Nilsson N J, Raphael B. A formal basis for the heuristic determination of minimum cost paths[J]. IEEE Trans-actions on Systems Science and Cybernetics, 1968, 4(2) : 100－107.

[5] HOLLAND J H. Adaptation In Natural and Artificial Systems[M]. Bradford: A Bradford Book, 1992.

[6] DORIGO M, MANIEZZO V, COLOMI A. Ant system : optimization by a colony of cooperating agents[J]. IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics), 1996, 26(1): 29-41.

[7] Mnih, V., Kavukcuoglu, K., Silver, D., Rusu, A.A., Veness, J., Bellemare, M.G., Graves, A., Riedmiller, M., Fidjeland, A.K., Ostrovski, G., Petersen, S., Beattie, C., Sadik, A., Antonoglou, I., King, H., Kumaran, D., Wierstra, D., Legg, S., Hassabis, D., 2015.

[8] Lee H T, Kim M K. Optimal path planning for a ship in coastal waters with deep Q network[J]. Ocean Engineering, 2024, 307: 118193.

[9] Chen, C., Chen, X.Q., Ma, F., Zeng, X.J., Wang, J., 2019a. A knowledge-free path planning approach for smart ships based on reinforcement learning. Ocean Eng. 189, 106299 https://doi.org/10.1016/j.oceaneng.2019.106299.

[10] Shen Haiqing. Collision Avoidance Navigation and Control for Unmanned Ships Based on Reinforcement Learning [D]. Dalian Maritime University, 2018.

[11] Yoo B, Kim J. Path optimization for marine vehicles in ocean currents using reinforcement learning[J]. Journal of Marine Science & Technology, 2016, 21(2): 334-343.

[12] Wang Yinan. Research on Ship Collision Avoidance Based on Navigation Rules in Q-Learning [D]. Dalian Maritime University, 2022.

[13] Wang Y, Lu C, Wu P, et al. Path planning for unmanned surface vehicle based on improved Q-Learning algorithm[J]. Ocean Engineering, 2024, 292: 116510.

[14] Chen X, Hu R, Luo K, et al. Intelligent ship route planning via an A∗ search model enhanced double-deep Q-network[J]. Ocean Engineering, 2025, 327: 120956.

[15] Yang Juncheng, Li Shuxia, Cai Zengyu. Research and Development of Path Planning Algorithms [J]. Control Engineering 2017,24(07):1473- 1480.