黄玉春, 宋俊, 王宸, 孙楠, 王勤, 钟毓宁. 基于变体天牛须改进QL的AGV路径规划算法研究[J]. 制造技术与机床, 2024, (10): 89-97. DOI: 10.19287/j.mtmt.1005-2402.2024.10.012
引用本文: 黄玉春, 宋俊, 王宸, 孙楠, 王勤, 钟毓宁. 基于变体天牛须改进QL的AGV路径规划算法研究[J]. 制造技术与机床, 2024, (10): 89-97. DOI: 10.19287/j.mtmt.1005-2402.2024.10.012
HUANG Yuchun, SONG Jun, WANG Chen, SUN Nan, WANG Qin, ZHONG Yuning. Research on AGV path planning algorithm based on variant beetle antennae search algorithm improved Q-learning[J]. Manufacturing Technology & Machine Tool, 2024, (10): 89-97. DOI: 10.19287/j.mtmt.1005-2402.2024.10.012
Citation: HUANG Yuchun, SONG Jun, WANG Chen, SUN Nan, WANG Qin, ZHONG Yuning. Research on AGV path planning algorithm based on variant beetle antennae search algorithm improved Q-learning[J]. Manufacturing Technology & Machine Tool, 2024, (10): 89-97. DOI: 10.19287/j.mtmt.1005-2402.2024.10.012

基于变体天牛须改进QL的AGV路径规划算法研究

Research on AGV path planning algorithm based on variant beetle antennae search algorithm improved Q-learning

  • 摘要: 针对Q-learning(QL)在解决AGV(automated guided vehicle)路径规划时前期收敛速度慢、后期易陷入局部最优等问题,提出了一种变体天牛须改进QL的进化强化学习算法(BAS-QL)。BAS-QL主要特点有三方面,首先使用变体天牛须算法对Q表格进行初始化,加快AGV路径规划的前期寻优速度;然后使用渐变衰减Epsilon-Greedy搜索策略,利用衰减Epsilon值来避免算法在后期陷入局部最优避免出现结果难收敛的现象。最后求解AGV行走的最优路径,并通过实验对BAS-QL算法进行验证。在n=15和n=20栅格图对比实验中,BAS-QL表现出平均耗时短、平均路程短和平均迭代次数少的特点。说明该方法在智能规划路径的同时还可以有效提升AGV的路径规划效率。

     

    Abstract: Aiming at the problems of Q-learning (QL) in solving automated guided vehicle(AGV) path planning, such as slow convergence speed in the early stage and easy to fall into local optimum in the late stage, an evolutionary reinforcement learning (BAS-QL) algorithm is proposed to improve QL. There are three main features of BAS-QL. Firstly, a variant of the beetle antennae search algorithm is used to initialize the Q table, which can accelerate the optimization speed in the early stage of AGV path planning. Secondly, use the asymptotic decay Epsilon-Greedy search strategy, using the decay Epsilon value to avoid the algorithm to fall into the local optimum at the later stage to avoid the phenomenon of difficult convergence of the results. Finally, the optimal path of AGV traveling is solved, and the BAS-QL algorithm is verified through experiments. In the n=15 and n=20 raster map comparison experiments, BAS-QL shows the characteristics of short average time consumption, short average distance traveled and low average number of iterations. It shows that the method can also effectively improve the path planning efficiency of AGV while intelligently planning paths.

     

/

返回文章
返回