Journal of Jilin University(Engineering and Technology Edition) ›› 2025, Vol. 55 ›› Issue (12): 3918-3927.doi: 10.13229/j.cnki.jdxbgxb.20240606

Previous Articles    

Autonomous driving decision⁃making model based on language reasoning and cognitive memory

Xiang WANG1(),Guo-zhen TAN1,Yan-fei PENG1,Hao REN2,Jian-ping LI1   

  1. 1.School of Computer Science and Technology,Dalian University of Technology,Dalian 116081,China
    2.Department of Precision Instrument,Tsinghua University,Beijing 100084,China
  • Received:2024-05-31 Online:2025-12-01 Published:2026-02-03

Abstract:

To address the issues of insufficent safety performance and low learning inefficient in traditional autonomous driving systems, an autonomous driving safety decision-making model capable of continuous learning and understanding linguistic information was proposed. Referring to the reasoning decision-making and experience accumulation processes in human driving, this model leverages a large language model (LLM) as the decision-making agent, integrating chain-of-thought reasoning, a two-stage attention mechanism, and cognitive memory storage and retrieval into the contextual safety learning of the driving process. Meanwhile, a kinematic module is employed to convert LLM decisions into executable driving commands, enabling the continuous learning of safe driving experiences. Experimental results demonstrate that the proposed decision-making model significantly improves safety and efficiency compared to rule, reinforcement learning, and knowledge-based approaches, and possesses the capability of continuous learning and adapting driving behaviors based on human instructions, providing a reference for human-like autonomous driving.

Key words: vehicle engineering, autonomous driving, continuous learning, large language model, chain-of-thought reasoning, two-stage attention mechanism

CLC Number: 

  • U495

Fig.1

Overall architecture of the model"

Fig.2

Architecture of ego-attention head"

Fig.3

Two-stage memory query"

Fig.4

In-context learning guiding decision-making"

Fig.5

Vehicle kinematics model"

Table 1

Llama 2 data flow"

数据流参数类型说明
输入modelstringllama2-7b-chat-v2
messageslist输入的内容
result_formatstring用户返回的内容类型
输出request_idstringllama2-7b-chat-v2
outputlist调用结果信息

usage.input_

tokens

int用户输入文本转换为Token后的长度

usage.output_

tokens

int模型生成回复转换为Token后的长度

Table 2

Attention mechanism parameter setting"

结构取值
输入[·,6]
层数编码器[64,64];2个注意力头;dk =32;解码器[64,64]
参数量3.4×104

Table 3

Decision evaluation index"

场景方法CζtPDPdCn

高速

环境

LRCMM012.826.41.351.3113.89
RBM426.819.73.253.1327.83
MFRLM835.215.73.843.4546.95
KDM314.629.52.472.3120.84

十字

路口

LRCMM032.847.21.840.9124.22
RBM275.132.14.212.6448.11
MFRLM968.430.55.281.0650.50
KDM742.558.53.983.6140.31

Fig.6

Highway decision process"

Fig.7

Intersection decision process"

Fig.8

Continuous learning result"

Table 4

Ablation experiment"

场景TAMCMCζtPDPdCn

高速

环境

××669.534.34.353.9448.00
×464.736.53.843.1243.82
×338.628.42.371.2626.90
014.727.11.441.2914.97

十字

路口

××233.748.94.213.9538.03
×332.547.84.183.8437.35
×132.147.33.893.5235.57
031.746.23.743.4334.55

Table 5

Driving behavior with different commands"

指导命令aˉ/(m·s-2sˉ/radvˉ/(m·s-1gˉ/mlˉ
DMA3.500.0334.78.67
DMC0.210.0121.241.81
NEC1.870.0228.325.62

Fig.9

Driving behavior"

[1] 马依宁, 姜为, 吴靖宇, 等. 基于不同风格行驶模型的自动驾驶仿真测试自演绎场景研究[J]. 中国公路报, 2023, 36(2): 216-228.
Ma Yi-ning, Jiang Wei, Wu Jing-yu, et al. Self- evolution scenarios for simulation tests of autonomous vehicles based on different models of driving styles[J]. China Journal of Highway and Transport, 2023, 36(2): 216-228.
[2] 李伟东, 马草原, 史浩, 等. 基于分层强化学习的自动驾驶决策控制算法[J]. 吉林大学学报: 工学版, 2025, 55(5): 1798-1805.
Li Wei-dong, Ma Cao-yuan, Shi Hao, et al. An automatic driving decision control algorithm based on
hierarchical reinforcement learning[J]. Journal of Jilin University (Engineering and Technology Edition), 2025, 55(5): 1798-1805.
[3] 朱波, 张纪伟, 谈东奎, 等. 基于多源传感器与导航地图的端到端自动驾驶方法[J]. 汽车安全与节能学报, 2022, 13(4): 738-749.
Zhu Bo, Zhang Ji-wei, Tan Dong-kui, et al. End-to-end autonomous driving method based on multi-source sensor and navigation map[J]. Journal of Automotive Safety and Energy, 2022, 13(4): 738-749.
[4] Zhang Q X, Zhao Y H, Wang Y J, et al. Towards cross-task universal perturbation against black-box object detectors in autonomous driving[J]. Computer Networks, 2020, 180: No.107388.
[5] Wang S Y, Zhu Y X, Li Z H, et al. ChatGPT as your vehicle co-pilot: An initial attempt[J]. IEEE Transactions on Intelligent Vehicles, 2023, 8(12): 4706-4721.
[6] Cui Y D, Huang S C, Zhong J M, et al. DriveLLM: charting the path toward full autonomous driving with large language models[J]. IEEE Transactions on Intelligent Vehicles, 2023, 9(1): 1450-1464.
[7] Kojima T, Gu S S, Reid M, et al. Large language models are zero-shot reasoners[J]. Advances in Neural Information Processing Systems, 2022, 35: 22199- 22213.
[8] 王祥, 谭国真. 基于知识与大语言模型的高速环境自动驾驶决策研究[J]. 系统仿真学报, 2025(5): 1246-1255.
Wang Xiang, Tan Guo-zhen. Research on decision-making of autonomous driving in highway environment based on knowledge and large language model[J]. Journal of System Simulation, 2025(5): 1246-1255.
[9] Peng Y F, Tan G Z, Si H W, et al. DRL-GAT-SA: Deep reinforcement learning for autonomous driving planning based on graph attention networks and simplex architecture[J]. Journal of Systems Architecture, 2022, 126: No.102505.
[10] 胡宏宇, 张慧珺, 姚荣涵, 等. L3级自动驾驶接管过程驾驶员情景意识研究[J]. 吉林大学学报: 工学版, 2024, 54(2): 410-418.
Hu Hong-yu, Zhang Hui-jun, Yao Rong-han, et al. Driver's situational awareness in takeover process of L3 automated vehicles[J]. Journal of Jilin University (Engineering and Technology Edition), 2024, 54(2): 410-418.
[11] Nie X T, Liang Y P, Ohkura K. Autonomous highway driving using reinforcement learning with safety check system based on time-to-collision[J]. Artificial Life and Robotics, 2023, 28(1): 158-165.
[12] Chang M K, Lee S H, Chung C C. Comparative evaluation of dynamic and kinematic vehicle models[C]∥Conference on Decision and Control, Los Angeles, CA, USA, 2015: 648-653.
[13] Treiber M, Hennecke A, Helbing D. Congested traffic states in empirical observations and microscopic simulations[J]. Physical Review E, 2000, 62(2): 1805.
[14] Xin L, Kong Y T, Li S E, et al. Enable faster and smoother spatio-temporal trajectory planning for autonomous vehicles in constrained dynamic environment[J].Journal of Automobile Engineering, 2021, 235(4): 1101-1112.
[15] Li G F, Li S L, Li S, et al. Deep reinforcement learning enabled decision-making for autonomous driving at intersections[J]. Automotive Innovation, 2020, 3: 374-385.
[1] Wei LAN,Zheng ZHOU,Guan-yu WANG,Wei WANG,Miao-miao ZHANG. Intelligent fitting method for vehicle design based on machine learning [J]. Journal of Jilin University(Engineering and Technology Edition), 2025, 55(9): 2858-2863.
[2] Bing ZHU,Peng-xiang MENG,Bin LIU,Jia-yi HAN,Jian ZHAO,Zhi-cheng CHEN,Dong-jian SONG,Xiao-wen TAO. Virtual lane lines fitting method based on traffic environment information [J]. Journal of Jilin University(Engineering and Technology Edition), 2025, 55(9): 2935-2945.
[3] Shou-tao LI,Xiang-yi JIA,Jun ZHU,Hong-yan GUO,Ding-li YU. Uncontrolled intersections decision⁃making method for intelligent driving vehicles based on Level⁃K [J]. Journal of Jilin University(Engineering and Technology Edition), 2025, 55(9): 3069-3078.
[4] Gui-shen YU,Xin CHEN,Yue TANG,Chun-hui ZHAO,Ai-jia NIU,Hui CHAI,Jing-xin NA. Effect of laser surface treatment on the shear strength of aluminum-aluminum bonding joints [J]. Journal of Jilin University(Engineering and Technology Edition), 2025, 55(8): 2555-2569.
[5] Jun-wu ZHAO,Ting QU,Yun-feng HU. Trajectory planning for intelligent vehicles based on adaptive sampling [J]. Journal of Jilin University(Engineering and Technology Edition), 2025, 55(8): 2802-2816.
[6] Jin-wu GAO,Shao-long SUN,Shun-yao WANG,Bing-zhao GAO. Speed fluctuation suppression strategy of range extender based on motor torque compensation [J]. Journal of Jilin University(Engineering and Technology Edition), 2025, 55(8): 2475-2486.
[7] Mei-xia JIA,Jian-jun HU,Feng XIAO. Multi⁃physics simulation method of vehicle motor under varying working conditions based on multi⁃software combination [J]. Journal of Jilin University(Engineering and Technology Edition), 2025, 55(6): 1862-1872.
[8] Xue-wei SONG,Ze-ping YU,Yang XIAO,De-ping WANG,Quan YUAN,Xin-zhuo LI,Jia-wen ZHENG. Research progress on the performance changes of lithium⁃ion batteries after aging [J]. Journal of Jilin University(Engineering and Technology Edition), 2025, 55(6): 1817-1833.
[9] Chun XIAO,Zi-chun YI,Bing-yin ZHOU,Shao-rui ZHANG. Fuzzy energy management strategy of fuel cell electric vehicle based on improved pigeon⁃inspired optimization [J]. Journal of Jilin University(Engineering and Technology Edition), 2025, 55(6): 1873-1882.
[10] Jian WANG,Chen-wei JIA. Trajectory prediction model for intelligent connected vehicle [J]. Journal of Jilin University(Engineering and Technology Edition), 2025, 55(6): 1963-1972.
[11] Wei-dong LI,Cao-yuan MA,Hao SHI,Heng CAO. An automatic driving decision control algorithm based on hierarchical reinforcement learning [J]. Journal of Jilin University(Engineering and Technology Edition), 2025, 55(5): 1798-1805.
[12] Dang LU,Yan-ru SUO,Yu-hang SUN,Hai-dong WU. Estimation of tire camber and sideslip combined mechanical characteristics based on dimensionless expression [J]. Journal of Jilin University(Engineering and Technology Edition), 2025, 55(5): 1516-1524.
[13] Zhen-hai GAO,Cheng-yuan ZHENG,Rui ZHAO. Review of active safety verification and validation for autonomous vehicles in real and virtual scenarios [J]. Journal of Jilin University(Engineering and Technology Edition), 2025, 55(4): 1142-1162.
[14] Tao ZHANG,Huang-da LIN,Zhong-jun YU. Real-time rolling optimization control method for gearshift of hybrid electric vehicles [J]. Journal of Jilin University(Engineering and Technology Edition), 2025, 55(4): 1215-1224.
[15] Dang LU,Xiao-fan WANG,Hai-dong WU. Analysis of uniform distribution characteristics of contact pressure of TWEEL tires [J]. Journal of Jilin University(Engineering and Technology Edition), 2025, 55(3): 811-819.
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!