基于深度学习的多模态多任务端到端自动驾驶研究

田晟; 冯宇鹏; 张裕天; 黄伟; 王蕾

doi:10.12198/j.issn.1673-159X.3416

基于深度学习的多模态多任务端到端自动驾驶研究

Research on Multi-Modal Multi-Task End-to-end Autonomous Driving Method Based on Deep Learning

摘要

摘要: 当前端到端自动驾驶系统的研究方法主要是采用图像或图像序列作为输入，使用卷积神经网络直接预测方向盘转角，取得了较好的效果，但仅通过转向命令并不足以完成自动驾驶车辆的控制。为了更好地实现对自动驾驶车辆的横纵向控制，构建基于端到端学习的CNN-LSTM（卷积神经网络-长短时记忆）多模态多任务神经网络模型，将图像、速度序列和方向盘转角序列作为输入，从而同时预测车辆的方向盘转角和速度值。在搭建的基于GTAV（Grand Theft Auto V, 侠盗猎车5）仿真平台数据集和真实场景数据集上进行实验和测试，实验结果表明模型能够较好地完成车道保持的驾驶行为和基本实现自动驾驶避障测试。

Abstract: The current research methods of the end-to-end automatic driving system mainly take an image or an image sequence as input, and directly predict the steering wheel angle with convolutional neural network. It has achieved good results, but it is not sufficient for the self-driving vehicle control by taking the advantage of the steering wheel alone. In order to better realize the horizontal and vertical control of the self-driving vehicle, the CNN-LSTM multi-modal multi-task neural network model based on end-to-end learning was constructed, the image, speed sequence and steering wheel angle sequence were taken as inputs, and the steering angle and speed of the vehicle were predicted. The model was trained and tested on the GTAV simulation platform dataset and the real-world scene dataset. The experimental results show that the model can better complete the driving behavior of lane keeping and basically realize the obstacle avoidance test of automatic driving.

HTML全文

参考文献(19)

文章被引

资源附件(0)