基于深度学习的自动驾驶技术综述

doi:10.16511/j.cnki.qhdxxb.2018.21.010

摘要
图/表
参考文献
相关文章
Metrics

全文: PDF(2102 KB)
输出: BibTeX | EndNote (RIS)

摘要该文在行人检测技术方面介绍了基于卷积神经网络（CNN）模型的目标识别、检测技术与改进的区域卷积神经网络（R-CNN）、任务辅助卷积神经网络（TA-CNN）模型技术。在立体匹配技术方面简述了基于孪生网络的立体匹配的深度学习模型技术。在多传感器融合技术方面回顾了基于深度学习网络的视觉传感器、雷达传感器与摄像机传感器的多源数据融合技术。在汽车控制技术方面分析了基于卷积神经网络实现无人驾驶车辆端到端的横向与纵向控制技术。深度学习技术在自动驾驶领域的感知层、决策层与控制层的广泛运用，不断地提高感知、检测、决策与控制的准确率，并取得一定的成功，分析表明深度学习技术将加速自动驾驶技术的发展速度，为自动驾驶成为现实带来巨大的可能性。

	服务

	把本文推荐给朋友
	加入引用管理器
	E-mail Alert
	RSS

	作者相关文章
	张新钰
	高洪波
	赵建辉
	周沫

关键词 ：计算机视觉, 深度学习, 无人驾驶车辆, 传感器

Abstract：This paper introduces target recognition and detection methods based on the convolutional neural network (CNN) model, the improved regions with convolutional neural network (R-CNN) and the task-assistant convolutional neural network (TA-CNN) model for pedestrian detection. This paper also describes stereo matching based on a deep learning model for stereo matching using the Siamese network. Multi-source data fusion is also introduced based on a vision sensor, a radar sensor and a camera using a deep learning network. The CNN is used for end-to-end horizontal and vertical control of autonomous vehicles. Deep learning is widely used in the perception level, decision-making level and control level in automatic driving systems to continuously improve the perception, detection, decision-making and control accuracy. Analyses show that deep learning will improve of autonomous driving systems.

Key words： computer vision deep learning autonomous vehicle sensor

收稿日期: 2017-12-30 出版日期: 2018-04-15

ZTFLH:

TP399

基金资助:国家重点研究和发展计划（2016YFB0100903）；北京市科学技术委员会重大专项（d171100005017002，d171100005117002）；中国博士后基金（2017M620765）

通讯作者: 高洪波,助理研究员,E-mail:ghb48@tsinghu.edu.cn E-mail: ghb48@tsinghu.edu.cn

作者简介: 张新钰(1974-),男,副研究员。

引用本文:

张新钰, 高洪波, 赵建辉, 周沫. 基于深度学习的自动驾驶技术综述[J]. 清华大学学报（自然科学版）, 2018, 58(4): 438-444.
ZHANG Xinyu, GAO Hongbo, ZHAO Jianhui, ZHOU Mo. Overview of deep learning intelligent driving methods. Journal of Tsinghua University(Science and Technology), 2018, 58(4): 438-444.

链接本文:

http://jst.tsinghuajournals.com/CN/10.16511/j.cnki.qhdxxb.2018.21.010 或 http://jst.tsinghuajournals.com/CN/Y2018/V58/I4/438

图１谷歌无人驾驶车辆架构

表１行人检测方法

图３双目系统获取深度信息原理图 <sup>[２２]</sup>

图４孪生深度网络原理图

图５雷达的密度深度图及对应的 HHA特征图

图６３种无人驾驶控制方案

[1] DÖRR D, GRABENGIESSER D, GAUTERIN F. Online driving style recognition using fuzzy logic[C]//Proceedings of the 17th International Conference on Intelligent Transportation Systems (ITSC). Qingdao, China:IEEE, 2014:1021-1026.
[2] DALAL N, TRIGGS B. Histograms of oriented gradients for human detection[C]//Proceedings of the 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition. San Diego, CA, USA:IEEE, 2005, 1:886-893.
[3] WU B, NEVATIA R. Detection of multiple, partially occluded humans in a single image by Bayesian combination of edgelet part detectors[C]//Proceedings of the 10th IEEE International Conference on Computer Vision. Beijing, China:IEEE, 2005, 1:90-97.
[4] GAVRILA D M. A Bayesian, exemplar-based approach to hierarchical shape matching[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2007, 29(8):1408-1421.
[5] MU Y D, YAN S C, LIU Y, et al. Discriminative local binary patterns for human detection in personal album[C]//Proceedings of the 2008 IEEE Conference on Computer Vision and Pattern Recognition. Anchorage, Alaska, USA:IEEE, 2008:1-8.
[6] WANG X Y, HAN T X, YAN S C. An HOG-LBP human detector with partial occlusion handling[C]//Proceedings of the 12th International Conference on Computer Vision. Kyoto, Japan:IEEE, 2009:32-39.
[7] TUZEL O, PORIKLI F, MEER P. Pedestrian detection via classification on Riemannian manifolds[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2008, 30(10):1713-1727.
[8] WATANABE T, ITO S, YOKOI K. Co-occurrence histograms of oriented gradients for human detection[J]. IPSJ Transactions on Computer Vision and Applications, 2010, 2:39-47.
[9] DOLLAR P, TU Z W, PERONA P, et al. Integral channel features[C]//Proceedings of the British Machine Vision Conference. London, UK:BMVC, 2009:1-11.
[10] GAO W, AI H Z, LAO S H. Adaptive Contour Features in oriented granular space for human detection and segmentation[C]//Proceedings of the 2009 IEEE Conference on Computer Vision and Pattern Recognition. Miami, FL, USA:IEEE, 2009:1786-1793.
[11] LIU Y Z, SHAN S G, ZHANG W C, et al. Granularity-tunable gradients partition (GGP) descriptors for human detection[C]//Proceedings of the 2009 IEEE Conference on Computer Vision and Pattern Recognition. Miami, FL, USA:IEEE, 2009:1255-1262.
[12] WU J X, GEYER C, REHG J M. Real-time human detection using contour cues[C]//Proceedings of the 2011 IEEE International Conference on Robotics and Automation. Shanghai, China:IEEE, 2011:860-867.
[13] WALK S, MAJER N, SCHINDLER K, et al. New features and insights for pedestrian detection[C]//Proceedings of the 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition. San Francisco, CA, USA:IEEE, 2010:1030-1037.
[14] PLÖCHL M, EDELMANN J. Driver models in automobile dynamics application[J]. Vehicle System Dynamics, 2007, 45(7-8):699-741.
[15] GIRSHICK R, DONAHUE J, DARRELL T, et al. Rich feature hierarchies for accurate object detection and semantic segmentation[C]//Proceedings of the 2014 IEEE Conference on Computer Vision and Pattern Recognition. Columbus, OH, USA:IEEE, 2014:580-587.
[16] WALLACE B, GOUBRAN R, KNOEFEL F, et al. Measuring variation in driving habits between drivers[C]//Proceedings of the 2014 IEEE International Symposium on Medical Measurements and Applications. Lisboa, Portugal:IEEE, 2014:1-6.
[17] GIRSHICK R. Fast R-CNN[C]//Proceedings of the 2015 IEEE International conference on Computer Vision. Santiago, Chile:IEEE, 2015:1440-1448.
[18] REN S Q, HE K M, GIRSHICK R, et al. Faster R-CNN:Towards real-time object detection with region proposal networks[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2017, 39(6):1137-1149.
[19] TIAN Y L, LUO P, WANG X G, et al. Pedestrian detection aided by deep learning semantic tasks[C]//Proceedings of the 2015 IEEE Conference on Computer Vision and Pattern Recognition. Boston, MA, USA:IEEE, 2015:5079-5087.
[20] LUO L H. Adaptive cruise control design with consideration of humans' driving psychology[C]//Proceedings of the 11th World Congress on Intelligent Control and Automation. Shenyang, China:IEEE, 2014:2973-2978.
[21] PACHECO J E, LÓPEZ E. Monitoring driving habits through an automotive CAN network[C]//Proceedings of the 23rd International Conference on Electronics, Communications and Computing. Cholula, Mexico:IEEE, 2013:138-143.
[22] MVHLMANN K, MAIER D, HESSER J, et al. Calculating dense disparity maps from color stereo images, an efficient implementation[J]. International Journal of Computer Vision, 2002, 47(1-3):79-88.
[23] CHOPRA S, HADSELL R, LECUN Y. Learning a similarity metric discriminatively, with application to face verification[C]//Proceedings of the 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition. San Diego, CA, USA:IEEE, 2005, 1:539-546.
[24] LUO W J, SCHWING A G, URTASUN R. Efficient deep learning for stereo matching[C]//Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition. Las Vegas, NV, USA:IEEE, 2016:5695-5703.
[25] BUTAKOV V, IOANNOU P. Driving autopilot with personalization feature for improved safety and comfort[C]//Proceedings of the 18th International Conference on Intelligent Transportation Systems. Las Palmas, Spain:IEEE, 2015:387-393.
[26] ALJAAFREH A, ALSHABATAT N, NAJIM AL-DIN M S. Driving style recognition using fuzzy logic[C]//Proceedings of the 2012 IEEE International Conferenceon Vehicular Electronics and Safety. Istanbul, Turkey:IEEE, 2012:460-463.
[27] JOHNSON D A, TRIVEDI M M. Driving style recognition using a smartphone as a sensor platform[C]//Proceedings of the 14th International IEEE Conference on Intelligent Transportation Systems. Washington, DC, USA:IEEE, 2011:1609-1615.
[28] SCHLOSSER J, CHOW C K, KIRA Z. Fusing LIDAR and images for pedestrian detection using convolutional neural networks[C]//Proceedings of the2016 IEEE International Conference on Robotics and Automation. Stockholm, Sweden:IEEE, 2016:2198-2205.
[29] VAN LY M, MARTIN S, TRIVEDI M M. Driver classification and driving style recognition using inertial sensors[C]//Proceedings of the 2013 IEEE Intelligent Vehicles Symposium (IV). Gold Coast, QLD, Australia:IEEE, 2013:1040-1045.
[30] POMERLEAU D A. ALVINN:An autonomous land vehicle in a neural network[C]//Advances in Neural Information Processing Systems. San Francisco, CA, USA:ACM, 1989:305-313.
[31] BOJARSKI M, DEL TESTA D, DWORAKOWSKI D, et al. End to end learning for self-driving cars[J/OL]. (2016-04-25). http://arxiv.org/pdf/1604.07316.pdf.
[32] CHEN C Y, SEFF A, KORNHAUSER A, et al. DeepDriving:Learning affordance for direct perception in autonomous driving[C]//Proceedings of the 2015 IEEE International Conference on Computer Vision. Santiago, Chile:IEEE, 2015:2722-2730.

[1]	冯宇星, 郑军, 林劲松. 基于恒聚焦光路的线激光视觉传感器的标定方案[J]. 清华大学学报（自然科学版）, 2024, 64(4): 738-748.
[2]	黄贲, 康飞, 唐玉. 基于目标检测的混凝土坝裂缝实时检测方法[J]. 清华大学学报（自然科学版）, 2023, 63(7): 1078-1086.
[3]	苗旭鹏, 张敏旭, 邵蓥侠, 崔斌. PS-Hybrid: 面向大规模推荐模型训练的混合通信框架[J]. 清华大学学报（自然科学版）, 2022, 62(9): 1417-1425.
[4]	张羽中, 柳世强, 张俊昌, 朱荣. 人体下肢运动协调模型[J]. 清华大学学报（自然科学版）, 2022, 62(3): 458-462.
[5]	李彦霖, 秦本科, 薄涵亮. 电容式棒位测量传感器的解析模型及验证[J]. 清华大学学报（自然科学版）, 2022, 62(10): 1636-1644.
[6]	梅杰, 李庆斌, 陈文夫, 邬昆, 谭尧升, 刘春风, 王东民, 胡昱. 基于目标检测模型的混凝土坯层覆盖间歇时间超时预警[J]. 清华大学学报（自然科学版）, 2021, 61(7): 688-693.
[7]	管志斌, 王晓萌, 辛伟, 王嘉捷. 源代码缺陷检测数据生成及标注方法[J]. 清华大学学报（自然科学版）, 2021, 61(11): 1240-1245.
[8]	韩坤, 潘海为, 张伟, 边晓菲, 陈春伶, 何舒宁. 基于多模态医学图像的Alzheimer病分类方法[J]. 清华大学学报（自然科学版）, 2020, 60(8): 664-671,682.
[9]	王志国, 章毓晋. 监控视频异常检测：综述[J]. 清华大学学报（自然科学版）, 2020, 60(6): 518-529.
[10]	蒋文斌, 王宏斌, 刘湃, 陈雨浩. 基于AVX2指令集的深度学习混合运算策略[J]. 清华大学学报（自然科学版）, 2020, 60(5): 408-414.
[11]	余传明, 原赛, 胡莎莎, 安璐. 基于深度学习的多语言跨领域主题对齐模型[J]. 清华大学学报（自然科学版）, 2020, 60(5): 430-439.
[12]	宋欣瑞, 张宪琦, 张展, 陈新昊, 刘宏伟. 多传感器数据融合的复杂人体活动识别[J]. 清华大学学报（自然科学版）, 2020, 60(10): 814-821.
[13]	庞齐齐, 张丽霞, 何一超, 宫正, 冯占宗, 陈亚龙, 危银涛, 杜永昌. 磁流变半主动悬架控制算法验证平台[J]. 清华大学学报（自然科学版）, 2019, 59(7): 567-574.
[14]	孙博文, 朱志明, 郭吉昌, 张天一. 基于组合激光结构光的视觉传感器检测算法及图像处理流程优化[J]. 清华大学学报（自然科学版）, 2019, 59(6): 445-452.
[15]	张继文, 宋立滨, 许君杰, 石循磊, 刘莉. 仿人足球机器人的非预定义足球检测算法[J]. 清华大学学报（自然科学版）, 2019, 59(4): 298-305.

Viewed

Full text

Abstract

Cited

Shared

Discussed