Issue |
JNWPU
Volume 41, Number 6, Decembre 2023
|
|
---|---|---|
Page(s) | 1190 - 1197 | |
DOI | https://doi.org/10.1051/jnwpu/20234161190 | |
Published online | 26 February 2024 |
3D point cloud object detection algorithm based on Transformer
基于Transformer的3D点云目标检测算法
1
Shenyang Aircraft Design Research Institute, Shenyang 110035, China
2
School of Electronics and Information, Northwestern Polytechnical University, Xi'an 710072, China
3
CSSC Systems Engineering Research Institute, Beijing 100094, China
4
No. 1 Military Representative Office of Equipment Department of PLA Airforce in Shenyang, Shenyang 110850, China
Received:
9
January
2023
In response to the difficulty in deploying anchor box based methods in 3D object detection due to the increase in spatial dimensions, this paper studies a point cloud object detection algorithm based on set prediction. This article proposes a Transformer based 3D point cloud object detection algorithm, and combines the characteristics of point clouds in autonomous driving scenarios to propose an improved spatial modulation attention and heat map initialization strategy for training acceleration and query initialization, achieving good detection performance in shallow networks. This article compares it with other algorithms on the KITTI dataset, and the results show that our algorithm has reached an advanced level in performance. We also conducted ablation experiments on the main components of the algorithm to verify the contribution of each module to the detection effect.
摘要
针对在三维目标检测中由于空间维度的增加基于锚框的方法难以部署的问题, 研究了基于集合预测的点云目标检测算法。提出一种基于Transformer的3D点云目标检测算法, 并结合自动驾驶场景下的点云特点, 提出了改进空间调制注意力和热图初始化策略进行训练加速和查询初始化, 在浅层网络下取得了良好的检测性能。在KITTI数据集上与其他算法进行比较, 结果表明所提算法在性能上已经达到先进水平, 进一步对算法中的主要组成部分进行了消融实验, 验证了各个模块对检测效果的贡献。
Key words: Transformer / spatial modulation attention mechanism / heat map initialization / target detection / deep learning
关键字 : Transformer / 空间调制注意力机制 / 热图初始化 / 目标检测 / 深度学习
© 2023 Journal of Northwestern Polytechnical University. All rights reserved.
This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
Current usage metrics show cumulative count of Article Views (full-text article views including HTML views, PDF and ePub downloads, according to the available data) and Abstracts Views on Vision4Press platform.
Data correspond to usage on the plateform after 2015. The current usage metrics is available 48-96 hours after online publication and is updated daily on week days.
Initial download of the metrics may take a while.