Title
F-Transformer: Point Cloud Fusion Transformer for Cooperative 3D Object Detection
Abstract
We present a novel cooperative detection framework to fuse multi-view point clouds, for accurately detecting hard samples (e.g., partly or fully occluded, or small objects). Building on a two-step communication scheme to transmit the pillar features between views, it is possible to observe the same object from different viewpoints. We then design a feature fusion scheme based on Transformer to fuse the pillar features by discretizing the point clouds. Considering the sparsity of information, we improve Transformer's self-attention mechanism, with Re-Scaled Dot-Product Attention, which allows the sparse information to capture valuable information more effectively. We evaluate the performance of our method by generating synthetic cooperative datasets over multiple complex traffic scenarios. The results show that our method surpasses all other cooperative perception methods with significant margins.
Year
DOI
Venue
2022
10.1007/978-3-031-15919-0_15
ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2022, PT I
Keywords
DocType
Volume
Feature fusion, Cooperative detection, Self-attention
Conference
13529
ISSN
Citations 
PageRank 
0302-9743
0
0.34
References 
Authors
0
4
Name
Order
Citations
PageRank
Jie Wang100.34
Guiyang Luo200.34
Quan Yuan300.34
Jinglin Li415030.39