Title
A Scale Aggregation and Spatial-Aware Network for Multi-View Crowd Counting
Abstract
Previous multi-view crowd counting methods underperform in maintaining scale consistency across views and overlook the negative effect of the complex background. To solve these problems, a scale aggregation and spatial-aware network for multi-view crowd counting (SASNet) is proposed. Firstly, we design a multi-branch adaptive scale aggregation module which aggregates the appropriate scale for each pixel in each view. Benefiting from the automatic feature learning process, it can help all camera-view features maintain the scale consistency as much as possible. Then, a crowd-centric selection module is used to reasonably assign the weight of pixels at different spatial locations, thereby selecting the region of crowd and suppressing the background information. Finally, we project each view selected features to the consistent world coordinate system and fuse them. Experimental results demonstrate that the proposed SASNet outperforms the state-of-the-art methods. Our SASNet achieves 7.44 MAE (9.46 RMSE) and 1.01 MAE(1.24 RMSE) in City Street and DukeMTMC respectively.
Year
DOI
Venue
2022
10.1109/ACCESS.2022.3213267
IEEE ACCESS
Keywords
DocType
Volume
Feature extraction, Convolution, Three-dimensional displays, Cameras, Estimation, Transformers, Aggregates, Multi-view crowd counting, one-stage, scale aggregation, spatial attention
Journal
10
ISSN
Citations 
PageRank 
2169-3536
0
0.34
References 
Authors
0
4
Name
Order
Citations
PageRank
Caihua Liu152.08
Yifan Chen247470.39
Xinyu He301.01
Tao Xu49813.14