Title
A Low-Altitude Remote Sensing Inspection Method on Rural Living Environments Based on a Modified YOLOv5s-ViT
Abstract
The governance of rural living environments is one of the important tasks in the implementation of a rural revitalization strategy. At present, the illegal behaviors of random construction and random storage in public spaces have seriously affected the effectiveness of the governance of rural living environments. The current supervision on such problems mainly relies on manual inspection. Due to the large number and wide distribution of rural areas to be inspected, this method is limited by obvious disadvantages, such as low detection efficiency, long-time spending, and huge consumption of human resources, so it is difficult to meet the requirements of efficient and accurate inspection. In response to the difficulties encountered, a low-altitude remote sensing inspection method on rural living environments was proposed based on a modified YOLOv5s-ViT (YOLOv5s-Vision Transformer) in this paper. First, the BottleNeck structure was modified to enhance the multi-scale feature capture capability of the model. Then, the SimAM attention mechanism module was embedded to intensify the model's attention to key features without increasing the number of parameters. Finally, the Vision Transformer component was incorporated to improve the model's ability to perceive global features in the image. The testing results of the established model showed that, compared with the original YOLOv5 network, the Precision, Recall, and mAP of the modified YOLOv5s-ViT model improved by 2.2%, 11.5%, and 6.5%, respectively; the total number of parameters was reduced by 68.4%; and the computation volume was reduced by 83.3%. Relative to other mainstream detection models, YOLOv5s-ViT achieved a good balance between detection performance and model complexity. This study provides new ideas for improving the digital capability of the governance of rural living environments.
Year
DOI
Venue
2022
10.3390/rs14194784
REMOTE SENSING
Keywords
DocType
Volume
Vision Transformer, attention mechanism, target detection, unmanned aerial vehicle (UAV), YOLOv5
Journal
14
Issue
ISSN
Citations 
19
2072-4292
0
PageRank 
References 
Authors
0.34
0
7
Name
Order
Citations
PageRank
Chunshan Wang111.02
Wei Sun253175.97
Hua-Rui Wu31111.57
J.-C. Zhao413552.42
Guifa Teng501.01
Yingru Yang600.34
Pengfei Du700.34