Paper
12 September 2024 CA-P2PNet: improve P2PNet counting and localization with confidence aggregation
Qi Zhang, Yuan Li, Yanzhao Zhou, Wei Sun, Lichuan Xiang, Jianbin Jiao
Author Affiliations +
Proceedings Volume 13256, Fourth International Conference on Computer Vision and Pattern Analysis (ICCPA 2024); 132560P (2024) https://doi.org/10.1117/12.3038082
Event: Fourth International Conference on Computer Vision and Pattern Analysis (ICCPA 2024), 2024, Anshan, China
Abstract
Crowd counting and localization are two crucial tasks that provide technical support for crowd analysis. In recent years, P2PNet has emerged as a mile-stone work in this field, presenting an end-to-end framework that combines these two tasks and exhibits strong performance. However, it has been observed that solely utilizing CNN to predict the category of reference points neglects the influence of the surrounding environment. To address this issue, we model the reference points as a graph, with each reference point connected to other reference points within a neighborhood range. We employ GCN to aggregate the confidence of reference points, thus incorporating important contextual information. Our method is straightforward to implement, requiring only a slight increase in model parameters, and it is plug-and-play, allowing for easy integration into other P2PNet-like methods. Additionally, in order to assess the localization performance more precisely, we devise a new metric called Normalized Mean Offset(NMO). Our method, namely CA-P2PNet, is evaluated on multiple public datasets. The results consistently surpass other baselines, thus the State-of-the-Art(SOTA) performance of our model is demonstrated.
(2024) Published by SPIE. Downloading of the abstract is permitted for personal use only.
Qi Zhang, Yuan Li, Yanzhao Zhou, Wei Sun, Lichuan Xiang, and Jianbin Jiao "CA-P2PNet: improve P2PNet counting and localization with confidence aggregation", Proc. SPIE 13256, Fourth International Conference on Computer Vision and Pattern Analysis (ICCPA 2024), 132560P (12 September 2024); https://doi.org/10.1117/12.3038082
Advertisement
Advertisement
RIGHTS & PERMISSIONS
Get copyright permission  Get copyright permission on Copyright Marketplace
KEYWORDS
Performance modeling

Object detection

Matrices

Data modeling

Education and training

Visualization

Ablation

Back to Top