LCDnet: a lightweight crowd density estimation model for real-time video surveillance
Automatic crowd counting using density estimation has gained significant attention in computer vision research. As a result, a large number of crowd counting and density estimation models using convolution neural networks (CNN) have been published in the last few years. These models have achieved good accuracy over benchmark datasets. However, attempts to improve the accuracy often lead to higher complexity in these models. In real-time video surveillance applications using drones with limited computing resources, deep models incur intolerable higher inference delay. In this paper, we propose (i) a Lightweight Crowd Density estimation model (LCDnet) for real-time video surveillance, and (ii) an improved training method using curriculum learning (CL). LCDnet is trained using CL and evaluated over two benchmark datasets i.e., DroneRGBT and CARPK. Results are compared with existing crowd models. Our evaluation shows that the LCDnet achieves a reasonably good accuracy while significantly reducing the inference time and memory requirement and thus can be deployed over edge devices with very limited computing resources.
Other Information
Published in: Journal of Real-Time Image Processing
License: https://creativecommons.org/licenses/by/4.0
See article on publisher's website: https://dx.doi.org/10.1007/s11554-023-01286-8
Funding
Open Access funding provided by the Qatar National Library.
History
Language
- English
Publisher
Springer NaturePublication Year
- 2023
License statement
This Item is licensed under the Creative Commons Attribution 4.0 International License.Institution affiliated with
- Qatar University
- College of Engineering - QU
- Qatar Mobility Innovations Center