Abstract
Automated detection of large animals in road scenes plays a crucial role in enhancing the safety of autonomous vehicles, particularly in regions where wildlife-related accidents are common. This paper introduces a deep learning-based explanation for detecting and classifying ten large animal classes within road scene environments, such as dogs, horses, cows, and bears. A specialized dataset was fetched using selected classes from the COCO and Open Images V5 datasets, annotated in the COCO format. Four advanced object detection models were trained and evaluated with the EfficientDet-D1, RetinaNet R-50-FPN, Faster R-CNN R-50-FPN, and Cascade R-CNN R-50-FPN. Results show that RetinaNet R-50-FPN achieved the highest mean Average Precision (mAP) of 0.83 for one joint class and 0.69 for ten classes while also delivering the fastest inference speed at 50.6 FPS for one-class detection and 45.2 FPS for multi-class detection. EfficientDet-D1 achieved a mAP of 0.89 for one joint class and 0.77 for ten classes, offering competitive performance but with slightly slower inference speeds. The findings highlight RetinaNet as the most effective and efficient model for real-time large animal detection in road scenes, offering significant potential for integration into modern autonomous driving systems.
References
- K. Anjali, D. C. Reddy, B. Prasanthi, S. M. Hussain, L. V. CH, and R. B. Budithi, “A comprehensive survey of technologies for wildlife detection and accident prevention,” in 2025 3rd International Conference on Intelligent Systems, Advanced Computing and Communication (ISACC). IEEE, 2025, pp. 823–830.
- H. L. Majeed, O. A. Hassen, D. A. Farhan, Y. Y. Gromov, K. Sheoran, and G. Dhand, “Computer vision of smile detection
- based on machine and deep learning approach,” Journal of Cybersecurity and Information Management (JCIM) Vol, vol. 16,
- no. 01, pp. 208–230, 2025.
- Madapudi, Rudra Kumar, A. Ananda Rao, and Gopichand Merugu. "Change requests artifacts to assess impact on structural design of SDLC phases." Int’l J. Computer Applications 54.18 (2012): 21-26.
- Reddy, K. Uday Kumar, S. Shabbiha, and M. Rudra Kumar. "Design of high security smart health care monitoring system using IoT." Int. J 8 (2020).
- Seshakagari, Haranadha Reddy Busireddy, et al. "Dynamic Financial Sentiment Analysis and Market Forecasting through Large Language Models." International Journal of Human Computations & Intelligence 4.1 (2025): 397-410.
- S. Beery, A. Agarwal, E. Cole, and V. Birodkar, “The iwildcam 2021 competition dataset,” arXiv preprint arXiv:2105.03494, 2021.
- J. Hong, K. Huang, H.-N. Liang, X. Wang, and R. Zhang, “Fine-grained image classification with object-part model,” in
- Advances in Brain Inspired Cognitive Systems: 10th International Conference, BICS 2019, Guangzhou, China, July 13–14,
- , Proceedings 10. Springer, 2020, pp. 233–243.
- H. Singh, S. Swagatika, R. S. Venkat, and S. Saxena, “Justification of stl-10 dataset using a competent cnn model trained on cifar-10,” in 2019 3rd International conference on Electronics, Communication and Aerospace Technology (ICECA). IEEE,
- , pp. 1254–1257.
- K. Parkavi, A. Ganguly, A. Banerjee, S. Sharma, and K. Kejriwal, “Enhancing road safety: Detection of animals on highways during night,” IEEE Access, 2025.
- S. Sengan, K. Kotecha, I. Vairavasundaram, P. Velayutham, V. Varadarajan, L. Ravi, and S. Vairavasundaram, “Real-time automatic investigation of indian roadway animals by 3d reconstruction detection using deep learning for r-3d-yolov3 image classification and filtering,” Electronics, vol. 10, no. 24, p. 3079, 2021.
- F. Simões, C. Bouveyron, and F. Precioso, “Deepwild: Wildlife identification, localisation and estimation on camera trap videos using deep learning,” Ecological Informatics, vol. 75, p. 102095, 2023.
- D. Sato, A. J. Zanella, and E. J. X. Costa, “Computational classification of animals for a highway detection system,” Brazilian Journal of Veterinary Research and Animal Science, vol. 58, no. esp, pp. 1–10, 2021.
- J. Siddaiyan and K. Poonusamy, “Identify the driving space for vehicle movement by lane line and road object detection
- using deep learning technique.” Journal of New Materials for Electrochemical Systems, vol. 28, no. 1, 2025.
- Y. Zhang, X. Xiong, S. Yang, Q. Zhang, M. Chi, X. Wen, X. Zhang, and J. Wang, “Enhancing the visual environment of
- urban coastal roads through deep learning analysis of street-view images: A perspective of aesthetic and distinctiveness,” PloS one, vol. 20, no. 1, p. e0317585, 2025.
- S. Jain, S. Dash, R. Deorari et al., “Object detection using coco dataset,” in 2022 International Conference on Cyber Resilience (ICCR). IEEE, 2022, pp. 1–4.
- I. Krylov, S. Nosov, and V. Sovrasov, “Open images v5 text annotation and yet another mask text spotter,” in Asian Conference on Machine Learning. PMLR, 2021, pp. 379–389.
- M. Tan, R. Pang, and Q. V. Le, “Efficientdet: Scalable and efficient object detection,” in Proceedings of the IEEE/CVF
- conference on computer vision and pattern recognition, 2020, pp. 10 781–10 790.
- Y. Wang, W. Liu, and W. Xing, “Balanced-retinanet: solving the imbalanced problems in object detection,” Journal of Electronic Imaging, vol. 30, no. 3, pp. 033 009–033 009, 2021.
- S. Ren, K. He, R. Girshick, and J. Sun, “Faster r-cnn: Towards real-time object detection with region proposal networks,” Advances in neural information processing systems, vol. 28, 2015.
- Z. Cai and N. Vasconcelos, “Cascade r-cnn: High quality object detection and instance segmentation,” IEEE transactions on pattern analysis and machine intelligence, vol. 43, no. 5, pp. 1483–1498, 2019.