0% Complete
Home
/
15th International Conference on Computer and Knowledge Engineering
Binary Classification of Capuchin Bird Calls via Spectrogram-Enhanced Frequency-Aware Convolutional Neural Networks
Authors :
Samad Najjar-Ghabel
1
Shamim Yousefi
2
Reza Danandeh Bileh Savar
3
1- Department of Computer Engineering, University of Mohaghegh Ardabili
2- Department of Computer Engineering, University of Mohaghegh Ardabili
3- Department of Computer Engineering, University of Mohaghegh Ardabili
Keywords :
Bioacoustic monitoring،Bird call classification،Capuchinbird detection،Convolutional Neural Network (CNN)،Spectrogram preprocessing
Abstract :
Automated recognition of bird vocalizations plays a critical role in ecological research, particularly in challenging environments. In this paper, we propose a frequency-aware Deep Learning (DL) framework for the binary classification of Capuchinbird vocalizations using a tailored Convolutional Neural Network (CNN) and smart spectrogram preprocessing. The model was trained and evaluated using a curated subset of the Z by HP Unlocked Challenge 3 – Signal Processing dataset, focusing on short audio clips ranging from 2 to 5 seconds. The preprocessing pipeline included duration standardization, zero-padding, and a novel smart cropping method that emphasizes low-frequency energy concentrations relevant to bird calls. Spectrograms were generated using Short-Time Fourier Transform (STFT) and normalized to enhance biologically informative regions. The CNN achieved outstanding performance, with 99% accuracy, 99.5% precision, 98% recall, and a 98.5% F1-score. Visualization tools, along with confusion matrix analysis, confirmed the robustness, generalization, and minimal overfitting of our model. The results demonstrate the effectiveness of our frequency-aware CNN approach for real-world bioacoustic classification tasks. The ability of the framework to reliably detect rare vocalizations under realistic conditions also makes it a valuable tool for scalable wildlife monitoring.
Papers List
List of archived papers
Advancing Brain Tumor Detection via ViRCNN: A Fusion of Vision Transformers and Faster R-CNN
Mehrshad Momen-Tayefeh - S. AmirAli GH. Ghahramani - Ali Mohammad Afshin Hemmatyar
Persis: A Persian Font Recognition Pipeline Using Convolutional Neural Networks
Mehrdad Mohammadian - Neda Maleki - Tobias Olsson - Fredrik Ahlgren
Crack Segmentation in Civil Structure Images Using a Deep Learning Based Multi-Classifier System
Mohammadreza Asadi - Seyedeh Sogand Hashemi - Mohammad Taghi Sadeghi
Speech Emotion Recognition Using a Hierarchical Adaptive Weighted Multi-Layer Sparse Auto-Encoder Extreme Learning Machine with New Weighting and Spectral/SpectroTemporal Gabor Filter Bank Features
Fatemeh Daneshfar - Seyed Jahanshah Kabudian
A Cost-Sensitive Genetic Algorithm for Customer Segmentation in Auto Insurances
Alireza Khajenoori - Mohammad Saniee Abadeh - Mohsen Mohammadzadeh
Bridging the Synthetic-to-Real Gap (BSRG): Creating Simulated Datasets for Domain Adaptation to Enhance Vehicle Detection
Behnaz Sadeghigol - Mohammad Ali Keyvanrad
Automatic Detection and Risk Assessment of Session Management Vulnerabilities in Web Applications
Nasrin Garmabi - Mohammad Ali Hadavi
An Ensemble CNN for Brain Age Estimation based on Hippocampal Region Applicable to Alzheimer's Diagnosis
Zahra Qodrati - Seyedeh Masoumeh Taji - Habibollah Danyali - Kamran Kazemi
Community-Based QoE Enhancement for User-Generated Content Live Streaming
Reza Saeedinia - S.Omid Fatemi - Daniele Lorenzi - Farzad Tashtarian - Christian Timmerer
Security Analysis of MiniApps: Vulnerabilities, Exploits, and a Tailored Mitigation Framework
Keyhan Mohammadi - Arman Moradi - Reza Ebrahimi Atani
more
Samin Hamayesh - Version 43.7.0