0% Complete
Home
/
15th International Conference on Computer and Knowledge Engineering
Binary Classification of Capuchin Bird Calls via Spectrogram-Enhanced Frequency-Aware Convolutional Neural Networks
Authors :
Samad Najjar-Ghabel
1
Shamim Yousefi
2
Reza Danandeh Bileh Savar
3
1- Department of Computer Engineering, University of Mohaghegh Ardabili
2- Department of Computer Engineering, University of Mohaghegh Ardabili
3- Department of Computer Engineering, University of Mohaghegh Ardabili
Keywords :
Bioacoustic monitoring،Bird call classification،Capuchinbird detection،Convolutional Neural Network (CNN)،Spectrogram preprocessing
Abstract :
Automated recognition of bird vocalizations plays a critical role in ecological research, particularly in challenging environments. In this paper, we propose a frequency-aware Deep Learning (DL) framework for the binary classification of Capuchinbird vocalizations using a tailored Convolutional Neural Network (CNN) and smart spectrogram preprocessing. The model was trained and evaluated using a curated subset of the Z by HP Unlocked Challenge 3 – Signal Processing dataset, focusing on short audio clips ranging from 2 to 5 seconds. The preprocessing pipeline included duration standardization, zero-padding, and a novel smart cropping method that emphasizes low-frequency energy concentrations relevant to bird calls. Spectrograms were generated using Short-Time Fourier Transform (STFT) and normalized to enhance biologically informative regions. The CNN achieved outstanding performance, with 99% accuracy, 99.5% precision, 98% recall, and a 98.5% F1-score. Visualization tools, along with confusion matrix analysis, confirmed the robustness, generalization, and minimal overfitting of our model. The results demonstrate the effectiveness of our frequency-aware CNN approach for real-world bioacoustic classification tasks. The ability of the framework to reliably detect rare vocalizations under realistic conditions also makes it a valuable tool for scalable wildlife monitoring.
Papers List
List of archived papers
FarCQA: A Farsi Community Dataset for Question Classification and Answer Selection
Saba Emami - Maedeh Mosharraf
REMA: Reinforced Exponential Moving Average for Real-Time Anomaly Detection in Sensor Data
Mohammad Hossein Jafari Naeimi - Ali Norouzi - Athena Abdi
Efficient Sub-Carrier Relationship Extraction for Human Activity Recognition via EEGNet in Wireless Sensing
Siavash Zaravashan - Sadegh ArefiZadeh - Sajjad Torabi
Lightweight Local Transformer for COVID-19 Detection Using Chest CT Scans
Hojat Asgarian Dehkordi - Hossein Kashiani - Amir Abbas Hamidi Imani - Shahriar Baradaran Shokouhi
DIPT: Diversified Personalized Transformer for QAC systems
Mahdi Dehghani - Samira Vaez Barenji - Saeed Farzi
Synthetic Trajectory Sharing Indoors under Privacy Constraints
Mahdi Soltanpour - Vahideh Moghtadaiee - Mina Alishahi
Implementation of a Low-Overhead 2-Bit Parity-Preserving Reversible Vedic Multiplier for Quantum Architectures
Shekoofeh Moghimi - Negin Mashayekhi - Mohammad Reza Reshadinezhad
Trust Management Enhancement for the Internet of Things: a Smart Contract Approach
Amin Rouzbahani - Fattaneh Taghiyareh
Lempel-Ziv-based Hyper-Heuristic Solution for Longest Common Subsequence Problem
Mahdi Nasrollahi - Reza Shami Tanha - Mohsen Hooshmand
XAI for Transparent Autonomous Vehicles: A New Approach to Understanding Decision-Making in Self-driving Cars
Maryam Sadat Hosseini Azad - Amir Abbas Hamidi Imani - Shahriar Baradaran Shokouhi
more
Samin Hamayesh - Version 43.7.0