0% Complete
Home
/
15th International Conference on Computer and Knowledge Engineering
Binary Classification of Capuchin Bird Calls via Spectrogram-Enhanced Frequency-Aware Convolutional Neural Networks
Authors :
Samad Najjar-Ghabel
1
Shamim Yousefi
2
Reza Danandeh Bileh Savar
3
1- Department of Computer Engineering, University of Mohaghegh Ardabili
2- Department of Computer Engineering, University of Mohaghegh Ardabili
3- Department of Computer Engineering, University of Mohaghegh Ardabili
Keywords :
Bioacoustic monitoring،Bird call classification،Capuchinbird detection،Convolutional Neural Network (CNN)،Spectrogram preprocessing
Abstract :
Automated recognition of bird vocalizations plays a critical role in ecological research, particularly in challenging environments. In this paper, we propose a frequency-aware Deep Learning (DL) framework for the binary classification of Capuchinbird vocalizations using a tailored Convolutional Neural Network (CNN) and smart spectrogram preprocessing. The model was trained and evaluated using a curated subset of the Z by HP Unlocked Challenge 3 – Signal Processing dataset, focusing on short audio clips ranging from 2 to 5 seconds. The preprocessing pipeline included duration standardization, zero-padding, and a novel smart cropping method that emphasizes low-frequency energy concentrations relevant to bird calls. Spectrograms were generated using Short-Time Fourier Transform (STFT) and normalized to enhance biologically informative regions. The CNN achieved outstanding performance, with 99% accuracy, 99.5% precision, 98% recall, and a 98.5% F1-score. Visualization tools, along with confusion matrix analysis, confirmed the robustness, generalization, and minimal overfitting of our model. The results demonstrate the effectiveness of our frequency-aware CNN approach for real-world bioacoustic classification tasks. The ability of the framework to reliably detect rare vocalizations under realistic conditions also makes it a valuable tool for scalable wildlife monitoring.
Papers List
List of archived papers
Real-Time Gender Recognition with a Deep Neural Network
Samad Azimi Abriz - Majid Meghdadi
A Graph-based Feature Selection using Class-Feature Association Map (CFAM)
Motahare Akhavan - Seyed Mohammad Hossein Hasheminejad
Deep Learning-Based Malaysian Sign Language (MSL) Recognition: Exploring the Impact of Color Spaces
Ervin Gubin Moung - Precilla Fiona Suwek - Maisarah Mohd Sufian - Valentino Liaw - Ali Farzamnia - Wei Leong Khong
Damage Detection After the Earthquake Using Sentinel-1 and 2 Images and Machine Learning Algorithms (Case Study: Sarpol-e Zahab Earthquake)
Niloofar Alizadeh - Behnam Asghari Beirami - Mehdi Mokhtarzade
Area-Efficient VLSI Implementation of Bit-Serial Multiplier Using Polynomial Basis over GF(2m)
Saeideh Nabipour - Javad Javidan - Gholamreza Zare Fatin
FedBrain-Distill: Communication-Efficient Federated Brain Tumor Classification Using Ensemble Knowledge Distillation on Non-IID Data
Rasoul Jafari Gohari - Laya Aliahmadipour - Ezat Valipour
DEW-WIN: A Dynamic Energy-aware Window-based Scheduler for Mixed-criticality Systems
Mahin Moradiyan - Yasser Sedaghat - Pouria Hosseini - Yousef Rezazadeh
Segmentation of Coronary Artery Stenosis in X-ray Angiography using Mamba Models
Fatemeh Fouladi - Ali Rostami - Hedieh Sajedi
Brain Age Estimation with Twin Vision Transformer using Hippocampus Information Applicable to Alzheimer Dementia Diagnosis
Zahra Qodrati - Seyedeh Masoumeh Taji - Amirhossein Ghaemi - Habibollah Danyali - Kamran Kazemi - Alireza Ghaemi
Robust Distributed Learning over Heterogeneous Adaptive Networks based on Federated BSP Model
Fatemeh Barani - MohammadHafez Yari - Abdorreza Savadi - Hadi Sadoghi Yazdi
more
Samin Hamayesh - Version 43.7.0