0% Complete
Home
/
15th International Conference on Computer and Knowledge Engineering
Binary Classification of Capuchin Bird Calls via Spectrogram-Enhanced Frequency-Aware Convolutional Neural Networks
Authors :
Samad Najjar-Ghabel
1
Shamim Yousefi
2
Reza Danandeh Bileh Savar
3
1- Department of Computer Engineering, University of Mohaghegh Ardabili
2- Department of Computer Engineering, University of Mohaghegh Ardabili
3- Department of Computer Engineering, University of Mohaghegh Ardabili
Keywords :
Bioacoustic monitoring،Bird call classification،Capuchinbird detection،Convolutional Neural Network (CNN)،Spectrogram preprocessing
Abstract :
Automated recognition of bird vocalizations plays a critical role in ecological research, particularly in challenging environments. In this paper, we propose a frequency-aware Deep Learning (DL) framework for the binary classification of Capuchinbird vocalizations using a tailored Convolutional Neural Network (CNN) and smart spectrogram preprocessing. The model was trained and evaluated using a curated subset of the Z by HP Unlocked Challenge 3 – Signal Processing dataset, focusing on short audio clips ranging from 2 to 5 seconds. The preprocessing pipeline included duration standardization, zero-padding, and a novel smart cropping method that emphasizes low-frequency energy concentrations relevant to bird calls. Spectrograms were generated using Short-Time Fourier Transform (STFT) and normalized to enhance biologically informative regions. The CNN achieved outstanding performance, with 99% accuracy, 99.5% precision, 98% recall, and a 98.5% F1-score. Visualization tools, along with confusion matrix analysis, confirmed the robustness, generalization, and minimal overfitting of our model. The results demonstrate the effectiveness of our frequency-aware CNN approach for real-world bioacoustic classification tasks. The ability of the framework to reliably detect rare vocalizations under realistic conditions also makes it a valuable tool for scalable wildlife monitoring.
Papers List
List of archived papers
Advancing Brain Tumor Detection via ViRCNN: A Fusion of Vision Transformers and Faster R-CNN
Mehrshad Momen-Tayefeh - S. AmirAli GH. Ghahramani - Ali Mohammad Afshin Hemmatyar
Deep Learning-Driven Beamforming Optimization for High-Performance 5G Planar Antenna Arrays
Rahman Mohammadi - Seyed Reza Razavi Pour
Optimization of quantum secret sharing communication using corresponding bits
Mahsa Khorrampanah - Mohammad Bolokian - Monireh Houshmand
Camouflage Object Segmentation with Attention-Guided Pix2Pix and Boundary Awareness
Erfan Akbarnezhad Sany - Fatemeh Naserizadeh - Parsa Sinichi - Seyyed Abed Hosseini
Classification of benign and malignant tumors in Digital Breast Tomosynthesis images using Radiomic-based methods
Farangis Sajadi moghadam - Saeid Rashidi
Evaluating the Impact of Traveling on COVID-19 Prevalence and Predicting the New Confirmed Cases According to the Travel Rate Using Machine Learning: A Case Study in Iran
Anita Ghandehari - Soheil Shirvani - Hadi Moradi
Age Estimation Based on Facial Images Using Hybrid Features and Particle Swarm Optimization
NILOUFAR MEHRABI - SAYED PEDRAM HAERI BOROUJENI
Blind image quality assessment based on Multi-resolution Local Structures
Seyed Majid Khorashadizadeh - Mehdi Sadeghi Bakhi - Fatemeh Seifishahpar - AliMohammad Latif
A New Time Series Approach in Churn Prediction with Discriminatory Intervals
Hedieh Ahmadi - Seyed Mohammad Hossein Hasheminejad
Real-Time Forecasting Using Mixed Frequency Time-Series Data
Armin Khayati - Mohammad Taheri - Koorush Ziarati
more
Samin Hamayesh - Version 43.7.0