0% Complete
Home
/
12th International Conference on Computer and Knowledge Engineering
Robat-e-Beheshti: A Persian Wake Word Detection Dataset for Robotic Purposes
Authors :
Parisa Ahmadzadeh Raji
1
Yasser Shekofteh
2
1- Faculty of Computer Science and Engineering, Shahid Beheshti University
2- Faculty of Computer Science and Engineering, Shahid Beheshti University
Keywords :
Speech Recognition،Wake Word،Convolutional Neural Network،Dataset،Transfer Learning
Abstract :
In this paper we explain a dataset that we collected for a wake word detection project which works on classifying audio data. The data is audio files in persian language. We collected 5738 audio files with different formats and a maximum length of 3 seconds. Then we changed the formats and their sample rates to the specific one. We have positive and negative examples for our dataset. Different audio recorders were used for recording the audios and most of the data gathered from 187 individuals and the other files collected from open source ShEMO dataset: a large-scale validated database for Persian speech emotion detection. In this paper we explain the process of collecting data, how they are organized in different files and also we explain the data analysis. This dataset will be free for academic usage with official request to us.
Papers List
List of archived papers
Multi-Fusion Ensemble CNN for Drug–Target Binding Affinity Prediction Using Transformer-Based Molecular and Protein Representations
Betsabeh Tanoori
Driving Violation Detection Using Vehicle Data and Environmental Conditions
Masood Ghasemi - Mahmood Fathy - Mohammad Shahverdy
Attentional Bi-LSTM for Multivariate Time Series Forecasting on Edge Devices: A Case Study on NanoPi Neo Plus2
Navid Hajizadeh - Saeed Yazdani - Sara Ershadi-Nasab
SUT: a new multi-purpose synthetic dataset for Farsi document image analysis
Elham Shabaninia - Fatemeh sadat Eslami - Ali Afkari Fahandari - Hossein Nezamabadi-pour
Novel Insights in Deep Learning for Predicting Climate Phenomena
Mohammad Naisipour - Saghar Ganji - Iraj Saeedpanah - Behnam Mehrakizadeh - Ahmad Reza Labibzadeh
Brain Age Estimation with Twin Vision Transformer using Hippocampus Information Applicable to Alzheimer Dementia Diagnosis
Zahra Qodrati - Seyedeh Masoumeh Taji - Amirhossein Ghaemi - Habibollah Danyali - Kamran Kazemi - Alireza Ghaemi
A Comprehensive Approach to SMS Spam Filtering Integrating Embedded and Statistical Features
Shaghayegh Hosseinpour - Mohammad Reza Keyvanpour
Dynamic Hand Gesture Recognition with 2DCNN-LSTM and Improved Keyframe Extraction
Narjes Heidari - Javid Norouzi - Mohammad Sadegh Helfroush - Habibollah Danyal
Adaptive Pattern Reconstruction Using Linear Regression for Improved TPS Anomaly Detection
Ali Azarsina - Alireza Safarzadeh - MohammadReza Jamali - Abdolhossein Vahabie
Towards Study of Research Topics Evolution in Artificial Intelligence based on Topic Embedding
Seyyed Reza Taher Harikandeh - Sadegh Aliakbary - Soroush Taheri
more
Samin Hamayesh - Version 43.7.0