0% Complete
Home
/
12th International Conference on Computer and Knowledge Engineering
Robat-e-Beheshti: A Persian Wake Word Detection Dataset for Robotic Purposes
Authors :
Parisa Ahmadzadeh Raji
1
Yasser Shekofteh
2
1- Faculty of Computer Science and Engineering, Shahid Beheshti University
2- Faculty of Computer Science and Engineering, Shahid Beheshti University
Keywords :
Speech Recognition،Wake Word،Convolutional Neural Network،Dataset،Transfer Learning
Abstract :
In this paper we explain a dataset that we collected for a wake word detection project which works on classifying audio data. The data is audio files in persian language. We collected 5738 audio files with different formats and a maximum length of 3 seconds. Then we changed the formats and their sample rates to the specific one. We have positive and negative examples for our dataset. Different audio recorders were used for recording the audios and most of the data gathered from 187 individuals and the other files collected from open source ShEMO dataset: a large-scale validated database for Persian speech emotion detection. In this paper we explain the process of collecting data, how they are organized in different files and also we explain the data analysis. This dataset will be free for academic usage with official request to us.
Papers List
List of archived papers
TCAR: Thermal and Congestion-Aware Routing Algorithm in a Partially Connected 3D Network on Chip
Majid Nezarat - Masoomeh Momeni
AgeNet-AT: An End-to-End Model for Robust Joint Speaker Age Estimation and Gender Recognition Based on Attention Mechanism and Titanet
Mahsa Zamani Tarashandeh - Amirhossein Torkanloo - Mohammad Hossein Moattar
T-Rank: Graph Data Analytics for Urban Traffic Modeling
Alireza Safarpour - Iman Gholampour - Amirhossain Aghazadeh Fard - Seyed Mohammad Karbasi
Adaptive Multi-Scale Attentional Network for Semantic Segmentation of Remote Sensing Images
Melika Zare - Sattar Hashemi
A Language-Independent Approach to Classification of Textual File Fragments: Case Study of Persian, English, and Chinese Languages
Fatemeh Mansouri Hanis - Hamidreza Khoshvaghti - Mehdi Teimouri - Hadi Veisi
Optimal PMU Placement Considering Reliability of Measurement System in Smart Grids
Mohammad Shahraeini - Shahla Khormali - Ahad Alvandi
African Vultures Optimization Algorithm for Optimal Damping Controllers Design in the Electrical Power Grid System
Aliyu Sabo - Theophilus Ebuka Odoh - Samuel Habu - Hossein Shahinzadeh - Farshad Ebrahimi
Assessing Users' Influence on Respondents in Conversation Quality: A Quantitative Study on Reddit Based on the Cooperative Principle
Afsaneh Habibi - Fattaneh Taghiyareh
Graph Representation Learning Towards Patents Network Analysis
Mohammad Heydari - Babak Teimourpour
Semi-automatic Detection of Persian Stopwords using FastText Library
Mohammad Dehghani - Mohammad Manthouri
more
Samin Hamayesh - Version 42.2.1