0% Complete
Home
/
12th International Conference on Computer and Knowledge Engineering
Robat-e-Beheshti: A Persian Wake Word Detection Dataset for Robotic Purposes
Authors :
Parisa Ahmadzadeh Raji
1
Yasser Shekofteh
2
1- Faculty of Computer Science and Engineering, Shahid Beheshti University
2- Faculty of Computer Science and Engineering, Shahid Beheshti University
Keywords :
Speech Recognition،Wake Word،Convolutional Neural Network،Dataset،Transfer Learning
Abstract :
In this paper we explain a dataset that we collected for a wake word detection project which works on classifying audio data. The data is audio files in persian language. We collected 5738 audio files with different formats and a maximum length of 3 seconds. Then we changed the formats and their sample rates to the specific one. We have positive and negative examples for our dataset. Different audio recorders were used for recording the audios and most of the data gathered from 187 individuals and the other files collected from open source ShEMO dataset: a large-scale validated database for Persian speech emotion detection. In this paper we explain the process of collecting data, how they are organized in different files and also we explain the data analysis. This dataset will be free for academic usage with official request to us.
Papers List
List of archived papers
LPCNet: Lane detection by lane points correction network in challenging environments based on deep learning
Sina BaniasadAzad - Seyed Mohammadreza Mousavi mirkolaei
Area-Efficient VLSI Implementation of Bit-Serial Multiplier Using Polynomial Basis over GF(2m)
Saeideh Nabipour - Javad Javidan - Gholamreza Zare Fatin
Deep Deterministic Policy Gradient in Acoustic To Articulatory inversion
Farzane Abdoli - Hamid Sheikhzade - Vahid Pourahmadi
Improve the utility of tensor cores by compacting sparse matrix technique
Mohammad.S Abazari - Mahsa Zahedi - Abdorreza Savadi
Histopathology Image-Based Cancer Classification Utilizing Transfer Learning Approach
Amir Meydani - Alireza Meidani - Ali Ramezani - Maryam Shabani - Mohammad Mehdi Kazeminasab - Shahriar Shahablavasani
A Graph-based Feature Selection using Class-Feature Association Map (CFAM)
Motahare Akhavan - Seyed Mohammad Hossein Hasheminejad
Joint ADC-less Analog Demodulator and Decoder for Extended Binary (8, 4, 4) Hamming Channel Code
Mir Mahdi Safari - Jafar Pourrostam - Behzad Mozaffari Tazehkand
Attention-Boosted Ensemble of Pre-trained Convolutional Neural Networks for Accurate Diabetic Retinopathy Detection
Benyamin Mirab Golkhatmi - Mohammad Hossein Moattar
Fine-tuned Generative Adversarial Network-based Model for Medical Image Super-Resolution
Alireza Aghelan - Modjtaba Rouhani
Automatic Generation of XACML Code using Model-Driven Approach
Athareh Fatemian - Bahman Zamani - Marzieh Masoumi - Mehran Kamranpour - Behrouz Tork Ladani - Shekoufeh Kolahdouz Rahimi
more
Samin Hamayesh - Version 41.7.6