0% Complete
Home
/
12th International Conference on Computer and Knowledge Engineering
Word-level Persian Lipreading Dataset
Authors :
Javad Peymanfard
1
Ali Lashini
2
Samin Heydarian
3
Hossein Zeinali
4
Nasser Mozayani
5
1- School of Computer Engineering Iran University of Science and Technology, Tehran, Iran
2- School of Computer Engineering Iran University of Science and Technology, Tehran, Iran
3- School of Computer Engineering Iran University of Science and Technology, Tehran, Iran
4- Department of Computer Engineering Amirkabir University of Technology, Tehran, Iran
5- School of Computer Engineering Iran University of Science and Technology, Tehran, Iran
Keywords :
Lip-reading،Persian dataset،audio-visual speech recognition
Abstract :
Lip-reading has made impressive progress in recent years, driven by advances in deep learning. Nonetheless, the prerequisite such advances is a suitable dataset. This paper provides a new in-the-wild dataset for Persian word-level lip reading containing 244,000 videos from approximately 1,800 speakers. We evaluated the state-of-the-art method in this field and used a novel approach for word-level lip reading. In this method, we used the AV-Hubert model for feature extraction and obtained significantly better performance on our dataset.
Papers List
List of archived papers
Improve the utility of tensor cores by compacting sparse matrix technique
Mohammad.S Abazari - Mahsa Zahedi - Abdorreza Savadi
Vaccine Distribution Modelling in Pandemics through Multi-Agent Systems: COVID-19 Case
Hossein Yarahmadi - Mohammad Ebrahim Shiri - Hamid Reza Navidi - Arash Sharifi - Moharram Challenger - Hassan Piriaei
Multi Model CNN Based Gas Meter Characters Recognition
Sanaz Tarhib - Jafar Tanha - Soodabeh Imanzadeh - Sahar Hassanzadeh Mostafaei
PowerLinear Activation Functions with application to the first layer of CNNs
Kamyar Nasiri - Kamaledin Ghiasi-Shirazi
Speech Emotion Recognition Using a Hierarchical Adaptive Weighted Multi-Layer Sparse Auto-Encoder Extreme Learning Machine with New Weighting and Spectral/SpectroTemporal Gabor Filter Bank Features
Fatemeh Daneshfar - Seyed Jahanshah Kabudian
SCDS: A Secure Clustering Protocol Using Dempster-Shafer Theory for VANET in Smart City
Hoda Mosadegh - Nazbanoo Farzaneh
Multi-Layer Collaborative Graph with BPR Similarity Embedding for Recommender System
Mostafa Ghorbani - Azadeh Mansouri
Joint mobility-aware offloading and UAV position optimization in Blockchain-enabled 5G
Zeinab Rabbani - Zeinab Movahedi
Word-level Persian Lipreading Dataset
Javad Peymanfard - Ali Lashini - Samin Heydarian - Hossein Zeinali - Nasser Mozayani
Area-Efficient VLSI Implementation of Bit-Serial Multiplier Using Polynomial Basis over GF(2m)
Saeideh Nabipour - Javad Javidan - Gholamreza Zare Fatin
more
Samin Hamayesh - Version 41.7.6