0% Complete
Home
/
12th International Conference on Computer and Knowledge Engineering
Word-level Persian Lipreading Dataset
Authors :
Javad Peymanfard
1
Ali Lashini
2
Samin Heydarian
3
Hossein Zeinali
4
Nasser Mozayani
5
1- School of Computer Engineering Iran University of Science and Technology, Tehran, Iran
2- School of Computer Engineering Iran University of Science and Technology, Tehran, Iran
3- School of Computer Engineering Iran University of Science and Technology, Tehran, Iran
4- Department of Computer Engineering Amirkabir University of Technology, Tehran, Iran
5- School of Computer Engineering Iran University of Science and Technology, Tehran, Iran
Keywords :
Lip-reading،Persian dataset،audio-visual speech recognition
Abstract :
Lip-reading has made impressive progress in recent years, driven by advances in deep learning. Nonetheless, the prerequisite such advances is a suitable dataset. This paper provides a new in-the-wild dataset for Persian word-level lip reading containing 244,000 videos from approximately 1,800 speakers. We evaluated the state-of-the-art method in this field and used a novel approach for word-level lip reading. In this method, we used the AV-Hubert model for feature extraction and obtained significantly better performance on our dataset.
Papers List
List of archived papers
A Genetic-based Fusion Approach of Persian and Universal Phonetic results for Spoken Language Identification
Ashkan Moradi - Yasser Shekofteh - Saeed Zarei
Intelligent Interpretation of Frequency Response Signatures to Diagnose Radial Deformation in Transformer Windings Using Artificial Neural Network
Reza Behkam - Hossein Karami - Mehdi Salay Naderi - Gevork B. Gharehpetian
Dual Memory Structure for Memory Augmented Neural Networks for Question-Answering Tasks
Amir Bidokhti - Shahrokh Ghaemmaghami
Depression Diagnosis Using Optimization of Nonlinear EEG Features Based on Parametric Learning Tactics
Ali Asadi Zeidabadi - Melika Changizi - Mahdi Zolfagharzadeh Kermani - Sara Bargi Barkouk
EEMC: Energy Efficient Multi-Clustering Using Grey Wolf Optimizer in WSNs
Maryam Ghorbanvirdi - Sayyed Majid Mazinani
Information Theoretic Learning-based Deep Embedded Clustering (ITL-DEC)
Hoda Shad - Mona Zamiri - Tahereh Bahreini - Reza Monsefi - Ghoshe Abed Hodtani
A Formalism for Specifying Capability-based Task Allocation in MAS
Samaneh HoseinDoost - Bahman Zamani - Afsaneh Fatemi
Robust Learning to Learn Graph Topologies
Navid Akhavan Attar - Ali Fahim
EfficientNetB0’s Hybrid Approach for Brain Tumor Classification from MRI Images Using Deep Learning and Bagging Trees
Yeganeh Modaresnia - Farhad Abedinzadeh Torghabeh - Seyyed Abed Hosseini
A Cost-Sensitive Genetic Algorithm for Customer Segmentation in Auto Insurances
Alireza Khajenoori - Mohammad Saniee Abadeh - Mohsen Mohammadzadeh
more
Samin Hamayesh - Version 41.5.3