0% Complete
Home
/
12th International Conference on Computer and Knowledge Engineering
Word-level Persian Lipreading Dataset
Authors :
Javad Peymanfard
1
Ali Lashini
2
Samin Heydarian
3
Hossein Zeinali
4
Nasser Mozayani
5
1- School of Computer Engineering Iran University of Science and Technology, Tehran, Iran
2- School of Computer Engineering Iran University of Science and Technology, Tehran, Iran
3- School of Computer Engineering Iran University of Science and Technology, Tehran, Iran
4- Department of Computer Engineering Amirkabir University of Technology, Tehran, Iran
5- School of Computer Engineering Iran University of Science and Technology, Tehran, Iran
Keywords :
Lip-reading،Persian dataset،audio-visual speech recognition
Abstract :
Lip-reading has made impressive progress in recent years, driven by advances in deep learning. Nonetheless, the prerequisite such advances is a suitable dataset. This paper provides a new in-the-wild dataset for Persian word-level lip reading containing 244,000 videos from approximately 1,800 speakers. We evaluated the state-of-the-art method in this field and used a novel approach for word-level lip reading. In this method, we used the AV-Hubert model for feature extraction and obtained significantly better performance on our dataset.
Papers List
List of archived papers
Data-Optimized Dry Rock Property Prediction Using Ensemble and Kernel-Based ML Methods
Esmael Makarian - Hassanreza Ghasemitabar - Alireza Behinrad - Mahdi Fathi - Andisheh Alimoradi - Ayub Elyasi
Efficient Vision Transformer for Accurate Traffic Sign Detection
Javad Mirzapour Kaleybar - Hooman Khaloo - Avaz Naghipour
Multi-Layer Collaborative Graph with BPR Similarity Embedding for Recommender System
Mostafa Ghorbani - Azadeh Mansouri
A New Application of Machine Learning Based Methods for Disk Space Variation Fault Diagnosis in Transformer Windings
Reza Behkam - Amir Lotfi - Gevork B. Gharehpetian
Enhanced Hate Speech Detection Using Focal Loss and Multi-Head Attention for Imbalanced Social Media Text
Ali Rezazadeh - Hadi Shahriar Shahhoseini
An Adaptive Budget and Deadline-aware Algorithm for Scheduling Workflows Ensemble in IaaS Clouds
Negin Shafinezhad - Hamid Abrishami - Saeid Abrishami
Multi-Layered Defense Against Modern Phishing: A Dual-Sandbox and CDR Approach
Mahdi Seyfipoor - Mohammad Mahdi Eskandari
FinTNet: From Tweets to Trades
Dorsa Tavakoli - Saman Haratizadeh
Segmentation of Hard Exudates in Retinal Fundus Images Using BCDU-Net
Nafise Ameri - Nasser Shoeibi - Mojtaba Abrishami
Minimizing Quantum Overhead: A Fault-Tolerant ALU Design with Reduced T Metrics
Sarallah Keshavarz - Shekoofeh Moghimi - Mohammad Reza Reshadinezhad
more
Samin Hamayesh - Version 43.7.0