0% Complete
Home
/
12th International Conference on Computer and Knowledge Engineering
Word-level Persian Lipreading Dataset
Authors :
Javad Peymanfard
1
Ali Lashini
2
Samin Heydarian
3
Hossein Zeinali
4
Nasser Mozayani
5
1- School of Computer Engineering Iran University of Science and Technology, Tehran, Iran
2- School of Computer Engineering Iran University of Science and Technology, Tehran, Iran
3- School of Computer Engineering Iran University of Science and Technology, Tehran, Iran
4- Department of Computer Engineering Amirkabir University of Technology, Tehran, Iran
5- School of Computer Engineering Iran University of Science and Technology, Tehran, Iran
Keywords :
Lip-reading،Persian dataset،audio-visual speech recognition
Abstract :
Lip-reading has made impressive progress in recent years, driven by advances in deep learning. Nonetheless, the prerequisite such advances is a suitable dataset. This paper provides a new in-the-wild dataset for Persian word-level lip reading containing 244,000 videos from approximately 1,800 speakers. We evaluated the state-of-the-art method in this field and used a novel approach for word-level lip reading. In this method, we used the AV-Hubert model for feature extraction and obtained significantly better performance on our dataset.
Papers List
List of archived papers
Virus-Antiviral Prediction Using Machine and Deep Learning Methods
Shayan Majidifar - Fatemeh Nasiri - Mohsen Hooshmand
A Comprehensive Approach to SMS Spam Filtering Integrating Embedded and Statistical Features
Shaghayegh Hosseinpour - Mohammad Reza Keyvanpour
AI-Driven Relocation Tracking in Dynamic Kitchen Environments
Arash Nasr Esfahani - Hamed Hosseini - Mehdi Tale Masouleh - Ahmad Kalhor - Hedieh Sajedi
Adaptive Hybrid TRCA–CORRCA algorithm for enhanced accuracy in SSVEP-based brain-computer interfaces
Sepehr Tayebeh Khabbaz - Sina Tayebeh Khabbaz - Arshia Barani - Arsalan Ganjeh - Sasan Harifi - Seyed Mohsen Mirhosseini
Capsule Routing over Stacked GCN-GAT Embeddings with Negative Sampling for Graph Link Prediction
Fatemeh Safari Sarvandi - Sayeh Mirzaei - Rooholah Abedian
A 2D-CNN Architecture for Improving the Classification Accuracy of an Electronic Nose with Different Sensor Positions
Hannaneh Mahdavi - Reza Goldoust - Saeideh Rahbarpour
An Automated Visual Defect Segmentation for Flat Steel Surface Using Deep Neural Networks
Dorna Nourbakhsh Sabet - Mohammad Reza Zarifi - Javad Khoramdel - Yasamin Borhani - Esmaeil Najafi
Predicting cascading failure with machine learning methods in the interdependent networks
Mohamad Hossein Maghsoodi - Mohamad Khansari
Reliability Evaluation of 4:2 Compressors Based on Hammock Networks
Farshad Safaei - Mohammad mahdi Emadi Kouchak - Sara Talebpour
UAV-based Firefighting by Multi-agent Reinforcement Learning
Reza Shami Tanha - Mohsen Hooshmand - Mohsen Afsharchi
more
Samin Hamayesh - Version 42.7.0