0% Complete
Home
/
13th International Conference on Computer and Knowledge Engineering
Towards Efficient Video Object Detection on Embedded Devices
Authors :
Mohammad Hajizadeh
1
Adel Rahmani
2
Mohammad Sabokrou
3
1- School of Computer Engineering Iran University of Science and Technology
2- School of Computer Engineering Iran University of Science and Technology, Tehran, Iran
3- School of Computer Science IPM Institute for Research in Fundamental Sciences
Keywords :
Object Detection،Embedded Device،Deep Neural Networks
Abstract :
The challenge of adapting various object recognition techniques from still images to videos remains unsolved. When applied to videos, methods that are specifically designed for images do not perform well due to several complications. These include blurriness, shifting or ambiguous locations, subpar quality, and other similar concerns. In addition, a lack of effective long-term memory in video object detection has yet to be addressed. It is widely recognized that consecutive frames in a video tend to produce highly similar results in most cases. Therefore, this characteristic can be exploited to improve performance. Moreover, the information contained in a series of sequential or non-consecutive frames exceeds that of a single frame. In our research, we have introduced a novel recurrent cell for feature propagation and have determined the optimal layer placement to augment the memory span. This has resulted in superior precision compared to methods presented in previous research. Furthermore, hardware constraints may exacerbate this issue. Therefore, we have focused on implementing and improving the effectiveness of these techniques on embedded devices. Our approach has yielded impressive results, with a 67.5% mAP accuracy on the real-time ImageNet VID dataset for mobile devices at a rate of 62 fps.
Papers List
List of archived papers
MultiPath ViT OCR: A Lightweight Visual Transformer-based License Plate Optical Character Recognition
Alireza Azadbakht - Saeed Reza Kheradpisheh - Hadi Farahani
A Genetic-based Fusion Approach of Persian and Universal Phonetic results for Spoken Language Identification
Ashkan Moradi - Yasser Shekofteh - Saeed Zarei
Sum Rate Analysis and Power Allocation in Massive MIMO Systems with Power Constraints
Abdolrasoul Sakhaei Gharagezlou - Mahdi Nangir
T-Rank: Graph Data Analytics for Urban Traffic Modeling
Alireza Safarpour - Iman Gholampour - Amirhossain Aghazadeh Fard - Seyed Mohammad Karbasi
Automated Person Identification from Hand Images\\using Hierarchical Vision Transformer Network
Zahra Ebrahimian - Seyed Ali Mirsharji - Ramin Toosi - Mohammad Ali Akhaee
Distilling Knowledge from CNN-Transformer Models for Enhanced Human Action Recognition
Hamid Ahmadabadi - Omid Nejati Manzari - Ahmad Ayatollahi
Analysis of Insect-plant Interactions Affected by Mining operations, A Graph Mining Approach
Mohammad Heydari - Ali Bayat - Amir Albadvi
Extracting Major Topics of COVID-19 Related Tweets
Faezeh Azizi - Hamed Vahdat-Nejad - Hamideh Hajiabadi - Mohammad Hossein Khosravi
Sports News Summarization Using Ensebmle Learning
Moein Sartakhti.salimi@gmail.com - Mohammad Javad Maleki Kahaki - Ahmad Yoosofan - Seyyed Vahid Moravvej
A Language-Independent Approach to Classification of Textual File Fragments: Case Study of Persian, English, and Chinese Languages
Fatemeh Mansouri Hanis - Hamidreza Khoshvaghti - Mehdi Teimouri - Hadi Veisi
more
Samin Hamayesh - Version 42.2.1