0% Complete
Home
/
14th International Conference on Computer and Knowledge Engineering
A Novel Approach for Image-Text Matching Cross-Modal Space Learning
Authors :
Amirreza Ebrahimi
1
Mohammad Javad Parseh
2
Pejman Rasti
3
1- jahrom university
2- jahrom university
3- Universite d’Angers,
Keywords :
Image-text matching،Computer Vision،NLP،Visual-semantic embedding
Abstract :
Image-text matching, a crucial area of study in image processing and AI, involves computing the similarity between a natural language sentence and an image to create a unified space for comparison. Traditional techniques often struggle to bridge the inherent gap between visual and verbal communication, leading to suboptimal performance. Our approach addresses this challenge by employing advanced matrix operations that directly handle the distinct characteristics of visual and textual data. This innovative method enhances the speed and accuracy of the matching process, reduces computational complexity, and eliminates the need for additional resources. Experimental results demonstrate significant improvements in matching precision and processing time, underscoring the potential of our method to advance the state-of-the-art in image-text matching. This research contributes to the broader field of multimodal AI, paving the way for more integrated and sophisticated systems capable of understanding and interpreting complex visual and textual information. These findings highlight the transformative potential of our approach in advancing the field of image-text matching.
Papers List
List of archived papers
Robust Learning to Learn Graph Topologies
Navid Akhavan Attar - Ali Fahim
Investigating the Behavior of Generation Z Customers in Online Banking Services (Case Study of a Bank of Iran)
Elham Mahmoudabadi - Esmaeil Mollaahmadi
A Self-Configurable Model for Cloud Resource Allocation
Ali Bazghandi
Hybrid Vision Transformer for Detection of Dentigerous Cysts in Dental Radiography Images
Reza Tavasoli - Arya VarastehNezhad - Hamed Farbeh
Brain Age Estimation with Twin Vision Transformer using Hippocampus Information Applicable to Alzheimer Dementia Diagnosis
Zahra Qodrati - Seyedeh Masoumeh Taji - Amirhossein Ghaemi - Habibollah Danyali - Kamran Kazemi - Alireza Ghaemi
Iris Detection and Segmentation Using Deep Learning
Ali Khaki - Ali Aghagolzadeh - Bagher Rahimpour Cami
An Overview of Regression Methods in Early Prediction of Movie Ratings
Houmaan Chamani - Zhivar Sourati Hassanzadeh - Behnam Bahrak
Information Theoretic Learning-based Deep Embedded Clustering (ITL-DEC)
Hoda Shad - Mona Zamiri - Tahereh Bahreini - Reza Monsefi - Ghoshe Abed Hodtani
A scalable blockchain-based educational network for data storage and assessment
Maryam Fattahi Vanani - Hamidreza Shayegh Borujeni - Ali Nourollah
Intelligent Interpretation of Frequency Response Signatures to Diagnose Radial Deformation in Transformer Windings Using Artificial Neural Network
Reza Behkam - Hossein Karami - Mehdi Salay Naderi - Gevork B. Gharehpetian
more
Samin Hamayesh - Version 41.7.6