0% Complete
Home
/
14th International Conference on Computer and Knowledge Engineering
A Novel Approach for Image-Text Matching Cross-Modal Space Learning
Authors :
Amirreza Ebrahimi
1
Mohammad Javad Parseh
2
Pejman Rasti
3
1- jahrom university
2- jahrom university
3- Universite d’Angers,
Keywords :
Image-text matching،Computer Vision،NLP،Visual-semantic embedding
Abstract :
Image-text matching, a crucial area of study in image processing and AI, involves computing the similarity between a natural language sentence and an image to create a unified space for comparison. Traditional techniques often struggle to bridge the inherent gap between visual and verbal communication, leading to suboptimal performance. Our approach addresses this challenge by employing advanced matrix operations that directly handle the distinct characteristics of visual and textual data. This innovative method enhances the speed and accuracy of the matching process, reduces computational complexity, and eliminates the need for additional resources. Experimental results demonstrate significant improvements in matching precision and processing time, underscoring the potential of our method to advance the state-of-the-art in image-text matching. This research contributes to the broader field of multimodal AI, paving the way for more integrated and sophisticated systems capable of understanding and interpreting complex visual and textual information. These findings highlight the transformative potential of our approach in advancing the field of image-text matching.
Papers List
List of archived papers
ROCT-Net: A new ensemble deep convolutional model with improved spatial resolution learning for detecting common diseases from retinal OCT images
Mohammad Rahimzadeh - Mahmoud Reza Mohammadi
Improving the classification of high dimensional class-imbalanced data using the Chaos particle swarm optimization with Levy Flight
Mohammad Ali Zarif - Javad Hamidzadeh
Farsi Text in Scene: A new dataset
Ali Salmasi - Ehsanollah Kabir
Supervised Contrastive Learning for Short Text Classification in Natural Language Processing
Mitra Esmaeili - Hamed Vahdat nejad
The process of multi class fake news dataset generation
Sajjad Rezaei - Mohsen Kahani - Behshid Behkamal
A Deep Reinforcement Learning Approach Combining Technical and Fundamental Analyses with a Large Language Model for Stock Trading
Mahan Veisi - Sadra Berangi - Mahdi Shahbazi Khojasteh - Armin Salimi-Badr
ExaASC: A General Target-Based Stance Detection Corpus in Arabic Language
Mohammad Mehdi Jaziriyan - Ahmad Akbari - Hamed Karbasi
AvashoG2P: A multi-module G2P Converter for Persian
Ali Moghadaszadeh - Fatemeh Pasban - Mohsen Mahmoudzadeh - Maryam Vatanparast - Amirmohammad Salehoof
Financial Market Prediction Using Deep Neural Networks with Hardware Acceleration
Dara Rahmati - Mohammad Hadi Foroughi - Ali Bagherzadeh - Mehdi Foroughi - Saeid Gorgin
Non-Functional Requirement Extracting Methods for AI-based Systems: A Survey
Reza Damirchi - Amineh Amini
more
Samin Hamayesh - Version 42.2.1