0% Complete
Home
/
14th International Conference on Computer and Knowledge Engineering
A Novel Approach for Image-Text Matching Cross-Modal Space Learning
Authors :
Amirreza Ebrahimi
1
Mohammad Javad Parseh
2
Pejman Rasti
3
1- jahrom university
2- jahrom university
3- Universite d’Angers,
Keywords :
Image-text matching،Computer Vision،NLP،Visual-semantic embedding
Abstract :
Image-text matching, a crucial area of study in image processing and AI, involves computing the similarity between a natural language sentence and an image to create a unified space for comparison. Traditional techniques often struggle to bridge the inherent gap between visual and verbal communication, leading to suboptimal performance. Our approach addresses this challenge by employing advanced matrix operations that directly handle the distinct characteristics of visual and textual data. This innovative method enhances the speed and accuracy of the matching process, reduces computational complexity, and eliminates the need for additional resources. Experimental results demonstrate significant improvements in matching precision and processing time, underscoring the potential of our method to advance the state-of-the-art in image-text matching. This research contributes to the broader field of multimodal AI, paving the way for more integrated and sophisticated systems capable of understanding and interpreting complex visual and textual information. These findings highlight the transformative potential of our approach in advancing the field of image-text matching.
Papers List
List of archived papers
Financial Market Prediction Using Deep Neural Networks with Hardware Acceleration
Dara Rahmati - Mohammad Hadi Foroughi - Ali Bagherzadeh - Mehdi Foroughi - Saeid Gorgin
Virtual Network Embedding based on Univariate Distribution Estimation
Arezoo Jahani
Fast and Accurate Motif Discovery in Protein Sequences Using Parallel Processing with OpenMP
Rahele Mohammadi - Mahmoud Naghibzadeh - Abdorreza Savadi
TriMAE: Fashion visual search with Triplet Masked Auto Encoder Vision Transformer
Lachin Zamani - Reza Azmi
Analysis of Insect-plant Interactions Affected by Mining operations, A Graph Mining Approach
Mohammad Heydari - Ali Bayat - Amir Albadvi
Adaptive Channel Estimation for MIMO-OFDM Systems in Impulsive Noise Environments
Mojtaba Hajiabadi
Android Malware Detection using Supervised Deep Graph Representation Learning
Fatemeh Deldar - Mahdi Abadi - Mohammad Ebrahimifard
SUT: a new multi-purpose synthetic dataset for Farsi document image analysis
Elham Shabaninia - Fatemeh sadat Eslami - Ali Afkari Fahandari - Hossein Nezamabadi-pour
Improve the utility of tensor cores by compacting sparse matrix technique
Mohammad.S Abazari - Mahsa Zahedi - Abdorreza Savadi
Towards Efficient Video Object Detection on Embedded Devices
Mohammad Hajizadeh - Adel Rahmani - Mohammad Sabokrou
more
Samin Hamayesh - Version 41.7.6