0% Complete
Home
/
14th International Conference on Computer and Knowledge Engineering
A Novel Approach for Image-Text Matching Cross-Modal Space Learning
Authors :
Amirreza Ebrahimi
1
Mohammad Javad Parseh
2
Pejman Rasti
3
1- jahrom university
2- jahrom university
3- Universite d’Angers,
Keywords :
Image-text matching،Computer Vision،NLP،Visual-semantic embedding
Abstract :
Image-text matching, a crucial area of study in image processing and AI, involves computing the similarity between a natural language sentence and an image to create a unified space for comparison. Traditional techniques often struggle to bridge the inherent gap between visual and verbal communication, leading to suboptimal performance. Our approach addresses this challenge by employing advanced matrix operations that directly handle the distinct characteristics of visual and textual data. This innovative method enhances the speed and accuracy of the matching process, reduces computational complexity, and eliminates the need for additional resources. Experimental results demonstrate significant improvements in matching precision and processing time, underscoring the potential of our method to advance the state-of-the-art in image-text matching. This research contributes to the broader field of multimodal AI, paving the way for more integrated and sophisticated systems capable of understanding and interpreting complex visual and textual information. These findings highlight the transformative potential of our approach in advancing the field of image-text matching.
Papers List
List of archived papers
Multi-Fusion Ensemble CNN for Drug–Target Binding Affinity Prediction Using Transformer-Based Molecular and Protein Representations
Betsabeh Tanoori
Lossless Watermarking in Encrypted Triangular Mesh Models Based on Optimized Vertex Estimation and Error Histogram Shifting
Alireza Ghaemi - Habibollah Danyali - Kamran Kazemi - Zahra Qodrati - Amirhossein Ghaemi - Seyedeh Masoumeh Taji
Joint ADC-less Analog Demodulator and Decoder for Extended Binary (8, 4, 4) Hamming Channel Code
Mir Mahdi Safari - Jafar Pourrostam - Behzad Mozaffari Tazehkand
Performance Evaluation Study of Color Space Selection In Video Based Facial Expression Recognition Using Deep Neural Networks For Sentiment Analysis
Phee Wei Qin - Ervin Gubin Moung - Ali Farzamnia - Farashazillah Yahya - John Julius Danker Khoo - Maisarah Mohd Sufian
DEW-WIN: A Dynamic Energy-aware Window-based Scheduler for Mixed-criticality Systems
Mahin Moradiyan - Yasser Sedaghat - Pouria Hosseini - Yousef Rezazadeh
Driving Violation Detection Using Vehicle Data and Environmental Conditions
Masood Ghasemi - Mahmood Fathy - Mohammad Shahverdy
Enhancing Vehicle Make and Model Recognition with 3D Attention Modules
Narges Semiromizadeh - Omid Nejati Manzari - Shahriar B. Shokouhi - Sattar Mirzakuchaki
Improve the utility of tensor cores by compacting sparse matrix technique
Mohammad.S Abazari - Mahsa Zahedi - Abdorreza Savadi
MIPS-Core Application Specific Instruction-Set Processor for IDEA Cryptography − Comparison between Single-Cycle and Multi-Cycle Architectures
Ahmad Ahmadi - Reza Faghih Mirzaee
Optimization of quantum secret sharing communication using corresponding bits
Mahsa Khorrampanah - Mohammad Bolokian - Monireh Houshmand
more
Samin Hamayesh - Version 43.7.0