0% Complete
Home
/
14th International Conference on Computer and Knowledge Engineering
A Novel Approach for Image-Text Matching Cross-Modal Space Learning
Authors :
Amirreza Ebrahimi
1
Mohammad Javad Parseh
2
Pejman Rasti
3
1- jahrom university
2- jahrom university
3- Universite d’Angers,
Keywords :
Image-text matching،Computer Vision،NLP،Visual-semantic embedding
Abstract :
Image-text matching, a crucial area of study in image processing and AI, involves computing the similarity between a natural language sentence and an image to create a unified space for comparison. Traditional techniques often struggle to bridge the inherent gap between visual and verbal communication, leading to suboptimal performance. Our approach addresses this challenge by employing advanced matrix operations that directly handle the distinct characteristics of visual and textual data. This innovative method enhances the speed and accuracy of the matching process, reduces computational complexity, and eliminates the need for additional resources. Experimental results demonstrate significant improvements in matching precision and processing time, underscoring the potential of our method to advance the state-of-the-art in image-text matching. This research contributes to the broader field of multimodal AI, paving the way for more integrated and sophisticated systems capable of understanding and interpreting complex visual and textual information. These findings highlight the transformative potential of our approach in advancing the field of image-text matching.
Papers List
List of archived papers
Adaptive Hybrid TRCA–CORRCA algorithm for enhanced accuracy in SSVEP-based brain-computer interfaces
Sepehr Tayebeh Khabbaz - Sina Tayebeh Khabbaz - Arshia Barani - Arsalan Ganjeh - Sasan Harifi - Seyed Mohsen Mirhosseini
Analyzing the Impact of COVID-19 on Economy from the Perspective of User’s Reviews
Fatemeh Salmani - Hamed Vahdat-Nejad - Hamideh Hajiabadi
Dual Memory Structure for Memory Augmented Neural Networks for Question-Answering Tasks
Amir Bidokhti - Shahrokh Ghaemmaghami
Optimal PMU Placement Considering Reliability of Measurement System in Smart Grids
Mohammad Shahraeini - Shahla Khormali - Ahad Alvandi
Vaccine Distribution Modelling in Pandemics through Multi-Agent Systems: COVID-19 Case
Hossein Yarahmadi - Mohammad Ebrahim Shiri - Hamid Reza Navidi - Arash Sharifi - Moharram Challenger - Hassan Piriaei
Fatty Liver Level Recognition Using Particle Swarm Optimization (PSO) Image Segmentation and Analysis
Seyed Muhammad Hossein Mousavi - Vyacheslav Lyashenko - Atiye Ilanloo - S. Younes Mirinezhad
Graph Representation Learning Towards Patents Network Analysis
Mohammad Heydari - Babak Teimourpour
Multi Model CNN Based Gas Meter Characters Recognition
Sanaz Tarhib - Jafar Tanha - Soodabeh Imanzadeh - Sahar Hassanzadeh Mostafaei
SUT: a new multi-purpose synthetic dataset for Farsi document image analysis
Elham Shabaninia - Fatemeh sadat Eslami - Ali Afkari Fahandari - Hossein Nezamabadi-pour
Implementation of a Low-Overhead 2-Bit Parity-Preserving Reversible Vedic Multiplier for Quantum Architectures
Shekoofeh Moghimi - Negin Mashayekhi - Mohammad Reza Reshadinezhad
more
Samin Hamayesh - Version 43.7.0