0% Complete
Home
/
14th International Conference on Computer and Knowledge Engineering
TriMAE: Fashion visual search with Triplet Masked Auto Encoder Vision Transformer
Authors :
Lachin Zamani
1
Reza Azmi
2
1- Department of Computer Engineering, Faculty of Engineering, Alzahra University, Tehran, Iran
2- Department of Computer Engineering, Faculty of Engineering, Alzahra University, Tehran, Iran
Keywords :
Visual Search،Triplet Network،Masked Auto Encoders Vision Transformer
Abstract :
Visual search is a technology that identifies images similar to a provided query image and presents results ranked by similarity. In the realm of apparel, this innovative tool revolutionizes shopping by enabling users to effortlessly find desired items based on visual preference. Visual search remains a challenging problem despite its potential to significantly enhance user experience. The existence of differences in minute details, the presence of multiple garments in a single image, discrepancies between user-taken and catalog images, and the inherent flexibility of clothing are among the challenges associated with this issue. By selecting robust features and improving the learning of similarity and dissimilarity between images, superior results can be obtained. Consequently, a method has been proposed to yield enhanced outcomes. Convolutional Neural Networks and Vision Transformers are commonly used as the backbone of triplet neural networks for visual search tasks. These networks are designed to better learn the similarities and differences between images. In this research, we employ a combination of triplet neural networks and a masked auto-encoder vision transformer model. A triplet loss function is used during network training to learn the similarity between images. We evaluate our method on the DeepFashion In-shop dataset, which comprises different categories of clothing images. Through extensive experiments on this benchmark, our model achieves an impressive Recall@1 of 93.2% for visual search.
Papers List
List of archived papers
Link Prediction for Recommendation based on Complex Representation of Items Similarities
Masoumeh Alinia - Seyed Mohammad Hossein Hasheminejad - Hadi Shakibian
Towards Study of Research Topics Evolution in Artificial Intelligence based on Topic Embedding
Seyyed Reza Taher Harikandeh - Sadegh Aliakbary - Soroush Taheri
Deep Deterministic Policy Gradient in Acoustic To Articulatory inversion
Farzane Abdoli - Hamid Sheikhzade - Vahid Pourahmadi
Crack Segmentation in Civil Structure Images Using a Deep Learning Based Multi-Classifier System
Mohammadreza Asadi - Seyedeh Sogand Hashemi - Mohammad Taghi Sadeghi
Solving the influence maximization problem by using entropy and weight of edges
Farzaneh Kazemzadeh - Amir Karian - Mitra Mirzarezaee - Ali Asghar Safaei
Parallel Local Feature Selection For High-dimensional Data
Zhaleh Manbari - Chiman Salavati - Fardin AkhlaghianTab - Barzan Saeedpoor - Himan Delbina - Mahmud Abdulla Mohammad
Generating Hand-Written Symbols With Trajectory Planning Using A Robotic Arm
Arya Parvizi - Armin Salimi-Badr
Energy Efficient Power Allocation in MIMO-NOMA Systems with ZF Receiver Beamforming in Multiple Clusters
Mahdi Nangir - Abdolrasoul Sakhaei Gharagezlou - Nima Imani
Analyzing the Impact of COVID-19 on Economy from the Perspective of User’s Reviews
Fatemeh Salmani - Hamed Vahdat-Nejad - Hamideh Hajiabadi
I-ACS: An Improved Ant Colony System to Solve the Time-Dependent Orienteering Problem
Zahra Bakhshandeh - Morteza Keshtkaran
more
Samin Hamayesh - Version 41.3.1