International Conference on Computer and Knowledge Engineering

Home / 15th International Conference on Computer and Knowledge Engineering

Adaptive Prioritization in Experience Replay Using Feedback from Multiple Learning Signals

Authors :

Seyed Hossein Mostafavi¹ Mohammad Bagher Naghibi Sistani²

1- Ferdowsi university of mashhad 2- Ferdowsi university of mashhad

Keywords :

Deep reinforcement learning،experience replay،prioritized experience replay،multi-criteria sampling،adaptive weighting

Abstract :

Deep reinforcement learning (DRL) has made significant progress in recent years. Many DRL algorithms utilize experience replay to store past experiences and reuse them during training. One main challenge in this process is choosing which experiences to sample. While most previous methods rely on one or two sampling criteria, this study introduces a method that incorporates four distinct criteria, with their weights adaptively tuned based on environmental feedback. Simulation results demonstrate that the proposed method outperforms previous approaches in various reinforcement learning environments.

List of archived papers

Facial Emotion Recognition Under Mask Coverage Using a Data Augmentation Technique

Aref Farhadipour - Pouya Taghipour

A Survey of the AVOA Metaheuristic Algorithm and its Suitability for Power System Optimization and Damping Controller Design

Aliyu Sabo - Theophilus Ebuka Odoh - Samuel Habu - Hossien Shahinzadeh - Farshad Ebrahimi

An Efficient Approach for Breast Abnormality Detection through High-Level Features of Thermography Images

Farhad Abedinzadeh Torghabeh - Yeganeh Modaresnia - Seyyed Abed Hosseini

Hardware-Efficient Pruned CNN Optimized by Neural Architecture Search and Genetic Algorithm for Diabetic Retinopathy Detection on STM32F746

Omid Askari Haddad - Sara Ershadi-Nasab

Developing Convolutional Neural Networks using a Novel Lamarckian Co-Evolutionary Algorithm

Zaniar Sharifi - Khabat Soltanian - Ali Amiri

Deep Learning Feature Extraction for COVID-19 Detection Algorithm using Computerized Tomography Scan

Maisarah Mohd Sufian - Ervin Gubin Moung - Chong Joon Hou - Ali Farzamnia

Energy Efficient Power Allocation in MIMO-NOMA Systems with ZF Receiver Beamforming in Multiple Clusters

Mahdi Nangir - Abdolrasoul Sakhaei Gharagezlou - Nima Imani

Reversible Data Insertion in Encryption Domain Based on Reduced Quad Difference Expansion

Alireza Ghaemi - Mohammad Zare Ehteshami - Amirhossein Ghaemi

Parallel Local Feature Selection For High-dimensional Data

Zhaleh Manbari - Chiman Salavati - Fardin AkhlaghianTab - Barzan Saeedpoor - Himan Delbina - Mahmud Abdulla Mohammad

Camouflage Object Segmentation with Attention-Guided Pix2Pix and Boundary Awareness

Erfan Akbarnezhad Sany - Fatemeh Naserizadeh - Parsa Sinichi - Seyyed Abed Hosseini

more

Samin Hamayesh - Version 44.5.0