0% Complete
Home
/
15th International Conference on Computer and Knowledge Engineering
Adaptive Prioritization in Experience Replay Using Feedback from Multiple Learning Signals
Authors :
Seyed Hossein Mostafavi
1
Mohammad Bagher Naghibi Sistani
2
1- Ferdowsi university of mashhad
2- Ferdowsi university of mashhad
Keywords :
Deep reinforcement learning،experience replay،prioritized experience replay،multi-criteria sampling،adaptive weighting
Abstract :
Deep reinforcement learning (DRL) has made significant progress in recent years. Many DRL algorithms utilize experience replay to store past experiences and reuse them during training. One main challenge in this process is choosing which experiences to sample. While most previous methods rely on one or two sampling criteria, this study introduces a method that incorporates four distinct criteria, with their weights adaptively tuned based on environmental feedback. Simulation results demonstrate that the proposed method outperforms previous approaches in various reinforcement learning environments.
Papers List
List of archived papers
A Deep Reinforcement Learning Approach Combining Technical and Fundamental Analyses with a Large Language Model for Stock Trading
Mahan Veisi - Sadra Berangi - Mahdi Shahbazi Khojasteh - Armin Salimi-Badr
Improve the utility of tensor cores by compacting sparse matrix technique
Mohammad.S Abazari - Mahsa Zahedi - Abdorreza Savadi
A Survey of the AVOA Metaheuristic Algorithm and its Suitability for Power System Optimization and Damping Controller Design
Aliyu Sabo - Theophilus Ebuka Odoh - Samuel Habu - Hossien Shahinzadeh - Farshad Ebrahimi
DIPT: Diversified Personalized Transformer for QAC systems
Mahdi Dehghani - Samira Vaez Barenji - Saeed Farzi
Performance Evaluation Study of Color Space Selection In Video Based Facial Expression Recognition Using Deep Neural Networks For Sentiment Analysis
Phee Wei Qin - Ervin Gubin Moung - Ali Farzamnia - Farashazillah Yahya - John Julius Danker Khoo - Maisarah Mohd Sufian
Dynamic Hand Gesture Recognition with 2DCNN-LSTM and Improved Keyframe Extraction
Narjes Heidari - Javid Norouzi - Mohammad Sadegh Helfroush - Habibollah Danyal
A large input-space-margin approach for adversarial training
Reihaneh Nikouei - Mohammad Taheri
Optimal PMU Placement Considering Reliability of Measurement System in Smart Grids
Mohammad Shahraeini - Shahla Khormali - Ahad Alvandi
DTranIDS: A Two-Tiered Intrusion Detection System for RPL-based IoT Networks based on Decision Tree and Transformer Models
Mohammad Fazeli - Mohsen Raji - Mohammad Mahdi Fazeli
DEW-WIN: A Dynamic Energy-aware Window-based Scheduler for Mixed-criticality Systems
Mahin Moradiyan - Yasser Sedaghat - Pouria Hosseini - Yousef Rezazadeh
more
Samin Hamayesh - Version 43.7.0