0% Complete
Home
/
15th International Conference on Computer and Knowledge Engineering
Adaptive Prioritization in Experience Replay Using Feedback from Multiple Learning Signals
Authors :
Seyed Hossein Mostafavi
1
Mohammad Bagher Naghibi Sistani
2
1- Ferdowsi university of mashhad
2- Ferdowsi university of mashhad
Keywords :
Deep reinforcement learning،experience replay،prioritized experience replay،multi-criteria sampling،adaptive weighting
Abstract :
Deep reinforcement learning (DRL) has made significant progress in recent years. Many DRL algorithms utilize experience replay to store past experiences and reuse them during training. One main challenge in this process is choosing which experiences to sample. While most previous methods rely on one or two sampling criteria, this study introduces a method that incorporates four distinct criteria, with their weights adaptively tuned based on environmental feedback. Simulation results demonstrate that the proposed method outperforms previous approaches in various reinforcement learning environments.
Papers List
List of archived papers
Enhancing Vehicle Make and Model Recognition with 3D Attention Modules
Narges Semiromizadeh - Omid Nejati Manzari - Shahriar B. Shokouhi - Sattar Mirzakuchaki
FGM Copula based Analysis of Coverage Region for Wireless Three-User Multiple Access Channel with Correlated Channel Coefficients
Mona Sadat Mohsenzadeh - Ghosheh Abed Hodtani
DFIG-WECS Renewable Integration to the Grid and Stability Improvement through Optimal Damping Controller Design
Theophilus Ebuka Odoh - Aliyu Sabo - Hossien Shahinzadeh - Noor Izzri Abdul Wahab - Farshad Ebrahimi
EpiGraph: Anomaly Detection in Contact Networks for Early Disease Outbreak Prediction
Abolfazl Zarghani
Reversible Data Insertion in Encryption Domain Based on Reduced Quad Difference Expansion
Alireza Ghaemi - Mohammad Zare Ehteshami - Amirhossein Ghaemi
Pyramid Transformer for Traffic Sign Detection
Omid Nejati manzari - Amin Boudesh - Shahriar B. Shokouhi
A Survey of the AVOA Metaheuristic Algorithm and its Suitability for Power System Optimization and Damping Controller Design
Aliyu Sabo - Theophilus Ebuka Odoh - Samuel Habu - Hossien Shahinzadeh - Farshad Ebrahimi
A New Inter-layer Similarity metric for link prediction in multilayer networks
Alireza Abdollahpouri - Samira Rafiee
Graph Attention Networks for Modeling Multi-Sensor Relationships in Early Prediction of Critical Events in ICU Patients
Amir Akhavan Saffar - Danial Eskandari Faruji - Javad Hamidzadeh
Paddy Plant Stress Identification Using Few-Shot Learning Framework
Ervin Gubin Moung - Pavindrah Naidu a/l Narayanasamy Naiidu - Maisarah Mohd Sufian - Valentino Liaw - Ali Farzamnia - Lorita Angeline
more
Samin Hamayesh - Version 43.7.0