Implemented Behavior Cloning, DAgger, Double Q-Learning, Dueling DQN, and Proximal Policy Optimization (PPO) in a simulated environment and analyzed/compared their performance in terms of efficiency, ...
ABSTRACT: Offline reinforcement learning (RL) focuses on learning policies using static datasets without further exploration. With the introduction of distributional reinforcement learning into ...
Clean, Robust, and Unified PyTorch implementation of popular Deep Reinforcement Learning (DRL) algorithms (Q-learning, Duel DDQN, PER, C51, Noisy DQN, PPO, DDPG, TD3 ...
Abstract: A multi-objective adaptive traffic signal control algorithm using fuzzy control and Q-learning was proposed to improve the efficiency, traffic safety, and operational stability of signalized ...
Abstract: This paper focuses on solving the linear quadratic regulator problem for discrete-time linear systems without knowing system matrices. The classical Q-learning methods for linear systems can ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results