Pinned Repositories
AC-Off-POC
Off-Policy Correction for Actor-Critic Algorithms in Deep Reinforcement Learning
DASE
Safe and Robust Experience Sharing for Deterministic Policy Gradient Algorithms
DISCOVER
Deep Intrinsically Motivated Exploration in Continuous Control
ICL-task-repr
In-Context Learning Task Representations
LA3P
Actor Prioritized Experience Replay
Q-Error-Exploration
An Optimistic Approach to the Q-Network Error in Actor-Critic Methods
RIS-MISO-Deep-Reinforcement-Learning
Joint Transmit Beamforming and Phase Shifts Design with Deep Reinforcement Learning
RIS-MISO-PDA-Deep-Reinforcement-Learning
Joint Transmit Beamforming and Phase Shifts Design with Deep Reinforcement Learning Under the Phase-Dependent Amplitude Model
SWTD3
Stochastic Weighted Twin Delayed Deep Deterministic Policy Gradient (SWTD3)
TCD3
Author's PyTorch implementation of TCD3 for OpenAI Gym continuous control tasks
baturaysaglam's Repositories
baturaysaglam/RIS-MISO-Deep-Reinforcement-Learning
Joint Transmit Beamforming and Phase Shifts Design with Deep Reinforcement Learning
baturaysaglam/RIS-MISO-PDA-Deep-Reinforcement-Learning
Joint Transmit Beamforming and Phase Shifts Design with Deep Reinforcement Learning Under the Phase-Dependent Amplitude Model
baturaysaglam/LA3P
Actor Prioritized Experience Replay
baturaysaglam/AC-Off-POC
Off-Policy Correction for Actor-Critic Algorithms in Deep Reinforcement Learning
baturaysaglam/DISCOVER
Deep Intrinsically Motivated Exploration in Continuous Control
baturaysaglam/SWTD3
Stochastic Weighted Twin Delayed Deep Deterministic Policy Gradient (SWTD3)
baturaysaglam/DASE
Safe and Robust Experience Sharing for Deterministic Policy Gradient Algorithms
baturaysaglam/ICL-task-repr
In-Context Learning Task Representations
baturaysaglam/Q-Error-Exploration
An Optimistic Approach to the Q-Network Error in Actor-Critic Methods
baturaysaglam/TCD3
Author's PyTorch implementation of TCD3 for OpenAI Gym continuous control tasks
baturaysaglam/baturaysaglam
Config files for my GitHub profile.
baturaysaglam/compat-grad-approx
Compatible Policy Gradient Approximations for Actor-Critic Algorithms
baturaysaglam/risk-averse-constrained-RL
Risk-Averse Constrained Reinforcement Learning with Optimized Certainty Equivalents