Pinned Repositories
Actor-Critic
agents
TF-Agents is a library for Reinforcement Learning in TensorFlow
Angry-Dragon
Starter kit for new players of Terminal. Contains starter-algo and a basic CLI for running/debugging algo's locally.
Atari_with_TF_Agents
This repository contains code for training reinforcement learning agents to play some of the Atari-2600 games. These agents are built using TensorFlow Agents library.
Chessica
Chess GUI that lets the user play against an engine (or an human opponent on a different computer connected to the same network)
Dyna
Iterative Policy Evaluation for estimating state-value function from an arbitrary policy.
GridMind
A library of reinforcement learning (RL) algorithms.
NST
Neural Style Transfer
REINFORCE
Naive implementation of Monte-Carlo Policy-Gradient Control
Toddler
A very basic chess engine.
shuvoxcd01's Repositories
shuvoxcd01/GridMind
A library of reinforcement learning (RL) algorithms.
shuvoxcd01/REINFORCE
Naive implementation of Monte-Carlo Policy-Gradient Control
shuvoxcd01/Valkyrie
A population based approach to train a reinforcement learning agent to play Atari.
shuvoxcd01/Actor-Critic
shuvoxcd01/agents
TF-Agents is a library for Reinforcement Learning in TensorFlow
shuvoxcd01/Angry-Dragon
Starter kit for new players of Terminal. Contains starter-algo and a basic CLI for running/debugging algo's locally.
shuvoxcd01/Data-Mining
Data Mining implementation dump that was written during my undergrad period.
shuvoxcd01/Dyna
Iterative Policy Evaluation for estimating state-value function from an arbitrary policy.
shuvoxcd01/tictactoe
A tictactoe environment to use with OpenAI gym.
shuvoxcd01/Toddler
A very basic chess engine.
shuvoxcd01/char-GPT
A fork from Karpathy's ng-video-lecture repo.
shuvoxcd01/DQN
Vanilla DQN Implementation
shuvoxcd01/gym
A toolkit for developing and comparing reinforcement learning algorithms.
shuvoxcd01/Gymnasium
A standard API for reinforcement learning and a diverse set of reference environments (formerly Gym)
shuvoxcd01/iGibsonChallenge2021
shuvoxcd01/Medical-Knowledge-Base
A knowledge-base that captures relations among diseases, symptoms and physicians.
shuvoxcd01/MonteCarlo-Algorithm-Suite
shuvoxcd01/neural_tic_tac_toe
shuvoxcd01/Parrot
Ontology based text acquisition.
shuvoxcd01/Phoenix
shuvoxcd01/Random-Walk-Env
A random walk gymnasium environment.
shuvoxcd01/ray
Ray is a unified framework for scaling AI and Python applications. Ray consists of a core distributed runtime and a toolkit of libraries (Ray AIR) for accelerating ML workloads.
shuvoxcd01/REINFORCE_with_Baseline
shuvoxcd01/RL-Worlds
Environments for Reinforcement Learning
shuvoxcd01/Schrodingers-Dealer
A Blackjack solver using Reinforcement Learning, inspired by Schrödinger's paradox. Just like the uncertainty in the famous thought experiment, this agent learns to navigate the unknown and optimize its strategy through trial and error, ultimately mastering the game of Blackjack.
shuvoxcd01/scispacy
A full spaCy pipeline and models for scientific/biomedical documents.
shuvoxcd01/shuvoxcd01
shuvoxcd01/shuvoxcd01.github.io
Personal website.
shuvoxcd01/TD-0-Prediction
TD(0) Prediction
shuvoxcd01/TD-Algorithm-Suite
A suite of Temporal Difference algorithms.