shuvoxcd01

Machine Learning Engineer. Love Reinforcement Learning and Chess.

IQVIADhaka, Bangladesh

Pinned Repositories

Actor-Critic
Language:Python0 1 00
agents
TF-Agents is a library for Reinforcement Learning in TensorFlow
Language:Python0 0 00
Angry-Dragon
Starter kit for new players of Terminal. Contains starter-algo and a basic CLI for running/debugging algo's locally.
Language:Python0 0 00
Atari_with_TF_Agents
This repository contains code for training reinforcement learning agents to play some of the Atari-2600 games. These agents are built using TensorFlow Agents library.
Language:Jupyter Notebook0 2 00
Chessica
Chess GUI that lets the user play against an engine (or an human opponent on a different computer connected to the same network)
Language:Java1 1 00
Dyna
Iterative Policy Evaluation for estimating state-value function from an arbitrary policy.
Language:Python0 1 00
GridMind
A library of reinforcement learning (RL) algorithms.
Language:Python7 1 01
NST
Neural Style Transfer
Language:Jupyter Notebook1 2 00
REINFORCE
Naive implementation of Monte-Carlo Policy-Gradient Control
Language:Python1 1 01
Toddler
A very basic chess engine.
Language:Python0 1 00

shuvoxcd01's Repositories

shuvoxcd01/GridMind
A library of reinforcement learning (RL) algorithms.
Language:Python7 1 01
shuvoxcd01/REINFORCE
Naive implementation of Monte-Carlo Policy-Gradient Control
Language:Python1 1 01
shuvoxcd01/Valkyrie
A population based approach to train a reinforcement learning agent to play Atari.
Language:Python1 1 00
shuvoxcd01/Actor-Critic
Language:Python0 1 00
shuvoxcd01/agents
TF-Agents is a library for Reinforcement Learning in TensorFlow
Language:Python0 0 00
shuvoxcd01/Angry-Dragon
Starter kit for new players of Terminal. Contains starter-algo and a basic CLI for running/debugging algo's locally.
Language:Python0 0 00
shuvoxcd01/Data-Mining
Data Mining implementation dump that was written during my undergrad period.
Language:Python0 1 00
shuvoxcd01/Dyna
Iterative Policy Evaluation for estimating state-value function from an arbitrary policy.
Language:Python0 1 00
shuvoxcd01/tictactoe
A tictactoe environment to use with OpenAI gym.
Language:Python0 1 00
shuvoxcd01/Toddler
A very basic chess engine.
Language:Python0 1 00
shuvoxcd01/char-GPT
A fork from Karpathy's ng-video-lecture repo.
Language:Python
shuvoxcd01/DQN
Vanilla DQN Implementation
Language:Jupyter Notebook1 0
shuvoxcd01/gym
A toolkit for developing and comparing reinforcement learning algorithms.
Language:Python0 0
shuvoxcd01/Gymnasium
A standard API for reinforcement learning and a diverse set of reference environments (formerly Gym)
Language:Python0 0
shuvoxcd01/iGibsonChallenge2021
Language:Python0 0
shuvoxcd01/Medical-Knowledge-Base
A knowledge-base that captures relations among diseases, symptoms and physicians.
1 0
shuvoxcd01/MonteCarlo-Algorithm-Suite
Language:Python
shuvoxcd01/neural_tic_tac_toe
Language:Python1 0
shuvoxcd01/Parrot
Ontology based text acquisition.
Language:Python1 0
shuvoxcd01/Phoenix
Language:C++1 0
shuvoxcd01/Random-Walk-Env
A random walk gymnasium environment.
Language:Python
shuvoxcd01/ray
Ray is a unified framework for scaling AI and Python applications. Ray consists of a core distributed runtime and a toolkit of libraries (Ray AIR) for accelerating ML workloads.
Language:Python0 0
shuvoxcd01/REINFORCE_with_Baseline
Language:Python1 0
shuvoxcd01/RL-Worlds
Environments for Reinforcement Learning
Language:Python
shuvoxcd01/Schrodingers-Dealer
A Blackjack solver using Reinforcement Learning, inspired by Schrödinger's paradox. Just like the uncertainty in the famous thought experiment, this agent learns to navigate the unknown and optimize its strategy through trial and error, ultimately mastering the game of Blackjack.
1 0
shuvoxcd01/scispacy
A full spaCy pipeline and models for scientific/biomedical documents.
Language:Python0 0
shuvoxcd01/shuvoxcd01
0 0
shuvoxcd01/shuvoxcd01.github.io
Personal website.
Language:HTML1 0
shuvoxcd01/TD-0-Prediction
TD(0) Prediction
Language:Python
shuvoxcd01/TD-Algorithm-Suite
A suite of Temporal Difference algorithms.
Language:Python