Pinned Repositories
60_Days_RL_Challenge
Learn Deep Reinforcement Learning in Depth in 60 days
adversarial-robustness-toolbox
Adversarial Robustness Toolbox (ART) - Python Library for Machine Learning Security - Evasion, Poisoning, Extraction, Inference - Red and Blue Teams
camera_model_and_stereo_depth_sensing
Camera model and stereo depth sensing using OpenCV
cse536-xv6-os
Writing code for xv6 OS.
FinRL
A Deep Reinforcement Learning Framework for Automated Trading in Quantitative Finance. NeurIPS 2020 & ICAIF 2021. 🔥
gpt4-testing-tom
Testing GPT4 completions for Theory-of-Mind (vs ChatGPT & text-davinci-003)
ImageBind_testing
ImageBind One Embedding Space to Bind Them All
imitation-learning
Imitation learning algorithms
learning-from-human-preferences
Reproduction of OpenAI and DeepMind's "Deep Reinforcement Learning from Human Preferences"
wordle_using_rollouts
This repository contains the official code for the paper: "Reinforcement Learning Methods for Wordle: A POMDP/Adaptive Control Approach" by Siddhant Bhambri, Amrita Bhattacharjee & Dimitri Bertsekas, accepted at IEEE CoG 2023.
sbhambr1's Repositories
sbhambr1/camera_model_and_stereo_depth_sensing
Camera model and stereo depth sensing using OpenCV
sbhambr1/wordle_using_rollouts
This repository contains the official code for the paper: "Reinforcement Learning Methods for Wordle: A POMDP/Adaptive Control Approach" by Siddhant Bhambri, Amrita Bhattacharjee & Dimitri Bertsekas, accepted at IEEE CoG 2023.
sbhambr1/gpt4-testing-tom
Testing GPT4 completions for Theory-of-Mind (vs ChatGPT & text-davinci-003)
sbhambr1/60_Days_RL_Challenge
Learn Deep Reinforcement Learning in Depth in 60 days
sbhambr1/adversarial-robustness-toolbox
Adversarial Robustness Toolbox (ART) - Python Library for Machine Learning Security - Evasion, Poisoning, Extraction, Inference - Red and Blue Teams
sbhambr1/cse536-xv6-os
Writing code for xv6 OS.
sbhambr1/FinRL
A Deep Reinforcement Learning Framework for Automated Trading in Quantitative Finance. NeurIPS 2020 & ICAIF 2021. 🔥
sbhambr1/ImageBind_testing
ImageBind One Embedding Space to Bind Them All
sbhambr1/imitation-learning
Imitation learning algorithms
sbhambr1/learning-from-human-preferences
Reproduction of OpenAI and DeepMind's "Deep Reinforcement Learning from Human Preferences"
sbhambr1/lmql-playground
Playing with LMQL:https://lmql.ai
sbhambr1/MarkovGameSolvers
This is code for finding the minimax/nash/stackelberg strategy of players in Markov Games.
sbhambr1/MAS-Memory-Aware-Synapses
Memory Aware Synapses method implementation code
sbhambr1/sbhambr1.github.io
sbhambr1/StackelbergEquilibribumSolvers
Solves a Mixed Integer Linear Program to generate the Stacklberg Equilibrium of a General-sum (+Bayesian) Games.
sbhambr1/segment_anything_playground
Playing with SAM model by MetaAI
sbhambr1/self-instruct
Aligning pretrained language models with instruction data generated by themselves.
sbhambr1/StatisticalML-Course
sbhambr1/symbolic_planning_and_rl
Spring 2021 - CSE 574 Project
sbhambr1/turtlebot3_simulations
Simulating TurtleBot3 in custom worlds & playing the evader-pursuer game.
sbhambr1/videos
Code for the manim-generated scenes used in 3blue1brown videos
sbhambr1/XAI-papers