sbhambr1

“七転び八起き”

Arizona State University

Pinned Repositories

60_Days_RL_Challenge
Learn Deep Reinforcement Learning in Depth in 60 days
Language:Jupyter Notebook0 0 00
adversarial-robustness-toolbox
Adversarial Robustness Toolbox (ART) - Python Library for Machine Learning Security - Evasion, Poisoning, Extraction, Inference - Red and Blue Teams
Language:Python0 0 00
camera_model_and_stereo_depth_sensing
Camera model and stereo depth sensing using OpenCV
Language:Python7 1 00
cse536-xv6-os
Writing code for xv6 OS.
Language:Assembly0 0 00
FinRL
A Deep Reinforcement Learning Framework for Automated Trading in Quantitative Finance. NeurIPS 2020 & ICAIF 2021. 🔥
Language:Jupyter Notebook0 0 00
gpt4-testing-tom
Testing GPT4 completions for Theory-of-Mind (vs ChatGPT & text-davinci-003)
Language:Python1 1 00
ImageBind_testing
ImageBind One Embedding Space to Bind Them All
Language:Python0 0 00
imitation-learning
Imitation learning algorithms
Language:Python0 0 00
learning-from-human-preferences
Reproduction of OpenAI and DeepMind's "Deep Reinforcement Learning from Human Preferences"
Language:Python0 0 00
wordle_using_rollouts
This repository contains the official code for the paper: "Reinforcement Learning Methods for Wordle: A POMDP/Adaptive Control Approach" by Siddhant Bhambri, Amrita Bhattacharjee & Dimitri Bertsekas, accepted at IEEE CoG 2023.
Language:Jupyter Notebook4 1 00

sbhambr1's Repositories

sbhambr1/camera_model_and_stereo_depth_sensing
Camera model and stereo depth sensing using OpenCV
Language:Python7 1 00
sbhambr1/wordle_using_rollouts
This repository contains the official code for the paper: "Reinforcement Learning Methods for Wordle: A POMDP/Adaptive Control Approach" by Siddhant Bhambri, Amrita Bhattacharjee & Dimitri Bertsekas, accepted at IEEE CoG 2023.
Language:Jupyter Notebook4 1 00
sbhambr1/gpt4-testing-tom
Testing GPT4 completions for Theory-of-Mind (vs ChatGPT & text-davinci-003)
Language:Python1 1 00
sbhambr1/60_Days_RL_Challenge
Learn Deep Reinforcement Learning in Depth in 60 days
Language:Jupyter Notebook0 0 00
sbhambr1/adversarial-robustness-toolbox
Adversarial Robustness Toolbox (ART) - Python Library for Machine Learning Security - Evasion, Poisoning, Extraction, Inference - Red and Blue Teams
Language:Python0 0 00
sbhambr1/cse536-xv6-os
Writing code for xv6 OS.
Language:Assembly0 0 00
sbhambr1/FinRL
A Deep Reinforcement Learning Framework for Automated Trading in Quantitative Finance. NeurIPS 2020 & ICAIF 2021. 🔥
Language:Jupyter Notebook0 0 00
sbhambr1/ImageBind_testing
ImageBind One Embedding Space to Bind Them All
Language:Python0 0 00
sbhambr1/imitation-learning
Imitation learning algorithms
Language:Python0 0 00
sbhambr1/learning-from-human-preferences
Reproduction of OpenAI and DeepMind's "Deep Reinforcement Learning from Human Preferences"
Language:Python0 0 00
sbhambr1/lmql-playground
Playing with LMQL:https://lmql.ai
Language:Python0 1 00
sbhambr1/MarkovGameSolvers
This is code for finding the minimax/nash/stackelberg strategy of players in Markov Games.
Language:Python0 0 00
sbhambr1/MAS-Memory-Aware-Synapses
Memory Aware Synapses method implementation code
Language:Jupyter Notebook0 0 00
sbhambr1/sbhambr1.github.io
Language:JavaScript0 1 00
sbhambr1/StackelbergEquilibribumSolvers
Solves a Mixed Integer Linear Program to generate the Stacklberg Equilibrium of a General-sum (+Bayesian) Games.
Language:Python0 0 00
sbhambr1/segment_anything_playground
Playing with SAM model by MetaAI
Language:Jupyter Notebook1 0
sbhambr1/self-instruct
Aligning pretrained language models with instruction data generated by themselves.
Language:Python0 0
sbhambr1/StatisticalML-Course
Language:Python1 0
sbhambr1/symbolic_planning_and_rl
Spring 2021 - CSE 574 Project
Language:Python0 0
sbhambr1/turtlebot3_simulations
Simulating TurtleBot3 in custom worlds & playing the evader-pursuer game.
Language:C++0 0
sbhambr1/videos
Code for the manim-generated scenes used in 3blue1brown videos
Language:Python0 0
sbhambr1/XAI-papers
0 0

sbhambr1

Pinned Repositories

60_Days_RL_Challenge

adversarial-robustness-toolbox

camera_model_and_stereo_depth_sensing

cse536-xv6-os

FinRL

gpt4-testing-tom

ImageBind_testing

imitation-learning

learning-from-human-preferences

wordle_using_rollouts

sbhambr1's Repositories

sbhambr1/camera_model_and_stereo_depth_sensing

sbhambr1/wordle_using_rollouts

sbhambr1/gpt4-testing-tom

sbhambr1/60_Days_RL_Challenge

sbhambr1/adversarial-robustness-toolbox

sbhambr1/cse536-xv6-os

sbhambr1/FinRL

sbhambr1/ImageBind_testing

sbhambr1/imitation-learning

sbhambr1/learning-from-human-preferences

sbhambr1/lmql-playground

sbhambr1/MarkovGameSolvers

sbhambr1/MAS-Memory-Aware-Synapses

sbhambr1/sbhambr1.github.io

sbhambr1/StackelbergEquilibribumSolvers

sbhambr1/segment_anything_playground

sbhambr1/self-instruct

sbhambr1/StatisticalML-Course

sbhambr1/symbolic_planning_and_rl

sbhambr1/turtlebot3_simulations

sbhambr1/videos

sbhambr1/XAI-papers