koulanurag
Applied Scientist 2 at Amazon | LLM for Code | Deep Reinforcement Learning
AmazonNew York, New York
koulanurag's Stars
Significant-Gravitas/AutoGPT
AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus on what matters.
mlabonne/llm-course
Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.
tldraw/tldraw
whiteboard / infinite canvas SDK
maybe-finance/maybe
The OS for your personal finances
preservim/nerdtree
A tree explorer plugin for vim.
jacobgil/pytorch-grad-cam
Advanced AI Explainability for computer vision. Support for CNNs, Vision Transformers, Classification, Object detection, Segmentation, Image similarity and more.
ahmedbahaaeldin/From-0-to-Research-Scientist-resources-guide
Detailed and tailored guide for undergraduate students or anybody want to dig deep into the field of AI with solid foundation.
karpathy/ng-video-lecture
YangLing0818/Diffusion-Models-Papers-Survey-Taxonomy
Diffusion model papers, survey, and taxonomy
danijar/dreamerv3
Mastering Diverse Domains through World Models
opendilab/LightZero
[NeurIPS 2023 Spotlight] LightZero: A Unified Benchmark for Monte Carlo Tree Search in General Sequential Decision Scenarios (awesome MCTS)
google-deepmind/mujoco_mpc
Real-time behaviour synthesis with MuJoCo, using Predictive Control
google-research/robopianist
[CoRL '23] Dexterous piano playing with deep reinforcement learning.
google-deepmind/dqn_zoo
DQN Zoo is a collection of reference implementations of reinforcement learning agents developed at DeepMind based on the Deep Q-Network (DQN) agent.
google-deepmind/alphastar
nicklashansen/tdmpc
Code for "Temporal Difference Learning for Model Predictive Control"
VIRL-Platform/VIRL
(ECCV 2024) Code for V-IRL: Grounding Virtual Intelligence in Real Life
locuslab/differentiable-mpc
awjuliani/neuro-nav
A library for neuroscience-inspired navigation and decision making research.
liuzuxin/FSRL
🚀 A fast safe reinforcement learning library in PyTorch
jurgisp/memory-maze
Evaluating long-term memory of reinforcement learning algorithms
kc-ml2/SimpleDreamer
A Simplified Pytorch Version of the Dreamer Algorithm
danijar/director
Deep Hierarchical Planning from Pixels
gwthomas/IQL-PyTorch
A PyTorch implementation of Implicit Q-Learning
frt03/generalized_dt
Generalized Decision Transformer for Offline Hindsight Information Matching (ICLR2022)
fareskalaboud/LearnPDDL
A beginner's guide to learning, implementing and using PDDL.
marc-rigter/waker
Official code for "Reward-Free Curricula for Training Robust World Models", ICLR 2024.
simorxb/MPC-Pendulum-Python
Model Predictive Control implemented in Python, using scipy.optimize.minimize, on the model of a pendulum.
robotsorcerer/levelsetpy
A GPU-accelerated toolbox for hyperbolic PDEs in a weaker (viscosity) sense. It leverages the integral to the solution of the conservation of momentum problem (being equivalent to the derivative of Hamilton-Jacobi equations) in one spatial dimension. We resolve such hyperbolic differential equations using wave-front propagating schemes on a spatial-by-spatial dimension in resolving the classical value in dynamic programming (respectively optimal control and differential games) problems.
koulanurag/opcc
Benchmark for "Offline Policy Comparison with Confidence"