koulanurag

Applied Scientist 2 at Amazon | LLM for Code | Deep Reinforcement Learning

AmazonNew York, New York

koulanurag's Stars

Significant-Gravitas/AutoGPT
AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus on what matters.
Language:Python170k 1.5k 3k44.7k
mlabonne/llm-course
Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.
Language:Jupyter Notebook40.4k 418 694.3k
tldraw/tldraw
whiteboard / infinite canvas SDK
Language:TypeScript37.8k 156 1.3k2.3k
maybe-finance/maybe
The OS for your personal finances
Language:Ruby34.6k 174 4542.5k
preservim/nerdtree
A tree explorer plugin for vim.
Language:Vim Script19.7k 307 9781.4k
jacobgil/pytorch-grad-cam
Advanced AI Explainability for computer vision. Support for CNNs, Vision Transformers, Classification, Object detection, Segmentation, Image similarity and more.
Language:Python10.9k 45 4201.6k
ahmedbahaaeldin/From-0-to-Research-Scientist-resources-guide
Detailed and tailored guide for undergraduate students or anybody want to dig deep into the field of AI with solid foundation.
7.5k 212 141.1k
karpathy/ng-video-lecture
Language:Python3.6k 57 30961
YangLing0818/Diffusion-Models-Papers-Survey-Taxonomy
Diffusion model papers, survey, and taxonomy
3k 52 8253
danijar/dreamerv3
Mastering Diverse Domains through World Models
Language:Python1.4k 28 142236
opendilab/LightZero
[NeurIPS 2023 Spotlight] LightZero: A Unified Benchmark for Monte Carlo Tree Search in General Sequential Decision Scenarios (awesome MCTS)
Language:Python1.2k 12 111129
google-deepmind/mujoco_mpc
Real-time behaviour synthesis with MuJoCo, using Predictive Control
Language:C++1.1k 24 111169
google-research/robopianist
[CoRL '23] Dexterous piano playing with deep reinforcement learning.
Language:Python595 12 1549
google-deepmind/dqn_zoo
DQN Zoo is a collection of reference implementations of reinforcement learning agents developed at DeepMind based on the Deep Q-Network (DQN) agent.
Language:Python462 18 2380
google-deepmind/alphastar
Language:Python432 11 657
nicklashansen/tdmpc
Code for "Temporal Difference Learning for Model Predictive Control"
Language:Python382 7 1955
VIRL-Platform/VIRL
(ECCV 2024) Code for V-IRL: Grounding Virtual Intelligence in Real Life
Language:Python319 12 513
locuslab/differentiable-mpc
Language:Python252 10 452
awjuliani/neuro-nav
A library for neuroscience-inspired navigation and decision making research.
Language:Jupyter Notebook201 9 817
liuzuxin/FSRL
🚀 A fast safe reinforcement learning library in PyTorch
Language:Python167 4 227
jurgisp/memory-maze
Evaluating long-term memory of reinforcement learning algorithms
Language:Python136 3 3014
kc-ml2/SimpleDreamer
A Simplified Pytorch Version of the Dreamer Algorithm
Language:Python114 7 714
danijar/director
Deep Hierarchical Planning from Pixels
Language:Python91 4 822
gwthomas/IQL-PyTorch
A PyTorch implementation of Implicit Q-Learning
Language:Python70 2 48
frt03/generalized_dt
Generalized Decision Transformer for Offline Hindsight Information Matching (ICLR2022)
Language:Python66 0 44
fareskalaboud/LearnPDDL
A beginner's guide to learning, implementing and using PDDL.
Language:PDDL54 4 011
marc-rigter/waker
Official code for "Reward-Free Curricula for Training Robust World Models", ICLR 2024.
Language:Python27 1 02
simorxb/MPC-Pendulum-Python
Model Predictive Control implemented in Python, using scipy.optimize.minimize, on the model of a pendulum.
Language:Python23 4 03
robotsorcerer/levelsetpy
A GPU-accelerated toolbox for hyperbolic PDEs in a weaker (viscosity) sense. It leverages the integral to the solution of the conservation of momentum problem (being equivalent to the derivative of Hamilton-Jacobi equations) in one spatial dimension. We resolve such hyperbolic differential equations using wave-front propagating schemes on a spatial-by-spatial dimension in resolving the classical value in dynamic programming (respectively optimal control and differential games) problems.
Language:Jupyter Notebook9 2 01
koulanurag/opcc
Benchmark for "Offline Policy Comparison with Confidence"
Language:Python3 1 00

koulanurag

koulanurag's Stars

Significant-Gravitas/AutoGPT

mlabonne/llm-course

tldraw/tldraw

maybe-finance/maybe

preservim/nerdtree

jacobgil/pytorch-grad-cam

ahmedbahaaeldin/From-0-to-Research-Scientist-resources-guide

karpathy/ng-video-lecture

YangLing0818/Diffusion-Models-Papers-Survey-Taxonomy

danijar/dreamerv3

opendilab/LightZero

google-deepmind/mujoco_mpc

google-research/robopianist

google-deepmind/dqn_zoo

google-deepmind/alphastar

nicklashansen/tdmpc

VIRL-Platform/VIRL

locuslab/differentiable-mpc

awjuliani/neuro-nav

liuzuxin/FSRL

jurgisp/memory-maze

kc-ml2/SimpleDreamer

danijar/director

gwthomas/IQL-PyTorch

frt03/generalized_dt

fareskalaboud/LearnPDDL

marc-rigter/waker

simorxb/MPC-Pendulum-Python

robotsorcerer/levelsetpy

koulanurag/opcc