nishaanthkanna
Independent ML Researcher. Interested in researching methods to make ML models reliable, trustworthy and robust to distribution shift
Fredericton
nishaanthkanna's Stars
huggingface/transformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
floodsung/Deep-Learning-Papers-Reading-Roadmap
Deep Learning papers reading roadmap for anyone who are eager to learn this amazing tech!
karpathy/nanoGPT
The simplest, fastest repository for training/finetuning medium-sized GPTs.
openai/gym
A toolkit for developing and comparing reinforcement learning algorithms.
facebookresearch/fairseq
Facebook AI Research Sequence-to-Sequence Toolkit written in Python.
tinygrad/tinygrad
You like pytorch? You like micrograd? You love tinygrad! ❤️
modularml/mojo
The Mojo Programming Language
karpathy/minGPT
A minimal PyTorch re-implementation of the OpenAI GPT (Generative Pretrained Transformer) training
triton-lang/triton
Development repository for the Triton language and compiler
karpathy/minbpe
Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.
EleutherAI/gpt-neox
An implementation of model parallel autoregressive transformers on GPUs, based on the Megatron and DeepSpeed libraries
vwxyzjn/cleanrl
High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)
karpathy/arxiv-sanity-preserver
Web interface for browsing, search and filtering recent arxiv submissions
CarperAI/trlx
A repo for distributed training of language models with Reinforcement Learning via Human Feedback (RLHF)
facebookresearch/ijepa
Official codebase for I-JEPA, the Image-based Joint-Embedding Predictive Architecture. First outlined in the CVPR paper, "Self-supervised learning from images with a joint-embedding predictive architecture."
Farama-Foundation/PettingZoo
An API standard for multi-agent reinforcement learning environments, with popular reference environments and related utilities
jonbarron/website
MineDojo/MineDojo
Building Open-Ended Embodied Agents with Internet-Scale Knowledge
keras-team/keras-cv
Industry-strength Computer Vision workflows with Keras
optimass/continual_learning_papers
Relevant papers in Continual Learning
CarperAI/OpenELM
Evolution Through Large Models
uber-research/go-explore
Code for Go-Explore: a New Approach for Hard-Exploration Problems
danijar/dreamer
Dream to Control: Learning Behaviors by Latent Imagination
adaptive-intelligent-robotics/QDax
Accelerated Quality-Diversity
ShengranHu/Thought-Cloning
[NeurIPS '23 Spotlight] Thought Cloning: Learning to Think while Acting by Imitating Human Thinking
facebookresearch/mtrl
Multi Task RL Baselines
mila-iqia/spr
Code for "Data-Efficient Reinforcement Learning with Self-Predictive Representations"
Shanghai-Digital-Brain-Laboratory/BDM-DB1
A large-scale multi-modal pre-trained model
aidangomez/weblm
Drive a browser with Cohere
CarperAI/ArchitextRL