akbir's Stars
twitter/the-algorithm
Source code for Twitter's Recommendation Algorithm
zed-industries/zed
Code at the speed of thought – Zed is a high-performance, multiplayer code editor from the creators of Atom and Tree-sitter.
google/jax
Composable transformations of Python+NumPy programs: differentiate, vectorize, JIT to GPU/TPU, and more
huggingface/candle
Minimalist ML framework for Rust
ggerganov/kbd-audio
🎤⌨️ Acoustic keyboard eavesdropping
facebookresearch/metaseq
Repo for external large-scale work
ikostrikov/pytorch-a2c-ppo-acktr-gail
PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation (ACKTR) and Generative Adversarial Imitation Learning (GAIL).
google-deepmind/alphatensor
Farama-Foundation/PettingZoo
An API standard for multi-agent reinforcement learning environments, with popular reference environments and related utilities
Farama-Foundation/chatarena
ChatArena (or Chat Arena) is a Multi-Agent Language Game Environments for LLMs. The goal is to develop communication and collaboration capabilities of AIs.
elicit/machine-learning-list
A curriculum for learning about foundation models, from scratch to the frontier
facebookresearch/nle
The NetHack Learning Environment
zkonduit/ezkl
ezkl is an engine for doing inference for deep learning models and other computational graphs in a zk-snark (ZKML). Use it from Python, Javascript, or the command line.
Tanuki/tanuki.py
Prompt engineering for developers
ikostrikov/jaxrl
JAX (Flax) implementation of algorithms for Deep Reinforcement Learning with continuous action spaces.
WhatsApp/waraft
An Erlang implementation of RAFT from WhatsApp
RobertTLange/evosax
Evolution Strategies in JAX 🦎
srush/annotated-s4
Implementation of https://srush.github.io/annotated-s4
facebookresearch/moolib
A library for distributed ML training with PyTorch
rabbitscam/rabbitr1
facebookresearch/optimizers
For optimization algorithm research and development.
rowanz/hellaswag
HellaSwag: Can a Machine _Really_ Finish Your Sentence?
kandouss/marlgrid
Gridworld for MARL experiments
davisyoshida/lorax
LoRA for arbitrary JAX models and functions
facebookresearch/dcd
Implementations of robust Dual Curriculum Design (DCD) algorithms for unsupervised environment design.
google-deepmind/debate
Formalizing stochastic doubly-efficient debate
CarperAI/autocrit
A repository for transformer critique learning and generation
ucl-dark/llm_debate
Code release for "Debating with More Persuasive LLMs Leads to More Truthful Answers"
longtermrisk/marltoolbox
A toolbox with the goal of speeding up research on bargaining in MARL (cooperation problems in MARL).
ucl-dark/pax
Scalable Opponent Shaping Experiments in JAX