akbir

PhD student at @ucl-dark

akbir's Stars

twitter/the-algorithm
Source code for Twitter's Recommendation Algorithm
Language:Scala62.1k 341 95612.1k
zed-industries/zed
Code at the speed of thought – Zed is a high-performance, multiplayer code editor from the creators of Atom and Tree-sitter.
Language:Rust47.9k 210 8.5k2.8k
google/jax
Composable transformations of Python+NumPy programs: differentiate, vectorize, JIT to GPU/TPU, and more
Language:Python30k 328 5.5k2.7k
huggingface/candle
Minimalist ML framework for Rust
Language:Rust15.4k 151 683905
ggerganov/kbd-audio
🎤⌨️ Acoustic keyboard eavesdropping
Language:C++8.5k 132 36584
facebookresearch/metaseq
Repo for external large-scale work
Language:Python6.5k 112 294724
ikostrikov/pytorch-a2c-ppo-acktr-gail
PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation (ACKTR) and Generative Adversarial Imitation Learning (GAIL).
Language:Python3.6k 67 229829
google-deepmind/alphatensor
Language:Python2.7k 57 11233
Farama-Foundation/PettingZoo
An API standard for multi-agent reinforcement learning environments, with popular reference environments and related utilities
Language:Python2.6k 18 371408
Farama-Foundation/chatarena
ChatArena (or Chat Arena) is a Multi-Agent Language Game Environments for LLMs. The goal is to develop communication and collaboration capabilities of AIs.
Language:Python1.3k 18 23129
elicit/machine-learning-list
A curriculum for learning about foundation models, from scratch to the frontier
941 21 170
facebookresearch/nle
The NetHack Learning Environment
Language:C937 30 113114
zkonduit/ezkl
ezkl is an engine for doing inference for deep learning models and other computational graphs in a zk-snark (ZKML). Use it from Python, Javascript, or the command line.
Language:Rust925 21 112130
Tanuki/tanuki.py
Prompt engineering for developers
Language:Python670 7 5323
ikostrikov/jaxrl
JAX (Flax) implementation of algorithms for Deep Reinforcement Learning with continuous action spaces.
Language:Jupyter Notebook613 12 865
WhatsApp/waraft
An Erlang implementation of RAFT from WhatsApp
Language:Erlang545 19 334
RobertTLange/evosax
Evolution Strategies in JAX 🦎
Language:Python488 10 4143
srush/annotated-s4
Implementation of https://srush.github.io/annotated-s4
Language:Python462 9 2761
facebookresearch/moolib
A library for distributed ML training with PyTorch
Language:C++366 12 1920
rabbitscam/rabbitr1
30117
facebookresearch/optimizers
For optimization algorithm research and development.
Language:Python253 14 1225
rowanz/hellaswag
HellaSwag: Can a Machine _Really_ Finish Your Sentence?
Language:Python180 4 822
kandouss/marlgrid
Gridworld for MARL experiments
Language:Python137 5 625
davisyoshida/lorax
LoRA for arbitrary JAX models and functions
Language:Python128 4 124
facebookresearch/dcd
Implementations of robust Dual Curriculum Design (DCD) algorithms for unsupervised environment design.
Language:Python121 6 925
google-deepmind/debate
Formalizing stochastic doubly-efficient debate
Language:Lean88 9 014
CarperAI/autocrit
A repository for transformer critique learning and generation
Language:Python84 6 317
ucl-dark/llm_debate
Code release for "Debating with More Persuasive LLMs Leads to More Truthful Answers"
Language:Python75 4 28
longtermrisk/marltoolbox
A toolbox with the goal of speeding up research on bargaining in MARL (cooperation problems in MARL).
Language:Python29 4 43
ucl-dark/pax
Scalable Opponent Shaping Experiments in JAX
Language:Python21 6 215

akbir

akbir's Stars

twitter/the-algorithm

zed-industries/zed

google/jax

huggingface/candle

ggerganov/kbd-audio

facebookresearch/metaseq

ikostrikov/pytorch-a2c-ppo-acktr-gail

google-deepmind/alphatensor

Farama-Foundation/PettingZoo

Farama-Foundation/chatarena

elicit/machine-learning-list

facebookresearch/nle

zkonduit/ezkl

Tanuki/tanuki.py

ikostrikov/jaxrl

WhatsApp/waraft

RobertTLange/evosax

srush/annotated-s4

facebookresearch/moolib

rabbitscam/rabbitr1

facebookresearch/optimizers

rowanz/hellaswag

kandouss/marlgrid

davisyoshida/lorax

facebookresearch/dcd

google-deepmind/debate

CarperAI/autocrit

ucl-dark/llm_debate

longtermrisk/marltoolbox

ucl-dark/pax