upiterbarg's Stars
heiner/nle
The NetHack Learning Environment
allenai/dolma
Data and tools for generating and inspecting OLMo pre-training data.
karpathy/llm.c
LLM training in simple, raw C/CUDA
vllm-project/vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
huggingface/alignment-handbook
Robust recipes to align language models with human and AI preferences
piercefreeman/gpt-json
Structured and typehinted GPT responses in Python
upiterbarg/diff_history
[ICML 2024] Official code release accompanying the paper "diff History for Neural Language Agents" (Piterbarg, Pinto, Fergus)
microsoft/PythonProgrammingPuzzles
A Dataset of Python Challenges for AI Research
flowersteam/lamorel
Lamorel is a Python library designed for RL practitioners eager to use Large Language Models (LLMs).
flowersteam/Grounding_LLMs_with_online_RL
We perform functional grounding of LLMs' knowledge in BabyAI-Text
notmahi/dobb-e
Dobb·E: An open-source, general framework for learning household robotic manipulation
meta-llama/codellama
Inference code for CodeLlama models
OSU-NLP-Group/Mind2Web
[NeurIPS'23 Spotlight] "Mind2Web: Towards a Generalist Agent for the Web"
upiterbarg/hihack
[NeurIPS 2023] Official code release accompanying the paper "NetHack is Hard to Hack" (Piterbarg, Pinto, Fergus)
allenai/longformer
Longformer: The Long-Document Transformer
lyutyuh/ASP
PyTorch implementation and pre-trained models for ASP - Autoregressive Structured Prediction with Language Models, EMNLP 22. https://arxiv.org/pdf/2210.14698.pdf
abetlen/llama-cpp-python
Python bindings for llama.cpp
ggerganov/llama.cpp
LLM inference in C/C++
flexflow/FlexFlow
FlexFlow Serve: Low-Latency, High-Performance LLM Serving
LingDong-/shan-shui-inf
Procedurally generated Chinese landscape painting.
allenai/open-instruct
google-research/FLAN
openai/tiktoken
tiktoken is a fast BPE tokeniser for use with OpenAI's models.
srush/Tensor-Puzzles
Solve puzzles. Improve your pytorch.
karpathy/nanoGPT
The simplest, fastest repository for training/finetuning medium-sized GPTs.
google-research/robotics_transformer
ngoodger/nle-language-wrapper
Nethack Learning Environment Wrapper for Language Interface
facebookresearch/torchbeast
A PyTorch Platform for Distributed RL
maciej-sypetkowski/autoascend
The first place solution for the NeurIPS 2021 Nethack Challenge -- https://www.aicrowd.com/challenges/neurips-2021-the-nethack-challenge
google/jax-cfd
Computational Fluid Dynamics in JAX