upiterbarg

Ph.D. student

NYU CourantNew York, New York

upiterbarg's Stars

heiner/nle
The NetHack Learning Environment
Language:C428
allenai/dolma
Data and tools for generating and inspecting OLMo pre-training data.
Language:Python92296
karpathy/llm.c
LLM training in simple, raw C/CUDA
Language:Cuda23.4k2.6k
vllm-project/vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
Language:Python27.2k4k
huggingface/alignment-handbook
Robust recipes to align language models with human and AI preferences
Language:Python4.5k388
piercefreeman/gpt-json
Structured and typehinted GPT responses in Python
Language:Python73530
upiterbarg/diff_history
[ICML 2024] Official code release accompanying the paper "diff History for Neural Language Agents" (Piterbarg, Pinto, Fergus)
Language:Python172
microsoft/PythonProgrammingPuzzles
A Dataset of Python Challenges for AI Research
Language:Python96192
flowersteam/lamorel
Lamorel is a Python library designed for RL practitioners eager to use Large Language Models (LLMs).
Language:Python18918
flowersteam/Grounding_LLMs_with_online_RL
We perform functional grounding of LLMs' knowledge in BabyAI-Text
Language:Python21424
notmahi/dobb-e
Dobb·E: An open-source, general framework for learning household robotic manipulation
Language:G-code56751
meta-llama/codellama
Inference code for CodeLlama models
Language:Python15.9k1.8k
OSU-NLP-Group/Mind2Web
[NeurIPS'23 Spotlight] "Mind2Web: Towards a Generalist Agent for the Web"
Language:Jupyter Notebook65794
upiterbarg/hihack
[NeurIPS 2023] Official code release accompanying the paper "NetHack is Hard to Hack" (Piterbarg, Pinto, Fergus)
Language:Python91
allenai/longformer
Longformer: The Long-Document Transformer
Language:Python2k271
lyutyuh/ASP
PyTorch implementation and pre-trained models for ASP - Autoregressive Structured Prediction with Language Models, EMNLP 22. https://arxiv.org/pdf/2210.14698.pdf
Language:Python9815
abetlen/llama-cpp-python
Python bindings for llama.cpp
Language:Python7.8k934
ggerganov/llama.cpp
LLM inference in C/C++
Language:C++65.3k9.4k
flexflow/FlexFlow
FlexFlow Serve: Low-Latency, High-Performance LLM Serving
Language:C++1.7k223
LingDong-/shan-shui-inf
Procedurally generated Chinese landscape painting.
Language:HTML5.5k445
allenai/open-instruct
Language:Python1.2k165
google-research/FLAN
Language:Python1.5k152
openai/tiktoken
tiktoken is a fast BPE tokeniser for use with OpenAI's models.
Language:Python11.9k804
srush/Tensor-Puzzles
Solve puzzles. Improve your pytorch.
Language:Jupyter Notebook3.1k259
karpathy/nanoGPT
The simplest, fastest repository for training/finetuning medium-sized GPTs.
Language:Python36.3k5.7k
google-research/robotics_transformer
Language:Python1.3k149
ngoodger/nle-language-wrapper
Nethack Learning Environment Wrapper for Language Interface
Language:Python332
facebookresearch/torchbeast
A PyTorch Platform for Distributed RL
Language:Python737113
maciej-sypetkowski/autoascend
The first place solution for the NeurIPS 2021 Nethack Challenge -- https://www.aicrowd.com/challenges/neurips-2021-the-nethack-challenge
Language:Python5515
google/jax-cfd
Computational Fluid Dynamics in JAX
Language:Jupyter Notebook724101

upiterbarg

upiterbarg's Stars

heiner/nle

allenai/dolma

karpathy/llm.c

vllm-project/vllm

huggingface/alignment-handbook

piercefreeman/gpt-json

upiterbarg/diff_history

microsoft/PythonProgrammingPuzzles

flowersteam/lamorel

flowersteam/Grounding_LLMs_with_online_RL

notmahi/dobb-e

meta-llama/codellama

OSU-NLP-Group/Mind2Web

upiterbarg/hihack

allenai/longformer

lyutyuh/ASP

abetlen/llama-cpp-python

ggerganov/llama.cpp

flexflow/FlexFlow

LingDong-/shan-shui-inf

allenai/open-instruct

google-research/FLAN

openai/tiktoken

srush/Tensor-Puzzles

karpathy/nanoGPT

google-research/robotics_transformer

ngoodger/nle-language-wrapper

facebookresearch/torchbeast

maciej-sypetkowski/autoascend

google/jax-cfd