kohjingyu's Stars
kohjingyu/search-agents
Code for the paper 🌳 Tree Search for Language Model Agents
web-arena-x/visualwebarena
VisualWebArena is a benchmark for multimodal agents.
openai/prm800k
800,000 step-level correctness labels on LLM solutions to MATH problems
1j01/jspaint
🎨 Classic MS Paint, REVIVED + ✨Extras
mlfoundations/open_flamingo
An open-source framework for training large multimodal models.
kohjingyu/gill
🐟 Code and models for the NeurIPS 2023 paper "Generating Images with Multimodal Language Models".
allenai/mmc4
MultimodalC4 is a multimodal extension of c4 that interleaves millions of images with text.
rom1504/clip-retrieval
Easily compute clip embeddings and build a clip retrieval system with them
hendrycks/test
Measuring Massive Multitask Language Understanding | ICLR 2021
TalSchuster/VitaminC
Contrastive Fact Verification
jhuangtw/xg2xg
by ex-googlers, for ex-googlers - a lookup table of similar tech & services
kohjingyu/fromage
🧀 Code and models for the ICML 2023 paper "Grounding Language Models to Images for Multimodal Inputs and Outputs".
JonasGeiping/cramming
Cramming the training of a (BERT-type) language model into limited compute.
karpathy/nanoGPT
The simplest, fastest repository for training/finetuning medium-sized GPTs.
huggingface/diffusers
🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch and FLAX.
mlfoundations/open_clip
An open source implementation of CLIP.
Aleph-Alpha/magma
MAGMA - a GPT-style multimodal model that can understand any combination of images and language. NOTE: The freely available model from this repo is only a demo. For the latest multimodal and multilingual models from Aleph Alpha check out our website https://app.aleph-alpha.com
bitsandbytes-foundation/bitsandbytes
Accessible large language models via k-bit quantization for PyTorch.
mjpost/sacrebleu
Reference BLEU implementation that auto-downloads test sets and reports a version string to facilitate cross-lab comparisons
kakaobrain/coyo-dataset
COYO-700M: Large-scale Image-Text Pair Dataset
google-research/se3ds
This repository hosts the code for our paper, "Simple and Effective Synthesis of Indoor 3D Scenes".
google-research/parti
kakaobrain/rq-vae-transformer
The official implementation of Autoregressive Image Generation using Residual Quantization (CVPR '22)
state-spaces/s4
Structured state space sequence models
facebookresearch/metaseq
Repo for external large-scale work
echen/restricted-boltzmann-machines
Restricted Boltzmann Machines in Python.
yell/boltzmann-machines
Boltzmann Machines in TensorFlow with examples
allenai/ScienceWorld
ScienceWorld is a text-based virtual environment centered around accomplishing tasks from the standardized elementary science curriculum.
facebookresearch/StyleNeRF
This is the open source implementation of the ICLR2022 paper "StyleNeRF: A Style-based 3D-Aware Generator for High-resolution Image Synthesis"
LunjunZhang/world-model-as-a-graph
Code for "World Model as a Graph: Learning Latent Landmarks for Planning" (ICML 2021 Long Presentation)