kohjingyu

ML PhD student at CMU. Previously at Google Research.

Carnegie Mellon University

kohjingyu's Stars

kohjingyu/search-agents
Code for the paper 🌳 Tree Search for Language Model Agents
Language:Python13014
web-arena-x/visualwebarena
VisualWebArena is a benchmark for multimodal agents.
Language:Python21740
openai/prm800k
800,000 step-level correctness labels on LLM solutions to MATH problems
Language:Python1.5k97
1j01/jspaint
🎨 Classic MS Paint, ＲＥＶＩＶＥＤ + ✨Extras
Language:JavaScript7.2k568
mlfoundations/open_flamingo
An open-source framework for training large multimodal models.
Language:Python3.7k278
kohjingyu/gill
🐟 Code and models for the NeurIPS 2023 paper "Generating Images with Multimodal Language Models".
Language:Jupyter Notebook42335
allenai/mmc4
MultimodalC4 is a multimodal extension of c4 that interleaves millions of images with text.
Language:Python89934
rom1504/clip-retrieval
Easily compute clip embeddings and build a clip retrieval system with them
Language:Jupyter Notebook2.4k208
hendrycks/test
Measuring Massive Multitask Language Understanding | ICLR 2021
Language:Python1.2k90
TalSchuster/VitaminC
Contrastive Fact Verification
Language:Python6911
jhuangtw/xg2xg
by ex-googlers, for ex-googlers - a lookup table of similar tech & services
14.6k1k
kohjingyu/fromage
🧀 Code and models for the ICML 2023 paper "Grounding Language Models to Images for Multimodal Inputs and Outputs".
Language:Jupyter Notebook47435
JonasGeiping/cramming
Cramming the training of a (BERT-type) language model into limited compute.
Language:Python1.3k100
karpathy/nanoGPT
The simplest, fastest repository for training/finetuning medium-sized GPTs.
Language:Python36.5k5.8k
huggingface/diffusers
🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch and FLAX.
Language:Python25.4k5.3k
mlfoundations/open_clip
An open source implementation of CLIP.
Language:Python9.9k959
Aleph-Alpha/magma
MAGMA - a GPT-style multimodal model that can understand any combination of images and language. NOTE: The freely available model from this repo is only a demo. For the latest multimodal and multilingual models from Aleph Alpha check out our website https://app.aleph-alpha.com
Language:Python47555
bitsandbytes-foundation/bitsandbytes
Accessible large language models via k-bit quantization for PyTorch.
Language:Python6.1k615
mjpost/sacrebleu
Reference BLEU implementation that auto-downloads test sets and reports a version string to facilitate cross-lab comparisons
Language:Python1.1k162
kakaobrain/coyo-dataset
COYO-700M: Large-scale Image-Text Pair Dataset
Language:Python1.1k36
google-research/se3ds
This repository hosts the code for our paper, "Simple and Effective Synthesis of Indoor 3D Scenes".
Language:Jupyter Notebook383
google-research/parti
1.5k87
kakaobrain/rq-vae-transformer
The official implementation of Autoregressive Image Generation using Residual Quantization (CVPR '22)
Language:Jupyter Notebook76883
state-spaces/s4
Structured state space sequence models
Language:Jupyter Notebook2.4k285
facebookresearch/metaseq
Repo for external large-scale work
Language:Python6.5k724
echen/restricted-boltzmann-machines
Restricted Boltzmann Machines in Python.
Language:Python946373
yell/boltzmann-machines
Boltzmann Machines in TensorFlow with examples
Language:Jupyter Notebook845134
allenai/ScienceWorld
ScienceWorld is a text-based virtual environment centered around accomplishing tasks from the standardized elementary science curriculum.
Language:Scala20626
facebookresearch/StyleNeRF
This is the open source implementation of the ICLR2022 paper "StyleNeRF: A Style-based 3D-Aware Generator for High-resolution Image Synthesis"
Language:Python95791
LunjunZhang/world-model-as-a-graph
Code for "World Model as a Graph: Learning Latent Landmarks for Planning" (ICML 2021 Long Presentation)
Language:Python623

kohjingyu

kohjingyu's Stars

kohjingyu/search-agents

web-arena-x/visualwebarena

openai/prm800k

1j01/jspaint

mlfoundations/open_flamingo

kohjingyu/gill

allenai/mmc4

rom1504/clip-retrieval

hendrycks/test

TalSchuster/VitaminC

jhuangtw/xg2xg

kohjingyu/fromage

JonasGeiping/cramming

karpathy/nanoGPT

huggingface/diffusers

mlfoundations/open_clip

Aleph-Alpha/magma

bitsandbytes-foundation/bitsandbytes

mjpost/sacrebleu

kakaobrain/coyo-dataset

google-research/se3ds

google-research/parti

kakaobrain/rq-vae-transformer

state-spaces/s4

facebookresearch/metaseq

echen/restricted-boltzmann-machines

yell/boltzmann-machines

allenai/ScienceWorld

facebookresearch/StyleNeRF

LunjunZhang/world-model-as-a-graph