wilson1yan

PhD student interested in generative modeling and representation learning

University of California, BerkeleyBerkeley, CA

Pinned Repositories

LWM
Language:Python7k 65 66539
deepul
Language:Jupyter Notebook733 61 12363
contrastive-forward-model
Language:Python30 5 68
cs294-158-ssl
Language:Python14 4 14
lpips-jax
Language:Python9 5 02
povt
Language:Python12 3 10
rlpyt
Reinforcement Learning in PyTorch
Language:Python25 4 05
teco
Language:Python98 5 29
VideoGPT
Language:Jupyter Notebook938 23 37107
VideoGPT-Paper
Language:Python15 6 24

wilson1yan's Repositories

wilson1yan/VideoGPT
Language:Jupyter Notebook938 23 37107
wilson1yan/teco
Language:Python98 5 29
wilson1yan/contrastive-forward-model
Language:Python30 5 68
wilson1yan/povt
Language:Python12 3 10
wilson1yan/i3d-jax
Language:Python2 3 0
wilson1yan/collect-minecraft
Language:Python1 3 0
wilson1yan/Video-LLaVA
Video-LLaVA: Learning United Visual Representation by Alignment Before Projection
Language:Python1 1 0
wilson1yan/video2dataset
Easily create large video dataset from video urls
Language:Python1 1 0
wilson1yan/collect-habitat
Language:Python3 0
wilson1yan/cs330
Language:Python3 0
wilson1yan/deepul
Language:Jupyter Notebook1 0
wilson1yan/FastChat
An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.
Language:Python1 0
wilson1yan/habitat-sim
A flexible, high-performance 3D simulator for Embodied AI research.
Language:C++2 0
wilson1yan/htmldate
Fast and robust date extraction from web pages, with Python or on the command-line
Language:Python2 0
wilson1yan/LLaVA
[NeurIPS'23 Oral] Visual Instruction Tuning: LLaVA (Large Language-and-Vision Assistant) built towards GPT-4V level capabilities.
Language:Python1 0
wilson1yan/LLMTest_NeedleInAHaystack
Doing simple retrieval from LLM models at various context lengths to measure accuracy
Language:Python1 0
wilson1yan/long-video-gan
Official PyTorch implementation of LongVideoGAN
Language:Python2 0
wilson1yan/LongChat
Official repository for LongChat and LongEval
Language:Python1 0
wilson1yan/Megatron-LM
Ongoing research training transformer models at scale
Language:Python1 0
wilson1yan/mmc4
MultimodalC4 is a multimodal extension of c4 that interleaves millions of images with text.
Language:Python0 0
wilson1yan/MMMU
This repo contains evaluation code for the paper "MMMU: A Massive Multi-discipline Multimodal Understanding and Reasoning Benchmark for Expert AGI"
Language:Python1 0
wilson1yan/ninjax
General Modules for JAX
Language:Python
wilson1yan/RaMViD
Language:Python2 0
wilson1yan/shell_scripts
Language:Shell3 0
wilson1yan/taming-transformers
Taming Transformers for High-Resolution Image Synthesis
Language:Jupyter Notebook2 0
wilson1yan/TATS
Official PyTorch implementation of TATS: A Long Video Generation Framework with Time-Agnostic VQGAN and Time-Sensitive Transformer (ECCV 2022)
Language:Python1 0
wilson1yan/trafilatura
Python & command-line tool to gather text on the Web: web crawling/scraping, extraction of text, metadata, comments
Language:Python2 0
wilson1yan/tux
Tools and Utils for Experiments (TUX). Modified from many others' code to fit my needs.
Language:Python1 0
wilson1yan/Valley
The official repository of "Video assistant towards large language model makes everything easy"
Language:Python1 0
wilson1yan/wilson1yan.github.io
Language:HTML2 0