wilson1yan
PhD student interested in generative modeling and representation learning
University of California, BerkeleyBerkeley, CA
Pinned Repositories
LWM
deepul
contrastive-forward-model
cs294-158-ssl
lpips-jax
povt
rlpyt
Reinforcement Learning in PyTorch
teco
VideoGPT
VideoGPT-Paper
wilson1yan's Repositories
wilson1yan/VideoGPT
wilson1yan/teco
wilson1yan/contrastive-forward-model
wilson1yan/povt
wilson1yan/i3d-jax
wilson1yan/collect-minecraft
wilson1yan/Video-LLaVA
Video-LLaVA: Learning United Visual Representation by Alignment Before Projection
wilson1yan/video2dataset
Easily create large video dataset from video urls
wilson1yan/collect-habitat
wilson1yan/cs330
wilson1yan/deepul
wilson1yan/FastChat
An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.
wilson1yan/habitat-sim
A flexible, high-performance 3D simulator for Embodied AI research.
wilson1yan/htmldate
Fast and robust date extraction from web pages, with Python or on the command-line
wilson1yan/LLaVA
[NeurIPS'23 Oral] Visual Instruction Tuning: LLaVA (Large Language-and-Vision Assistant) built towards GPT-4V level capabilities.
wilson1yan/LLMTest_NeedleInAHaystack
Doing simple retrieval from LLM models at various context lengths to measure accuracy
wilson1yan/long-video-gan
Official PyTorch implementation of LongVideoGAN
wilson1yan/LongChat
Official repository for LongChat and LongEval
wilson1yan/Megatron-LM
Ongoing research training transformer models at scale
wilson1yan/mmc4
MultimodalC4 is a multimodal extension of c4 that interleaves millions of images with text.
wilson1yan/MMMU
This repo contains evaluation code for the paper "MMMU: A Massive Multi-discipline Multimodal Understanding and Reasoning Benchmark for Expert AGI"
wilson1yan/ninjax
General Modules for JAX
wilson1yan/RaMViD
wilson1yan/shell_scripts
wilson1yan/taming-transformers
Taming Transformers for High-Resolution Image Synthesis
wilson1yan/TATS
Official PyTorch implementation of TATS: A Long Video Generation Framework with Time-Agnostic VQGAN and Time-Sensitive Transformer (ECCV 2022)
wilson1yan/trafilatura
Python & command-line tool to gather text on the Web: web crawling/scraping, extraction of text, metadata, comments
wilson1yan/tux
Tools and Utils for Experiments (TUX). Modified from many others' code to fit my needs.
wilson1yan/Valley
The official repository of "Video assistant towards large language model makes everything easy"
wilson1yan/wilson1yan.github.io