MikeWangWZHL

CS Phd at UIUC, Research Assistant at BLENDER lab advised by Prof. Heng Ji | Intern at Tencent AI lab | Intern at MSRA

UIUCChampaign, Illinois

Pinned Repositories

acl-anthology
Data and software for building the ACL Anthology.
Language:Python10
Aida_COVID
Repo for Aida Covid Hackathon src
Language:Smalltalk1 2 00
EEG-To-Text
code for AAAI2022 paper "Open Vocabulary Electroencephalography-To-Text Decoding and Zero-shot Sentiment Classification"
Language:Python142 9 1132
Multitask-Finetuning_CLIP
Code for paper "Rethinking Task Sampling for Few-shot Vision-Language Transfer Learning" COLING 2022 workshop
Language:Python3 3 01
Paxion
Repo for paper: "Paxion: Patching Action Knowledge in Video-Language Foundation Models" Neurips 23 Spotlight
Language:Python31 1 02
Solo-Performance-Prompting
Repo for paper "Unleashing Cognitive Synergy in Large Language Models: A Task-Solving Agent through Multi-Persona Self-Collaboration"
Language:Python294 3 427
VDLM
Repo for paper: Text-based Reasoning About Vector Graphics
Language:Python16 1 01
VidIL
Pytorch code for Language Models with Image Descriptors are Strong Few-Shot Video-Language Learners
Language:Python110 5 112
Wikinews_Pipeline
Get news from Wikipedia page's reference section
Language:Python30
Zemi
Repo for "Zemi: Learning Zero-Shot Semi-Parametric Language Models from Multiple Tasks" ACL 2023 Findings
Language:Python16 4 10

MikeWangWZHL's Repositories

MikeWangWZHL/Solo-Performance-Prompting
Repo for paper "Unleashing Cognitive Synergy in Large Language Models: A Task-Solving Agent through Multi-Persona Self-Collaboration"
Language:Python294 3 427
MikeWangWZHL/EEG-To-Text
code for AAAI2022 paper "Open Vocabulary Electroencephalography-To-Text Decoding and Zero-shot Sentiment Classification"
Language:Python142 9 1132
MikeWangWZHL/VidIL
Pytorch code for Language Models with Image Descriptors are Strong Few-Shot Video-Language Learners
Language:Python110 5 112
MikeWangWZHL/Paxion
Repo for paper: "Paxion: Patching Action Knowledge in Video-Language Foundation Models" Neurips 23 Spotlight
Language:Python31 1 02
MikeWangWZHL/VDLM
Repo for paper: Text-based Reasoning About Vector Graphics
Language:Python16 1 01
MikeWangWZHL/Zemi
Repo for "Zemi: Learning Zero-Shot Semi-Parametric Language Models from Multiple Tasks" ACL 2023 Findings
Language:Python16 4 10
MikeWangWZHL/Multitask-Finetuning_CLIP
Code for paper "Rethinking Task Sampling for Few-shot Vision-Language Transfer Learning" COLING 2022 workshop
Language:Python3 3 01
MikeWangWZHL/Wikinews_Pipeline
Get news from Wikipedia page's reference section
Language:Python30
MikeWangWZHL/MikeWangWZHL.github.io
Github Pages template for academic personal websites, forked from mmistakes/minimal-mistakes
Language:JavaScript1 1 0
MikeWangWZHL/1d-tokenizer
This repo contains the code for our paper An Image is Worth 32 Tokens for Reconstruction and Generation
MikeWangWZHL/alfworld-docker-setup
MikeWangWZHL/Cutie
[CVPR 2024 Highlight] Putting the Object Back Into Video Object Segmentation
Language:Python0 0
MikeWangWZHL/diffusers
🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch
Language:Python1 01
MikeWangWZHL/Grounded-Segment-Anything
Grounded-SAM: Marrying Grounding-DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything
Language:Jupyter Notebook
MikeWangWZHL/LAVIS
LAVIS - A One-stop Library for Language-Vision Intelligence
Language:Python1 0
MikeWangWZHL/LaVIT
LaVIT: Empower the Large Language Model to Understand and Generate Visual Content
Language:Jupyter Notebook0 0
MikeWangWZHL/LLaVA
[NeurIPS 2023 Oral] Visual Instruction Tuning: LLaVA (Large Language-and-Vision Assistant) built towards multimodal GPT-4 level capabilities.
Language:Python0 0
MikeWangWZHL/MathVista
MathVista: data, code, and evaluation for Mathematical Reasoning in Visual Contexts
Language:Jupyter Notebook0 0
MikeWangWZHL/maze-dataset
maze datasets for investigating OOD behavior of ML systems
Language:Jupyter Notebook
MikeWangWZHL/MiniGPT4-video
MikeWangWZHL/parti-pytorch
Implementation of Parti, Google's pure attention-based text-to-image neural network, in Pytorch
MikeWangWZHL/rq-vae-transformer
The official implementation of Autoregressive Image Generation using Residual Quantization (CVPR '22)
Language:Jupyter Notebook
MikeWangWZHL/sam-hq
Segment Anything in High Quality [NeurIPS 2023]
Language:Python0 0
MikeWangWZHL/self-refine
LLMs can generate feedback on their work, use it to improve the output, and repeat this process iteratively.
MikeWangWZHL/singularity
Official PyTorch code for Singularity model in the paper "Revealing Single Frame Bias for Video-and-Language Learning"
MikeWangWZHL/Tracking-Anything-with-DEVA
Forked from paper [ICCV 2023] Tracking Anything with Decoupled Video Segmentation
Language:Python
MikeWangWZHL/VAR
[GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". An *ultra-simple, user-friendly yet state-of-the-art* codebase for autoregressive image generation!
Language:Python0 0
MikeWangWZHL/Video-ChatGPT
"Video-ChatGPT" is a video conversation model capable of generating meaningful conversation about videos. It combines the capabilities of LLMs with a pretrained visual encoder adapted for spatiotemporal video representation. We also introduce a rigorous 'Quantitative Evaluation Benchmarking' for video-based conversational models.
MikeWangWZHL/viper
Code for the paper "ViperGPT: Visual Inference via Python Execution for Reasoning"
Language:Jupyter Notebook0 0
MikeWangWZHL/VQGAN-LC