king159
CS PhD at Chinese University of Hong Kong (CUHK)
The Chinese University of Hong KongHongKong, China
king159's Stars
ChenglongMa/zoplicate
A plugin that does one thing only: Detect and manage duplicate items in Zotero.
TencentARC/Open-MAGVIT2
Open-MAGVIT2: Democratizing Autoregressive Visual Generation
cshw2021/Learned-Image-Video-Compression
A collection of papers related to image and video compression
QwenLM/Qwen
The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.
QwenLM/Qwen2
Qwen2 is the large language model series developed by Qwen team, Alibaba Cloud.
kovidgoyal/calibre
The official source code repository for the calibre ebook manager
jmliu206/LIC_TCM
TQTQliu/MVSGaussian
MVSGaussian: Fast Generalizable Gaussian Splatting Reconstruction from Multi-View Stereo
yuweihao/MambaOut
MambaOut: Do We Really Need Mamba for Vision?
huggingface/lerobot
🤗 LeRobot: End-to-end Learning for Real-World Robotics in Pytorch
githubnext/monaspace
An innovative superfamily of fonts for code
lllyasviel/IC-Light
More relighting!
lucidrains/mmdit
Implementation of a single layer of the MMDiT, proposed in Stable Diffusion 3, in Pytorch
facebookresearch/dinov2
PyTorch code and models for the DINOv2 self-supervised learning method.
lucidrains/magvit2-pytorch
Implementation of MagViT2 Tokenizer in Pytorch
meta-llama/llama3
The official Meta Llama 3 GitHub site
king159/svd-mv
Unofficial Implementation of "Stable Video Diffusion Multi-View"
shulin16/MMInA
Official implementation of the paper "MMInA: Benchmarking Multihop Multimodal Internet Agents"
snap-research/Panda-70M
[CVPR 2024] Panda-70M: Captioning 70M Videos with Multiple Cross-Modality Teachers
lucidrains/vector-quantize-pytorch
Vector (and Scalar) Quantization, in Pytorch
CompVis/taming-transformers
Taming Transformers for High-Resolution Image Synthesis
ziqihuangg/Awesome-Evaluation-of-Visual-Generation
A list of works on evaluation of visual generation models, including evaluation metrics, models, and systems
nashsu/FreeAskInternet
FreeAskInternet is a completely free, PRIVATE and LOCALLY running search aggregator & answer generate using MULTI LLMs, without GPU needed. The user can ask a question and the system will make a multi engine search and combine the search result to LLM and generate the answer based on search results. It's all FREE to use.
astral-sh/ruff
An extremely fast Python linter and code formatter, written in Rust.
EvolvingLMMs-Lab/lmms-eval
Accelerating the development of large multimodal models (LMMs) with lmms-eval
jzhang38/EasyContext
Memory optimization and training recipes to extrapolate language models' context length to 1 million tokens, with minimal hardware.
FoundationVision/VAR
[GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". An *ultra-simple, user-friendly yet state-of-the-art* codebase for autoregressive image generation!
harry0703/MoneyPrinterTurbo
利用AI大模型,一键生成高清短视频 Generate short videos with one click using AI LLM.
QY-H00/attention-interpolation-diffusion
Interpolation Between Text-to-Image Generation!
jabir-zheng/TCD
Official Repository of the paper "Trajectory Consistency Distillation"