kxgong's Stars
lllyasviel/Paints-UNDO
Understand Human Behavior to Align True Needs
hijkzzz/Awesome-LLM-Strawberry
A collection of LLM papers, blogs, and projects, with a focus on OpenAI o1 🍓 and reasoning techniques.
KwaiVGI/LivePortrait
Bring portraits to life!
kijai/ComfyUI-LivePortraitKJ
ComfyUI nodes for LivePortrait
huggingface/diffusion-models-class
Materials for the Hugging Face Diffusion Models Course
krennic999/STAR
STAR: Scale-wise Text-to-image generation via Auto-Regressive representations
voxel51/fiftyone
Refine high-quality datasets and visual AI models
karpathy/nanoGPT
The simplest, fastest repository for training/finetuning medium-sized GPTs.
sczhou/CodeFormer
[NeurIPS 2022] Towards Robust Blind Face Restoration with Codebook Lookup Transformer
FoundationVision/LlamaGen
Autoregressive Model Beats Diffusion: 🦙 Llama for Scalable Image Generation
OpenImagingLab/LenslessFace
LenslessFace : An End-to-End Optimized Lensless System for Privacy-Preserving Face Verification
Picsart-AI-Research/StreamingT2V
StreamingT2V: Consistent, Dynamic, and Extendable Long Video Generation from Text
pytorch/torchtune
PyTorch native post-training library
pytorch/torchtitan
A native PyTorch Library for large model training
lucidrains/audiolm-pytorch
Implementation of AudioLM, a SOTA Language Modeling Approach to Audio Generation out of Google Research, in Pytorch
er-muyue/BeMapNet
harry0703/MoneyPrinterTurbo
利用AI大模型,一键生成高清短视频 Generate short videos with one click using AI LLM.
xai-org/grok-1
Grok open release
ChenHsing/Awesome-Video-Diffusion-Models
[CSUR] A Survey on Video Diffusion Models
guoyww/AnimateDiff
Official implementation of AnimateDiff.
csuhan/OneLLM
[CVPR 2024] OneLLM: One Framework to Align All Modalities with Language
tim-learn/awesome-test-time-adaptation
Collection of awesome test-time (domain/batch/instance) adaptation methods
BradyFU/Awesome-Multimodal-Large-Language-Models
:sparkles::sparkles:Latest Advances on Multimodal Large Language Models
Alpha-VLLM/WeMix-LLM
rosinality/vq-vae-2-pytorch
Implementation of Generating Diverse High-Fidelity Images with VQ-VAE-2 in PyTorch
invictus717/MetaTransformer
Meta-Transformer for Unified Multimodal Learning
Chanzhaoyu/chatgpt-web
用 Express 和 Vue3 搭建的 ChatGPT 演示网页
jrzaurin/tabulardl-benchmark
Benchmark tabular Deep Learning models against each other and other non-DL techniques
Significant-Gravitas/AutoGPT
AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus on what matters.
OFA-Sys/OFA
Official repository of OFA (ICML 2022). Paper: OFA: Unifying Architectures, Tasks, and Modalities Through a Simple Sequence-to-Sequence Learning Framework