Pinned Repositories
1d-tokenizer
This repo contains the code for our paper An Image is Worth 32 Tokens for Reconstruction and Generation
algorithm-visualizer
:fireworks:Interactive Online Platform that Visualizes Algorithms from Code
AnimateDiff
Official implementation of AnimateDiff.
Auto-GPT
An experimental open-source attempt to make GPT-4 fully autonomous.
awesome-deep-learning
A curated list of awesome Deep Learning tutorials, projects and communities.
CogVLM
a state-of-the-art-level open visual language model | 多模态预训练模型
ComfyUI
The most powerful and modular stable diffusion GUI, api and backend with a graph/nodes interface.
ControlNet
Let us control diffusion models!
data-science-ipython-notebooks
Continually updated data science Python notebooks: Deep learning (TensorFlow, Theano, Caffe), scikit-learn, Kaggle, big data (Spark, Hadoop MapReduce, HDFS), matplotlib, pandas, NumPy, SciPy, Python essentials, AWS, and various command lines.
DataProcessing
JerryWei1985's Repositories
JerryWei1985/1d-tokenizer
This repo contains the code for our paper An Image is Worth 32 Tokens for Reconstruction and Generation
JerryWei1985/ComfyUI
The most powerful and modular stable diffusion GUI, api and backend with a graph/nodes interface.
JerryWei1985/generative-ai-for-beginners
12 Lessons, Get Started Building with Generative AI 🔗 https://microsoft.github.io/generative-ai-for-beginners/
JerryWei1985/diffusers
🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch
JerryWei1985/DiT
Official PyTorch Implementation of "Scalable Diffusion Models with Transformers"
JerryWei1985/Embodied-AI-Guide
具身智能入门路径&信息总结
JerryWei1985/Embodied_AI_Paper_List
[Embodied-AI-Survey-2024] Paper list and projects for Embodied AI
JerryWei1985/Emu3
Next-Token Prediction is All You Need
JerryWei1985/generative-models
Generative Models by Stability AI
JerryWei1985/HunyuanDiT
Hunyuan-DiT : A Powerful Multi-Resolution Diffusion Transformer with Fine-Grained Chinese Understanding
JerryWei1985/InstantID
InstantID : Zero-shot Identity-Preserving Generation in Seconds 🔥
JerryWei1985/InstantStyle
InstantStyle: Free Lunch towards Style-Preserving in Text-to-Image Generation 🔥
JerryWei1985/IP-Adapter
The image prompt adapter is designed to enable a pretrained text-to-image diffusion model to generate images with image prompt.
JerryWei1985/kohya_ss
JerryWei1985/Kolors
Kolors Team
JerryWei1985/LaVIT
LaVIT: Empower the Large Language Model to Understand and Generate Visual Content
JerryWei1985/LLMs-from-scratch
Implementing a ChatGPT-like LLM in PyTorch from scratch, step by step
JerryWei1985/manim
Animation engine for explanatory math videos
JerryWei1985/nanoGPT
The simplest, fastest repository for training/finetuning medium-sized GPTs.
JerryWei1985/Open-Sora
Building your own video generation model like OpenAI's Sora
JerryWei1985/Open-Sora-Plan
This project aim to reproduce Sora (Open AI T2V model), but we only have limited resource. We deeply wish the all open source community can contribute to this project.
JerryWei1985/PixArt-alpha
PixArt-α: Fast Training of Diffusion Transformer for Photorealistic Text-to-Image Synthesis
JerryWei1985/PixArt-sigma
PixArt-Σ: Weak-to-Strong Training of Diffusion Transformer for 4K Text-to-Image Generation
JerryWei1985/seq-monkey-data
JerryWei1985/stablediffusion
High-Resolution Image Synthesis with Latent Diffusion Models
JerryWei1985/StoryDiffusion
Create Magic Story!
JerryWei1985/VAR
[GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". An *ultra-simple, user-friendly yet state-of-the-art* codebase for autoregressive image generation!
JerryWei1985/video-subtitle-remover
基于AI的图片/视频硬字幕去除、文本水印去除,无损分辨率生成去字幕、去水印后的图片/视频文件。无需申请第三方API,本地实现。AI-based tool for removing hard-coded subtitles and text-like watermarks from videos or Pictures.
JerryWei1985/VideoMamba
VideoMamba: State Space Model for Efficient Video Understanding
JerryWei1985/xDiT
xDiT: A Scalable Inference Engine for Diffusion Transformers (DiTs) on multi-GPU Clusters