Pinned Repositories
2D_audio_driven_digital_human
第十四届全国大学生服务外包创新创业大赛企业类命题A15-2d虚拟人语音驱动算法
4K4D
4K4D: Real-Time 4D View Synthesis at 4K Resolution
AI-
AI视频创作,开发使用python支持多国语音配音,ffmpeg+openai-whisper+tts,项目仅作参考,技术探讨研究,拒绝非法使用
AnimateAnyone
Animate Anyone: Consistent and Controllable Image-to-Video Synthesis for Character Animation
AnimeSR
Codes for "AnimeSR: Learning Real-World Super-Resolution Models for Animation Videos"
Awesome-Video-Diffusion-Models
[Arxiv] A Survey on Video Diffusion Models
Chinese-LLaMA-Alpaca-2
中文LLaMA-2 & Alpaca-2大模型二期项目 + 16K超长上下文模型 (Chinese LLaMA-2 & Alpaca-2 LLMs, including 16K long context models)
CogVideo
Text-to-video generation. The repo for ICLR2023 paper "CogVideo: Large-scale Pretraining for Text-to-Video Generation via Transformers"
CogVLM
a state-of-the-art-level open visual language model | 多模态预训练模型
DynamiCrafter
DynamiCrafter: Animating Open-domain Images with Video Diffusion Priors
Mr-Harry's Repositories
Mr-Harry/4K4D
4K4D: Real-Time 4D View Synthesis at 4K Resolution
Mr-Harry/DynamiCrafter
DynamiCrafter: Animating Open-domain Images with Video Diffusion Priors
Mr-Harry/generative-models
Generative Models by Stability AI
Mr-Harry/EMO
Mr-Harry/IC-Light
More relighting!
Mr-Harry/InstantID
InstantID : Zero-shot Identity-Preserving Generation in Seconds 🔥
Mr-Harry/interactdiffusion
[CVPR 2024] Official repo for "InteractDiffusion: Interaction-Control for Text-to-Image Diffusion Model".
Mr-Harry/Latte
The official implementation of Latte: Latent Diffusion Transformer for Video Generation.
Mr-Harry/LGM
LGM: Large Multi-View Gaussian Model for High-Resolution 3D Content Creation.
Mr-Harry/Lumina-T2X
Lumina-T2X is a unified framework for Text to Any Modality Generation
Mr-Harry/LWM
Mr-Harry/mamba
A simple and efficient Mamba implementation in PyTorch and MLX.
Mr-Harry/MambaTransformer
Integrating Mamba/SSMs with Transformer for Enhanced Long Context and High-Quality Sequence Modeling
Mr-Harry/minisora
The Mini Sora project aims to explore the implementation path and future development direction of Sora.
Mr-Harry/MNBVC
MNBVC(Massive Never-ending BT Vast Chinese corpus)超大规模中文语料集。对标chatGPT训练的40T数据。MNBVC数据集不但包括主流文化,也包括各个小众文化甚至火星文的数据。MNBVC数据集包括新闻、作文、小说、书籍、杂志、论文、台词、帖子、wiki、古诗、歌词、商品介绍、笑话、糗事、聊天记录等一切形式的纯文本中文数据。
Mr-Harry/OneTo3D
OneTo3D: One Image to Editable Dynamic 3D Model and Video Generation
Mr-Harry/Open-Sora
Open-Sora: Democratizing Efficient Video Production for All
Mr-Harry/Open-Sora-Plan
This project aim to reproducing Sora (Open AI T2V model), but we only have limited resource. We deeply wish the all open source community can contribute to this project.
Mr-Harry/PixArt-alpha
Fast Training of Diffusion Transformer for Photorealistic Text-to-Image Synthesis
Mr-Harry/ragas
Evaluation framework for your Retrieval Augmented Generation (RAG) pipelines
Mr-Harry/Retinexformer
"Retinexformer: One-stage Retinex-based Transformer for Low-light Image Enhancement" (ICCV 2023)
Mr-Harry/sitcom-simulator
A tool that combines ChatGPT, Stable Diffusion, FakeYou, and FreePD to create AI-generated videos.
Mr-Harry/StableCascade
Official Code for Stable Cascade
Mr-Harry/StreamDiffusion
StreamDiffusion: A Pipeline-Level Solution for Real-Time Interactive Generation
Mr-Harry/StreamingT2V
StreamingT2V: Consistent, Dynamic, and Extendable Long Video Generation from Text
Mr-Harry/sunotoapi
将 sunoAi web转成 openai 格式进行调用
Mr-Harry/TTS
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
Mr-Harry/VAR
[GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction"
Mr-Harry/VBench
[CVPR2024 Highlight] VBench - We Evaluate Video Generation
Mr-Harry/WonderJourney