JimLee4530

postgraduate at CS Department, HangZhou Dianzi University.

Media Intelligence Laboratory(MIL@HDU)HangZhou,China

JimLee4530's Stars

yt-dlp/yt-dlp
A feature-rich command-line audio/video downloader
Language:Python83.6k 501 7.8k6.5k
tiangolo/fastapi
FastAPI framework, high performance, easy to learn, fast to code, ready for production
Language:Python74k 674 3.4k6.2k
labmlai/annotated_deep_learning_paper_implementations
🧑‍🏫 60+ Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), gans(cyclegan, stylegan2, ...), 🎮 reinforcement learning (ppo, dqn), capsnet, distillation, ... 🧠
Language:Python54.4k 448 1325.6k
xai-org/grok-1
Grok open release
Language:Python49.5k 562 2098.3k
geekan/MetaGPT
🌟 The Multi-Agent Framework: First AI Software Company, Towards Natural Language Programming
Language:Python44k 899 6335.2k
hpcaitech/Open-Sora
Open-Sora: Democratizing Efficient Video Production for All
Language:Python21.7k 185 4872.1k
harry0703/MoneyPrinterTurbo
利用AI大模型，一键生成高清短视频 Generate short videos with one click using AI LLM.
Language:Python16.3k 135 3792.6k
cumulo-autumn/StreamDiffusion
StreamDiffusion: A Pipeline-Level Solution for Real-Time Interactive Generation
Language:Python9.5k 79 116683
xxlllq/system_architect
:100: 2024年系统架构设计师（软考高级）备考资料。
Language:HTML6.6k 196 01.8k
levihsu/OOTDiffusion
Official implementation of OOTDiffusion: Outfitting Fusion based Latent Diffusion for Controllable Virtual Try-on
Language:Python5.4k 74 203802
Zejun-Yang/AniPortrait
AniPortrait: Audio-Driven Synthesis of Photorealistic Portrait Animation
Language:Python4.5k 61 183569
VAST-AI-Research/TripoSR
Language:Python4.4k 49 99505
fudan-generative-vision/champ
Champ: Controllable and Consistent Human Image Animation with 3D Parametric Guidance
Language:Python4.2k 247 115517
openai/transformer-debugger
Language:Python4k 25 14233
alibaba-damo-academy/FunASR
A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.
Language:Python4k 48 841456
geekan/scrapy-examples
Multifarious Scrapy examples. Spiders for alexa / amazon / douban / douyu / github / linkedin etc.
Language:Python3.2k 233 151k
TMElyralab/MuseTalk
MuseTalk: Real-Time High Quality Lip Synchorization with Latent Space Inpainting
Language:Python2.5k 49 186305
TMElyralab/MuseV
MuseV: Infinite-length and High Fidelity Virtual Human Video Generation with Visual Conditioned Parallel Denoising
Language:Python2.4k 33 116254
harlanhong/awesome-talking-head-generation
1.4k 75 4110
Picsart-AI-Research/StreamingT2V
StreamingT2V: Consistent, Dynamic, and Extendable Long Video Generation from Text
Language:Python1.4k 42 56141
AILab-CVC/UniRepLKNet
[CVPR'24] UniRepLKNet: A Universal Perception Large-Kernel ConvNet for Audio, Video, Point Cloud, Time-Series and Image Recognition
Language:Python904 12 1953
mayuelala/FollowYourClick
[arXiv 2024] Follow-Your-Click: This repo is the official implementation of "Follow-Your-Click: Open-domain Regional Image Animation via Short Prompts"
842 58 1333
sail-sg/MDT
Masked Diffusion Transformer is the SOTA for image synthesis. (ICCV 2023)
Language:Python511 18 5038
TIGER-AI-Lab/AnyV2V
Code and data for "AnyV2V: A Tuning-Free Framework For Any Video-to-Video Editing Tasks"
Language:Jupyter Notebook462 17 1034
MStypulkowski/diffused-heads
Official repository for Diffused Heads: Diffusion Models Beat GANs on Talking-Face Generation
Language:Python461 85 2631
whlzy/FiT
[ICML 2024 Spotlight] FiT: Flexible Vision Transformer for Diffusion Model
358 32 77
TIGER-AI-Lab/ConsistI2V
ConsistI2V: Enhancing Visual Consistency for Image-to-Video Generation (TMLR 2024)
Language:Python202 16 2314
PatrickZH/DeepCore
Code for coreset selection methods
Language:Python201 4 1638
johndpope/Emote-hack
Emote Portrait Alive - using ai to reverse engineer code from white paper. (abandoned)
Language:Python168 21 367
AGI-Edgerunners/IIL
Code for our Paper "All in an Aggregated Image for In-Image Learning"
Language:Python27 1 00

JimLee4530

JimLee4530's Stars

yt-dlp/yt-dlp

tiangolo/fastapi

labmlai/annotated_deep_learning_paper_implementations

xai-org/grok-1

geekan/MetaGPT

hpcaitech/Open-Sora

harry0703/MoneyPrinterTurbo

cumulo-autumn/StreamDiffusion

xxlllq/system_architect

levihsu/OOTDiffusion

Zejun-Yang/AniPortrait

VAST-AI-Research/TripoSR

fudan-generative-vision/champ

openai/transformer-debugger

alibaba-damo-academy/FunASR

geekan/scrapy-examples

TMElyralab/MuseTalk

TMElyralab/MuseV

harlanhong/awesome-talking-head-generation

Picsart-AI-Research/StreamingT2V

AILab-CVC/UniRepLKNet

mayuelala/FollowYourClick

sail-sg/MDT

TIGER-AI-Lab/AnyV2V

MStypulkowski/diffused-heads

whlzy/FiT

TIGER-AI-Lab/ConsistI2V

PatrickZH/DeepCore

johndpope/Emote-hack

AGI-Edgerunners/IIL