gfl699468's Stars
hacksider/Deep-Live-Cam
real time face swap and one-click video deepfake with only a single image
dockur/windows
Windows inside a Docker container.
myshell-ai/OpenVoice
Instant voice cloning by MIT and MyShell.
mifi/lossless-cut
The swiss army knife of lossless video/audio editing
opendatalab/MinerU
A high-quality tool for convert PDF to Markdown and JSON.一站式开源高质量数据提取工具,将PDF转换成Markdown和JSON格式。
netbox-community/netbox
The premier source of truth powering network automation. Open source under Apache 2. Try NetBox Cloud free: https://netboxlabs.com/free-netbox-cloud/
PKU-YuanGroup/Open-Sora-Plan
This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.
FunAudioLLM/CosyVoice
Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.
yorukot/superfile
Pretty fancy and modern terminal file manager
jasonppy/VoiceCraft
Zero-Shot Speech Editing and Text-to-Speech in the Wild
lllyasviel/Omost
Your image is almost there!
Zejun-Yang/AniPortrait
AniPortrait: Audio-Driven Synthesis of Photorealistic Portrait Animation
yisol/IDM-VTON
[ECCV2024] IDM-VTON : Improving Diffusion Models for Authentic Virtual Try-on in the Wild
showlab/Awesome-Video-Diffusion
A curated list of recent diffusion models for video generation, editing, restoration, understanding, etc.
truefoundry/cognita
RAG (Retrieval Augmented Generation) Framework for building modular, open source applications for production by TrueFoundry
TMElyralab/MuseTalk
MuseTalk: Real-Time High Quality Lip Synchorization with Latent Space Inpainting
Doubiiu/DynamiCrafter
[ECCV 2024, Oral] DynamiCrafter: Animating Open-domain Images with Video Diffusion Priors
nerfstudio-project/gsplat
CUDA accelerated rasterization of gaussian splatting
NVlabs/VILA
VILA - a multi-image visual language model with training, inference and evaluation recipe, deployable from cloud to edge (Jetson Orin and laptops)
lichao-sun/Mora
Mora: More like Sora for Generalist Video Generation
egzumer/uv-k5-firmware-custom
A merge between https://github.com/OneOfEleven/uv-k5-firmware-custom and https://github.com/fagci/uv-k5-firmware-fagci-mod
TencentQQGYLab/ELLA
ELLA: Equip Diffusion Models with LLM for Enhanced Semantic Alignment
JackAILab/ConsistentID
Customized ID Consistent for human
HuiZeng/Image-Adaptive-3DLUT
Learning Image-adaptive 3D Lookup Tables for High Performance Photo Enhancement in Real-time
sergeytulyakov/mocogan
MoCoGAN: Decomposing Motion and Content for Video Generation
jnjaby/KEEP
[ECCV'24] Kalman-Inspired Feature Propagation for Video Face Super-Resolution
tensorflower/seetaFace6Python
简单、快速搞定人脸识别应用,觉得有帮助,给个start吧!
e2b-dev/awesome-devins
Awesome Devin-inspired AI agents
TencentQQGYLab/LinguaLinker
LinguaLinker: Audio-Driven Portraits Animation with Implicit Facial Control Enhancement
mulns/PerVFI
Official code base of "Perception-Oriented Video Frame Interpolation via Asymmetric Blending" (CVPR 2024), also denoted as ''PerVFI''.