pia-ai's Stars
alibaba/SmartEngine
SmartEngine is a lightweight business orchestration engine.
alibaba/Elastic-Federated-Learning-Solution
alibaba/nacos
an easy-to-use dynamic service discovery, configuration and service management platform for building cloud native applications.
suno-ai/bark
🔊 Text-Prompted Generative Audio Model
kyutai-labs/moshi
hacksider/Deep-Live-Cam
real time face swap and one-click video deepfake with only a single image
facebookresearch/sam2
The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
google-ai-edge/mediapipe
Cross-platform, customizable ML solutions for live and streaming media.
magic-research/magic-animate
[CVPR 2024] MagicAnimate: Temporally Consistent Human Image Animation using Diffusion Model
ZiqiaoPeng/SyncTalk
[CVPR 2024] This is the official source for our paper "SyncTalk: The Devil is in the Synchronization for Talking Head Synthesis"
Tencent/MimicMotion
High-Quality Human Motion Video Generation with Confidence-aware Pose Guidance
Stability-AI/generative-models
Generative Models by Stability AI
yael-vinker/CLIPasso
mli/autocut
用文本编辑器剪视频
modelscope/FunASR
A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.
modelscope/facechain
FaceChain is a deep-learning toolchain for generating your Digital-Twin.
williechai/speedup-plugin-for-stable-diffusions
aigc-apps/EasyAnimate
📺 An End-to-End Solution for High-Resolution and Long Video Generation Based on Transformer Diffusion
KwaiVGI/LivePortrait
Bring portraits to life!
deepfakes/faceswap
Deepfakes Software For All
lllyasviel/Paints-UNDO
Understand Human Behavior to Align True Needs
deezer/spleeter
Deezer source separation library including pretrained models.
apify/crawlee-python
Crawlee—A web scraping and browser automation library for Python to build reliable crawlers. Extract data for AI, LLMs, RAG, or GPTs. Download HTML, PDF, JPG, PNG, and other files from websites. Works with BeautifulSoup, Playwright, and raw HTTP. Both headful and headless mode. With proxy rotation.
roboflow/supervision
We write your reusable computer vision tools. 💜
fudan-generative-vision/hallo
Hallo: Hierarchical Audio-Driven Visual Synthesis for Portrait Image Animation
TMElyralab/MusePose
MusePose: a Pose-Driven Image-to-Video Framework for Virtual Human Generation
antgroup/echomimic
EchoMimic: Lifelike Audio-Driven Portrait Animations through Editable Landmark Conditioning
Kingfish404/segment-anything-webui
Yet another SAM webui + CLIP
openai/CLIP
CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image
facefusion/facefusion
Industry leading face manipulation platform