CatherineZhou

CatherineZhou's Stars

PaddlePaddle/PaddleOCR
Awesome multilingual OCR toolkits based on PaddlePaddle (practical ultra lightweight OCR system, support 80+ languages recognition, provide data annotation and synthesis tools, support training and deployment among server, mobile, embedded and IoT devices)
Language:Python42.9k 440 9.3k7.7k
lllyasviel/ControlNet
Let us control diffusion models!
Language:Python29.9k 217 5432.7k
google-ai-edge/mediapipe
Cross-platform, customizable ML solutions for live and streaming media.
Language:C++26.9k 496 5.1k5.1k
iperov/DeepFaceLive
Real-time face swap for PC streaming or video calls
Language:Python26.2k 364 1444.5k
infiniflow/ragflow
RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.
Language:Python18.2k 109 1.2k1.8k
lllyasviel/style2paints
sketch + style = paints :art: (TOG2018/SIGGRAPH2018ASIA)
Language:JavaScript18k 561 2122.1k
stanford-oval/storm
An LLM-powered knowledge curation system that researches a topic and generates a full-length report with citations.
Language:Python11.6k 83 1181k
SYSTRAN/faster-whisper
Faster Whisper transcription with CTranslate2
Language:Python11.6k 121 688960
netease-youdao/QAnything
Question and Answer based on Anything.
Language:Python11.5k 102 4001.1k
FlagOpen/FlagEmbedding
Retrieval and Retrieval-augmented LLMs
Language:Python7k 44 993508
PeterL1n/BackgroundMattingV2
Real-Time High-Resolution Background Matting
Language:Python6.8k 148 195950
HVision-NKU/StoryDiffusion
Accepted as [NeurIPS 2024] Spotlight Presentation Paper
Language:Jupyter Notebook5.8k 86 143577
YaoFANGUK/video-subtitle-remover
基于AI的图片/视频硬字幕去除、文本水印去除，无损分辨率生成去字幕、去水印后的图片/视频文件。无需申请第三方API，本地实现。AI-based tool for removing hard-coded subtitles and text-like watermarks from videos or Pictures.
Language:Python4k 31 81526
yisol/IDM-VTON
[ECCV2024] IDM-VTON : Improving Diffusion Models for Authentic Virtual Try-on in the Wild
Language:Python3.7k 56 148583
MahmoudAshraf97/whisper-diarization
Automatic Speech Recognition with Speaker Diarization based on OpenAI Whisper
Language:Jupyter Notebook3.4k 46 165282
TencentARC/InstantMesh
InstantMesh: Efficient 3D Mesh Generation from a Single Image with Sparse-view Large Reconstruction Models
Language:Python3.1k 40 151323
TMElyralab/MuseTalk
MuseTalk: Real-Time High Quality Lip Synchorization with Latent Space Inpainting
Language:Python2.5k 49 186303
facebookresearch/DPR
Dense Passage Retriever - is a set of tools and models for open domain Q&A task.
Language:Python1.7k 23 210299
jrottenberg/ffmpeg
Docker build for FFmpeg on Ubuntu / Alpine / Centos / Scratch / nvidia / vaapi
Language:Dockerfile1.4k 48 193454
ZiqiaoPeng/SyncTalk
[CVPR 2024] This is the official source for our paper "SyncTalk: The Devil is in the Synchronization for Talking Head Synthesis"
Language:Python1.3k 62 221146
RUC-NLPIR/FlashRAG
⚡FlashRAG: A Python Toolkit for Efficient RAG Research
Language:Python1.1k 10 7684
hao-ai-lab/LookaheadDecoding
[ICML 2024] Break the Sequential Dependency of LLM Inference Using Lookahead Decoding
Language:Python1.1k 11 5564
wuhuikai/FaceSwap
Swap face between two photos.
Language:Python722 20 37224
JinhuaLiang/WavCraft
Official repo for WavCraft, an AI agent for audio creation and editing
Language:Python648 71 396
Hujiazeng/Vach
Real time streaming talking head
Language:Python414 9 2358
hao-ai-lab/Consistency_LLM
[ICML 2024] CLLMs: Consistency Large Language Models
Language:Python339 9 1016
HuskyInSalt/CRAG
Corrective Retrieval Augmented Generation
Language:Python275 9 2226
eric-ai-lab/swap-anything
"SwapAnything: Enabling Arbitrary Object Swapping in Personalized Visual Editing"
208 32 15
starsuzi/Adaptive-RAG
Language:Jsonnet153 6 1221
NanKeRen2020/UVR5_Linux
ultimate vocal remover application run on linux ubuntu1604
Language:Python51 4 24