AlvinZheng

AlvinZheng's Stars

huggingface/transformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
Language:Python134k 1.1k 15.9k26.7k
openai/whisper
Robust Speech Recognition via Large-Scale Weak Supervision
Language:Python69.9k 573 08.2k
facebookresearch/segment-anything
The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
Language:Jupyter Notebook47.2k 305 6645.6k
coqui-ai/TTS
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
Language:Python34.7k 287 1.1k4.2k
huggingface/diffusers
🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch and FLAX.
Language:Python25.6k 199 4.1k5.3k
AliaksandrSiarohin/first-order-model
This repository contains the source code for the paper First Order Motion Model for Image Animation
Language:Jupyter Notebook14.5k 353 5313.2k
microsoft/Swin-Transformer
This is an official implementation for "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows".
Language:Python13.8k 126 3142k
OpenBMB/MiniCPM-V
MiniCPM-V 2.6: A GPT-4V Level MLLM for Single Image, Multi Image and Video on Your Phone
Language:Python12.3k 101 554862
NVlabs/stylegan2
StyleGAN2 - Official TensorFlow Implementation
Language:Python11k 371 02.5k
Rudrabha/Wav2Lip
This repository contains the codes of "A Lip Sync Expert Is All You Need for Speech to Lip Generation In the Wild", published at ACM Multimedia 2020. For HD commercial model, please try out Sync Labs
Language:Python10.5k 167 6592.3k
magic-research/magic-animate
[CVPR 2024] MagicAnimate: Temporally Consistent Human Image Animation using Diffusion Model
Language:Python10.4k 104 1461.1k
carson-katri/dream-textures
Stable Diffusion built-in to Blender
Language:Python7.8k 110 544425
OpenTalker/video-retalking
[SIGGRAPH Asia 2022] VideoReTalking: Audio-based Lip Synchronization for Talking Head Video Editing In the Wild
Language:Python6.5k 72 244962
cvg/LightGlue
LightGlue: Local Feature Matching at Light Speed (ICCV 2023)
Language:Python3.4k 48 108323
magicleap/SuperGluePretrainedNetwork
SuperGlue: Learning Feature Matching with Graph Neural Networks (CVPR 2020, Oral)
Language:Python3.3k 56 140668
cvg/Hierarchical-Localization
Visual localization made easy with hloc
Language:Python3.2k 88 306584
ali-vilab/VGen
Official repo for VGen: a holistic video generation ecosystem for video generation building on diffusion models
Language:Python2.9k 33 134260
cszn/KAIR
Image Restoration Toolbox (PyTorch). Training and testing codes for DPIR, USRNet, DnCNN, FFDNet, SRMD, DPSR, BSRGAN, SwinIR
Language:Python2.9k 47 163629
wkeeling/selenium-wire
Extends Selenium's Python bindings to give you the ability to inspect requests made by the browser.
Language:Python1.9k 25 629254
YuliangXiu/ICON
[CVPR'22] ICON: Implicit Clothed humans Obtained from Normals
Language:Python1.6k 42 236218
premAI-io/state-of-open-source-ai
:closed_book: Clarity in the current fast-paced mess of Open Source innovation
Language:TeX1.5k 23 4189
cvg/GlueStick
Joint Deep Matcher for Points and Lines 🖼️💥🖼️ (ICCV 2023)
Language:Jupyter Notebook555 16 2944
mitmedialab/AI-generated-characters
AI-generated-character
Language:Jupyter Notebook455 20 10107
sibozhang/Text2Video
ICASSP 2022: "Text2Video: text-driven talking-head video synthesis with phonetic dictionary".
Language:Python421 11 2292
fcakyon/content-moderation-deep-learning
Deep learning based content moderation from text, audio, video & image input modalities.
309 5 018
mamonraab/Real-Time-Violence-Detection-in-Video-
Language:Jupyter Notebook123 4 757
CheshireCaat/selenium-with-fingerprints
Anonymous automation via selenium with fingerprint replacement technology.
Language:JavaScript79 3 2413
mamonraab/violance-detection-in-video-with-pytroch
Language:Python56 3 617
geoyee/Imatch-P
A demo using SuperGlue and SuperPoint to do the image matching task based PaddlePaddle.
Language:Python21 1 14
AI-change-the-world/chinese-poetry
The most comprehensive database of Chinese poetry 🧶最全中华古诗词数据库, 唐宋两朝近一万四千古诗人, 接近5.5万首唐诗加26万宋诗. 两宋时期1564位词人，21050首词。
Language:JavaScript21