AlvinZheng's Stars
huggingface/transformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
openai/whisper
Robust Speech Recognition via Large-Scale Weak Supervision
facebookresearch/segment-anything
The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
coqui-ai/TTS
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
huggingface/diffusers
🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch and FLAX.
AliaksandrSiarohin/first-order-model
This repository contains the source code for the paper First Order Motion Model for Image Animation
microsoft/Swin-Transformer
This is an official implementation for "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows".
OpenBMB/MiniCPM-V
MiniCPM-V 2.6: A GPT-4V Level MLLM for Single Image, Multi Image and Video on Your Phone
NVlabs/stylegan2
StyleGAN2 - Official TensorFlow Implementation
Rudrabha/Wav2Lip
This repository contains the codes of "A Lip Sync Expert Is All You Need for Speech to Lip Generation In the Wild", published at ACM Multimedia 2020. For HD commercial model, please try out Sync Labs
magic-research/magic-animate
[CVPR 2024] MagicAnimate: Temporally Consistent Human Image Animation using Diffusion Model
carson-katri/dream-textures
Stable Diffusion built-in to Blender
OpenTalker/video-retalking
[SIGGRAPH Asia 2022] VideoReTalking: Audio-based Lip Synchronization for Talking Head Video Editing In the Wild
cvg/LightGlue
LightGlue: Local Feature Matching at Light Speed (ICCV 2023)
magicleap/SuperGluePretrainedNetwork
SuperGlue: Learning Feature Matching with Graph Neural Networks (CVPR 2020, Oral)
cvg/Hierarchical-Localization
Visual localization made easy with hloc
ali-vilab/VGen
Official repo for VGen: a holistic video generation ecosystem for video generation building on diffusion models
cszn/KAIR
Image Restoration Toolbox (PyTorch). Training and testing codes for DPIR, USRNet, DnCNN, FFDNet, SRMD, DPSR, BSRGAN, SwinIR
wkeeling/selenium-wire
Extends Selenium's Python bindings to give you the ability to inspect requests made by the browser.
YuliangXiu/ICON
[CVPR'22] ICON: Implicit Clothed humans Obtained from Normals
premAI-io/state-of-open-source-ai
:closed_book: Clarity in the current fast-paced mess of Open Source innovation
cvg/GlueStick
Joint Deep Matcher for Points and Lines 🖼️💥🖼️ (ICCV 2023)
mitmedialab/AI-generated-characters
AI-generated-character
sibozhang/Text2Video
ICASSP 2022: "Text2Video: text-driven talking-head video synthesis with phonetic dictionary".
fcakyon/content-moderation-deep-learning
Deep learning based content moderation from text, audio, video & image input modalities.
mamonraab/Real-Time-Violence-Detection-in-Video-
CheshireCaat/selenium-with-fingerprints
Anonymous automation via selenium with fingerprint replacement technology.
mamonraab/violance-detection-in-video-with-pytroch
geoyee/Imatch-P
A demo using SuperGlue and SuperPoint to do the image matching task based PaddlePaddle.
AI-change-the-world/chinese-poetry
The most comprehensive database of Chinese poetry 🧶最全中华古诗词数据库, 唐宋两朝近一万四千古诗人, 接近5.5万首唐诗加26万宋诗. 两宋时期1564位词人,21050首词。