Pinned Repositories
APILayerKit
an abstract of apis of app
csv-localizer
Solve the pain of copy paste string from localized string list, put it in CSV file and convert to iOS, Android or JSON localizable strings
EasyOCR
Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and etc.
EasyRequest
an easy request
EventDriven-Programming
programming with event drive.
Exhibition
A highly experimental 3D room layout for a gallery that aims to show exhibition details in an interesting way.
IosInstructure
A generic ios app template, api abstract, data save, ....
Irecorder
audio recorder.
jbot
Make Slack and Facebook Bots in Java.
ML
machine learing
AlvinZheng's Repositories
AlvinZheng/EasyOCR
Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and etc.
AlvinZheng/peclr
This is the pretraining code for PeCLR. An equivariant contrastive learning framework for 3D hand pose estimation. The paper is presented at ICCV 2021.
AlvinZheng/yolov5
YOLOv5 🚀 in PyTorch > ONNX > CoreML > TFLite
AlvinZheng/3D-art-gallery
This is an interactive 3D art gallery made with Three.js, perfect for artists or designers to exhibit their portfolio of artworks and projects.
AlvinZheng/AI-generated-characters
AI-generated-character
AlvinZheng/chinese-poetry
The most comprehensive database of Chinese poetry 🧶最全中华古诗词数据库, 唐宋两朝近一万四千古诗人, 接近5.5万首唐诗加26万宋诗. 两宋时期1564位词人,21050首词。
AlvinZheng/first-order-model
This repository contains the source code for the paper First Order Motion Model for Image Animation
AlvinZheng/ICON
[CVPR'22] ICON: Implicit Clothed humans Obtained from Normals
AlvinZheng/Imatch-P
A demo using SuperGlue and SuperPoint to do the image matching task based PaddlePaddle.
AlvinZheng/KAIR
Image Restoration Toolbox (PyTorch). Training and testing codes for DPIR, USRNet, DnCNN, FFDNet, SRMD, DPSR, BSRGAN, SwinIR
AlvinZheng/LightGlue
LightGlue: Local Feature Matching at Light Speed (ICCV 2023)
AlvinZheng/magic-animate
[CVPR 2024] MagicAnimate: Temporally Consistent Human Image Animation using Diffusion Model
AlvinZheng/MiniCPM-V
MiniCPM-Llama3-V 2.5: A GPT-4V Level Multimodal LLM on Your Phone
AlvinZheng/python-qrcode
Python QR Code image generator
AlvinZheng/Real-Time-Violence-Detection-in-Video-
AlvinZheng/segment-anything
The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
AlvinZheng/SimSwap
An arbitrary face-swapping framework on images and videos with one single trained model!
AlvinZheng/stylegan2
StyleGAN2 - Official TensorFlow Implementation
AlvinZheng/Swin-Transformer
This is an official implementation for "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows".
AlvinZheng/Swin-Transformer-Semantic-Segmentation
This is an official implementation for "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows" on Semantic Segmentation.
AlvinZheng/SwinTransformer
torch implementation of SwinTransformer
AlvinZheng/Text2Video
ICASSP 2022: "Text2Video: text-driven talking-head video synthesis with phonetic dictionary".
AlvinZheng/transformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
AlvinZheng/TTS
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
AlvinZheng/undetected-chromedriver
Custom Selenium Chromedriver | Zero-Config | Passes ALL bot mitigation systems (like Distil / Imperva/ Datadadome / CloudFlare IUAM)
AlvinZheng/VGen
Official repo for VGen: a holistic video generation ecosystem for video generation building on diffusion models
AlvinZheng/video-retalking
[SIGGRAPH Asia 2022] VideoReTalking: Audio-based Lip Synchronization for Talking Head Video Editing In the Wild
AlvinZheng/Wav2Lip
This repository contains the codes of "A Lip Sync Expert Is All You Need for Speech to Lip Generation In the Wild", published at ACM Multimedia 2020. For HD commercial model, please try out Sync Labs
AlvinZheng/whisper
Robust Speech Recognition via Large-Scale Weak Supervision
AlvinZheng/yolov5_obb
yolov5 + csl_label.(Oriented Object Detection)(Rotation Detection)(Rotated BBox)基于yolov5的旋转目标检测