Pinned Repositories
ABCPruner
Pytorch implementation of our paper under review -- Channel Pruning via Automatic Structure Search
AdvancedLiterateMachinery
A collection of original, innovative ideas and algorithms towards Advanced Literate Machinery. This project is maintained by the OCR Team in the Language Technology Lab, Alibaba DAMO Academy.
awesome-anomaly-detection
A curated list of awesome anomaly detection resources
awesome-face
😎 face releated algorithm, dataset and paper
generate-image
Learning-Deep-Features-for-One-Class-Classification
#Deep One class #Anomaly Detection
Noise2noise
TF_yolov3
This repo for enhacing the performance of yolov3
UIpython
yolact
Improve some features from original github https://github.com/dbolya/yolact
buiduchanh's Repositories
buiduchanh/AdvancedLiterateMachinery
A collection of original, innovative ideas and algorithms towards Advanced Literate Machinery. This project is maintained by the OCR Team in the Language Technology Lab, Alibaba DAMO Academy.
buiduchanh/buiduchanh
buiduchanh/CatVTON
CatVTON is a simple and efficient virtual try-on diffusion model with 1) Lightweight Network (899.06M parameters totally), 2) Parameter-Efficient Training (49.57M parameters trainable) and 3) Simplified Inference (< 8G VRAM for 1024X768 resolution).
buiduchanh/Cloth2Tex
Cloth2Tex: A Customized Cloth Texture Generation Pipeline for 3D Virtual Try-On
buiduchanh/CnSTD
CnSTD: 基于 PyTorch/MXNet 的 中文/英文 场景文字检测(Scene Text Detection)、数学公式检测(Mathematical Formula Detection, MFD)、篇章分析(Layout Analysis)的Python3 包
buiduchanh/decord
An efficient video loader for deep learning with smart shuffling that's super easy to digest
buiduchanh/Deep-Live-Cam
real time face swap and one-click video deepfake with only a single image (uncensored)
buiduchanh/DemoFusion
Let us democratise high-resolution generation! (arXiv 2023)
buiduchanh/DifFace
DifFace: Blind Face Restoration with Diffused Error Contraction (PyTorch)
buiduchanh/doctr
docTR (Document Text Recognition) - a seamless, high-performing & accessible library for OCR-related tasks powered by Deep Learning.
buiduchanh/FaceRecognizer
人脸识别应用
buiduchanh/FastChat
An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and FastChat-T5.
buiduchanh/FETNet
FETNet: Feature Erasing and Transferring Network for Scene Text Removal
buiduchanh/LivePortrait
Make one portrait alive!
buiduchanh/lossless-cut
The swiss army knife of lossless video/audio editing
buiduchanh/moshi
buiduchanh/nanosam
A distilled Segment Anything (SAM) model capable of running real-time with NVIDIA TensorRT
buiduchanh/night-enhancement
[ECCV2022] "Unsupervised Night Image Enhancement: When Layer Decomposition Meets Light-Effects Suppression", https://arxiv.org/abs/2207.10564
buiduchanh/norfair
Lightweight Python library for adding real-time multi-object tracking to any detector.
buiduchanh/OOTDiffusion
Official implementation of OOTDiffusion: Outfitting Fusion based Latent Diffusion for Controllable Virtual Try-on
buiduchanh/PDF-Extract-Kit
A Comprehensive Toolkit for High-Quality PDF Content Extraction
buiduchanh/pipeless
An open-source computer vision framework to build and deploy apps in minutes without worrying about multimedia pipelines
buiduchanh/RapidOCR
Awesome OCR multiple programing languages toolkits based on ONNXRuntime, OpenVION and PaddlePaddle.
buiduchanh/refiners
A microframework on top of PyTorch with first-class citizen APIs for foundation model adaptation
buiduchanh/surya
OCR, layout analysis, and line detection in 90+ languages
buiduchanh/VideoMamba
VideoMamba: State Space Model for Efficient Video Understanding
buiduchanh/VoiceCraft
Zero-Shot Speech Editing and Text-to-Speech in the Wild
buiduchanh/watsor
Object detection for video surveillance
buiduchanh/YOWOv3
buiduchanh/ZIM
ZIM: Zero-Shot Image Matting for Anything