Pinned Repositories
analysis-synthesis-deblurring
BSRGAN-PyTorch
CVPR2021-Transformer-and-Low-level-Vision
DSG
This project is the official implementation of our paper Diverse Sample Generation: Pushing the Limit of Data-free Quantization.
easyportrait
EasyPortrait - Face Parsing and Portrait Segmentation Dataset
EffectiveModernCppChinese
《Effective Modern C++》翻译 - 已完成
FasterNet
[CVPR 2023] Code release for PConv and FasterNet
GAN-Slimming
[ECCV 2020] "All-in-One GAN Compression by Unified Optimization" by Haotao Wang, Shupeng Gui, Haichuan Yang, Ji Liu, and Zhangyang Wang
KDFReVS-3D
Repository for 《A Distillation Framework for Accurate and Robust Single View 3D Face Reconstruction with Video Supervision》
Towards-Compact-CNNs-via-Collaborative-Compression
liuguoyou's Repositories
liuguoyou/resemble-enhance
AI powered speech denoising and enhancement
liuguoyou/VIM
liuguoyou/adetailer
Auto detecting, masking and inpainting with detection model.
liuguoyou/Af-DCD
The official project website of "Augmentation-free Dense Contrastive Distillation for Efficient Semantic Segmentation" (Af-DCD for short, accepted to NeurIPS 2023).
liuguoyou/anything_in_anyscene
liuguoyou/BiMatting
This project is the official implementation of our accepted NeurIPS 2023 paper BiMatting: Efficient Video Matting via Binarization.
liuguoyou/BVI-VFI-database
[IEEE TIP'2023] "BVI-VFI: A Video Quality Database for Video Frame Interpolation", Duolikun Danier, Fan Zhang, David Bull
liuguoyou/CA-SUM-360
A PyTorch implementation of our method from "An Integrated System for Spatio-Temporal Summarization of 360-degrees Videos", Proc. MMM 2024
liuguoyou/COMM
Pytorch code for paper From CLIP to DINO: Visual Encoders Shout in Multi-modal Large Language Models
liuguoyou/EmotiVoice
EmotiVoice 😊: a Multi-Voice and Prompt-Controlled TTS Engine
liuguoyou/EResFD
Lightweight Face Detector from CLOVA
liuguoyou/FastLLVE
FastLLVE: Real-Time Low-Light Video Enhancement with Intensity-Aware Lookup Table (ACM MM 2023)
liuguoyou/frame-interpolation-pytorch
PyTorch implementation of FILM: Frame Interpolation for Large Motion, In ECCV 2022.
liuguoyou/gpt_academic
为ChatGPT/GLM提供实用化交互界面,特别优化论文阅读/润色/写作体验,模块化设计,支持自定义快捷按钮&函数插件,支持Python和C++等项目剖析&自译解功能,PDF/LaTex论文翻译&总结功能,支持并行问询多种LLM模型,支持chatglm2等本地模型。兼容文心一言, moss, llama2, rwkv, claude2, 通义千问, 书生, 讯飞星火等。
liuguoyou/HybridSORT
[AAAI2024]Hybrid-SORT: Weak Cues Matter for Online Multi-Object Tracking
liuguoyou/Lightweight-Face-Detector-Pruning
Code and pruned models for our paper: K. Gkrispanis, N. Gkalelis, V. Mezaris, "Filter-Pruning of Lightweight Face Detectors Using a Geometric Median Criterion", Proc. IEEE/CVF Winter Conference on Applications of Computer Vision Workshops (WACVW 2024), Waikoloa, Hawaii, USA, Jan. 2024.
liuguoyou/Matting-Anything
Matting Anything Model (MAM), an efficient and versatile framework for estimating the alpha matte of any instance in an image with flexible and interactive visual or linguistic user prompt guidance.
liuguoyou/Metric3D
The repo for "Metric3D: Towards Zero-shot Metric 3D Prediction from A Single Image"
liuguoyou/MFT
MFT: Long-Term Tracking of Every Pixel -- code for the WACV 2024 paper
liuguoyou/MISO-VFI
Official implementation of "A Multi-In-Single-Out Network for Video Frame Interpolation without Optical Flow"
liuguoyou/MobileSAM-pytorch
Reproduction of MobileSAM using pytorch
liuguoyou/modular-memorability
Official implementation for the CVPR 2023 paper "Modular memorability: tiered representations for video memorability prediction"
liuguoyou/PixArt-alpha
Fast Training of Diffusion Transformer for Photorealistic Text-to-Image Synthesis
liuguoyou/sd-webui-fastblend
Make videos smooth!
liuguoyou/SlimSAM
SlimSAM: 0.1% Data Makes Segment Anything Slim
liuguoyou/ssmnet_ISMIR2023
liuguoyou/syenet
SYENet: A Simple Yet Effective Network for Multiple Low-Level Vision Tasks with Real-Time Performance on Mobile Device, in ICCV 2023
liuguoyou/terminaltexteffects
Visual effects applied to text in the terminal.
liuguoyou/TinyLlama
The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.
liuguoyou/XMem2
A tool for efficient semi-supervised video object segmentation (great results with minimal manual labor) and a dataset for benchmarking