Pinned Repositories
audio-annotator
音频标注工具
augmix
AugMix: A Simple Data Processing Method to Improve Robustness and Uncertainty
BeamerStyleSlides
🌈Beamer风格的幻灯片模板集。包含了PowerPoint和Keynote两套格式。
classifier-balancing
This repository contains code for the paper "Decoupling Representation and Classifier for Long-Tailed Recognition", published at ICLR 2020
CONTRIQUE
Official implementation for "Image Quality Assessment using Contrastive Learning"
ConvNeXt
Code release for ConvNeXt model
CRNN_Chinese_Characters_Rec
(CRNN) Chinese Characters Recognition.
Decoupled-attention-network
Pytorch implementation for "Decoupled attention network for text recognition".
examples
TensorFlow examples
OCR_DataSet
收集并整理有关OCR的数据集并统一标注格式,以便实验需要
WPU93's Repositories
WPU93/augmix
AugMix: A Simple Data Processing Method to Improve Robustness and Uncertainty
WPU93/CONTRIQUE
Official implementation for "Image Quality Assessment using Contrastive Learning"
WPU93/ConvNeXt
Code release for ConvNeXt model
WPU93/DeepFaceLab
DeepFaceLab is the leading software for creating deepfakes.
WPU93/examples
TensorFlow examples
WPU93/generators-with-stylegan2
Here is a series of face generators based on StyleGAN2
WPU93/headnerf
Pytorch implementation of HeadNerf
WPU93/HowToCook
程序员在家做饭方法指南。Programmer's guide about how to cook at home (Chinese only).
WPU93/Implicit-feature-alignment
Pytorch implementation for "Implicit Feature Alignment: Learn to Convert Text Recognizer to Text Spotter".
WPU93/Long-Tailed-Recognition.pytorch
[NeurIPS 2020] This project provides a strong single-stage baseline for Long-Tailed Classification, Detection, and Instance Segmentation (LVIS). It is also a PyTorch implementation of the NeurIPS 2020 paper 'Long-Tailed Classification by Keeping the Good and Removing the Bad Momentum Causal Effect'.
WPU93/ml-visuals
🎨 ML Visuals contains figures and templates which you can reuse and customize to improve your scientific writing.
WPU93/open-pose-editor
online 3d openpose editor for stable diffusion and controlnet
WPU93/OpenChatKit
WPU93/PaddleOCR
Awesome multilingual OCR toolkits based on PaddlePaddle (practical ultra lightweight OCR system, support 80+ languages recognition, provide data annotation and synthesis tools, support training and deployment among server, mobile, embedded and IoT devices)
WPU93/photometric_optimization
Photometric optimization code for creating the FLAME texture space and other applications
WPU93/pytorch-image-models
PyTorch image models, scripts, pretrained weights -- (SE)ResNet/ResNeXT, DPN, EfficientNet, MixNet, MobileNet-V3/V2, MNASNet, Single-Path NAS, FBNet, and more
WPU93/pytorch-metric-learning
The easiest way to use deep metric learning in your application. Modular, flexible, and extensible. Written in PyTorch.
WPU93/pytorch3d
PyTorch3D is FAIR's library of reusable components for deep learning with 3D data
WPU93/SAR_TF
This is an implementation of Show, Attend and Read with tensorflow
WPU93/Scene-Text-Recognition
WPU93/Server
PanDownload的个人维护版本
WPU93/Text-To-Video-Finetuning
Finetune ModelScope's Text To Video model using Diffusers 🧨
WPU93/the-algorithm
Source code for Twitter's Recommendation Algorithm
WPU93/TumbleOCR
WPU93/VAN-Classification
WPU93/Video-P2P
Video-P2P: Video Editing with Cross-attention Control
WPU93/VNN
VNN是由欢聚集团(Joyy Inc.)推出的高性能、轻量级神经网络部署框架。目前已为Hago、VOO、VFly、马克相机等App提供20余种AI能力的支持,覆盖直播、短视频、视频编辑等泛娱乐场景和工程场景
WPU93/WPU93
Config files for my GitHub profile.
WPU93/yn
A Hackable Markdown Note Application for Programmers. Documents encryption, code snippet running, integrated terminal, chart embedding, HTML applets, plug-in, and macro replacement.
WPU93/yolov5-face
YOLO5Face: Why Reinventing a Face Detector (https://arxiv.org/abs/2105.12931)