Pinned Repositories
3d-ken-burns
an implementation of 3D Ken Burns Effect from a Single Image using PyTorch
Attention-ocr-Chinese-Version
Attention OCR Based On Tensorflow
Attention_ocr.pytorch
This repository implements the the encoder and decoder model with attention model for OCR
Awesome-Edge-Detection-Papers
:books: A collection of edge/contour/boundary detection papers.
caffe_ocr
主流ocr算法研究实验性的项目,目前实现了CNN+BLSTM+CTC架构
capsnet-traffic-sign-classifier
A Tensorflow implementation of CapsNet(Capsules Net) apply on german traffic sign dataset
captcha_crack
选字验证码破解,试验过网易和极验,破解率99
CaptchaRecognition
End-to-end variable length Captcha recognition using CNN+RNN+Attention/CTC (pytorch implementation). 端到端的不定长验证码识别
freetype-py
Python binding for the freetype library
tensorflow_end2end_speech_recognition
End-to-End speech recognition implementation base on TensorFlow (CTC, Attention, and MTL training)
eLavin11's Repositories
eLavin11/3d-ken-burns
an implementation of 3D Ken Burns Effect from a Single Image using PyTorch
eLavin11/CRAFT-pytorch
Pytorch implementation of CRAFT text detector
eLavin11/Cross-modal-retrieval
Activity image-based video retrieval
eLavin11/cvat
Powerful and efficient Computer Vision Annotation Tool (CVAT)
eLavin11/Cycle-IR
This is a Tensorflow implementation of Cycle-IR approach for content-aware image retargeting.
eLavin11/Decoupled-attention-network
Pytorch implementation for "Decoupled attention network for text recognition".
eLavin11/DeepNude-an-Image-to-Image-technology
DeepNude's algorithm and general image generation theory and practice research, including pix2pix, CycleGAN, UGATIT, DCGAN, SinGAN and VAE models (TensorFlow2 implementation). DeepNude的算法以及通用GAN图像生成的理论与实践研究。
eLavin11/DewarpNet
Code for the paper "DewarpNet: Single-Image Document Unwarping With Stacked 3D and 2D Regression Networks" (ICCV '19)
eLavin11/eLavin11.github.io
eLavin11/fast-bert
Super easy library for BERT based NLP models
eLavin11/ghostnet
[CVPR2020] Surpassing MobileNetV3: "GhostNet: More Features from Cheap Operations"
eLavin11/Hand-CNN
ICCV 2019, Hand Detection
eLavin11/InGAN
Official code for the paper "InGAN: Capturing and Retargeting the DNA of a Natural Image"
eLavin11/keras-bert
Implementation of BERT that could load official pre-trained models for feature extraction and prediction
eLavin11/lost
Label Objects and Save Time (LOST) - Design your own smart Image Annotation process in a web-based environment.
eLavin11/moco.tensorflow
A TensorFlow re-implementation of Momentum Contrast (MoCo): https://arxiv.org/abs/1911.05722
eLavin11/moviepy
Video editing with Python
eLavin11/ocr_invoice
ocr system of invoice
eLavin11/OIDv4_ToolKit
Download and visualize single or multiple classes from the huge Open Images v4 dataset
eLavin11/OpenSelfSup
Self-Supervised Learning Toolbox and Benchmark
eLavin11/PhotoDemon
A free portable photo editor focused on pro-grade features, high performance, and maximum usability.
eLavin11/pycorrector
pycorrector is a toolkit for text error correction. It was developed to facilitate the designing, comparing, and sharing of deep text error correction models.
eLavin11/retinanet-examples
Fast and accurate object detection with end-to-end GPU optimization
eLavin11/RxFFmpeg
🔥RxFFmpeg 是基于 ( FFmpeg 4.0 + X264 + mp3lame + fdk-aac ) 编译的适用于 Android 平台的音视频编辑、视频剪辑的快速处理框架,包含以下功能(视频拼接,转码,压缩,裁剪,片头片尾,分离音视频,变速,添加静态贴纸和gif动态贴纸,添加字幕,添加滤镜,添加背景音乐,加速减速视频,倒放音视频,音频裁剪,变声,混音,图片合成视频,视频解码图片等主流特色功能
eLavin11/shotcut
cross-platform (Qt), open-source (GPLv3) video editor
eLavin11/sketch-code
Keras model to generate HTML code from hand-drawn website mockups. Implements an image captioning architecture to drawn source images.
eLavin11/SlowFast
PySlowFast: video understanding codebase from FAIR for reproducing state-of-the-art video models.
eLavin11/Video-to-Online-Platform
An intelligent multimodal-learning based system for video, product and ads analysis. Based on the system, people can build a lot of downstream applications such as product recommendation, video retrieval, etc.
eLavin11/video_analyst
A series of basic algorithms that are useful for video understanding, including Single Object Tracking (SOT), Video Object Segmentation (VOS) and so on.
eLavin11/vmaf
Perceptual video quality assessment based on multi-method fusion.