Pinned Repositories
AdvancedEAST
AdvancedEAST is an algorithm used for Scene image text detect, which is primarily based on EAST, and the significant improvement was also made, which make long text predictions more accurate.
algo
Set up a personal IPSEC VPN in the cloud
alpr-unconstrained
License Plate Detection and Recognition in Unconstrained Scenarios
aster
Recognizing cropped text in natural images.
clandmark
Open Source Landmarking Library
ComputeLibrary
The ARM Computer Vision and Machine Learning library is a set of functions optimised for both ARM CPUs and GPUs using SIMD technologies.
xshhhm's Repositories
xshhhm/AdvancedEAST
AdvancedEAST is an algorithm used for Scene image text detect, which is primarily based on EAST, and the significant improvement was also made, which make long text predictions more accurate.
xshhhm/algo
Set up a personal IPSEC VPN in the cloud
xshhhm/alpr-unconstrained
License Plate Detection and Recognition in Unconstrained Scenarios
xshhhm/aster
Recognizing cropped text in natural images.
xshhhm/bark-with-voice-clone
🔊 Text-prompted Generative Audio Model - With the ability to clone voices
xshhhm/Bytedance_ICME_challenge
xshhhm/cascade-rcnn
Caffe implementation of multiple popular object detection frameworks
xshhhm/cnn_lstm_ctc_ocr_for_ICPR
Forked from weinman/cnn_lstm_ctc_ocr for the ICPR MTWI 2018 challenge 1
xshhhm/DeepInterestNetwork
xshhhm/DensePose
A real-time approach for mapping all human pixels of 2D RGB images to a 3D surface-based model of the body
xshhhm/enas
TensorFlow Code for paper "Efficient Neural Architecture Search via Parameter Sharing"
xshhhm/Flow-Guided-Feature-Aggregation
Flow-Guided Feature Aggregation for Video Object Detection
xshhhm/GRU4Rec
GRU4Rec is the original Theano implementation of the algorithm in "Session-based Recommendations with Recurrent Neural Networks" paper, published at ICLR 2016 and its follow-up "Recurrent Neural Networks with Top-k Gains for Session-based Recommendations". The code is optimized for execution on the GPU.
xshhhm/hallo
Hallo: Hierarchical Audio-Driven Visual Synthesis for Portrait Image Animation
xshhhm/mace
MACE is a deep learning inference framework optimized for mobile heterogeneous computing platforms.
xshhhm/NeuralBabyTalk
Pytorch code of for our CVPR 2018 paper "Neural Baby Talk"
xshhhm/NLP-progress
Repository to track the progress in Natural Language Processing (NLP), including the datasets and the current state-of-the-art for the most common NLP tasks.
xshhhm/PocketFlow
An Automatic Model Compression (AutoMC) framework for developing smaller and faster AI applications.
xshhhm/qlora
QLoRA: Efficient Finetuning of Quantized LLMs
xshhhm/Recommenders
Best Practices on Recommendation Systems
xshhhm/roop
one-click deepfake (face swap)
xshhhm/Summary-of-Recommender-System-Papers
阅读过的推荐系统论文的归类总结,持续更新中…
xshhhm/TextRecognitionDataGenerator
A synthetic data generator for text recognition
xshhhm/textspotter
xshhhm/the-algorithm
Source code for Twitter's Recommendation Algorithm
xshhhm/vall-e
An unofficial PyTorch implementation of the audio LM VALL-E
xshhhm/VALL-E-X
An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io
xshhhm/vid2vid
Pytorch implementation of our method for high-resolution (e.g. 2048x1024) photorealistic video-to-video translation.
xshhhm/video-nonlocal-net
Non-local Neural Networks for Video Classification
xshhhm/x-deeplearning
An industrial deep learning framework for high-dimension sparse data