ndcuong91's Stars
lutzroeder/netron
Visualizer for neural network, deep learning and machine learning models
aleju/imgaug
Image augmentation for machine learning experiments.
kkroening/ffmpeg-python
Python bindings for FFmpeg - with complex filtering support
yemount/pose-animator
open-mmlab/mmsegmentation
OpenMMLab Semantic Segmentation Toolbox and Benchmark.
google/automl
Google Brain AutoML
Kaggle/kaggle-api
Official Kaggle API
zylo117/Yet-Another-EfficientDet-Pytorch
The pytorch re-implement of the official efficientdet with SOTA performance in real time and pretrained weights.
open-mmlab/mmocr
OpenMMLab Text Detection, Recognition and Understanding Toolbox
Belval/TextRecognitionDataGenerator
A synthetic data generator for text recognition
kba/awesome-ocr
Links to awesome OCR projects
ahupp/python-magic
A python wrapper for libmagic
facebookresearch/av_hubert
A self-supervised learning framework for audio-visual speech
zhang0jhon/AttentionOCR
Scene text recognition
microsoft/RegionCLIP
[CVPR 2022] Official code for "RegionCLIP: Region-based Language-Image Pretraining"
VinAIResearch/PhoBERT
PhoBERT: Pre-trained language models for Vietnamese (EMNLP-2020 Findings)
binhvq/news-corpus
Corpus tiếng việt
mpc001/Visual_Speech_Recognition_for_Multiple_Languages
Visual Speech Recognition for Multiple Languages
VIPL-Audio-Visual-Speech-Understanding/LipNet-PyTorch
The state-of-art PyTorch implementation of the method described in the paper "LipNet: End-to-End Sentence-level Lipreading" (https://arxiv.org/abs/1611.01599)
mpc001/end-to-end-lipreading
Pytorch code for End-to-End Audiovisual Speech Recognition
ngthuong45/vietnam-number
Thư viện xữ lý chữ số dành riêng cho Tiếng Việt.
Chris10M/Lip2Speech
A pipeline to read lips and generate speech for the read content, i.e Lip to Speech Synthesis.
hllj/ZaloAIChallenge-2022
Top 2 Solution for Zalo AI Challenge 2022 - Liveness Detection track
hungk64it1x/zac-2022
1st place solution for Zalo AI Challenge 2022
bpluta/Dogtector
Dogtector is dog breed detection app for iOS using YOLOv5 model combined with Metal based object decoder optimized for ultra fast live detection on iOS devices
ski-net/lipnet
LipNet with gluon
SwePalm/video_crop
using opencv to crop a video
ndcuong91/viText
ndcuong91/conversion_tools
Collection of tools to convert dataset
Laurawly/tvm-1
End to end Tensor IR/DSL stack for deploying deep learning workloads to hardwares