Fengdalu

A Gray Cat.

Pinned Repositories

acm-icpc
2015 to 2017, ACM-ICPC Training Codes, Team SpadeAce
Language:C++8 6 05
auto_avsr
Auto-AVSR: Lip-Reading Sentences Project
Language:Python0 0 00
av_hubert
A self-supervised learning framework for audio-visual speech
Language:Python0 0 00
awesome-audio-visualization
A curated list about Audio Visualization.
Language:Shell0 1 00
FacePose_pytorch
🔥🔥The pytorch implement of the head pose estimation(yaw,roll,pitch) and emotion detection with SOTA performance in real time.Easy to deploy, easy to use, and high accuracy.Solve all problems of face detection at one time.(极简，极快，高效是我们的宗旨)
Language:Python1 0 00
Lipreading_using_Temporal_Convolutional_Networks
ICASSP'20 Lipreading using Temporal Convolutional Networks
Language:Python1 1 00
learn-an-effective-lip-reading-model-without-pains
The PyTorch Code and Model In "Learn an Effective Lip Reading Model without Pains", (https://arxiv.org/abs/2011.07557), which reaches the state-of-art performance in LRW-1000 dataset.
Language:Python146 1 2636
LipNet-PyTorch
The state-of-art PyTorch implementation of the method described in the paper "LipNet: End-to-End Sentence-level Lipreading" (https://arxiv.org/abs/1611.01599)
Language:Python203 5 3648
Lipreading-DenseNet3D
DenseNet3D Model In "LRW-1000: A Naturally-Distributed Large-Scale Benchmark for Lip Reading in the Wild", https://arxiv.org/abs/1810.06990
Language:Python117 6 1221

Fengdalu's Repositories

Fengdalu/FacePose_pytorch
🔥🔥The pytorch implement of the head pose estimation(yaw,roll,pitch) and emotion detection with SOTA performance in real time.Easy to deploy, easy to use, and high accuracy.Solve all problems of face detection at one time.(极简，极快，高效是我们的宗旨)
Language:Python1 0 00
Fengdalu/Lipreading_using_Temporal_Convolutional_Networks
ICASSP'20 Lipreading using Temporal Convolutional Networks
Language:Python1 1 00
Fengdalu/auto_avsr
Auto-AVSR: Lip-Reading Sentences Project
Language:Python0 0 00
Fengdalu/av_hubert
A self-supervised learning framework for audio-visual speech
Language:Python0 0 00
Fengdalu/awesome-audio-visualization
A curated list about Audio Visualization.
Language:Shell0 1 00
Fengdalu/Awesome-Video-Datasets
Video datasets
0 1 00
Fengdalu/bark
🔊 Text-Prompted Generative Audio Model
Language:Python0 0 00
Fengdalu/chinese_text_normalization
Chinese text normalization for speech processing
Language:Python1 01
Fengdalu/DeepFaceLab
DeepFaceLab is the leading software for creating deepfakes.
Language:Python2 0
Fengdalu/examples
A set of examples around pytorch in Vision, Text, Reinforcement Learning, etc.
Language:Python1 0
Fengdalu/fairscale
PyTorch extensions for high performance and large scale training.
Language:Python1 0
Fengdalu/FastChat
An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.
Language:Python0 0
Fengdalu/gpu-burn
Multi-GPU CUDA stress test
Language:C++1 0
Fengdalu/lightning-bolts
Toolbox of models, callbacks, and datasets for AI/ML researchers.
Language:Python0 0
Fengdalu/LRW_ID
The speaker-labeled information of LRW dataset, which is the outcome of the paper "Speaker-adaptive Lip Reading with User-dependent Padding" (ECCV 2022)
0 0
Fengdalu/mdistiller
The official implementation of [CVPR2022] Decoupled Knowledge Distillation https://arxiv.org/abs/2203.08679
Language:Python0 0
Fengdalu/nvcodec-python
Language:C1 0
Fengdalu/nvjpeg-python
nvjpeg for python
Language:C1 0
Fengdalu/RGB_HSV_HSL
a pure pytorch implementation of color space conversion, including rgb2hsl, rgb2hsv, hsv2rgb, hsl2rgb
Language:Python1 0
Fengdalu/SCPapers
Must-read Papers on Sememe Computation
1 0
Fengdalu/Speech-Transformer
A PyTorch implementation of Speech Transformer, an End-to-End ASR with Transformer network on Mandarin Chinese.
Language:Python1 0
Fengdalu/stanfordacm
Stanford ACM-ICPC related materials
Language:HTML0 0
Fengdalu/stargan
StarGAN - Official PyTorch Implementation (CVPR 2018)
Fengdalu/torchnvjpeg
Decode JPEG image on GPU using PyTorch
Language:C++0 0
Fengdalu/VITS-Paimon
Language:Jupyter Notebook0 0
Fengdalu/Wave-U-Net-for-Speech-Enhancement
Implement Wave-U-Net by PyTorch, and migrate it to the speech enhancement.
Language:Python1 0
Fengdalu/wenet
Production First and Production Ready End-to-End Speech Recognition Toolkit
Fengdalu/whisper
Robust Speech Recognition via Large-Scale Weak Supervision
Language:Jupyter Notebook0 0
Fengdalu/whisperX
WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)
Language:Python0 0
Fengdalu/yolov5-face
Language:Python1 0