kiwi1kkkkk's Stars
jsalt2020-asrdiar/jsalt2020_simulate
Training data simulation
dmort27/epitran
A tool for transcribing orthographic text as IPA (International Phonetic Alphabet)
lingjzhu/CharsiuG2P
Multilingual G2P in 100 languages
LetheSec/HuggingFace-Download-Accelerator
利用HuggingFace的官方下载工具从镜像网站进行高速下载。
lingjzhu/clap-ipa
Keyword spotting and forced alignment in any language
QingruZhang/AdaLoRA
AdaLoRA: Adaptive Budget Allocation for Parameter-Efficient Fine-Tuning (ICLR 2023).
microsoft/LoRA
Code for loralib, an implementation of "LoRA: Low-Rank Adaptation of Large Language Models"
yeyupiaoling/Whisper-Finetune
Fine-tune the Whisper speech recognition model to support training without timestamp data, training with timestamp data, and training without speech data. Accelerate inference and support Web deployment, Windows desktop deployment, and Android deployment
huggingface/peft
🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.
hpcaitech/Open-Sora
Open-Sora: Democratizing Efficient Video Production for All
nachifur/RDDM
CVPR 2024: Residual Denoising Diffusion Models
dome272/Diffusion-Models-pytorch
Pytorch implementation of Diffusion Models (https://arxiv.org/pdf/2006.11239.pdf)
amusi/CVPR2024-Papers-with-Code
CVPR 2024 论文和开源项目合集
qute012/Wav2Keyword
Wav2Keyword is keyword spotting(KWS) based on Wav2Vec 2.0. This model shows state-of-the-art in Speech commands dataset V1 and V2.
lilianemomeni/KWS-Net
Seeing Wake Words: Audio-visual Keyword Spotting
AILab-CVC/UniRepLKNet
[CVPR'24] UniRepLKNet: A Universal Perception Large-Kernel ConvNet for Audio, Video, Point Cloud, Time-Series and Image Recognition
LAION-AI/CLAP
Contrastive Language-Audio Pretraining
openai/CLIP
CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image
HolgerBovbjerg/data2vec-KWS
This repository contains code for applying Data2Vec to pretrain Keyword Transformer model as described in "Improving Label-Deficient Keyword Spotting Through Self-Supervised Pretraining".
sovrasov/flops-counter.pytorch
Flops counter for convolutional networks in pytorch framework
TencentGameMate/chinese_speech_pretrain
chinese speech pretrained models
gusrud1103/LibriPhrase
Recipe for LibriPhrase
roman-vygon/triplet_loss_kws
Learning Efficient Representations for Keyword Spotting with Triplet Loss
pengzhiliang/Conformer
Official code for Conformer: Local Features Coupling Global Representations for Visual Recognition
ArchitParnami/Few-Shot-KWS
Few-Shot Keyword Spotting
milesial/Pytorch-UNet
PyTorch implementation of the U-Net for image semantic segmentation with high quality images
datawhalechina/llm-cookbook
面向开发者的 LLM 入门教程,吴恩达大模型系列课程中文版
mli/paper-reading
深度学习经典、新论文逐段精读
Mooler0410/LLMsPracticalGuide
A curated list of practical guide resources of LLMs (LLMs Tree, Examples, Papers)
A-suozhang/awesome-quantization-and-fixed-point-training
Neural Network Quantization & Low-Bit Fixed Point Training For Hardware-Friendly Algorithm Design