kiwi1kkkkk

kiwi1kkkkk's Stars

jsalt2020-asrdiar/jsalt2020_simulate
Training data simulation
Language:Python406
dmort27/epitran
A tool for transcribing orthographic text as IPA (International Phonetic Alphabet)
Language:Python635121
lingjzhu/CharsiuG2P
Multilingual G2P in 100 languages
Language:Jupyter Notebook28124
LetheSec/HuggingFace-Download-Accelerator
利用HuggingFace的官方下载工具从镜像网站进行高速下载。
Language:Python76569
lingjzhu/clap-ipa
Keyword spotting and forced alignment in any language
Language:Python332
QingruZhang/AdaLoRA
AdaLoRA: Adaptive Budget Allocation for Parameter-Efficient Fine-Tuning (ICLR 2023).
Language:Python25828
microsoft/LoRA
Code for loralib, an implementation of "LoRA: Low-Rank Adaptation of Large Language Models"
Language:Python10.4k668
yeyupiaoling/Whisper-Finetune
Fine-tune the Whisper speech recognition model to support training without timestamp data, training with timestamp data, and training without speech data. Accelerate inference and support Web deployment, Windows desktop deployment, and Android deployment
Language:C831132
huggingface/peft
🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.
Language:Python16k1.6k
hpcaitech/Open-Sora
Open-Sora: Democratizing Efficient Video Production for All
Language:Python21.8k2.1k
nachifur/RDDM
CVPR 2024: Residual Denoising Diffusion Models
Language:Python32329
dome272/Diffusion-Models-pytorch
Pytorch implementation of Diffusion Models (https://arxiv.org/pdf/2006.11239.pdf)
Language:Python1.1k260
amusi/CVPR2024-Papers-with-Code
CVPR 2024 论文和开源项目合集
17.8k2.6k
qute012/Wav2Keyword
Wav2Keyword is keyword spotting(KWS) based on Wav2Vec 2.0. This model shows state-of-the-art in Speech commands dataset V1 and V2.
Language:Python10028
lilianemomeni/KWS-Net
Seeing Wake Words: Audio-visual Keyword Spotting
Language:Python6212
AILab-CVC/UniRepLKNet
[CVPR'24] UniRepLKNet: A Universal Perception Large-Kernel ConvNet for Audio, Video, Point Cloud, Time-Series and Image Recognition
Language:Python90553
LAION-AI/CLAP
Contrastive Language-Audio Pretraining
Language:Python1.4k133
openai/CLIP
CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image
Language:Jupyter Notebook25k3.2k
HolgerBovbjerg/data2vec-KWS
This repository contains code for applying Data2Vec to pretrain Keyword Transformer model as described in "Improving Label-Deficient Keyword Spotting Through Self-Supervised Pretraining".
Language:Python255
sovrasov/flops-counter.pytorch
Flops counter for convolutional networks in pytorch framework
Language:Python2.8k308
TencentGameMate/chinese_speech_pretrain
chinese speech pretrained models
Language:Shell1k83
gusrud1103/LibriPhrase
Recipe for LibriPhrase
Language:Python234
roman-vygon/triplet_loss_kws
Learning Efficient Representations for Keyword Spotting with Triplet Loss
Language:Python9415
pengzhiliang/Conformer
Official code for Conformer: Local Features Coupling Global Representations for Visual Recognition
Language:Jupyter Notebook53387
ArchitParnami/Few-Shot-KWS
Few-Shot Keyword Spotting
Language:Jupyter Notebook5416
milesial/Pytorch-UNet
PyTorch implementation of the U-Net for image semantic segmentation with high quality images
Language:Python9k2.5k
datawhalechina/llm-cookbook
面向开发者的 LLM 入门教程，吴恩达大模型系列课程中文版
Language:Jupyter Notebook11.5k1.4k
mli/paper-reading
深度学习经典、新论文逐段精读
26.5k2.4k
Mooler0410/LLMsPracticalGuide
A curated list of practical guide resources of LLMs (LLMs Tree, Examples, Papers)
9.3k712
A-suozhang/awesome-quantization-and-fixed-point-training
Neural Network Quantization & Low-Bit Fixed Point Training For Hardware-Friendly Algorithm Design
15724