Pinned Repositories
3m-asr
Anomaly-Transformer
About Code release for "Anomaly Transformer: Time Series Anomaly Detection with Association Discrepancy" (ICLR 2022 Spotlight), https://openreview.net/forum?id=LzQQ89U1qm_
audio-diffusion-pytorch
Audio generation using diffusion models, in PyTorch.
bark
🔊 Text-Prompted Generative Audio Model
best-rq-pytorch
Implementation of BEST-RQ - a model for self-supervised learning of speech signals using a random projection quantizer, in Pytorch.
client
Triton Python, C++ and Java client libraries, and GRPC-generated client examples for go, java and scala.
cnn
C++ neural network library
codon
A high-performance, zero-overhead, extensible Python compiler using LLVM
ctcdecode
PyTorch CTC Decoder bindings
lvzhiqiang's Repositories
lvzhiqiang/3m-asr
lvzhiqiang/Anomaly-Transformer
About Code release for "Anomaly Transformer: Time Series Anomaly Detection with Association Discrepancy" (ICLR 2022 Spotlight), https://openreview.net/forum?id=LzQQ89U1qm_
lvzhiqiang/audio-diffusion-pytorch
Audio generation using diffusion models, in PyTorch.
lvzhiqiang/bark
🔊 Text-Prompted Generative Audio Model
lvzhiqiang/best-rq-pytorch
Implementation of BEST-RQ - a model for self-supervised learning of speech signals using a random projection quantizer, in Pytorch.
lvzhiqiang/client
Triton Python, C++ and Java client libraries, and GRPC-generated client examples for go, java and scala.
lvzhiqiang/codon
A high-performance, zero-overhead, extensible Python compiler using LLVM
lvzhiqiang/datasets_emotion
This repository collects information about different data sets for Music Emotion Recognition.
lvzhiqiang/DeepAFx-ST
DeepAFx-ST - Style transfer of audio effects with differentiable signal processing. Please see https://csteinmetz1.github.io/DeepAFx-ST/
lvzhiqiang/deepvac
PyTorch python project standard.
lvzhiqiang/espeak-ng
eSpeak NG is an open source speech synthesizer that supports more than hundred languages and accents.
lvzhiqiang/espnet_onnx
Onnx wrapper for espnet infrernce model
lvzhiqiang/FastSpeech2
PyTorch Implementation of FastSpeech 2 : Fast and High-Quality End-to-End Text to Speech
lvzhiqiang/gpuRIR
Python library for Room Impulse Response (RIR) simulation with GPU acceleration
lvzhiqiang/gtn_applications
Applications using the GTN library and code to reproduce experiments in "Differentiable Weighted Finite-State Transducers"
lvzhiqiang/HanLP
中文分词 词性标注 命名实体识别 依存句法分析 成分句法分析 语义依存分析 语义角色标注 指代消解 风格转换 语义相似度 新词发现 关键词短语提取 自动摘要 文本分类聚类 拼音简繁转换 自然语言处理
lvzhiqiang/laion-prepro
Get hundred of million of image+url from the crawling at home dataset and preprocess them
lvzhiqiang/llama.cpp
My develoopment fork of llama.cpp. For now working on RK3588 NPU backend
lvzhiqiang/mfa
About how to use 'Montreal Forced Aligner'.
lvzhiqiang/MockingBird
🚀AI拟声: 5秒内克隆您的声音并生成任意语音内容 Clone a voice in 5 seconds to generate arbitrary speech in real-time
lvzhiqiang/Multi-Singer
PyTorch Implementation of Multi-Singer (ACM-MM'21)
lvzhiqiang/music_source_separation
lvzhiqiang/pkwrap
A pytorch wrapper for LF-MMI training and parallel training in Kaldi
lvzhiqiang/PSL
Source code for ICASSP2022 "Pseudo Strong labels for large scale weakly supervised audio tagging"
lvzhiqiang/riffusion-app
Stable diffusion for real-time music generation (web app)
lvzhiqiang/rVADfast
This is the Python library for an unsupervised, fast method for robust voice activity detection (rVAD), as in the paper rVAD: An Unsupervised Segment-Based Robust Voice Activity Detection Method.
lvzhiqiang/SFANC-Window
Real-time Implementation of CNN-based selective fixed-filter active noise control and effectiveness analysis using explainable AI
lvzhiqiang/SpeechAlgorithms
Speech Algorithms Collections
lvzhiqiang/Squeezeformer
Squeezeformer: An Efficient Transformer for Automatic Speech Recognition
lvzhiqiang/wenet
Production First and Production Ready End-to-End Speech Recognition Toolkit