kobenaxie's Stars
CyC2018/CS-Notes
:books: 技术面试必备基础知识、Leetcode、计算机操作系统、计算机网络、系统设计
huggingface/transformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
geekxh/hello-algorithm
🌍 针对小白的算法训练 | 包括四部分:①.大厂面经 ②.力扣图解 ③.千本开源电子书 ④.百张技术思维导图(项目花了上百小时,希望可以点 star 支持,🌹感谢~)推荐免费ChatGPT使用网站
afatcoder/LeetcodeTop
汇总各大互联网公司容易考察的高频leetcode题🔥
tensorflow/tensor2tensor
Library of deep learning models and datasets designed to make deep learning more accessible and accelerate ML research.
Dao-AILab/flash-attention
Fast and memory-efficient exact attention
chriskempson/tomorrow-theme
Tomorrow Theme
fishaudio/Bert-VITS2
vits2 backbone with multilingual-bert
flashlight/flashlight
A C++ standalone library for machine learning
wenet-e2e/wenet
Production First and Production Ready End-to-End Speech Recognition Toolkit
idiap/fast-transformers
Pytorch library for fast transformer implementations
resemble-ai/resemble-enhance
AI powered speech denoising and enhancement
QwenLM/Qwen-Audio
The official repo of Qwen-Audio (通义千问-Audio) chat & pretrained large audio language model proposed by Alibaba Cloud.
awslabs/sockeye
Sequence-to-sequence framework with a focus on Neural Machine Translation based on PyTorch
mosaicml/streaming
A Data Streaming Library for Efficient Neural Network Training
bytedance/SALMONN
SALMONN: Speech Audio Language Music Open Neural Network
k2-fsa/icefall
TensorSpeech/TensorFlowASR
:zap: TensorFlowASR: Almost State-of-the-art Automatic Speech Recognition in Tensorflow 2. Supported languages that can use characters or subwords
wladradchenko/wunjo.wladradchenko.ru
Wunjo CE: Face Swap, Lip Sync, Control Remove Objects & Text & Background, Restyling, Audio Separator, Clone Voice, Video Generation. Open Source, Local & Free.
parlance/ctcdecode
PyTorch CTC Decoder bindings
jaywalnut310/glow-tts
A Generative Flow for Text-to-Speech via Monotonic Alignment Search
LinkSoul-AI/LLaSM
第一个支持中英文双语语音-文本多模态对话的开源可商用对话模型。便捷的语音输入将大幅改善以文本为输入的大模型的使用体验,同时避免了基于 ASR 解决方案的繁琐流程以及可能引入的错误。
Z-yq/TensorflowASR
一个执着于让CPU\端侧-Model逼近GPU-Model性能的项目,CPU上的实时率(RTF)小于0.1
kssteven418/Squeezeformer
[NeurIPS'22] Squeezeformer: An Efficient Transformer for Automatic Speech Recognition
k2-fsa/libriheavy
Libriheavy: a 50,000 hours ASR corpus with punctuation casing and context
tencent-ailab/FRA-RIR
ferreirafabio/video2tfrecord
Easily convert RGB video data (e.g. .avi) to the TensorFlow tfrecords file format for training e.g. a NN in TensorFlow. This implementation allows to limit the number of frames per video to be stored in the tfrecords.
k2-fsa/fast_rnnt
A torch implementation of a recursion which turns out to be useful for RNN-T.
adobe-research/convmelspec
Convmelspec: Convertible Melspectrograms via 1D Convolutions
csukuangfj/optimized_transducer
Memory efficient transducer loss computation