jeremy110

bronciTaiwan

jeremy110's Stars

fishaudio/Bert-VITS2
vits2 backbone with multilingual-bert
Language:Python8k1.1k
myshell-ai/MeloTTS
High-quality multi-lingual text-to-speech library by MyShell.ai. Support English, Spanish, French, Chinese, Japanese and Korean.
Language:Python4.7k604
alxndrTL/mamba.py
A simple and efficient Mamba implementation in pure PyTorch and MLX.
Language:Python99690
johnma2006/mamba-minimal
Simple, minimal implementation of the Mamba SSM in one file of PyTorch.
Language:Python2.6k191
egruttadauria98/SSpaVAlDo
Language:Jupyter Notebook272
datawhalechina/learn-nlp-with-transformers
we want to create a repo to illustrate usage of transformers in chinese
Language:Shell2.3k392
VikParuchuri/surya
OCR, layout analysis, reading order, table recognition in 90+ languages
Language:Python13.6k855
OpenGVLab/OmniQuant
[ICLR2024 spotlight] OmniQuant is a simple and powerful quantization technique for LLMs.
Language:Python72155
zju3dv/EasyVolcap
[SIGGRAPH Asia 2023 (Technical Communications)] EasyVolcap: Accelerating Neural Volumetric Video Research
Language:Python62644
NVlabs/FasterViT
[ICLR 2024] Official PyTorch implementation of FasterViT: Fast Vision Transformers with Hierarchical Attention
Language:Python78462
cmhungsteve/Awesome-Transformer-Attention
An ultimately comprehensive paper list of Vision Transformer/Attention, including papers, codes, and related websites
4.6k489
lucidrains/x-transformers
A concise but complete full-attention transformer with a set of promising experimental features from various papers
Language:Python4.7k407
xmu-xiaoma666/External-Attention-pytorch
🍀 Pytorch implementation of various Attention Mechanisms, MLP, Re-parameter, Convolution, which is helpful to further understand papers.⭐⭐⭐
Language:Python11.4k1.9k
Xiaobin-Rong/gtcrn
The official implementation of GTCRN, an ultra-lite speech enhancement model.
Language:Python20434
csteinmetz1/auraloss
Collection of audio-focused loss functions in PyTorch
Language:Python73567
modelscope/3D-Speaker
A Repository for Single- and Multi-modal Speaker Verification, Speaker Recognition and Speaker Diarization
Language:Python1.2k102
BUTSpeechFIT/DiaPer
Language:Python433
microsoft/generative-ai-for-beginners
21 Lessons, Get Started Building with Generative AI 🔗 https://microsoft.github.io/generative-ai-for-beginners/
Language:Jupyter Notebook64.6k32.9k
JaeBinCHA7/DEMUCS-for-Speech-Enhancement
We implemented the DEMUCS model for speech enhancement in the time-frequency domain, and additionally implemented HD-DEMUCS.
Language:Python203
p0p4k/pflowtts_pytorch
Unofficial implementation of NVIDIA P-Flow TTS paper
Language:Python21730
mit-han-lab/streaming-llm
[ICLR 2024] Efficient Streaming Language Models with Attention Sinks
Language:Python6.6k363
FL33TW00D/whisper-turbo
Cross-Platform, GPU Accelerated Whisper 🏎️
Language:TypeScript1.7k75
Audio-WestlakeU/FS-EEND
The official Pytorch implementation of "Frame-wise streaming end-to-end speaker diarization with non-autoregressive self-attention-based attractors". [ICASSP 2024]
Language:Python814
jax-ml/jax
Composable transformations of Python+NumPy programs: differentiate, vectorize, JIT to GPU/TPU, and more
Language:Python30.4k2.8k
DongKeon/Awesome-Speaker-Diarization
Some comprehensive papers about speaker diarization
2143
VoxBlink/ScriptsForVoxBlink
A repo containing download guidance and corresponding scripts of the VoxBlink dataset.
Language:Python21
pyannote/pyannote-audio
Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding
Language:Jupyter Notebook6.2k773
FrenchKrab/IS2023-powerset-diarization
Official repository for the "Powerset multi-class cross entropy loss for neural speaker diarization" paper published in Interspeech 2023.
Language:Jupyter Notebook694
521xueweihan/HelloGitHub
:octocat: 分享 GitHub 上有趣、入门级的开源项目。Share interesting, entry-level open source projects on GitHub.
Language:Python92.6k9.6k
mkunes/w2v2_audioFrameClassification
wav2vec2 audio classification for prosodic boundary detection and other tasks
Language:Jupyter Notebook336

jeremy110

jeremy110's Stars

fishaudio/Bert-VITS2

myshell-ai/MeloTTS

alxndrTL/mamba.py

johnma2006/mamba-minimal

egruttadauria98/SSpaVAlDo

datawhalechina/learn-nlp-with-transformers

VikParuchuri/surya

OpenGVLab/OmniQuant

zju3dv/EasyVolcap

NVlabs/FasterViT

cmhungsteve/Awesome-Transformer-Attention

lucidrains/x-transformers

xmu-xiaoma666/External-Attention-pytorch

Xiaobin-Rong/gtcrn

csteinmetz1/auraloss

modelscope/3D-Speaker

BUTSpeechFIT/DiaPer

microsoft/generative-ai-for-beginners

JaeBinCHA7/DEMUCS-for-Speech-Enhancement

p0p4k/pflowtts_pytorch

mit-han-lab/streaming-llm

FL33TW00D/whisper-turbo

Audio-WestlakeU/FS-EEND

jax-ml/jax

DongKeon/Awesome-Speaker-Diarization

VoxBlink/ScriptsForVoxBlink

pyannote/pyannote-audio

FrenchKrab/IS2023-powerset-diarization

521xueweihan/HelloGitHub

mkunes/w2v2_audioFrameClassification