jeremy110

bronciTaiwan

jeremy110's Stars

lifeiteng/OmniSenseVoice
Omni SenseVoice: High-Speed Speech Recognition with words timestamps 🗣️🎯
Language:Python66924
JusperLee/TIGER
Language:HTML8
SWivid/F5-TTS
Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"
Language:Python6.3k711
nttcslab-sp/mamba-diarization
Official repository for Mamba-based Segmentation Model for Speaker Diarization
Language:Python183
Human9000/nd-Mamba2-torch
Only implemented through torch: "bi - mamba2" , "vision- mamba2 -torch". support 1d/2d/3d/nd and support export by jit.script/onnx;
Language:Python1646
nttcslab-sp-admin/mamba-diarization
111
lucidrains/minGRU-pytorch
Implementation of the proposed minGRU in Pytorch
Language:Python22214
yl4579/StyleTTS-ZS
StyleTTS-ZS: Efficient High-Quality Zero-Shot Text-to-Speech Synthesis with Distilled Time-Varying Style Diffusion
1569
FireRedTeam/FireRedTTS
An Open-Sourced LLM-empowered Foundation TTS System
Language:Python40026
tw93/Pake
🤱🏻 Turn any webpage into a desktop app with Rust. 🤱🏻 利用 Rust 轻松构建轻量级多端桌面应用
Language:Rust32k5.6k
Ceelog/DictionaryByGPT4
一本 GPT4 生成的单词书📚，超过 8000 个单词分析，涵盖了词义、例句、词根词缀、变形、文化背景、记忆技巧和小故事
Language:HTML3.7k243
Lightning-AI/LitServe
Lightning-fast serving engine for any AI model of any size. Flexible. Easy. Enterprise-scale.
Language:Python2.3k145
lovemefan/fsmn-vad
A enterprise-grade Voice Activity Detector from modelscope and funasr.
Language:Python586
opendatalab/MinerU
A one-stop, open-source, high-quality data extraction tool, supports PDF/webpage/e-book extraction.一站式开源高质量数据提取工具，支持PDF/网页/多格式电子书提取。
Language:JavaScript13.5k1k
modelscope/FunASR
A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.
Language:Python6.8k717
FunAudioLLM/CosyVoice
Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.
Language:Python6k644
FunAudioLLM/SenseVoice
Multilingual Voice Understanding Model
Language:Python3.3k302
hustvl/Vim
[ICML 2024] Vision Mamba: Efficient Visual Representation Learning with Bidirectional State Space Model
Language:Python3k196
state-spaces/mamba
Mamba SSM architecture
Language:Python13.1k1.1k
dongrixinyu/jiojio
A convenient Chinese word segmentation tool 简便中文分词器
Language:Python467
huggingface/diarizers
Language:Python25416
tango4j/llm_speaker_tagging
SLT 2024 Challenge: Post-ASR-Speaker-Tagging
Language:Python141
xi-j/Mamba-TasNet
Language:Jupyter Notebook533
KindXiaoming/pykan
Kolmogorov Arnold Networks
Language:Jupyter Notebook15k1.4k
JusperLee/SPMamba
Language:Python12415
edahelsinki/slisemap
SLISEMAP: Combining supervised dimensionality reduction with local explanations
Language:Python173
huggingface/parler-tts
Inference and training library for high-quality TTS models.
Language:Python4.5k458
karpathy/llm.c
LLM training in simple, raw C/CUDA
Language:Cuda24.3k2.7k
jasonppy/VoiceCraft
Zero-Shot Speech Editing and Text-to-Speech in the Wild
Language:Jupyter Notebook7.6k743
unslothai/unsloth
Finetune Llama 3.2, Mistral, Phi & Gemma LLMs 2-5x faster with 80% less memory
Language:Python17.7k1.2k

jeremy110

jeremy110's Stars

lifeiteng/OmniSenseVoice

JusperLee/TIGER

SWivid/F5-TTS

nttcslab-sp/mamba-diarization

Human9000/nd-Mamba2-torch

nttcslab-sp-admin/mamba-diarization

lucidrains/minGRU-pytorch

yl4579/StyleTTS-ZS

FireRedTeam/FireRedTTS

tw93/Pake

Ceelog/DictionaryByGPT4

Lightning-AI/LitServe

lovemefan/fsmn-vad

opendatalab/MinerU

modelscope/FunASR

FunAudioLLM/CosyVoice

FunAudioLLM/SenseVoice

hustvl/Vim

state-spaces/mamba

dongrixinyu/jiojio

huggingface/diarizers

tango4j/llm_speaker_tagging

xi-j/Mamba-TasNet

KindXiaoming/pykan

JusperLee/SPMamba

edahelsinki/slisemap

huggingface/parler-tts

karpathy/llm.c

jasonppy/VoiceCraft

unslothai/unsloth