nukes

Edinburgh, UK

nukes's Stars

LAION-AI/natural_voice_assistant
Language:Python42033
fixie-ai/ultravox
Language:Python40215
liutaocode/TTS-arxiv-daily
Automatically Update Text-to-speech (TTS) Papers Daily using Github Actions (Update Every 12th hours)
Language:Python651
Alpha-VLLM/Lumina-T2X
Lumina-T2X is a unified framework for Text to Any Modality Generation
Language:Python1.3k51
huggingface/parler-tts
Inference and training library for high-quality TTS models.
Language:Python2.7k281
karpathy/llm.c
LLM training in simple, raw C/CUDA
Language:Cuda20.5k2.2k
jasonppy/VoiceCraft
Zero-Shot Speech Editing and Text-to-Speech in the Wild
Language:Jupyter Notebook7k694
mekumiao/ssml-editor
基于wangeditor实现的支持SSML语法的编辑器
Language:TypeScript328
Vaibhavs10/insanely-fast-whisper
Language:Jupyter Notebook6.8k492
jim-schwoebel/voice_datasets
🔊 A comprehensive list of open-source datasets for voice and sound computing (95+ datasets).
1.6k222
AppFlowy-IO/AppFlowy
AppFlowy is an open-source alternative to Notion. You are in charge of your data and customizations. Built with Flutter and Rust.
Language:Dart49.8k3.3k
labmlai/annotated_deep_learning_paper_implementations
🧑‍🏫 60 Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), gans(cyclegan, stylegan2, ...), 🎮 reinforcement learning (ppo, dqn), capsnet, distillation, ... 🧠
Language:Jupyter Notebook49.6k5.1k
resemble-ai/resemble-enhance
AI powered speech denoising and enhancement
Language:Python1k95
open-mmlab/Amphion
Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, music, and speech generation research and development.
Language:Python4.1k340
princeton-nlp/LLM-Shearing
[ICLR 2024] Sheared LLaMA: Accelerating Language Model Pre-training via Structured Pruning
Language:Python48036
X-LANCE/UniCATS-CTX-vec2wav
[AAAI 2024] Code for CTX-vec2wav in UniCATS
Language:Python10315
collabora/WhisperSpeech
An Open Source text-to-speech system built by inverting Whisper.
Language:Jupyter Notebook3.5k177
lucidrains/vector-quantize-pytorch
Vector (and Scalar) Quantization, in Pytorch
Language:Python2k171
google-research/magvit
Official JAX implementation of MAGVIT: Masked Generative Video Transformer
Language:Python86641
turboderp/exllamav2
A fast inference library for running LLMs locally on modern consumer-class GPUs
Language:Python3.1k228
modelscope/FunCodec
FunCodec is a research-oriented toolkit for audio quantization and downstream applications, such as text-to-speech synthesis, music generation et.al.
Language:Python29624
haoheliu/versatile_audio_super_resolution
Versatile audio super resolution (any -> 48kHz) with AudioSR.
Language:Python93989
mit-han-lab/streaming-llm
[ICLR 2024] Efficient Streaming Language Models with Attention Sinks
Language:Python6.3k355
kakaobrain/magvlt
The official implementation of MAGVLT: Masked Generative Vision-and-Language Transformer (CVPR'23)
Language:Python23
yangdongchao/UniAudio
The Open Source Code of UniAudio
Language:Python46130
hyn2028/llm-cxr
Official code for "LLM-CXR: Instruction-Finetuned LLM for CXR Image Understanding and Generation"
Language:Python908
Computer-Vision-in-the-Wild/CVinW_Readings
A collection of papers on the topic of ``Computer Vision in the Wild (CVinW)''
1k53
dvlab-research/LongLoRA
Code and documents of LongLoRA and LongAlpaca (ICLR 2024 Oral)
Language:Python2.5k256
hiyouga/LLaMA-Factory
Unify Efficient Fine-Tuning of 100+ LLMs
Language:Python24k3k
YuanGongND/whisper-at
Code and Pretrained Models for Interspeech 2023 Paper "Whisper-AT: Noise-Robust Automatic Speech Recognizers are Also Strong Audio Event Taggers"
Language:Python27622

nukes

nukes's Stars

LAION-AI/natural_voice_assistant

fixie-ai/ultravox

liutaocode/TTS-arxiv-daily

Alpha-VLLM/Lumina-T2X

huggingface/parler-tts

karpathy/llm.c

jasonppy/VoiceCraft

mekumiao/ssml-editor

Vaibhavs10/insanely-fast-whisper

jim-schwoebel/voice_datasets

AppFlowy-IO/AppFlowy

labmlai/annotated_deep_learning_paper_implementations

resemble-ai/resemble-enhance

open-mmlab/Amphion

princeton-nlp/LLM-Shearing

X-LANCE/UniCATS-CTX-vec2wav

collabora/WhisperSpeech

lucidrains/vector-quantize-pytorch

google-research/magvit

turboderp/exllamav2

modelscope/FunCodec

haoheliu/versatile_audio_super_resolution

mit-han-lab/streaming-llm

kakaobrain/magvlt

yangdongchao/UniAudio

hyn2028/llm-cxr

Computer-Vision-in-the-Wild/CVinW_Readings

dvlab-research/LongLoRA

hiyouga/LLaMA-Factory

YuanGongND/whisper-at