tbright17

Ming Tu, research on speech recognition and NLU

tbright17's Stars

suno-ai/bark
🔊 Text-Prompted Generative Audio Model
Language:Jupyter Notebook36.6k 332 4484.3k
babysor/MockingBird
🚀AI拟声: 5秒内克隆您的声音并生成任意语音内容 Clone a voice in 5 seconds to generate arbitrary speech in real-time
Language:Python35.6k 306 8865.2k
google-research/tuning_playbook
A playbook for systematically maximizing the performance of deep learning models.
27.8k 291 432.3k
openai/chatgpt-retrieval-plugin
The ChatGPT Retrieval Plugin lets you easily find personal or work documents by asking questions in natural language.
Language:Python21.1k 320 2343.7k
BlinkDL/RWKV-LM
RWKV (pronounced RwaKuv) is an RNN with great LLM performance, which can also be directly trained like a GPT transformer (parallelizable). We are at RWKV-7 "Goose". So it's combining the best of RNN and transformer - great performance, linear time, constant space (no kv-cache), fast training, infinite ctx_len, and free sentence embedding.
Language:Python13k 133 228880
bigscience-workshop/petals
🌸 Run LLMs at home, BitTorrent-style. Fine-tuning and inference up to 10x faster than offloading
Language:Python9.3k 96 206527
modelscope/modelscope
ModelScope: bring the notion of Model-as-a-Service to life.
Language:Python7.2k 77 648749
facebookresearch/encodec
State-of-the-art deep learning based audio codec supporting both mono 24 kHz audio and stereo 48 kHz audio.
Language:Python3.6k 59 71311
enhuiz/vall-e
An unofficial PyTorch implementation of the audio LM VALL-E
Language:Python3k 88 98418
lucidrains/audiolm-pytorch
Implementation of AudioLM, a SOTA Language Modeling Approach to Audio Generation out of Google Research, in Pytorch
Language:Python2.5k 62 175269
lifeiteng/vall-e
PyTorch implementation of VALL-E(Zero-Shot Text-To-Speech), Reproduced Demo https://lifeiteng.github.io/valle/index.html
Language:Python2.1k 49 127322
archinetai/audio-diffusion-pytorch
Audio generation using diffusion models, in PyTorch.
Language:Python2k 40 43168
circlestarzero/EX-chatGPT
Let ChatGPT truly learn how to go online and call APIs! 'EX-ChatGPT' can rival and even surpass NewBing
Language:Python2k 14 67329
archinetai/audio-ai-timeline
A timeline of the latest AI models for audio generation, starting in 2023!
1.9k 168 470
harlanhong/awesome-talking-head-generation
1.6k 83 4112
google/maxtext
A simple, performant and scalable Jax LLM!
Language:Python1.5k 28 88274
lucidrains/naturalspeech2-pytorch
Implementation of Natural Speech 2, Zero-shot Speech and Singing Synthesizer, in Pytorch
Language:Python1.3k 55 31104
RUCAIBox/TextBox
TextBox 2.0 is a text generation library with pre-trained language models
Language:Python1.1k 20 72117
facebookresearch/av_hubert
A self-supervised learning framework for audio-visual speech
Language:Python865 15 111138
Shark-NLP/DiffuSeq
[ICLR'23] DiffuSeq: Sequence to Sequence Text Generation with Diffusion Models
Language:Python750 25 8091
microsoft/ProphetNet
A research project for natural language generation, containing the official implementations by MSRA NLC team.
Language:Python696 21 76110
LAION-AI/audio-dataset
Audio Dataset for training CLAP and other models
Language:Python655 21 5955
yangdongchao/AcademiCodec
AcademiCodec: An Open Source Audio Codec Model for Academic Research
Language:Python607 31 4080
replicate/paint-by-text
A microsite for InstructPix2Pix
Language:JavaScript446 20 898
danielgross/teleprompter
Language:Python328 18 339
shizhediao/ChatGPTPapers
Must-read papers, related blogs and API tools on the pre-training and tuning methods for ChatGPT.
318 10 119
carlosholivan/DeepLearningMusicGeneration
State of the Art of Music Generation with Deep Learning and AI
280 12 025
archinetai/archisound
A collection of pre-trained audio models, in PyTorch.
Language:Python111 5 24
sedrickkeh/PANCETTA
Dataset and code for PANCETTA: Phoneme Aware Neural Completion to Elicit Tongue Twisters Automatically
4 2 10
amazon-science/listen-know-spell-dataset
2 2 1

tbright17

tbright17's Stars

suno-ai/bark

babysor/MockingBird

google-research/tuning_playbook

openai/chatgpt-retrieval-plugin

BlinkDL/RWKV-LM

bigscience-workshop/petals

modelscope/modelscope

facebookresearch/encodec

enhuiz/vall-e

lucidrains/audiolm-pytorch

lifeiteng/vall-e

archinetai/audio-diffusion-pytorch

circlestarzero/EX-chatGPT

archinetai/audio-ai-timeline

harlanhong/awesome-talking-head-generation

google/maxtext

lucidrains/naturalspeech2-pytorch

RUCAIBox/TextBox

facebookresearch/av_hubert

Shark-NLP/DiffuSeq

microsoft/ProphetNet

LAION-AI/audio-dataset

yangdongchao/AcademiCodec

replicate/paint-by-text

danielgross/teleprompter

shizhediao/ChatGPTPapers

carlosholivan/DeepLearningMusicGeneration

archinetai/archisound

sedrickkeh/PANCETTA

amazon-science/listen-know-spell-dataset