tbright17's Stars
suno-ai/bark
🔊 Text-Prompted Generative Audio Model
babysor/MockingBird
🚀AI拟声: 5秒内克隆您的声音并生成任意语音内容 Clone a voice in 5 seconds to generate arbitrary speech in real-time
google-research/tuning_playbook
A playbook for systematically maximizing the performance of deep learning models.
openai/chatgpt-retrieval-plugin
The ChatGPT Retrieval Plugin lets you easily find personal or work documents by asking questions in natural language.
BlinkDL/RWKV-LM
RWKV (pronounced RwaKuv) is an RNN with great LLM performance, which can also be directly trained like a GPT transformer (parallelizable). We are at RWKV-7 "Goose". So it's combining the best of RNN and transformer - great performance, linear time, constant space (no kv-cache), fast training, infinite ctx_len, and free sentence embedding.
bigscience-workshop/petals
🌸 Run LLMs at home, BitTorrent-style. Fine-tuning and inference up to 10x faster than offloading
modelscope/modelscope
ModelScope: bring the notion of Model-as-a-Service to life.
facebookresearch/encodec
State-of-the-art deep learning based audio codec supporting both mono 24 kHz audio and stereo 48 kHz audio.
enhuiz/vall-e
An unofficial PyTorch implementation of the audio LM VALL-E
lucidrains/audiolm-pytorch
Implementation of AudioLM, a SOTA Language Modeling Approach to Audio Generation out of Google Research, in Pytorch
lifeiteng/vall-e
PyTorch implementation of VALL-E(Zero-Shot Text-To-Speech), Reproduced Demo https://lifeiteng.github.io/valle/index.html
archinetai/audio-diffusion-pytorch
Audio generation using diffusion models, in PyTorch.
circlestarzero/EX-chatGPT
Let ChatGPT truly learn how to go online and call APIs! 'EX-ChatGPT' can rival and even surpass NewBing
archinetai/audio-ai-timeline
A timeline of the latest AI models for audio generation, starting in 2023!
harlanhong/awesome-talking-head-generation
google/maxtext
A simple, performant and scalable Jax LLM!
lucidrains/naturalspeech2-pytorch
Implementation of Natural Speech 2, Zero-shot Speech and Singing Synthesizer, in Pytorch
RUCAIBox/TextBox
TextBox 2.0 is a text generation library with pre-trained language models
facebookresearch/av_hubert
A self-supervised learning framework for audio-visual speech
Shark-NLP/DiffuSeq
[ICLR'23] DiffuSeq: Sequence to Sequence Text Generation with Diffusion Models
microsoft/ProphetNet
A research project for natural language generation, containing the official implementations by MSRA NLC team.
LAION-AI/audio-dataset
Audio Dataset for training CLAP and other models
yangdongchao/AcademiCodec
AcademiCodec: An Open Source Audio Codec Model for Academic Research
replicate/paint-by-text
A microsite for InstructPix2Pix
danielgross/teleprompter
shizhediao/ChatGPTPapers
Must-read papers, related blogs and API tools on the pre-training and tuning methods for ChatGPT.
carlosholivan/DeepLearningMusicGeneration
State of the Art of Music Generation with Deep Learning and AI
archinetai/archisound
A collection of pre-trained audio models, in PyTorch.
sedrickkeh/PANCETTA
Dataset and code for PANCETTA: Phoneme Aware Neural Completion to Elicit Tongue Twisters Automatically
amazon-science/listen-know-spell-dataset