vra's Stars
meta-llama/llama3
The official Meta Llama 3 GitHub site
karpathy/llm.c
LLM training in simple, raw C/CUDA
roboflow/supervision
We write your reusable computer vision tools. 💜
neonbjb/tortoise-tts
A multi-voice TTS system trained with an emphasis on quality
apple/corenet
CoreNet: A library for training deep neural networks
jaywalnut310/vits
VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech
idealo/imagededup
😎 Finding duplicate images made easy!
jbeder/yaml-cpp
A YAML parser and emitter in C++
sindresorhus/create-dmg
Create a good-looking DMG for your macOS app in seconds
rany2/edge-tts
Use Microsoft Edge's online text-to-speech service from Python WITHOUT needing Microsoft Edge or Windows or an API key
facebookresearch/encodec
State-of-the-art deep learning based audio codec supporting both mono 24 kHz audio and stereo 48 kHz audio.
TencentARC/InstantMesh
InstantMesh: Efficient 3D Mesh Generation from a Single Image with Sparse-view Large Reconstruction Models
siliconflow/onediff
OneDiff: An out-of-the-box acceleration library for diffusion models.
PKU-YuanGroup/MagicTime
MagicTime: Time-lapse Video Generation Models as Metamorphic Simulators
juanmc2005/diart
A python package to build AI-powered real-time audio applications
xcmyz/FastSpeech
The Implementation of FastSpeech based on pytorch.
huggingface/quanto
A pytorch Quantization Toolkit
mistralai/mistral-common
rhasspy/gruut
A tokenizer, text cleaner, and phonemizer for many human languages.
microsoft/onnxruntime-genai
Generative AI extensions for onnxruntime
metame-ai/awesome-audio-plaza
Daily tracking of awesome audio papers, including music generation, zero-shot tts, asr, audio generation
p0p4k/pflowtts_pytorch
Unofficial implementation of NVIDIA P-Flow TTS paper
chschock/textsplit
Segment documents into coherent parts using word embeddings.
wilsonzlin/hackerverse
Exploring Hacker News by mapping and analyzing 40 million posts and comments for fun
resemble-ai/monotonic_align
Monotonic Alignment Search
linjing7/ChatHuman
allenai/cached_path
A file utility for accessing both local and remote files through a unified interface.
sigmeta/g2p-kd
Token-Level Ensemble Distillation for Grapheme-to-Phoneme Conversion
fakerybakery/txtsplit
A simple text splitter based on Tortoise for use in text-to-speech applications
WenetSpeech4TTS/wenetspeech4tts