altryne's Stars
gpt-engineer-org/gpt-engineer
Specify what you want it to build, the AI asks for clarification, and then builds it.
OpenInterpreter/open-interpreter
A natural language interface for computers
coqui-ai/TTS
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
facebookresearch/seamless_communication
Foundational Models for State-of-the-Art Speech and Text Translation
artidoro/qlora
QLoRA: Efficient Finetuning of Quantized LLMs
HumanAIGC/EMO
Emote Portrait Alive: Generating Expressive Portrait Videos with Audio2Video Diffusion Model under Weak Conditions
rhasspy/piper
A fast, local neural text to speech system
sczhou/ProPainter
[ICCV 2023] ProPainter: Improving Propagation and Transformer for Video Inpainting
vikhyat/moondream
tiny vision language model
yl4579/StyleTTS2
StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models
Arize-ai/phoenix
AI Observability & Evaluation
run-llama/sec-insights
A real world full-stack application using LlamaIndex
Doubiiu/DynamiCrafter
DynamiCrafter: Animating Open-domain Images with Video Diffusion Priors
FL33TW00D/whisper-turbo
Cross-Platform, GPU Accelerated Whisper 🏎️
trevorhobenshield/twitter-api-client
Implementation of X/Twitter v1, v2, and GraphQL APIs
ufal/whisper_streaming
Whisper realtime streaming for long speech-to-text transcription and translation
YavorGIvanov/sam.cpp
dylanpdx/BetterTwitFix
Fix Twitter video embeds in Discord (and Telegram!)
wandb/weave
Weave is a toolkit for developing AI-powered applications, built by Weights & Biases.
camenduru/MusicGen-colab
louisfb01/start-llms
A complete guide to start and improve your LLM skills in 2024 with little background in the field and stay up-to-date with the latest news and state-of-the-art techniques!
langchain-ai/langsmith-sdk
LangSmith Client SDK Implementations
meistrari/cursive-py
✦ The intuitive python LLM framework
teknium1/ShareGPT-Builder
smlum/scription
An editor for speech-to-text transcripts such as AWS Transcribe and Mozilla DeepSpeech
hedrergudene/asr-sd-pipeline
Speech recognition & diarisation solution with text alignment, deployed in AML pipelines
aimadnet/twttrapi-python
TwttrAPI Python - Unofficial Twitter API
UpstreetAI/upstreet-sdk
hyperaudio/hyperaudio-lite-editor
A lightweight transcript editor for editing and correcting STT generated timed transcripts
pgerhardt/DongaChairArticle