owenwp's Stars
comfyanonymous/ComfyUI
The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.
oobabooga/text-generation-webui
A Gradio web UI for Large Language Models with support for multiple inference backends.
suno-ai/bark
🔊 Text-Prompted Generative Audio Model
myshell-ai/OpenVoice
Instant voice cloning by MIT and MyShell. Audio foundation model.
PromtEngineer/localGPT
Chat with your documents on your local device using GPT models. No data leaves your device and 100% private.
speechbrain/speechbrain
A PyTorch-based Speech Toolkit
modelscope/agentscope
Start building LLM-empowered multi-agent applications in an easier way.
KoljaB/RealtimeSTT
A robust, efficient, low-latency speech-to-text library with advanced voice activity detection, wake word activation and instant transcription.
cisco/openh264
Open Source H.264 Codec
serp-ai/bark-with-voice-clone
🔊 Text-prompted Generative Audio Model - With the ability to clone voices
lucidrains/musiclm-pytorch
Implementation of MusicLM, Google's new SOTA model for music generation using attention networks, in Pytorch
gpt-omni/mini-omni
open-source multimodal large language model that can hear, talk while thinking. Featuring real-time end-to-end speech input and streaming audio output conversational capabilities.
microsoft/PromptWizard
Task-Aware Agent-driven Prompt Optimization Framework
KeenSoftwareHouse/SpaceEngineers
KoljaB/RealtimeTTS
Converts text to speech in realtime
xuebinqin/DIS
This is the repo for our new project Highly Accurate Dichotomous Image Segmentation
googlevr/tilt-brush
rsxdalv/tts-generation-webui
TTS Generation Web UI (Bark, MusicGen + AudioGen, Tortoise, RVC, Vocos, Demucs, SeamlessM4T, MAGNet, StyleTTS2, MMS, Stable Audio, Mars5, F5-TTS, ParlerTTS)
princeton-nlp/MeZO
[NeurIPS 2023] MeZO: Fine-Tuning Language Models with Just Forward Passes. https://arxiv.org/abs/2305.17333
JonathanFly/bark
🚀 BARK INFINITY GUI CMD 🎶 Powered Up Bark Text-prompted Generative Audio Model
TheAiSingularity/graphrag-local-ollama
Local models support for Microsoft's graphrag using ollama (llama3, mistral, gemma2 phi3)- LLM & Embedding extraction
EricGuo5513/text-to-motion
Official implementation for "Generating Diverse and Natural 3D Human Motions from Texts (CVPR2022)."
neuml/rag
🚀 Retrieval Augmented Generation (RAG) with txtai. Combine search and LLMs to find insights with your own data.
antibitcoin/ReflectionAnyLLM
This project demonstrates a basic chain-of-thought interaction with any LLM (Large Language Model)
MozillaReality/servo-unity
INACTIVE - Servo for Unity - experimental
dobrado76/Stable-Diffusion-Unity-Integration
Stable-Diffusion-Unity-Integration
olrea/openai-cpp
OpenAI C++ is a community-maintained library for the Open AI API
DSprtn/GTFO_VR_Plugin
A plugin to add full roomscale Virtual Reality support to your favorite game!
sato-team/Stable-Text-to-Motion-Framework
SATO: Stable Text-to-Motion Framework
eastskykang/UnityMeshImportExample
Runtime mesh import example for Unity