outsider7's Stars
microsoft/graphrag
A modular graph-based Retrieval-Augmented Generation (RAG) system
maxjiang153/BRClient
A cross-platform Bedrock client (Web / PWA / Linux / Win / MacOS). with support to Claude3 model
mudler/LocalAI
:robot: The free, Open Source OpenAI alternative. Self-hosted, community-driven and local-first. Drop-in replacement for OpenAI running on consumer-grade hardware. No GPU required. Runs gguf, transformers, diffusers and many more models architectures. It allows to generate Text, Audio, Video, Images. Also with voice cloning capabilities.
ChatGPTNextWeb/ChatGPT-Next-Web
A cross-platform ChatGPT/Gemini UI (Web / PWA / Linux / Win / MacOS). 一键拥有你自己的跨平台 ChatGPT/Gemini 应用。
PKU-YuanGroup/Open-Sora-Plan
This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.
open-webui/open-webui
User-friendly WebUI for LLMs (Formerly Ollama WebUI)
yanue/V2rayU
V2rayU,基于v2ray核心的mac版客户端,用于科学上网,使用swift编写,支持trojan,vmess,shadowsocks,socks5等服务协议,支持订阅, 支持二维码,剪贴板导入,手动配置,二维码分享等
chouaibMo/ChatGemini
A multiplatform chatbot app (Android, iOS and Desktop) built with Compose Multiplatform and powered by Gemini 1.5 Pro API.
awslabs/idf-modules
Industry Data Framework (IDF) IAC modules repository
RocketChat/Rocket.Chat.ReactNative
Rocket.Chat mobile clients
awslabs/aws-sdk-kotlin
Multiplatform AWS SDK for Kotlin
yeyupiaoling/Whisper-Finetune
Fine-tune the Whisper speech recognition model to support training without timestamp data, training with timestamp data, and training without speech data. Accelerate inference and support Web deployment, Windows desktop deployment, and Android deployment
suno-ai/bark
🔊 Text-Prompted Generative Audio Model
awslabs/scale-out-computing-on-aws
Scale-Out Computing on AWS is a solution that helps customers deploy and operate a multiuser environment for computationally intensive workflows.
aws-amplify/aws-sdk-android
AWS SDK for Android. For more information, see our web site:
jing332/tts-server-android
这是一个Android系统TTS应用,内置微软演示接口,可自定义HTTP请求,可导入其他本地TTS引擎,以及根据中文双引号的简单旁白/对话识别朗读 ,还有自动重试,备用配置,文本替换等更多功能。
aws/aws-parallelcluster-ui
awslabs/amazon-transcribe-streaming-sdk
The Amazon Transcribe Streaming SDK is an async Python SDK for converting audio into text via Amazon Transcribe.
awslabs/amazon-kinesis-video-streams-webrtc-sdk-android
Android SDK for interfacing with Amazon Kinesis Video Streams Signaling Service.
facebookresearch/seamless_communication
Foundational Models for State-of-the-Art Speech and Text Translation
aws/aws-pdk
The AWS PDK provides building blocks for common patterns together with development tools to manage and build your projects.
coqui-ai/TTS
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
aws-samples/image-generator-with-stable-diffusion-on-amazon-bedrock-using-streamlit
A quick demostration to deploy a Stable Diffusion Web application with containers running on Amazon ECS. The model is provided by Amazon Bedrock in this example
Ai-Austin/Bing-GPT-Voice-Assistant
This is a Python voice assistant that takes two different wake words. One for prompting Bing AI using EdgeGPT and the other will prompt the GPT-3.5-Turbo API
LearnedVector/A-Hackers-AI-Voice-Assistant
A hackers AI voice assistant, built using Python and PyTorch.
openai/whisper
Robust Speech Recognition via Large-Scale Weak Supervision
facebookresearch/audiocraft
Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable music generation LM with textual and melodic conditioning.
ggerganov/whisper.cpp
Port of OpenAI's Whisper model in C/C++
ggerganov/llama.cpp
LLM inference in C/C++
aws-samples/amazon-bedrock-workshop
This is a workshop designed for Amazon Bedrock a foundational model service.