Azreal42's Stars
ggerganov/llama.cpp
LLM inference in C/C++
myshell-ai/OpenVoice
Instant voice cloning by MIT and MyShell.
crewAIInc/crewAI
Framework for orchestrating role-playing, autonomous AI agents. By fostering collaborative intelligence, CrewAI empowers agents to work together seamlessly, tackling complex tasks.
Mozilla-Ocho/llamafile
Distribute and run LLMs with a single file.
m-bain/whisperX
WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)
jasonppy/VoiceCraft
Zero-Shot Speech Editing and Text-to-Speech in the Wild
THUDM/CogVLM
a state-of-the-art-level open visual language model | 多模态预训练模型
myshell-ai/MeloTTS
High-quality multi-lingual text-to-speech library by MyShell.ai. Support English, Spanish, French, Chinese, Japanese and Korean.
snakers4/silero-vad
Silero VAD: pre-trained enterprise-grade Voice Activity Detector
argmaxinc/WhisperKit
On-device Speech Recognition for Apple Silicon
MahmoudAshraf97/whisper-diarization
Automatic Speech Recognition with Speaker Diarization based on OpenAI Whisper
yinkaisheng/Python-UIAutomation-for-Windows
🐍Python 3 wrapper of Microsoft UIAutomation. Support UIAutomation for MFC, WindowsForm, WPF, Modern UI(Metro UI), Qt, IE, Firefox, Chrome ...
KoljaB/RealtimeTTS
Converts text to speech in realtime
richardyc/Chrome-GPT
An AutoGPT agent that controls Chrome on your desktop
MeetKai/functionary
Chat language model that can use tools and interpret the results
a-real-ai/pywinassistant
The first open source Large Action Model generalist Artificial Narrow Intelligence agentic framework that controls completely human user interfaces by only using natural language. Based on Visualization-of-Thought Elicits Spatial Reasoning in Large Language Models.
e2b-dev/code-interpreter
Python & JS/TS SDK for running AI-generated code/code interpreting in your AI app
aymenfurter/microagents
Agents Capable of Self-Editing Their Prompts / Python Code
ILikeAI/AlwaysReddy
AlwaysReddy is a LLM voice assistant that is always just a hotkey away.
idiap/coqui-ai-TTS
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
Abc-Arbitrage/ZeroLog
A high-performance, zero-allocation .NET logging library.
Abc-Arbitrage/Disruptor-cpp
Port of LMAX Disruptor to C++
Abc-Arbitrage/Zebus
A lightweight Peer to Peer Service Bus
shashikg/WhisperS2T
An Optimized Speech-to-Text Pipeline for the Whisper Model Supporting Multiple Inference Engine
Jonathan-Adly/AgentRun
The easiest, and fastest way to run AI-generated Python code safely
GoodAI/charlie-mnemonic
Charlie Mnemonic: The First Personal Assistant with Long-Term Memory
evalplus/repoqa
RepoQA: Evaluating Long-Context Code Understanding
ValyrianTech/OpenVoice_server
API server for Instant voice cloning by MyShell.
leumasme/copilot-cli-powershell
GitHub Copilot CLI integration for Powershell
peanutcocktail/ai-town
A MIT-licensed, deployable starter kit for building and customizing your own version of AI town - a virtual town where AI characters live, chat and socialize.