eruma

eruma's Stars

microsoft/graphrag
A modular graph-based Retrieval-Augmented Generation (RAG) system
Language:Python21.7k 150 5922.1k
stanford-oval/storm
An LLM-powered knowledge curation system that researches a topic and generates a full-length report with citations.
Language:Python20.3k 155 1701.8k
fishaudio/fish-speech
SOTA Open Source TTS
Language:Python18.4k 111 4931.4k
Shubhamsaboo/awesome-llm-apps
Collection of awesome LLM apps with AI Agents and RAG using OpenAI, Anthropic, Gemini and opensource models.
Language:Python12.7k 161 321.4k
actix/actix
Actor framework for Rust.
Language:Rust8.7k 139 260656
Cysharp/UniTask
Provides an efficient allocation free async/await integration for Unity.
Language:C#8.6k 116 443863
miurla/morphic
An AI-powered search engine with a generative UI
Language:TypeScript6.6k 55 1741.8k
pipecat-ai/pipecat
Open Source framework for voice and multimodal conversational AI
Language:Python4.3k 35 234455
vocodedev/vocode-core
🤖 Build voice-based LLM agents. Modular + open source.
Language:Python3.1k 46 179514
Camb-ai/MARS5-TTS
MARS5 speech model (TTS) from CAMB.AI
Language:Jupyter Notebook2.6k 34 51215
BasedHardware/Friend
AI wearable necklace
Language:C2.5k 39 212298
IAHispano/Applio
A simple, high-quality voice conversion tool focused on ease of use and performance.
Language:Python2k 27 474321
apple/ml-4m
4M: Massively Multimodal Masked Modeling
Language:Python1.7k 34 26101
OS-Copilot/OS-Copilot
An self-improving embodied conversational agent seamlessly integrated into the operating system to automate our daily tasks.
Language:Python1.6k 22 31177
DigitalPhonetics/IMS-Toucan
Controllable and fast Text-to-Speech for over 7000 languages!
Language:Python1.5k 23 170173
QwenLM/Qwen2-Audio
The official repo of Qwen2-Audio chat & pretrained large audio language model proposed by Alibaba Cloud.
Language:Python1.4k 33 9094
MontrealCorpusTools/Montreal-Forced-Aligner
Command line utility for forced alignment using Kaldi
Language:Python1.4k 35 727251
TensorSpeech/TensorFlowASR
:zap: TensorFlowASR: Almost State-of-the-art Automatic Speech Recognition in Tensorflow 2. Supported languages that can use characters or subwords
Language:Python953 33 208246
Neph0s/awesome-llm-role-playing-with-persona
Awesome-llm-role-playing-with-persona: a curated list of resources for large language models for role-playing with assigned personas
655 17 330
EmulationAI/awesome-large-audio-models
Collection of resources on the applications of Large Language Models (LLMs) in Audio AI.
642 27 337
rtvi-ai/rtvi-web-demo
Example UI implementing the RTVI web client
Language:TypeScript471 9 1168
lucidrains/e2-tts-pytorch
Implementation of E2-TTS, "Embarrassingly Easy Fully Non-Autoregressive Zero-Shot TTS", in Pytorch
Language:Python405 26 2638
ensan-hcl/azooKey
azooKey: A Japanese Keyboard iOS Application Fully Developed in Swift
Language:Swift322 5 17016
ikegami-yukino/neologdn
Japanese text normalizer for mecab-neologd
Language:Cython276 8 820
isi-nlp/uroman
Universal Romanizer that can convert any unicode script to roman (latin) script
Language:Perl169 13 1514
SALT-NLP/demonstrated-feedback
Language:Python115 1 414
Wataru-Nakata/miipher
Unofficial implementation of miipher
Language:Python114 4 816
feldberlin/timething
Timething is a library for aligning text transcripts with their audio recordings.
Language:Jupyter Notebook111 4 228
facebookresearch/MemoryMosaics
Memory Mosaics are networks of associative memories working in concert to achieve a prediction task.
Language:Python37 4 33
zeyuxie29/AudioTime
Language:Python22 2 10