callmejacob's Stars
spring-projects/spring-boot
Spring Boot helps you to create Spring-powered, production-grade applications and services with absolute minimum fuss.
openai/whisper
Robust Speech Recognition via Large-Scale Weak Supervision
RVC-Boss/GPT-SoVITS
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
2noise/ChatTTS
A generative speech model for daily dialogue.
infiniflow/ragflow
RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.
mozilla/DeepSpeech
DeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Raspberry Pi 4 to high power GPU servers.
deepset-ai/haystack
AI orchestration framework to build customizable, production-ready LLM applications. Connect components (models, vector DBs, file converters) to pipelines or agents that can interact with your data. With advanced retrieval methods, it's best suited for building RAG, question answering, semantic search or conversational agent chatbots.
princeton-nlp/SWE-agent
[NeurIPS 2024] SWE-agent takes a GitHub issue and tries to automatically fix it, using GPT-4, or your LM of choice. It can also be employed for offensive cybersecurity or competitive coding challenges.
kkroening/ffmpeg-python
Python bindings for FFmpeg - with complex filtering support
neuml/txtai
💡 All-in-one open-source embeddings database for semantic search, LLM orchestration and language model workflows
pathwaycom/llm-app
Ready-to-run cloud templates for RAG, AI pipelines, and enterprise search with live data. 🐳Docker-friendly.⚡Always in sync with Sharepoint, Google Drive, S3, Kafka, PostgreSQL, real-time data APIs, and more.
Workiva/go-datastructures
A collection of useful, performant, and threadsafe Go datastructures.
netease-youdao/EmotiVoice
EmotiVoice 😊: a Multi-Voice and Prompt-Controlled TTS Engine
rany2/edge-tts
Use Microsoft Edge's online text-to-speech service from Python WITHOUT needing Microsoft Edge or Windows or an API key
jianchang512/ChatTTS-ui
一个简单的本地网页界面,使用ChatTTS将文字合成为语音,同时支持对外提供API接口。A simple native web interface that uses ChatTTS to synthesize text into speech, along with support for external API interfaces.
SophonPlus/ChineseNlpCorpus
搜集、整理、发布 中文 自然语言处理 语料/数据集,与 有志之士 共同 促进 中文 自然语言处理 的 发展。
myshell-ai/MeloTTS
High-quality multi-lingual text-to-speech library by MyShell.ai. Support English, Spanish, French, Chinese, Japanese and Korean.
open-mmlab/mmyolo
OpenMMLab YOLO series toolbox and benchmark. Implemented RTMDet, RTMDet-Rotated,YOLOv5, YOLOv6, YOLOv7, YOLOv8,YOLOX, PPYOLOE, etc.
zyds/transformers-code
手把手带你实战 Huggingface Transformers 课程视频同步更新在B站与YouTube
6drf21e/ChatTTS_colab
🚀 一键部署(含离线整合包)!基于 ChatTTS ,支持流式输出、音色抽卡、长音频生成和分角色朗读。简单易用,无需复杂安装。
0xAX/go-algorithms
Algorithms and data structures for golang
arnauddri/algorithms
Algorithms & Data Structures in Go
bytedance/gopkg
Universal Utilities for Go
THUDM/ImageReward
[NeurIPS 2023] ImageReward: Learning and Evaluating Human Preferences for Text-to-image Generation
lexburner/oauth2-demo
Re:从零开始的Spring Security Oauth2
caijc00/programmer_bookshelf_cn
📚 一个程序员的书架
linuxserver/docker-ffmpeg
cjyyx/AI_Gen_Novel
基于大语言模型(LLM)和多智能体(Multi-Agent),探究AI写小说能力的边界
gaboolic/novel-gpt
"主控"机器人拥有编写大纲、创建ai人物、读写文件的能力。也就是完全模拟人脑写小说的流程。 由"主控"机器人先生成大纲,然后根据大纲来控制剧情走向,决定下文是环境描写还是对话等等。而真正的对话,是由ai人物自己产生。 主控机器人的prompt由大纲、前文提要、当前上下文(可以是最近的1000个字)组成。 ai人物的prompt由前文提要、对话上下文组成。
openapi-processor/openapi-processor-spring
an OpenAPI 3.0 & 3.1 interface & model java code generator for Spring Boot