callmejacob

callmejacob's Stars

spring-projects/spring-boot
Spring Boot helps you to create Spring-powered, production-grade applications and services with absolute minimum fuss.
Language:Java75.7k 3.4k 36.5k40.8k
openai/whisper
Robust Speech Recognition via Large-Scale Weak Supervision
Language:Python73.4k 599 08.8k
RVC-Boss/GPT-SoVITS
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
Language:Python37.4k 218 1.4k4.2k
2noise/ChatTTS
A generative speech model for daily dialogue.
Language:Python33.2k 191 5813.6k
infiniflow/ragflow
RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.
Language:Python26.3k 144 1.9k2.5k
mozilla/DeepSpeech
DeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Raspberry Pi 4 to high power GPU servers.
Language:C++25.5k 674 2.1k4k
deepset-ai/haystack
AI orchestration framework to build customizable, production-ready LLM applications. Connect components (models, vector DBs, file converters) to pipelines or agents that can interact with your data. With advanced retrieval methods, it's best suited for building RAG, question answering, semantic search or conversational agent chatbots.
Language:Python18.3k 144 3.6k2k
princeton-nlp/SWE-agent
[NeurIPS 2024] SWE-agent takes a GitHub issue and tries to automatically fix it, using GPT-4, or your LM of choice. It can also be employed for offensive cybersecurity or competitive coding challenges.
Language:Python13.8k 97 3871.4k
kkroening/ffmpeg-python
Python bindings for FFmpeg - with complex filtering support
Language:Python10.2k 113 714895
neuml/txtai
💡 All-in-one open-source embeddings database for semantic search, LLM orchestration and language model workflows
Language:Python9.8k 95 803625
pathwaycom/llm-app
Ready-to-run cloud templates for RAG, AI pipelines, and enterprise search with live data. 🐳Docker-friendly.⚡Always in sync with Sharepoint, Google Drive, S3, Kafka, PostgreSQL, real-time data APIs, and more.
9k 31 16273
Workiva/go-datastructures
A collection of useful, performant, and threadsafe Go datastructures.
Language:Go7.7k 324 44837
netease-youdao/EmotiVoice
EmotiVoice 😊: a Multi-Voice and Prompt-Controlled TTS Engine
Language:Python7.6k 63 157641
rany2/edge-tts
Use Microsoft Edge's online text-to-speech service from Python WITHOUT needing Microsoft Edge or Windows or an API key
Language:Python6.6k 53 242653
jianchang512/ChatTTS-ui
一个简单的本地网页界面，使用ChatTTS将文字合成为语音，同时支持对外提供API接口。A simple native web interface that uses ChatTTS to synthesize text into speech, along with support for external API interfaces.
Language:Python6.4k 39 234765
SophonPlus/ChineseNlpCorpus
搜集、整理、发布中文自然语言处理语料/数据集，与有志之士共同促进中文自然语言处理的发展。
Language:Jupyter Notebook6k 116 251.4k
myshell-ai/MeloTTS
High-quality multi-lingual text-to-speech library by MyShell.ai. Support English, Spanish, French, Chinese, Japanese and Korean.
Language:Python5.1k 42 193677
open-mmlab/mmyolo
OpenMMLab YOLO series toolbox and benchmark. Implemented RTMDet, RTMDet-Rotated,YOLOv5, YOLOv6, YOLOv7, YOLOv8,YOLOX, PPYOLOE, etc.
Language:Python3k 34 400549
zyds/transformers-code
手把手带你实战 Huggingface Transformers 课程视频同步更新在B站与YouTube
Language:Jupyter Notebook2.3k 19 13317
6drf21e/ChatTTS_colab
🚀 一键部署（含离线整合包）！基于 ChatTTS ，支持流式输出、音色抽卡、长音频生成和分角色朗读。简单易用，无需复杂安装。
Language:Python2.2k 20 91274
0xAX/go-algorithms
Algorithms and data structures for golang
Language:Go1.9k 62 6352
arnauddri/algorithms
Algorithms & Data Structures in Go
Language:Go1.9k 88 10268
bytedance/gopkg
Universal Utilities for Go
Language:Go1.8k 30 42230
THUDM/ImageReward
[NeurIPS 2023] ImageReward: Learning and Evaluating Human Preferences for Text-to-image Generation
Language:Python1.2k 15 9465
lexburner/oauth2-demo
Re：从零开始的Spring Security Oauth2
Language:Java1.2k 72 41588
caijc00/programmer_bookshelf_cn
📚 一个程序员的书架
Language:Python600 9 0376
linuxserver/docker-ffmpeg
Language:Dockerfile161 13 3348
cjyyx/AI_Gen_Novel
基于大语言模型(LLM)和多智能体(Multi-Agent)，探究AI写小说能力的边界
Language:Python148 5 728
gaboolic/novel-gpt
"主控"机器人拥有编写大纲、创建ai人物、读写文件的能力。也就是完全模拟人脑写小说的流程。由"主控"机器人先生成大纲，然后根据大纲来控制剧情走向，决定下文是环境描写还是对话等等。而真正的对话，是由ai人物自己产生。主控机器人的prompt由大纲、前文提要、当前上下文（可以是最近的1000个字）组成。 ai人物的prompt由前文提要、对话上下文组成。
Language:Vue44 3 26
openapi-processor/openapi-processor-spring
an OpenAPI 3.0 & 3.1 interface & model java code generator for Spring Boot
Language:Kotlin40 3 1049