marchNA's Stars
netease-youdao/EmotiVoice
EmotiVoice 😊: a Multi-Voice and Prompt-Controlled TTS Engine
fishaudio/fish-speech
Brand new TTS solution
metavoiceio/metavoice-src
Foundational model for human-like, expressive TTS
BytedanceSpeech/seed-tts-eval
libukai/Awesome-ChatTTS
官方推荐的 ChatTTS 资源汇总项目,整理了全网相关资源和常见问题 || Officially recommended ChatTTS resource collection project
InternLM/HuixiangDou
HuixiangDou: Overcoming Group Chat Scenarios with LLM-based Technical Assistance
ConnectAI-E/feishu-openai
🎒 飞书 ×(GPT-4 + GPT-4V + DALL·E-3 + Whisper)= 飞一般的工作体验 🚀 语音对话、角色扮演、多话题讨论、图片创作、表格分析、文档导出 🚀
kermitt2/grobid
A machine learning software for extracting information from scholarly documents
binary-husky/gpt_academic
为GPT/GLM等LLM大语言模型提供实用化交互接口,特别优化论文阅读/润色/写作体验,模块化设计,支持自定义快捷按钮&函数插件,支持Python和C++等项目剖析&自译解功能,PDF/LaTex论文翻译&总结功能,支持并行问询多种LLM模型,支持chatglm3等本地模型。接入通义千问, deepseekcoder, 讯飞星火, 文心一言, llama2, rwkv, claude2, moss等。
deepseek-ai/DeepSeek-Coder-V2
DeepSeek-Coder-V2: Breaking the Barrier of Closed-Source Models in Code Intelligence
QwenLM/CodeQwen1.5
CodeQwen1.5 is the code version of Qwen, the large language model series developed by Qwen team, Alibaba Cloud.
THUDM/CodeGeeX4
CodeGeeX4-ALL-9B, a versatile model for all AI software development scenarios, including code completion, code interpreter, web search, function calling, repository-level Q&A and much more.
tree-sitter/tree-sitter
An incremental parsing system for programming tools
tree-sitter/py-tree-sitter
Python bindings to the Tree-sitter parsing library
THUDM/CodeGeeX
CodeGeeX: An Open Multilingual Code Generation Model (KDD 2023)
unit-mesh/auto-dev-vscode
AutoDev - 🧙the AI-powered coding wizard . Put the most loved AutoDev AI assistant into your VSCode, and have things done quickly
unit-mesh/auto-dev
🧙AutoDev: The AI-powered coding wizard(开源 AI 编程助手) with multilingual support 🌐, auto code generation 🏗️, and a helpful bug-slaying assistant 🐞! Customizable prompts 🎨 and a magic Auto Dev/Testing/Document/Agent feature 🧪 included! 🚀
openai/openai-python
The official Python library for the OpenAI API
SJTU-IPADS/PowerInfer
High-speed Large Language Model Serving on PCs with Consumer-grade GPUs
karpathy/LLM101n
LLM101n: Let's build a Storyteller
open-compass/opencompass
OpenCompass is an LLM evaluation platform, supporting a wide range of models (Llama3, Mistral, InternLM2,GPT-4,LLaMa2, Qwen,GLM, Claude, etc) over 100+ datasets.
neo4j-labs/llm-graph-builder
Neo4j graph construction from unstructured data using LLMs
turboderp/exllamav2
A fast inference library for running LLMs locally on modern consumer-class GPUs
pdfminer/pdfminer.six
Community maintained fork of pdfminer - we fathom PDF
deepseek-ai/DeepSeek-V2
DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model
deepseek-ai/DeepSeek-Coder
DeepSeek Coder: Let the Code Write Itself
vanna-ai/vanna
🤖 Chat with your SQL database 📊. Accurate Text-to-SQL Generation via LLMs using RAG 🔄.
opendatalab/PDF-Extract-Kit
A Comprehensive Toolkit for High-Quality PDF Content Extraction
xhluca/bm25s
Fast lexical search library implementing BM25 in Python using Numpy and Scipy
esbatmop/MNBVC
MNBVC(Massive Never-ending BT Vast Chinese corpus)超大规模中文语料集。对标chatGPT训练的40T数据。MNBVC数据集不但包括主流文化,也包括各个小众文化甚至火星文的数据。MNBVC数据集包括新闻、作文、小说、书籍、杂志、论文、台词、帖子、wiki、古诗、歌词、商品介绍、笑话、糗事、聊天记录等一切形式的纯文本中文数据。