marchNA

marchNA's Stars

netease-youdao/EmotiVoice
EmotiVoice 😊: a Multi-Voice and Prompt-Controlled TTS Engine
Language:Python7.1k604
fishaudio/fish-speech
Brand new TTS solution
Language:Python7.3k575
metavoiceio/metavoice-src
Foundational model for human-like, expressive TTS
Language:Python3.7k642
BytedanceSpeech/seed-tts-eval
Language:Python89591
libukai/Awesome-ChatTTS
官方推荐的 ChatTTS 资源汇总项目，整理了全网相关资源和常见问题 || Officially recommended ChatTTS resource collection project
94463
InternLM/HuixiangDou
HuixiangDou: Overcoming Group Chat Scenarios with LLM-based Technical Assistance
Language:Python1.4k118
ConnectAI-E/feishu-openai
🎒 飞书 ×（GPT-4 + GPT-4V + DALL·E-3 + Whisper）= 飞一般的工作体验 🚀 语音对话、角色扮演、多话题讨论、图片创作、表格分析、文档导出 🚀
Language:Go5.5k955
kermitt2/grobid
A machine learning software for extracting information from scholarly documents
Language:Java3.4k443
binary-husky/gpt_academic
为GPT/GLM等LLM大语言模型提供实用化交互接口，特别优化论文阅读/润色/写作体验，模块化设计，支持自定义快捷按钮&函数插件，支持Python和C++等项目剖析&自译解功能，PDF/LaTex论文翻译&总结功能，支持并行问询多种LLM模型，支持chatglm3等本地模型。接入通义千问, deepseekcoder, 讯飞星火, 文心一言, llama2, rwkv, claude2, moss等。
Language:Python63.5k7.9k
deepseek-ai/DeepSeek-Coder-V2
DeepSeek-Coder-V2: Breaking the Barrier of Closed-Source Models in Code Intelligence
1.8k86
QwenLM/CodeQwen1.5
CodeQwen1.5 is the code version of Qwen, the large language model series developed by Qwen team, Alibaba Cloud.
Language:Python41725
THUDM/CodeGeeX4
CodeGeeX4-ALL-9B, a versatile model for all AI software development scenarios, including code completion, code interpreter, web search, function calling, repository-level Q&A and much more.
Language:Python1.1k84
tree-sitter/tree-sitter
An incremental parsing system for programming tools
Language:Rust17.8k1.3k
tree-sitter/py-tree-sitter
Python bindings to the Tree-sitter parsing library
Language:C80094
THUDM/CodeGeeX
CodeGeeX: An Open Multilingual Code Generation Model (KDD 2023)
Language:Python8.1k584
unit-mesh/auto-dev-vscode
AutoDev - 🧙‍the AI-powered coding wizard . Put the most loved AutoDev AI assistant into your VSCode, and have things done quickly
Language:TypeScript22735
unit-mesh/auto-dev
🧙‍AutoDev: The AI-powered coding wizard（开源 AI 编程助手） with multilingual support 🌐, auto code generation 🏗️, and a helpful bug-slaying assistant 🐞! Customizable prompts 🎨 and a magic Auto Dev/Testing/Document/Agent feature 🧪 included! 🚀
Language:Kotlin2.7k310
openai/openai-python
The official Python library for the OpenAI API
Language:Python21.8k3k
SJTU-IPADS/PowerInfer
High-speed Large Language Model Serving on PCs with Consumer-grade GPUs
Language:C++7.8k401
karpathy/LLM101n
LLM101n: Let's build a Storyteller
27.3k1.5k
open-compass/opencompass
OpenCompass is an LLM evaluation platform, supporting a wide range of models (Llama3, Mistral, InternLM2,GPT-4,LLaMa2, Qwen,GLM, Claude, etc) over 100+ datasets.
Language:Python3.6k383
neo4j-labs/llm-graph-builder
Neo4j graph construction from unstructured data using LLMs
Language:Jupyter Notebook1.9k266
turboderp/exllamav2
A fast inference library for running LLMs locally on modern consumer-class GPUs
Language:Python3.4k254
pdfminer/pdfminer.six
Community maintained fork of pdfminer - we fathom PDF
Language:Python5.8k915
deepseek-ai/DeepSeek-V2
DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model
3.3k124
deepseek-ai/DeepSeek-Coder
DeepSeek Coder: Let the Code Write Itself
Language:Python6.4k454
vanna-ai/vanna
🤖 Chat with your SQL database 📊. Accurate Text-to-SQL Generation via LLMs using RAG 🔄.
Language:Python10.5k801
opendatalab/PDF-Extract-Kit
A Comprehensive Toolkit for High-Quality PDF Content Extraction
Language:Python4.4k288
xhluca/bm25s
Fast lexical search library implementing BM25 in Python using Numpy and Scipy
Language:Python74025
esbatmop/MNBVC
MNBVC(Massive Never-ending BT Vast Chinese corpus)超大规模中文语料集。对标chatGPT训练的40T数据。MNBVC数据集不但包括主流文化，也包括各个小众文化甚至火星文的数据。MNBVC数据集包括新闻、作文、小说、书籍、杂志、论文、台词、帖子、wiki、古诗、歌词、商品介绍、笑话、糗事、聊天记录等一切形式的纯文本中文数据。
3.3k231