bgl0703's Stars
mozilla/DeepSpeech
DeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Raspberry Pi 4 to high power GPU servers.
openai/whisper
Robust Speech Recognition via Large-Scale Weak Supervision
BMPixel/moffee
moffee: Make Markdown Ready to Present
opendatalab/MinerU
A high-quality tool for convert PDF to Markdown and JSON.一站式开源高质量数据提取工具,将PDF转换成Markdown和JSON格式。
sherlcok314159/ChineseMRC-Data
收集了目前为止中文领域的MRC抽取式数据集
hiyouga/LLaMA-Factory
Unified Efficient Fine-Tuning of 100+ LLMs (ACL 2024)
eric-mitchell/direct-preference-optimization
Reference implementation for DPO (Direct Preference Optimization)
QwenLM/Qwen
The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.
wangcunxiang/LLM-Factuality-Survey
The repository for the survey paper <<Survey on Large Language Models Factuality: Knowledge, Retrieval and Domain-Specificity>>
THUDM/ChatGLM3
ChatGLM3 series: Open Bilingual Chat LLMs | 开源双语对话语言模型
huggingface/distil-whisper
Distilled variant of Whisper for speech recognition. 6x faster, 50% smaller, within 1% word error rate.
ztxz16/fastllm
纯c++的全平台llm加速库,支持python调用,chatglm-6B级模型单卡可达10000+token / s,支持glm, llama, moss基座,手机端流畅运行
MetaGLM/FinGLM
FinGLM: 致力于构建一个开放的、公益的、持久的金融大模型项目,利用开源开放来促进「AI+金融」。
hpcaitech/ColossalAI
Making large AI models cheaper, faster and more accessible
CLUEbenchmark/SuperCLUElyb
SuperCLUE琅琊榜:中文通用大模型匿名对战评价基准
hkust-nlp/ceval
Official github repo for C-Eval, a Chinese evaluation suite for foundation models [NeurIPS 2023]
baichuan-inc/Baichuan-7B
A large-scale 7B pretraining language model developed by BaiChuan-Inc.
DA-southampton/NLP_ability
总结梳理自然语言处理工程师(NLP)需要积累的各方面知识,包括面试题,各种基础知识,工程能力等等,提升核心竞争力
houking-can/CCKS2019-Task5
CCKS2019评测任务五-公众公司公告信息抽取,第3名
itcharge/LeetCode-Py
⛽️「算法通关手册」:超详细的「算法与数据结构」基础讲解教程,从零基础开始学习算法知识,850+ 道「LeetCode 题目」详细解析,200 道「大厂面试热门题目」。
wyu97/KENLG-Reading
Author: Wenhao Yu (wyu1@nd.edu). ACM Computing Survey'22. Reading list for knowledge-enhanced text generation, with a survey.
HqWu-HITCS/Awesome-Chinese-LLM
整理开源的中文大语言模型,以规模较小、可私有化部署、训练成本较低的模型为主,包括底座模型,垂直领域微调及应用,数据集与教程等。
liang8qi/Data-to-Text-Generation
Some papers and datasets about Data-To-Text Generation
thunlp/OpenPrompt
An Open-Source Framework for Prompt-Learning.
TingFree/NLPer-Arsenal
收录NLP竞赛策略实现、各任务baseline、相关竞赛经验贴(当前赛事、往期赛事、训练赛)、NLP会议时间、常用自媒体、GPU推荐等,持续更新中
649453932/Bert-Chinese-Text-Classification-Pytorch
使用Bert,ERNIE,进行中文文本分类
IDEA-CCNL/Fengshenbang-LM
Fengshenbang-LM(封神榜大模型)是IDEA研究院认知计算与自然语言研究中心主导的大模型开源体系,成为中文AIGC和认知智能的基础设施。
graykode/nlp-tutorial
Natural Language Processing Tutorial for Deep Learning Researchers
huggingface/transformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
liaogx/fastapi-tutorial
整体的介绍 FastAPI,快速上手开发,结合 API 交互文档逐个讲解核心模块的使用。视频学习地址: