Excuses123's Stars
Excuses123/2024BDC
JinpengLI/deep_ocr
make a better chinese character recognition OCR than tesseract
HumanAIGC/OutfitAnyone
Outfit Anyone: Ultra-high quality virtual try-on for Any Clothing and Any Person
HqWu-HITCS/Awesome-Chinese-LLM
整理开源的中文大语言模型,以规模较小、可私有化部署、训练成本较低的模型为主,包括底座模型,垂直领域微调及应用,数据集与教程等。
hiyouga/LLaMA-Factory
Efficiently Fine-Tune 100+ LLMs in WebUI (ACL 2024)
xlang-ai/xlang-paper-reading
Paper collection on building and evaluating language model agents via executable language grounding
zhiqix/NL2GQL
The LLM of NL2GQL with NebulaGraph or Neo4j
artidoro/qlora
QLoRA: Efficient Finetuning of Quantized LLMs
baichuan-inc/Baichuan-7B
A large-scale 7B pretraining language model developed by BaiChuan-Inc.
Duxiaoman-DI/XuanYuan
轩辕:度小满中文金融对话大模型
k2-fsa/icefall
ymcui/Chinese-LLaMA-Alpaca
中文LLaMA&Alpaca大语言模型+本地CPU/GPU训练部署 (Chinese LLaMA & Alpaca LLMs)
microsoft/JARVIS
JARVIS, a system to connect LLMs with ML community. Paper: https://arxiv.org/pdf/2303.17580.pdf
microsoft/DeepSpeed
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
Shawn-IEITSystems/Yuan-1.0
Yuan 1.0 Large pretrained LM
LianjiaTech/BELLE
BELLE: Be Everyone's Large Language model Engine(开源中文对话大模型)
THUDM/ChatGLM-6B
ChatGLM-6B: An Open Bilingual Dialogue Language Model | 开源双语对话语言模型
SophonPlus/ChineseNlpCorpus
搜集、整理、发布 中文 自然语言处理 语料/数据集,与 有志之士 共同 促进 中文 自然语言处理 的 发展。
txsun1997/MOSS
MOSS is a conversational language model like ChatGPT.
hpcaitech/ColossalAI
Making large AI models cheaper, faster and more accessible
km1994/recommendation_advertisement_search
整理自然语言处理、推荐系统、搜索引擎等AI领域的入门笔记,论文学习笔记和面试资料(关于NLP那些你不知道的事、关于推荐系统那些你不知道的事、NLP百面百搭、推荐系统百面百搭、搜索引擎百面百搭)
Jacen789/relation-extraction
中文关系抽取
easezyc/WSDM2022-PTUPCDR
This is the official implementation of our paper Personalized Transfer of User Preferences for Cross-domain Recommendation (PTUPCDR), which has been accepted by WSDM2022.
liuhuanyong/PersonGraphDataSet
PersonGraphDataSet, nearly 10 thousand person2person relationship facts。 人物图谱数据集,近十万的人物关系图谱事实数据库,通过人物关系抽取算法抽取+人工整理得出,可用于人物关系搜索、查询、人物关系多跳问答,以及人物关系推理等场景提供基础数据。
nju-websoft/OpenEA
A Benchmarking Study of Embedding-based Entity Alignment for Knowledge Graphs, VLDB 2020
knowitall/openie
Quality information extraction at web scale.
kangvcar/InfoSpider
INFO-SPIDER 是一个集众多数据源于一身的爬虫工具箱🧰,旨在安全快捷的帮助用户拿回自己的数据,工具代码开源,流程透明。支持数据源包括GitHub、QQ邮箱、网易邮箱、阿里邮箱、新浪邮箱、Hotmail邮箱、Outlook邮箱、京东、淘宝、支付宝、**移动、**联通、**电信、知乎、哔哩哔哩、网易云音乐、QQ好友、QQ群、生成朋友圈相册、浏览器浏览历史、12306、博客园、CSDN博客、开源**博客、简书。
HumanSignal/label-studio
Label Studio is a multi-type data labeling and annotation tool with standardized output format
km1994/NLP-Interview-Notes
该仓库主要记录 NLP 算法工程师相关的面试题
wzhe06/Reco-papers
Classic papers and resources on recommendation