haojiepan1's Stars
zejunwang1/CSTS
中文自然语言推理与语义相似度数据集
CLUEbenchmark/SimCLUE
3000000+语义理解与匹配数据集。可用于无监督对比学习、半监督学习等构建中文领域效果最好的预训练模型
charent/Phi2-mini-Chinese
Phi2-Chinese-0.2B 从0开始训练自己的Phi2中文小模型,支持接入langchain加载本地知识库做检索增强生成RAG。Training your own Phi2 small chat model from scratch.
charent/ChatLM-mini-Chinese
中文对话0.2B小模型(ChatLM-Chinese-0.2B),开源所有数据集来源、数据清洗、tokenizer训练、模型预训练、SFT指令微调、RLHF优化等流程的全部代码。支持下游任务sft微调,给出三元组信息抽取微调示例。
Doragd/PaperReading
Paper阅读记录博客(基于GitHub Action和GitHub Issue实现)。
Doragd/Algorithm-Practice-in-Industry
搜索、推荐、广告、用增等工业界实践文章收集(来源:知乎、Datafuntalk、技术公众号)
yuleiqin/fantastic-data-engineering
Fantastic Data Engineering for Large Language Models
mli/paper-reading
深度学习经典、新论文逐段精读
borisdayma/dalle-mini
DALL·E Mini - Generate images from a text prompt
Saurav6789/Books-
Books for Data Science
taishan1994/pytorch_GlobalPointer_Ner
基于pytorch的GlobalPointer进行中文命名实体识别。
zwkkk/wentian-rank2
“阿里灵杰”问天引擎电商搜索算法赛 第二名。电商领域两阶段文本匹配算法。
HUSTAI/uie_pytorch
PaddleNLP UIE模型的PyTorch版实现
taishan1994/pytorch_uie_ner
基于pytorch的百度UIE命名实体识别。
AetherCortex/Llama-X
Open Academic Research on Improving LLaMA to SOTA LLM
SuperBruceJia/Awesome-LLM-Self-Consistency
Awesome LLM Self-Consistency: a curated list of Self-consistency in Large Language Models
princeton-nlp/ALCE
[EMNLP 2023] Enabling Large Language Models to Generate Text with Citations. Paper: https://arxiv.org/abs/2305.14627
OpenBMB/AgentVerse
🤖 AgentVerse 🪐 is designed to facilitate the deployment of multiple LLM-based agents in various applications, which primarily provides two frameworks: task-solving and simulation
aiwaves-cn/agents
An Open-source Framework for Data-centric, Self-evolving Autonomous Language Agents
microsoft/autogen
A programming framework for agentic AI 🤖
geekan/MetaGPT
🌟 The Multi-Agent Framework: First AI Software Company, Towards Natural Language Programming
THU-KEG/EvaluationPapers4ChatGPT
Resource, Evaluation and Detection Papers for ChatGPT
netease-youdao/QAnything
Question and Answer based on Anything.
openai/weak-to-strong
langgenius/dify
Dify is an open-source LLM app development platform. Dify's intuitive interface combines AI workflow, RAG pipeline, agent capabilities, model management, observability features and more, letting you quickly go from prototype to production.
LC1332/Chat-Haruhi-Suzumiya
Chat凉宫春日, An open sourced Role-Playing chatbot Cheng Li, Ziang Leng, and others.
mit-han-lab/streaming-llm
[ICLR 2024] Efficient Streaming Language Models with Attention Sinks
THUDM/LongBench
[ACL 2024] LongBench: A Bilingual, Multitask Benchmark for Long Context Understanding
tau-nlp/scrolls
The official code of EMNLP 2022, "SCROLLS: Standardized CompaRison Over Long Language Sequences".
Zjh-819/LLMDataHub
A quick guide (especially) for trending instruction finetuning datasets