Pinned Repositories
2018-CCF-BDCI-China-Unicom-Research-Institute-top2
2018-CCF大数据与计算智能大赛-面向电信行业存量用户的智能套餐个性化匹配模型联通赛-复赛第二名
2018-iFLYTEK-Marketing-Algorithms-Competition-Finals-Rank1
2018科大讯飞营销算法大赛(冠军方案)
2019-CCF-BDCI-OCR-MCZJ-OCR-IdentificationIDElement
2019CCF-BDCI大赛 最佳创新探索奖获得者 基于OCR身份证要素提取赛题冠军 天晨破晓团队 赛题源码
2019Baai-zhihu-Cup-findexp-4th
2019年知乎看山杯第四名
Chatbot_CN
基于金融-司法领域(兼有闲聊性质)的聊天机器人,其中的主要模块有信息抽取、NLU、NLG、知识图谱等,并且利用Django整合了前端展示,目前已经封装了nlp和kg的restful接口
CrossWOZ
A Large-Scale Chinese Cross-Domain Task-Oriented Dialogue Dataset
DIAC
问题等价性判断数据预处理,包含添加对抗样本(同音字、近义词替换等)、获取样本的pattern(用通配符替换相同词汇,提取相同和不同词汇)
DrQA
Reading Wikipedia to Answer Open-Domain Questions
lmft
Language Model Fine-Tuning, for ChatGLM, BELLE, LLaMA fine-tuning.
MacBERT
Revisiting Pre-trained Models for Chinese Natural Language Processing (Findings of EMNLP)
haojiepan1's Repositories
haojiepan1/lmft
Language Model Fine-Tuning, for ChatGLM, BELLE, LLaMA fine-tuning.
haojiepan1/awesome-chatgpt-1
Curated list of awesome tools, demos, docs for ChatGPT and GPT-3
haojiepan1/awesome-chatgpt-prompts-zh
ChatGPT 中文调教指南。各种场景使用指南。学习怎么让它听你的话。
haojiepan1/awesome-totally-open-chatgpt
A list of totally open alternatives to ChatGPT
haojiepan1/BELLE
BELLE: Bloom-Enhanced Large Language model Engine(开源中文对话大模型-70亿参数)
haojiepan1/bert4torch
pytorch implement of transformers refer to bert4keras
haojiepan1/ChatGLM-Efficient-Tuning
Fine-tuning ChatGLM-6B with PEFT | 基于 PEFT 的高效 ChatGLM 微调
haojiepan1/ChatGLM-Finetuning
基于ChatGLM-6B模型,进行下游具体任务微调,涉及Freeze、Lora、P-tuning等
haojiepan1/ChatGLM-Tuning
一种平价的chatgpt实现方案, 基于ChatGLM-6B + LoRA
haojiepan1/chatgpt-corpus
ChatGPT 中文语料库 对话语料 小说语料 客服语料
haojiepan1/Chinese-ChatLLaMA
中文LLaMA基础模型;中文ChatLLaMA对话模型;NLP预训练/指令微调数据集
haojiepan1/Chinese-Frame-Semantic-Parsing
汉语框架语义解析
haojiepan1/ColossalAI
Making large AI models cheaper, faster and more accessible
haojiepan1/DeepSpeedExamples
Example models using DeepSpeed
haojiepan1/geektime-ai-course
Jupyter Notebooks for Geektime AI Course
haojiepan1/GLM
GLM (General Language Model)
haojiepan1/GlobalPointer_pytorch
全局指针统一处理嵌套与非嵌套NER的Pytorch实现
haojiepan1/hcgf
Humanable ChatGLM/GPT Fine-tuning | ChatGLM微调
haojiepan1/InstructGLM
ChatGLM-6B 指令学习|指令数据|Instruct
haojiepan1/langchain
⚡ Building applications with LLMs through composability ⚡
haojiepan1/lightning
Deep learning framework to train, deploy, and ship AI products Lightning fast.
haojiepan1/LLMsPracticalGuide
A curated list of practical guide resources of LLMs (LLMs Tree, Examples, Papers)
haojiepan1/Megatron-DeepSpeed
Ongoing research training transformer language models at scale, including: BERT & GPT-2
haojiepan1/MNBVC
MNBVC(Massive Never-ending BT Vast Chinese corpus)超大规模中文语料集。对标chatGPT训练的40T数据。MNBVC数据集不但包括主流文化,也包括各个小众文化甚至火星文的数据。MNBVC数据集包括新闻、作文、小说、书籍、杂志、论文、台词、帖子、wiki、古诗、歌词、商品介绍、笑话、糗事、聊天记录等一切形式的纯文本中文数据。
haojiepan1/peft
🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.
haojiepan1/stanford_alpaca
Code and documentation to train Stanford's Alpaca models, and generate the data.
haojiepan1/trl
Train transformer language models with reinforcement learning.
haojiepan1/trlx
A repo for distributed training of language models with Reinforcement Learning via Human Feedback (RLHF)
haojiepan1/whisper
Robust Speech Recognition via Large-Scale Weak Supervision
haojiepan1/zero_nlp
中文nlp应用(数据、模型、训练、推理)