dyh1996's Stars
Social-AI-Studio/ToxiCloakCN
Official repository for EMNLP'24 paper "ToxiCloakCN: Evaluating Robustness of Offensive Language Detection in Chinese with Cloaking Perturbations"
ArthurDarkstone/vueuse-template
template for vueuse. 以vueuse 项目作为模板的utils库项目模板
deepseek-ai/DeepSeek-LLM
DeepSeek LLM: Let there be answers
Zjh-819/LLMDataHub
A quick guide (especially) for trending instruction finetuning datasets
Schlampig/Knowledge_Graph_Wander
A collection of papers, codes, projects, tutorials ... for Knowledge Graph and other NLP methods
InternLM/InternLM-techreport
HqWu-HITCS/Awesome-Chinese-LLM
整理开源的中文大语言模型,以规模较小、可私有化部署、训练成本较低的模型为主,包括底座模型,垂直领域微调及应用,数据集与教程等。
MLGroupJLU/LLM-eval-survey
The official GitHub page for the survey paper "A Survey on Evaluation of Large Language Models".
fiyen/PaddlePaddle-Knover
Knover的使用教程,用以训练一个完整的PLATO-2模型,还包括了两个快速实现多轮对话的现成模块。A manual to use Knover to train Plato-2, including two modules to achieve bot-dialog.
BI4O/rasa_milktea_chatbot
Chatbot with bert chinese model, base on rasa framework(中文聊天机器人,结合bert意图分析,基于rasa框架)
benchi/big_file_sort
Python library to sort large files by breaking them into smaller chunks, writing those to temporary files, and merging.
coder-duibai/Contrastive-Learning-Papers-Codes
A comprehensive list of Awesome Contrastive Learning Papers&Codes.Research include, but are not limited to: CV, NLP, Audio, Video, Multimodal, Graph, Language, etc.
thunlp/OpenPrompt
An Open-Source Framework for Prompt-Learning.
princeton-nlp/LM-BFF
[ACL 2021] LM-BFF: Better Few-shot Fine-tuning of Language Models https://arxiv.org/abs/2012.15723
ucinlp/autoprompt
AutoPrompt: Automatic Prompt Construction for Masked Language Models.
timoschick/pet
This repository contains the code for "Exploiting Cloze Questions for Few-Shot Text Classification and Natural Language Inference"
sunyilgdx/NSP-BERT
The code for our paper "NSP-BERT: A Prompt-based Zero-Shot Learner Through an Original Pre-training Task —— Next Sentence Prediction"
huggingface/transformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
dbiir/UER-py
Open Source Pre-training Model Framework in PyTorch & Pre-trained Model Zoo
425776024/nlpcda
一键中文数据增强包 ; NLP数据增强、bert数据增强、EDA:pip install nlpcda
thunlp/OpenAttack
An Open-Source Package for Textual Adversarial Attack.
pris-nlp/nlp-paper-reading-list
motivation: 系统整理NLP各个方向需要阅读的论文
fwwdn/sensitive-stop-words
互联网常用敏感词、停止词词库
DA-southampton/NLP_ability
总结梳理自然语言处理工程师(NLP)需要积累的各方面知识,包括面试题,各种基础知识,工程能力等等,提升核心竞争力
songyingxin/NLPer-Interview
该仓库主要记录 NLP 算法工程师相关的面试题
afatcoder/LeetcodeTop
汇总各大互联网公司容易考察的高频leetcode题🔥
huggingface/datasets
🤗 The largest hub of ready-to-use datasets for ML models with fast, easy-to-use and efficient data manipulation tools
unbug/codelf
A search tool helps dev to solve the naming things problem.
OpenNMT/OpenNMT-py
Open Source Neural Machine Translation and (Large) Language Models in PyTorch
Maluuba/nlg-eval
Evaluation code for various unsupervised automated metrics for Natural Language Generation.