pre-trained-language-models

There are 71 repositories under pre-trained-language-models topic.

ymcui/Chinese-LLaMA-Alpaca
中文LLaMA&Alpaca大语言模型+本地CPU/GPU训练部署 (Chinese LLaMA & Alpaca LLMs)
Language:Python18.6k 184 7321.9k
RUCAIBox/LLMSurvey
The official GitHub page for the survey paper "A Survey of Large Language Models".
Language:Python10.8k 159 65837
thunlp/OpenPrompt
An Open-Source Framework for Prompt-Learning.
Language:Python4.4k 44 261457
thunlp/PromptPapers
Must-read papers on prompt-based tuning for pre-trained language models.
4.1k 119 5381
ddangelov/Top2Vec
Top2Vec learns jointly embedded topic, document and word vectors.
Language:Python3k 37 332374
brightmart/roberta_zh
RoBERTa中文预训练模型: RoBERTa for Chinese
Language:Python2.7k 52 95411
zjunlp/KnowLM
An Open-sourced Knowledgable Large Language Model Framework.
Language:Python1.3k 11 136128
cedrickchee/awesome-transformer-nlp
A curated list of NLP resources focused on Transformer networks, attention mechanism, GPT, BERT, ChatGPT, LLMs, and transfer learning.
1.1k 41 0129
zjunlp/KnowledgeEditingPapers
Must-read Papers on Knowledge Editing for Large Language Models.
979 27 865
THUDM/P-tuning
A novel method to tune language models. Codes and datasets for paper ``GPT understands, too''.
Language:Python927 23 50111
txsun1997/LMaaS-Papers
Awesome papers on Language-Model-as-a-Service (LMaaS)
552 12 032
sunyilgdx/SIFRank_zh
Keyphrase or Keyword Extraction 基于预训练模型的中文关键词抽取方法（论文SIFRank: A New Baseline for Unsupervised Keyphrase Extraction Based on Pre-trained Language Model 的中文版代码）
Language:Python422 8 3078
airaria/TextPruner
A PyTorch-based model pruning toolkit for pre-trained language models
Language:Python382 5 1734
wjn1996/HugNLP
HugNLP is a unified and comprehensive NLP library based on HuggingFace Transformer. Please hugging for NLP now!😊 HugNLP will released to @HugAILab
Language:Python247 8 013
zjunlp/MolGen
[ICLR 2024] Domain-Agnostic Molecular Generation with Chemical Feedback
Language:Python144 7 2212
zjunlp/DART
[ICLR 2022] Differentiable Prompt Makes Pre-trained Language Models Better Few-shot Learners
Language:Python130 6 916
sunyilgdx/SIFRank
The code of our paper "SIFRank: A New Baseline for Unsupervised Keyphrase Extraction Based on Pre-trained Language Model"
Language:Python121 4 920
zjunlp/MKG_Analogy
[ICLR 2023] Multimodal Analogical Reasoning over Knowledge Graphs
Language:Python105 6 3011
TobiasLee/Awesome-Efficient-PLM
Must-read papers on improving efficiency for pre-trained language models.
102 5 014
lyy1994/awesome-data-contamination
The Paper List on Data Contamination for Large Language Models Evaluation.
82 2 52
Hanlard/Electra_CRF_NER
We start a company-name recognition task with a small scale and low quality training data, then using skills to enhanced model training speed and predicting performance with least artificial participation. The methods we use involve lite pre-training models such as Albert-small or Electra-small with financial corpus, knowledge of distillation and multi-stage learning. The result is that we improve the recall rate of company names recognition task from 0.73 to 0.92 and get 4 times as fast as BERT-Bilstm-CRF model.
Language:Python80 5 314
Victorwz/VaLM
VaLM: Visually-augmented Language Modeling. ICLR 2023.
Language:Python56 2 53
pat-jj/TagReal
[ACL'23] Open KG Completion with PLM (Bridging Text Mining and Prompt Engineering)
Language:Python51 2 41
WangRongsheng/Chinese-LLaMA-Alpaca-Usage
📔 对Chinese-LLaMA-Alpaca进行使用说明和核心代码注解
Language:Jupyter Notebook48 2 16
zjunlp/ChatCell
ChatCell: Facilitating Single-Cell Analysis with Natural Language
Language:Python47 8 09
lancopku/DynamicKD
Code for EMNLP 2021 main conference paper "Dynamic Knowledge Distillation for Pre-trained Language Models"
Language:Python40 6 16
anas-zafar/LLM-Survey
The official GitHub page for the survey paper "Large language models: a comprehensive survey of its applications, challenges, limitations, and future prospects"
30 2 05
lanwuwei/GigaBERT
Zero-shot Transfer Learning from English to Arabic
Language:Python29 6 25
ai2-ner-project/pytorch-ko-ner
PLM 기반 한국어 개체명 인식 (NER)
Language:Python28 0 03
xuanyuan14/ARES
SIGIR'22 paper: Axiomatically Regularized Pre-training for Ad hoc Search
Language:Python24 1 02
nkcs-iclab/linglong
LingLong (玲珑): a small-scale Chinese pretrained language model
Language:Python18 0 01
XingLuxi/Cal-FLOPs-for-PLM
Calculating FLOPs of Pre-trained Models in NLP
Language:Python18 2 14
cliang1453/super-structured-lottery-tickets
Super Tickets in Pre-Trained Language Models: From Model Compression to Improving Generalization (ACL 2021)
Language:Python17 2 11
zjunlp/knowledge-rumination
[EMNLP 2023] Knowledge Rumination for Pre-trained Language Models
Language:Python17 4 01
yuzhimanhua/SeeTopic
Seed-Guided Topic Discovery with Out-of-Vocabulary Seeds (NAACL'22)
Language:C16 3 22
Navy10021/SLS
SLS : Neural Information Retrieval(IR)-based Semantic Search model
Language:Jupyter Notebook13 1 14

pre-trained-language-models

ymcui/Chinese-LLaMA-Alpaca

RUCAIBox/LLMSurvey

thunlp/OpenPrompt

thunlp/PromptPapers

ddangelov/Top2Vec

brightmart/roberta_zh

zjunlp/KnowLM

cedrickchee/awesome-transformer-nlp

zjunlp/KnowledgeEditingPapers

THUDM/P-tuning

txsun1997/LMaaS-Papers

sunyilgdx/SIFRank_zh

airaria/TextPruner

wjn1996/HugNLP

zjunlp/MolGen

zjunlp/DART

sunyilgdx/SIFRank

zjunlp/MKG_Analogy

TobiasLee/Awesome-Efficient-PLM

lyy1994/awesome-data-contamination

Hanlard/Electra_CRF_NER

Victorwz/VaLM

pat-jj/TagReal

WangRongsheng/Chinese-LLaMA-Alpaca-Usage

zjunlp/ChatCell

lancopku/DynamicKD

anas-zafar/LLM-Survey

lanwuwei/GigaBERT

ai2-ner-project/pytorch-ko-ner

xuanyuan14/ARES

nkcs-iclab/linglong

XingLuxi/Cal-FLOPs-for-PLM

cliang1453/super-structured-lottery-tickets

zjunlp/knowledge-rumination

yuzhimanhua/SeeTopic

Navy10021/SLS