pretrained-language-model

There are 131 repositories under pretrained-language-model topic.

wenge-research/YAYI2
YAYI 2 是中科闻歌研发的新一代开源大语言模型，采用了超过 2 万亿 Tokens 的高质量、多语言语料进行预训练。(Repo for YaYi 2 Chinese LLMs)
Language:Python3.4k 7 1019
microsoft/torchscale
Foundation Architecture for (M)LLMs
Language:Python3.1k 44 86219
Separius/awesome-sentence-embedding
A curated list of pretrained sentence and word embedding models
Language:Python2.3k 77 19262
THUDM/P-tuning-v2
An optimized deep prompt tuning strategy comparable to fine-tuning across scales and tasks
Language:Python2.1k 29 78204
thunlp/OpenDelta
A plug-and-play library for parameter-efficient-tuning (Delta Tuning)
Language:Python1k 19 6483
xcfcode/Summarization-Papers
Summarization Papers
Language:TeX1k 23 3146
AndrewZhe/lawyer-llama
中文法律LLaMA (LLaMA for Chinese legel domain)
Language:Python957 11 67131
gaoisbest/NLP-Projects
word2vec, sentence2vec, machine reading comprehension, dialog system, text classification, pretrained language model (i.e., XLNet, BERT, ELMo, GPT), sequence labeling, information retrieval, information extraction (i.e., entity, relation and event extraction), knowledge graph, text generation, network embedding
Language:OpenEdge ABL558 21 1153
allenai/dont-stop-pretraining
Code associated with the Don't Stop Pretraining ACL 2020 paper
Language:Python535 8 3973
OpenBMB/CPM-Live
Live Training for Open-source Big Models
Language:Python507 20 3839
RenzeLou/awesome-instruction-learning
Papers and Datasets on Instruction Tuning and Following. ✨✨✨
Language:Python499 7 025
Hzfinfdu/Diffusion-BERT
ACL'2023: DiffusionBERT: Improving Generative Masked Language Models with Diffusion Models
Language:Python324 11 3326
LYH-YF/MWPToolkit
MWPToolkit is an open-source framework for math word problem(MWP) solvers.
Language:Python163 3 2637
yueyu1030/AttrPrompt
[NeurIPS 2023] This is the code for the paper `Large Language Model as Attributed Training Data Generator: A Tale of Diversity and Bias`.
Language:Python153 3 613
zzz47zzz/awesome-lifelong-learning-methods-for-llm
[ACM Computing Surveys 2025] This repository collects awesome survey, resource, and paper for Lifelong Learning with Large Language Models. (Updated Regularly)
146 3 16
hyintell/awesome-refreshing-llms
EMNLP'23 survey: a curation of awesome papers and resources on refreshing large language models (LLMs) without expensive retraining.
134 5 011
ZhengZixiang/ATPapers
Worth-reading papers and related resources on attention mechanism, Transformer and pretrained language model (PLM) such as BERT. 值得一读的注意力机制、Transformer和预训练语言模型论文与相关资源集合
133 7 013
DC-research/TEMPO
The official code for "TEMPO: Prompt-based Generative Pre-trained Transformer for Time Series Forecasting (ICLR 2024)". TEMPO is one of the very first open source Time Series Foundation Models for forecasting task v1.0 version.
Language:Python120 2 1414
microsoft/COCO-LM
[NeurIPS 2021] COCO-LM: Correcting and Contrasting Text Sequences for Language Model Pretraining
Language:Python117 3 713
SuperBruceJia/Awesome-LLM-Self-Consistency
Awesome LLM Self-Consistency: a curated list of Self-consistency in Large Language Models
108 4 17
git-disl/BERT4ETH
BERT4ETH: A Pre-trained Transformer for Ethereum Fraud Detection (WWW23)
Language:Python102 5 1120
thunlp/Prompt-Transferability
On Transferability of Prompt Tuning for Natural Language Processing
Language:Python100 6 811
SJTU-IPADS/Bamboo
Bamboo-7B Large Language Model
93 10 11
RUCAIBox/UniCRS
[KDD22] Official PyTorch implementation for "Towards Unified Conversational Recommender Systems via Knowledge-Enhanced Prompt Learning".
Language:Python87 1 015
yumeng5/TopClus
[WWW 2022] Topic Discovery via Latent Space Clustering of Pretrained Language Model Representations
Language:Python87 2 611
GanjinZero/CODER
CODER: Knowledge infused cross-lingual medical term embedding for term normalization. [JBI, ACL-BioNLP 2022]
Language:Python79 1 55
EagleW/Scientific-Inspiration-Machines-Optimized-for-Novelty
Official implementation of the ACL 2024: Scientific Inspiration Machines Optimized for Novelty
Language:Python77 2 211
ChangwenXu98/TransPolymer
Implementation of "TransPolymer: a Transformer-based language model for polymer property predictions" in PyTorch
Language:Python68 2 1421
azminewasi/Awesome-LLMs-ICLR-24
It is a comprehensive resource hub compiling all LLM papers accepted at the International Conference on Learning Representations (ICLR) in 2024.
64 1 03
yumeng5/SuperGen
[NeurIPS 2022] Generating Training Data with Language Models: Towards Zero-Shot Language Understanding
Language:Python64 2 314
FranxYao/PoincareProbe
Implementation of ICLR 21 paper: Probing BERT in Hyperbolic Spaces
Language:Jupyter Notebook58 6 05
SKplanet/Dialog-KoELECTRA
ELECTRA기반 한국어 대화체 언어모델
Language:Python54 7 37
GanjinZero/BioBART
BioBART: Pretraining and Evaluation of A Biomedical Generative Language Model [ACL-BioNLP 2022]
Language:Python52 2 54
heraclex12/NLP2SPARQL
Translate Natural Language Processing to SPARQL Query and vice versa
Language:Python52 2 612
zjukg/DUET
[Paper][AAAI 2023] DUET: Cross-modal Semantic Grounding for Contrastive Zero-shot Learning
Language:Python52 3 38
OpenMatch/COCO-DR
[EMNLP 2022] This is the code repo for our EMNLP‘22 paper "COCO-DR: Combating Distribution Shifts in Zero-Shot Dense Retrieval with Contrastive and Distributionally Robust Learning".
Language:Python50 4 44

pretrained-language-model

wenge-research/YAYI2

microsoft/torchscale

Separius/awesome-sentence-embedding

THUDM/P-tuning-v2

thunlp/OpenDelta

xcfcode/Summarization-Papers

AndrewZhe/lawyer-llama

gaoisbest/NLP-Projects

allenai/dont-stop-pretraining

OpenBMB/CPM-Live

RenzeLou/awesome-instruction-learning

Hzfinfdu/Diffusion-BERT

LYH-YF/MWPToolkit

yueyu1030/AttrPrompt

zzz47zzz/awesome-lifelong-learning-methods-for-llm

hyintell/awesome-refreshing-llms

ZhengZixiang/ATPapers

DC-research/TEMPO

microsoft/COCO-LM

SuperBruceJia/Awesome-LLM-Self-Consistency

git-disl/BERT4ETH

thunlp/Prompt-Transferability

SJTU-IPADS/Bamboo

RUCAIBox/UniCRS

yumeng5/TopClus

GanjinZero/CODER

EagleW/Scientific-Inspiration-Machines-Optimized-for-Novelty

ChangwenXu98/TransPolymer

azminewasi/Awesome-LLMs-ICLR-24

yumeng5/SuperGen

FranxYao/PoincareProbe

SKplanet/Dialog-KoELECTRA

GanjinZero/BioBART

heraclex12/NLP2SPARQL

zjukg/DUET

OpenMatch/COCO-DR