pre-training

There are 161 repositories under pre-training topic.

RUCAIBox/LLMSurvey
The official GitHub page for the survey paper "A Survey of Large Language Models".
Language:Python10.6k 159 65829
modelscope/ms-swift
Use PEFT or Full-parameter to finetune 400+ LLMs (Qwen2.5, Llama3.2, GLM4, Internlm2.5, Yi1.5, Mistral, Baichuan2, DeepSeek, ...) or 100+ MLLMs (Qwen2-VL, Qwen2-Audio, Llama3.2-Vision, Llava, InternVL2.5, MiniCPM-V-2.6, GLM4v, Xcomposer2.5, Yi-VL, DeepSeek-VL2, Phi3.5-Vision, GOT-OCR2, ...).
Language:Python4.7k 23 1.4k414
modelscope/data-juicer
Making data higher-quality, juicier, and more digestible for foundation models! 🍎 🍋 🌽 ➡️ ➡️🍸 🍹 🍷为大模型提供更高质量、更丰富、更易”消化“的数据！
Language:Python3.2k 18 207187
dbiir/UER-py
Open Source Pre-training Model Framework in PyTorch & Pre-trained Model Zoo
Language:Python3k 76 264527
ChandlerBang/awesome-self-supervised-gnn
Papers about pretraining and self-supervised learning on Graph Neural Networks (GNN).
Language:Python1.6k 54 3164
EgoAlpha/prompt-in-context-learning
Awesome resources for in-context learning and prompt engineering: Mastery of the LLMs such as ChatGPT, GPT-3, and FlanT5, with up-to-date and cutting-edge updates.
Language:Jupyter Notebook1.5k 37 293
LirongWu/awesome-graph-self-supervised-learning
Code for TKDE paper "Self-supervised learning on graphs: Contrastive, generative, or predictive"
1.4k 17 0168
zjunlp/KnowLM
An Open-sourced Knowledgable Large Language Model Framework.
Language:Python1.3k 11 135127
yzhuoning/Awesome-CLIP
Awesome list for research on CLIP (Contrastive Language-Image Pre-Training).
1.2k 19 1557
Tencent/TencentPretrain
Tencent Pre-training framework in PyTorch & Pre-trained Model Zoo
Language:Python1k 21 38142
microsoft/Oscar
Oscar and VinVL
Language:Python1k 26 202251
SalesforceAIResearch/uni2ts
Unified Training of Universal Time Series Forecasting Transformers
Language:Jupyter Notebook963 9 79110
brightmart/bert_language_understanding
Pre-training of Deep Bidirectional Transformers for Language Understanding: pre-train TextCNN
Language:Python960 49 21211
qingsongedu/Awesome-TimeSeries-SpatioTemporal-LM-LLM
A professional list on Large (Language) Models and Foundation Models (LLM, LM, FM) for Time Series, Spatiotemporal, and Event Data.
944 29 370
ChenRocks/UNITER
Research code for ECCV 2020 paper "UNITER: UNiversal Image-TExt Representation Learning"
Language:Python786 18 95109
jackroos/VL-BERT
Code for ICLR 2020 paper "VL-BERT: Pre-training of Generic Visual-Linguistic Representations".
Language:Jupyter Notebook740 14 83109
nancheng58/Awesome-LLM4RS-Papers
Large Language Model-enhanced Recommender System Papers
599 11 1352
princeton-nlp/LLM-Shearing
[ICLR 2024] Sheared LLaMA: Accelerating Language Model Pre-training via Structured Pruning
Language:Python567 23 7248
Shen-Lab/GraphCL
[NeurIPS 2020] "Graph Contrastive Learning with Augmentations" by Yuning You, Tianlong Chen, Yongduo Sui, Ting Chen, Zhangyang Wang, Yang Shen
Language:Python564 10 70103
acbull/GPT-GNN
Code for KDD'20 "Generative Pre-Training of Graph Neural Networks"
Language:Python490 13 4988
microsoft/XPretrain
Multi-modality pre-training
Language:Python479 14 3936
GAIR-NLP/MathPile
[NeurlPS D&B 2024] Generative AI for Math: MathPile
Language:Python400 7 521
linwhitehat/ET-BERT
The repository of ET-BERT, a network traffic classification model on encrypted traffic. The work has been accepted as The Web Conference (WWW) 2022 accepted paper.
Language:Python393 4 9882
google-research-datasets/conceptual-12m
Conceptual 12M is a dataset containing (image-URL, caption) pairs collected for vision-and-language pre-training.
374 13 721
GestaltCogTeam/STEP
Code for our SIGKDD'22 paper Pre-training-Enhanced Spatial-Temporal Graph Neural Network For Multivariate Time Series Forecasting.
Language:Python345 7 5838
THUDM/GCC
GCC: Graph Contrastive Coding for Graph Neural Network Pre-Training @ KDD 2020
Language:Python324 15 2555
westlake-repl/Recommendation-Systems-without-Explicit-ID-Features-A-Literature-Review
Paper List of Pre-trained Foundation Recommender Models
320 10 127
sayakpaul/probing-vits
Probing the representations of Vision Transformers.
Language:Jupyter Notebook317 10 720
Lupin1998/Awesome-MIM
[Survey] Masked Modeling for Self-supervised Representation Learning on Vision and Beyond (https://arxiv.org/abs/2401.00897)
Language:Python312 7 616
ZigeW/data_management_LLM
Collection of training data management explorations for large language models
298 5 129
ViTAE-Transformer/SAMRS
The official repo for [NeurIPS'23] "SAMRS: Scaling-up Remote Sensing Segmentation Dataset with Segment Anything Model"
Language:Python296 4 3814
OpenDriveLab/ViDAR
[CVPR 2024 Highlight] Visual Point Cloud Forecasting
Language:Python285 9 4318
wangxiao5791509/MultiModal_BigModels_Survey
[MIR-2023-Survey] A continuously updated paper list for multi-modal pre-trained big models
281 9 017
showlab/all-in-one
[CVPR2023] All in One: Exploring Unified Video-Language Pre-training
Language:Python280 6 2117
DeepGraphLearning/GearNet
GearNet and Geometric Pretraining Methods for Protein Structure Representation Learning, ICLR'2023 (https://arxiv.org/abs/2203.06125)
Language:Python276 10 6628
mczhuge/Kaleido-BERT
💐Kaleido-BERT: Vision-Language Pre-training on Fashion Domain
Language:Python262 3 1519

pre-training

RUCAIBox/LLMSurvey

modelscope/ms-swift

modelscope/data-juicer

dbiir/UER-py

ChandlerBang/awesome-self-supervised-gnn

EgoAlpha/prompt-in-context-learning

LirongWu/awesome-graph-self-supervised-learning

zjunlp/KnowLM

yzhuoning/Awesome-CLIP

Tencent/TencentPretrain

microsoft/Oscar

SalesforceAIResearch/uni2ts

brightmart/bert_language_understanding

qingsongedu/Awesome-TimeSeries-SpatioTemporal-LM-LLM

ChenRocks/UNITER

jackroos/VL-BERT

nancheng58/Awesome-LLM4RS-Papers

princeton-nlp/LLM-Shearing

Shen-Lab/GraphCL

acbull/GPT-GNN

microsoft/XPretrain

GAIR-NLP/MathPile

linwhitehat/ET-BERT

google-research-datasets/conceptual-12m

GestaltCogTeam/STEP

THUDM/GCC

westlake-repl/Recommendation-Systems-without-Explicit-ID-Features-A-Literature-Review

sayakpaul/probing-vits

Lupin1998/Awesome-MIM

ZigeW/data_management_LLM

ViTAE-Transformer/SAMRS

OpenDriveLab/ViDAR

wangxiao5791509/MultiModal_BigModels_Survey

showlab/all-in-one

DeepGraphLearning/GearNet

mczhuge/Kaleido-BERT