Pinned Repositories
TubeDETR
[CVPR 2022 Oral] TubeDETR: Spatio-Temporal Video Grounding with Transformers
data-juicer
A one-stop data processing system to make data higher-quality, juicier, and more digestible for (multimodal) LLMs! 🍎 🍋 🌽 ➡️ ➡️🍸 🍹 🍷为大模型提供更高质量、更丰富、更易”消化“的数据!
Paddle_cvt
Paddle_gdiT
T2T_paddle
vit-pytorch
Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Pytorch
KoPA
[Paper][ACM MM 2024] Making Large Language Models Perform Better in Knowledge Graph Completion
KnowLM
An Open-sourced Knowledgable Large Language Model Framework.
zhl98's Repositories
zhl98/T2T_paddle
zhl98/Paddle_cvt
zhl98/Paddle_gdiT
zhl98/vit-pytorch
Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Pytorch