SLTK1

ShenZhen,China

SLTK1's Stars

2noise/ChatTTS
A generative speech model for daily dialogue.
Language:Python33.2k 191 5813.6k
black-forest-labs/flux
Official inference repo for FLUX.1 models
Language:Python18.6k 159 01.3k
optuna/optuna
A hyperparameter optimization framework
Language:Python11.1k 117 1.7k1.1k
NVIDIA/apex
A PyTorch Extension: Tools for easy mixed precision and distributed training in Pytorch
Language:Python8.5k 100 1.2k1.4k
InternLM/lmdeploy
LMDeploy is a toolkit for compressing, deploying, and serving LLMs.
Language:Python4.9k 40 1.6k445
Tencent/HunyuanDiT
Hunyuan-DiT : A Powerful Multi-Resolution Diffusion Transformer with Fine-Grained Chinese Understanding
Language:Jupyter Notebook3.7k 42 182319
moskomule/senet.pytorch
PyTorch implementation of SENet
Language:Python2.3k 16 0442
kvcache-ai/Mooncake
Mooncake is the serving platform for Kimi, a leading LLM service provided by Moonshot AI.
Language:C++2.3k 27 33130
yuweihao/MambaOut
MambaOut: Do We Really Need Mamba for Vision?
Language:Python2.1k 8 24535
facebookresearch/Detic
Code release for "Detecting Twenty-thousand Classes using Image-level Supervision".
Language:Python1.9k 21 103209
QiuChenly/InjectLib
你知道我要说什么
Language:Python1.4k 21 59172
IDEA-Research/Grounded-SAM-2
Grounded SAM 2: Ground and Track Anything in Videos with Grounding DINO, Florence-2 and SAM 2
Language:Jupyter Notebook1.4k 11 57128
showlab/Show-o
Repository for Show-o, One Single Transformer to Unify Multimodal Understanding and Generation.
Language:Python1.1k 15 4746
VITA-MLLM/VITA
✨✨VITA: Towards Open-Source Interactive Omni Multimodal LLM
Language:Python1.1k 38 6764
jianzongwu/Awesome-Open-Vocabulary
(TPAMI 2024) A Survey on Open Vocabulary Learning
870 26 1451
lucidrains/transfusion-pytorch
Pytorch implementation of Transfusion, "Predict the Next Token and Diffuse Images with One Multi-Modal Model", from MetaAI
Language:Python839 35 2634
youngwanLEE/centermask2
[CVPR 2020] CenterMask : Real-time Anchor-Free Instance Segmentation
Language:Python783 16 99157
thu-coai/CrossWOZ
A Large-Scale Chinese Cross-Domain Task-Oriented Dialogue Dataset
Language:Python660 17 32114
AIGText/Glyph-ByT5
[ECCV2024] This is an official inference code of the paper "Glyph-ByT5: A Customized Text Encoder for Accurate Visual Text Rendering" and "Glyph-ByT5-v2: A Strong Aesthetic Baseline for Accurate Multilingual Visual Text Rendering""
Language:Jupyter Notebook527 18 1723
HOST-Oman/libraqm
A library for complex text layout
Language:C272 20 8863
showlab/videollm-online
VideoLLM-online: Online Video Large Language Model for Streaming Video (CVPR 2024)
Language:Python269 8 4731
baaivision/EVE
[NeurIPS'24 Spotlight] EVE: Encoder-Free Vision-Language Models
Language:Python254 9 164
OFA-Sys/InsTag
InsTag: A Tool for Data Analysis in LLM Supervised Fine-tuning
229 4 107
MaxKinny222/TabRecSet
A large scale camera-taken table detection and recognition dataset.
Language:Python104 5 98
LDLINGLINGLING/MiniCPM_Series_Tutorial
Minicpm和MiniCPM-V的项目和教程。包括推理，量化，边端部署，微调，技术报告、应用六个主题
Language:Python90 3 43
princeton-nlp/CharXiv
[NeurIPS 2024] CharXiv: Charting Gaps in Realistic Chart Understanding in Multimodal LLMs
Language:Python87 3 109
nttmdlab-nlp/SlideVQA
SlideVQA: A Dataset for Document Visual Question Answering on Multiple Images (AAAI2023)
Language:Python81 1 57
JianshuZhang/TAP
Track, Attend and Parse for Online Handwritten Mathematical Expression Recognition
Language:Python71 6 1327
LayTextLLM/LayTextLLM
Language:Jupyter Notebook68 3 139
TianheWu/CoSeR
An unofficial implementation for "CoSeR: Bridging Image and Language for Cognitive Super-Resolution (CVPR 2024)"
Language:Python64 3 85

SLTK1

SLTK1's Stars

2noise/ChatTTS

black-forest-labs/flux

optuna/optuna

NVIDIA/apex

InternLM/lmdeploy

Tencent/HunyuanDiT

moskomule/senet.pytorch

kvcache-ai/Mooncake

yuweihao/MambaOut

facebookresearch/Detic

QiuChenly/InjectLib

IDEA-Research/Grounded-SAM-2

showlab/Show-o

VITA-MLLM/VITA

jianzongwu/Awesome-Open-Vocabulary

lucidrains/transfusion-pytorch

youngwanLEE/centermask2

thu-coai/CrossWOZ

AIGText/Glyph-ByT5

HOST-Oman/libraqm

showlab/videollm-online

baaivision/EVE

OFA-Sys/InsTag

MaxKinny222/TabRecSet

LDLINGLINGLING/MiniCPM_Series_Tutorial

princeton-nlp/CharXiv

nttmdlab-nlp/SlideVQA

JianshuZhang/TAP

LayTextLLM/LayTextLLM

TianheWu/CoSeR