mashijie1028

Ph.D. student @ Institute of Automation, Chinese Academy of Sciences (CASIA). Previously B.E. @ Tsinghua University.

CASIABeijing

mashijie1028's Stars

huggingface/transformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
Language:Python138k 1.1k 16.5k27.5k
coder/code-server
VS Code in the browser
Language:TypeScript69.3k 730 3.6k5.7k
microsoft/DeepSpeed
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
Language:Python36.2k 348 2.9k4.2k
gradio-app/gradio
Build and share delightful machine learning apps, all in Python. 🌟 Star to support our work!
Language:Python35.2k 178 5.2k2.7k
Delgan/loguru
Python logging made (stupidly) simple
Language:Python20.5k 142 1k711
black-forest-labs/flux
Official inference repo for FLUX.1 models
Language:Python19.4k 166 01.4k
mlfoundations/open_clip
An open source implementation of CLIP.
Language:Python10.8k 81 5061k
kohya-ss/sd-scripts
Language:Python5.6k 56 1.2k910
ostris/ai-toolkit
Various AI scripts. Mostly Stable Diffusion stuff.
Language:Python3.8k 52 196426
jingyi0000/VLM_survey
Collection of AWESOME vision-language models for vision tasks
2.7k 125 10229
baaivision/Emu3
Next-Token Prediction is All You Need
Language:Python2k 32 5577
XLabs-AI/x-flux
Language:Python1.8k 29 123128
Stability-AI/sd3.5
Language:Python879 12 2163
TencentARC/Open-MAGVIT2
Open-MAGVIT2: Democratizing Autoregressive Visual Generation
Language:Python712 20 3929
LAION-AI/CLIP_benchmark
CLIP-like model evaluation
Language:Jupyter Notebook647 12 6580
TencentARC/LLaMA-Pro
[ACL 2024] Progressive LLaMA with Block Expansion.
Language:Python491 20 3336
AILab-CVC/SEED-X
Multimodal Models in Real World
Language:Jupyter Notebook427 18 3019
AILab-CVC/SEED-Bench
(CVPR2024)A benchmark for evaluating Multimodal LLMs using multiple-choice questions.
Language:Python324 4 2913
Atomic-man007/Awesome_Multimodel_LLM
Awesome_Multimodel is a curated GitHub repository that provides a comprehensive collection of resources for Multimodal Large Language Models (MLLM). It covers datasets, tuning techniques, in-context learning, visual reasoning, foundational models, and more. Stay updated with the latest advancement.
285 8 119
baaivision/DIVA
Diffusion Feedback Helps CLIP See Better
Language:Python242 8 1012
SHI-Labs/CuMo
CuMo: Scaling Multimodal LLM with Co-Upcycled Mixture-of-Experts
Language:Python138 2 139
Hsu1023/DuQuant
[NeurIPS 2024 Oral🔥] DuQuant: Distributing Outliers via Dual Transformation Makes Stronger Quantized LLMs.
Language:Python137 2 169
Understanding-Visual-Datasets/VisDiff
Official implementation of "Describing Differences in Image Sets with Natural Language" (CVPR 2024 Oral)
Language:Jupyter Notebook112 2 514
TencentARC/mllm-npu
mllm-npu: training multimodal large language models on Ascend NPUs
Language:Python89 5 42
dongzhuoyao/Diffusion-Representation-Learning-Survey-Taxonomy
67 3 00
Baijiong-Lin/LoRA-Torch
PyTorch Reimplementation of LoRA (featuring with supporting nn.MultiheadAttention)
Language:Python49 2 73
mashijie1028/Happy-CGCD
Official code for NeurIPS 2024 paper "Happy: A Debiased Learning Framework for Continual Generalized Category Discovery"
Language:Python301
KyanChen/MakeMultiHeadNaive
Use naive MultiheadAttention implement to replace nn.MultiheadAttention in pytorch
Language:Python29 2 22
mashijie1028/TrustDD
Code for our paper "Towards Trustworthy Dataset Distillation" (Pattern Recognition 2025)
Language:Python1 1 00
mashijie1028/UCAS-CASIA-Beamer-Theme
Beamer template for UCAS and CASIA.
Language:TeX1 1 00

mashijie1028

mashijie1028's Stars

huggingface/transformers

coder/code-server

microsoft/DeepSpeed

gradio-app/gradio

Delgan/loguru

black-forest-labs/flux

mlfoundations/open_clip

kohya-ss/sd-scripts

ostris/ai-toolkit

jingyi0000/VLM_survey

baaivision/Emu3

XLabs-AI/x-flux

Stability-AI/sd3.5

TencentARC/Open-MAGVIT2

LAION-AI/CLIP_benchmark

TencentARC/LLaMA-Pro

AILab-CVC/SEED-X

AILab-CVC/SEED-Bench

Atomic-man007/Awesome_Multimodel_LLM

baaivision/DIVA

SHI-Labs/CuMo

Hsu1023/DuQuant

Understanding-Visual-Datasets/VisDiff

TencentARC/mllm-npu

dongzhuoyao/Diffusion-Representation-Learning-Survey-Taxonomy

Baijiong-Lin/LoRA-Torch

mashijie1028/Happy-CGCD

KyanChen/MakeMultiHeadNaive

mashijie1028/TrustDD

mashijie1028/UCAS-CASIA-Beamer-Theme