zhaoshitian

Intern in Shanghai ailab.

East China Normal UniversityShanghai, China

zhaoshitian's Stars

facebookresearch/segment-anything
The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
Language:Jupyter Notebook46.7k 306 6595.5k
vllm-project/vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
Language:Python26.7k 223 4.4k3.9k
PKU-YuanGroup/Open-Sora-Plan
This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.
Language:Python11.2k 160 2901k
Stability-AI/StableCascade
Official Code for Stable Cascade
Language:Jupyter Notebook6.5k 61 121530
DepthAnything/Depth-Anything-V2
Depth Anything V2. A More Capable Foundation Model for Monocular Depth Estimation
Language:Python3.3k 30 138259
lllyasviel/Paints-UNDO
Understand Human Behavior to Align True Needs
Language:Python3.3k 16 54286
dvlab-research/MGM
Official repo for "Mini-Gemini: Mining the Potential of Multi-modality Vision Language Models"
Language:Python3.2k 27 130279
lm-sys/RouteLLM
A framework for serving and evaluating LLM routers - save LLM costs without compromising quality!
Language:Python2.9k 25 45218
AiuniAI/Unique3D
Official implementation of Unique3D: High-Quality and Efficient 3D Mesh Generation from a Single Image
Language:Python2.8k 35 96219
lucidrains/vector-quantize-pytorch
Vector (and Scalar) Quantization, in Pytorch
Language:Python2.4k 30 116196
thunlp/UltraChat
Large-scale, Informative, and Diverse Multi-round Chat Data (and Models)
Language:Python2.2k 39 30113
AlibabaResearch/AdvancedLiterateMachinery
A collection of original, innovative ideas and algorithms towards Advanced Literate Machinery. This project is maintained by the OCR Team in the Language Technology Lab, Tongyi Lab, Alibaba Group.
Language:C++1.4k 32 161164
open-compass/VLMEvalKit
Open-source evaluation toolkit of large vision-language models (LVLMs), support ~100 VLMs, 40+ benchmarks
Language:Python1k 10 151143
GAIR-NLP/anole
Anole: An Open, Autoregressive and Native Multimodal Models for Interleaved Image-Text Generation
Language:Python640 8 3936
Alpha-VLLM/Lumina-mGPT
Official Implementation of "Lumina-mGPT: Illuminate Flexible Photorealistic Text-to-Image Generation with Multimodal Generative Pretraining"
Language:Python452 6 2319
catcathh/UltraPixel
Implementation of UltraPixel: Advancing Ultra-High-Resolution Image Synthesis to New Peaks
Language:Python44316
google-research/maskgit
Official Jax Implementation of MaskGIT
Language:Jupyter Notebook428 17 1250
huggingface/cosmopedia
Language:Python417 11 1042
dome272/MaskGIT-pytorch
Pytorch implementation of MaskGIT: Masked Generative Image Transformer (https://arxiv.org/pdf/2202.04200.pdf)
Language:Python401 15 1834
kyegomez/CM3Leon
An open source implementation of "Scaling Autoregressive Multi-Modal Models: Pretraining and Instruction Tuning", an all-new multi modal AI that uses just a decoder to generate both text and images
Language:Python358 21 1518
huggingface/open-muse
Open reproduction of MUSE for fast text2image generation.
Language:Python321 38 2726
frank-xwang/UnSAM
Code release for "Segment Anything without Supervision"
Language:Jupyter Notebook278 5 1118
Jyouhou/UnrealText
Synthetic Scene Text from 3D Engines
Language:C++240 10 2839
zhuyr97/WGWS-Net
Language:Python69 1 134
duchenzhuang/FSQ-pytorch
A Pytorch Implementation of Finite Scalar Quantization
Language:Python67 5 44
CodeGoat24/DreamText
Official implementation of High Fidelity Scene Text Synthesis.
Language:Python33 2 30
iiclab/DecompST
130
99Franklin/DiffText
11
agneet42/revision
[ECCV 2024] "REVISION: Rendering Tools Enable Spatial Fidelity in Vision-Language Models"
Language:Python80
cloneofsimo/compare_aura_sd3
Vibe check Imagegen models (AuraFlow vs Others)
Language:HTML2

zhaoshitian

zhaoshitian's Stars

facebookresearch/segment-anything

vllm-project/vllm

PKU-YuanGroup/Open-Sora-Plan

Stability-AI/StableCascade

DepthAnything/Depth-Anything-V2

lllyasviel/Paints-UNDO

dvlab-research/MGM

lm-sys/RouteLLM

AiuniAI/Unique3D

lucidrains/vector-quantize-pytorch

thunlp/UltraChat

AlibabaResearch/AdvancedLiterateMachinery

open-compass/VLMEvalKit

GAIR-NLP/anole

Alpha-VLLM/Lumina-mGPT

catcathh/UltraPixel

google-research/maskgit

huggingface/cosmopedia

dome272/MaskGIT-pytorch

kyegomez/CM3Leon

huggingface/open-muse

frank-xwang/UnSAM

Jyouhou/UnrealText

zhuyr97/WGWS-Net

duchenzhuang/FSQ-pytorch

CodeGoat24/DreamText

iiclab/DecompST

99Franklin/DiffText

agneet42/revision

cloneofsimo/compare_aura_sd3