xiaojieli0903
Ph.D. candidate at the School of Computer Science and Technology, Harbin Institute of Technology (Shenzhen).
HIT (Shenzhen)Shenzhen
xiaojieli0903's Stars
tatsu-lab/stanford_alpaca
Code and documentation to train Stanford's Alpaca models, and generate the data.
borisdayma/dalle-mini
DALL·E Mini - Generate images from a text prompt
diff-usion/Awesome-Diffusion-Models
A collection of resources and papers on Diffusion Models
lucidrains/DALLE2-pytorch
Implementation of DALL-E 2, OpenAI's updated text-to-image synthesis neural network, in Pytorch
modelscope/facechain
FaceChain is a deep-learning toolchain for generating your Digital-Twin.
lucidrains/imagen-pytorch
Implementation of Imagen, Google's Text-to-Image Neural Network, in Pytorch
lucidrains/DALLE-pytorch
Implementation / replication of DALL-E, OpenAI's Text to Image Transformer, in Pytorch
open-mmlab/mmdetection3d
OpenMMLab's next-generation platform for general 3D object detection.
rom1504/img2dataset
Easily turn large sets of image urls to an image dataset. Can download, resize and package 100M urls in 20h on one machine.
ahmetbersoz/chatgpt-prompts-for-academic-writing
This list of writing prompts covers a range of topics and tasks, including brainstorming research ideas, improving language and style, conducting literature reviews, and developing research plans.
OpenGVLab/Ask-Anything
[CVPR2024 Highlight][VideoChatGPT] ChatGPT with video understanding! And many more supported LMs such as miniGPT4, StableLM, and MOSS.
X-PLUG/mPLUG-Owl
mPLUG-Owl: The Powerful Multi-modal Large Language Model Family
dvlab-research/LISA
Project Page for "LISA: Reasoning Segmentation via Large Language Model"
lucidrains/byol-pytorch
Usable Implementation of "Bootstrap Your Own Latent" self-supervised learning, from Deepmind, in Pytorch
chongzhou96/EdgeSAM
Official PyTorch implementation of "EdgeSAM: Prompt-In-the-Loop Distillation for On-Device Deployment of SAM"
IrisRainbowNeko/HCP-Diffusion
A universal Stable-Diffusion toolbox
jianzongwu/Awesome-Open-Vocabulary
(TPAMI 2024) A Survey on Open Vocabulary Learning
EdisonLeeeee/Awesome-Masked-Autoencoders
A collection of literature after or concurrent with Masked Autoencoder (MAE) (Kaiming He el al.).
yxuansu/PandaGPT
[TLLM'23] PandaGPT: One Model To Instruction-Follow Them All
kakaobrain/karlo
LambdaLabsML/lambda-diffusers
facebookresearch/muavic
MuAViC: A Multilingual Audio-Visual Corpus for Robust Speech Recognition and Robust Speech-to-Text Translation
Lupin1998/Awesome-MIM
[Survey] Masked Modeling for Self-supervised Representation Learning on Vision and Beyond (https://arxiv.org/abs/2401.00897)
xyupeng/ContrastiveCrop
[CVPR 2022 Oral] Crafting Better Contrastive Views for Siamese Representation Learning
brandontrabucco/da-fusion
Effective Data Augmentation With Diffusion Models
X-PLUG/mPLUG-2
mPLUG-2: A Modularized Multi-modal Foundation Model Across Text, Image and Video (ICML 2023)
yzd-v/cls_KD
'NKD and USKD' (ICCV 2023) and 'ViTKD' (CVPRW 2024)
FutureXiang/ddae
[ICCV 2023 Oral] Official Implementation of "Denoising Diffusion Autoencoders are Unified Self-supervised Learners"
quanlin-wu/dmae
Denoising Masked Autoencoders Help Robust Classification.
zangzelin/DiffAug