Czm369's Stars
AUTOMATIC1111/stable-diffusion-webui
Stable Diffusion web UI
lm-sys/FastChat
An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.
Lightning-AI/pytorch-lightning
Pretrain, finetune and deploy AI models on multiple GPUs, TPUs with zero code changes.
openai/CLIP
CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image
microsoft/unilm
Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities
borisdayma/dalle-mini
DALL·E Mini - Generate images from a text prompt
OpenMOSS/MOSS
An open-source tool-augmented conversational language model from Fudan University
lucidrains/DALLE2-pytorch
Implementation of DALL-E 2, OpenAI's updated text-to-image synthesis neural network, in Pytorch
NVIDIA/NeMo
A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)
kornia/kornia
Geometric Computer Vision Library for Spatial AI
microsoft/LoRA
Code for loralib, an implementation of "LoRA: Low-Rank Adaptation of Large Language Models"
NVIDIA/Megatron-LM
Ongoing research training transformer models at scale
facebookresearch/dinov2
PyTorch code and models for the DINOv2 self-supervised learning method.
cloneofsimo/lora
Using Low-rank adaptation to quickly fine-tune diffusion models.
pytorch-labs/gpt-fast
Simple and efficient pytorch-native transformer text generation in <1000 LOC of python.
open-mmlab/mmdetection3d
OpenMMLab's next-generation platform for general 3D object detection.
NVlabs/stylegan2-ada-pytorch
StyleGAN2-ADA - Official PyTorch implementation
juncongmoo/pyllama
LLaMA: Open and Efficient Foundation Language Models
datawhalechina/learn-nlp-with-transformers
we want to create a repo to illustrate usage of transformers in chinese
ytongbai/LVM
OFA-Sys/ONE-PEACE
A general representation model across vision, audio, language modalities. Paper: ONE-PEACE: Exploring One General Representation Model Toward Unlimited Modalities
shalfun/DrivingDiffusion
Layout-Guided multi-view driving scene video generation with latent diffusion model
tatp22/multidim-positional-encoding
An implementation of 1D, 2D, and 3D positional encoding in Pytorch and TensorFlow
MuQiuJun-AI/bert4pytorch
超轻量级bert的pytorch版本,大量中文注释,容易修改结构,持续更新
google-research/syn-rep-learn
Learning from synthetic data - code and models
BraveGroup/Drive-WM
[CVPR 2024] A world model for autonomous driving.
fudan-zvg/PolarFormer
[AAAI 2023] PolarFormer: Multi-camera 3D Object Detection with Polar Transformers
vlm-driver/Dolphins
wayveai/LingoQA
Official GitHub repository for the paper "LingoQA: Video Question Answering for Autonomous Driving"
LLVM-AD/MAPLM
[CVPR 2024] MAPLM: A Large-Scale Vision-Language Dataset for Map and Traffic Scene Understanding