speedinghzl's Stars
nomic-ai/gpt4all
GPT4All: Run Local LLMs on Any Device. Open-source and available for commercial use.
labmlai/annotated_deep_learning_paper_implementations
🧑🏫 60+ Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), gans(cyclegan, stylegan2, ...), 🎮 reinforcement learning (ppo, dqn), capsnet, distillation, ... 🧠
facebookresearch/segment-anything
The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
THUDM/ChatGLM-6B
ChatGLM-6B: An Open Bilingual Dialogue Language Model | 开源双语对话语言模型
moymix/TaskMatrix
tatsu-lab/stanford_alpaca
Code and documentation to train Stanford's Alpaca models, and generate the data.
microsoft/JARVIS
JARVIS, a system to connect LLMs with ML community. Paper: https://arxiv.org/pdf/2303.17580.pdf
tloen/alpaca-lora
Instruct-tune LLaMA on consumer hardware
facebookresearch/dinov2
PyTorch code and models for the DINOv2 self-supervised learning method.
togethercomputer/OpenChatKit
nebuly-ai/nebuly
The user analytics platform for LLMs
OpenGVLab/LLaMA-Adapter
[ICLR 2024] Fine-tuning LLaMA to follow Instructions within 1 Hour and 1.2M Parameters
runwayml/stable-diffusion
Latent Text-to-Image Diffusion
ajbrock/BigGAN-PyTorch
The author's officially unofficial PyTorch BigGAN implementation.
OFA-Sys/OFA
Official repository of OFA (ICML 2022). Paper: OFA: Unifying Architectures, Tasks, and Modalities Through a Simple Sequence-to-Sequence Learning Framework
microsoft/DialoGPT
Large-scale pretraining for dialogue
fudan-zvg/Semantic-Segment-Anything
Automated dense category annotation engine that serves as the initial semantic labeling for the Segment Anything dataset (SA-1B).
lucidrains/gigagan-pytorch
Implementation of GigaGAN, new SOTA GAN out of Adobe. Culmination of nearly a decade of research into GANs
AetherCortex/Llama-X
Open Academic Research on Improving LLaMA to SOTA LLM
thu-ml/unidiffuser
Code and models for the paper "One Transformer Fits All Distributions in Multi-Modal Diffusion"
NVlabs/prismer
The implementation of "Prismer: A Vision-Language Model with Multi-Task Experts".
Shiriluz/Word-As-Image
PiotrNawrot/nanoT5
Fast & Simple repository for pre-training and fine-tuning T5-style models
allenai/mmc4
MultimodalC4 is a multimodal extension of c4 that interleaves millions of images with text.
NVlabs/denoising-diffusion-gan
Tackling the Generative Learning Trilemma with Denoising Diffusion GANs https://arxiv.org/abs/2112.07804
Zhendong-Wang/Diffusion-GAN
Official PyTorch implementation for paper: Diffusion-GAN: Training GANs with Diffusion
sihyun-yu/PVDM
Official PyTorch implementation of Video Probabilistic Diffusion Models in Projected Latent Space (CVPR 2023).
thuanz123/enhancing-transformers
An unofficial implementation of both ViT-VQGAN and RQ-VAE in Pytorch
fudan-zvg/GSS
[CVPR 2023] Official repository of Generative Semantic Segmentation
speedinghzl/TopFormer
TopFormer: Token Pyramid Transformer for Mobile Semantic Segmentation, CVPR2022