yyChen233's Stars
cs230-stanford/cs230-code-examples
Code examples in pyTorch and Tensorflow for CS230
hindupuravinash/the-gan-zoo
A list of all named GANs!
Stability-AI/stablediffusion
High-Resolution Image Synthesis with Latent Diffusion Models
hsinyilin19/ResNetVAE
Variational AutoEncoder + ResNet Transfer Learning
openai/DALL-E
PyTorch package for the discrete VAE used for DALL·E.
dome272/VQGAN-pytorch
Pytorch implementation of VQGAN (Taming Transformers for High-Resolution Image Synthesis) (https://arxiv.org/pdf/2012.09841.pdf)
OpenBMB/XAgent
An Autonomous LLM Agent for Complex Task Solving
OpenBMB/BMTools
Tool Learning for Big Models, Open-Source Solutions of ChatGPT-Plugins
PKU-YuanGroup/MoE-LLaVA
Mixture-of-Experts for Large Vision-Language Models
BradyFU/Woodpecker
✨✨Woodpecker: Hallucination Correction for Multimodal Large Language Models. The first work to correct hallucinations in MLLMs.
luca-medeiros/lightning-sam
Fine-tune Segment-Anything Model with Lightning Fabric.
microsoft/unilm
Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities
BradyFU/Awesome-Multimodal-Large-Language-Models
:sparkles::sparkles:Latest Advances on Multimodal Large Language Models
hwchase17/langchain-gradio-template
XiaoRobb/CarTeller
汽车识别(包括车牌、车型、车品牌、属性、及驾驶员违规行为识别检测)
facebookresearch/grounded-video-description
Video Grounding and Captioning
Marinto-Richee/YOLOv8-and-GroundingDINO-for-Real-Time-License-Plate-Detection
A project using YoloV8 to detect License Plates
longzw1997/Open-GroundingDino
This is the third party implementation of the paper Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection.
VainF/Awesome-Anything
General AI methods for Anything: AnyObject, AnyGeneration, AnyModel, AnyTask, AnyX
xinyu1205/recognize-anything
Open-source and strong foundation image recognition models.
salesforce/BLIP
PyTorch code for BLIP: Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and Generation
UX-Decoder/Segment-Everything-Everywhere-All-At-Once
[NeurIPS 2023] Official implementation of the paper "Segment Everything Everywhere All at Once"
baaivision/tokenize-anything
[ECCV 2024] Tokenize Anything via Prompting
fudan-zvg/Semantic-Segment-Anything
Automated dense category annotation engine that serves as the initial semantic labeling for the Segment Anything dataset (SA-1B).
baaivision/Painter
Painter & SegGPT Series: Vision Foundation Models from BAAI
IDEA-Research/Grounded-Segment-Anything
Grounded SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything
IDEA-Research/GroundingDINO
[ECCV 2024] Official implementation of the paper "Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection"
LetheSec/HuggingFace-Download-Accelerator
利用HuggingFace的官方下载工具从镜像网站进行高速下载。
xverse-ai/XVERSE-13B
XVERSE-13B: A multilingual large language model developed by XVERSE Technology Inc.
CVHub520/X-AnyLabeling
Effortless data labeling with AI support from Segment Anything and other awesome models.