iamxiaoyubei

Just do it.

SYSUChina

iamxiaoyubei's Stars

AUTOMATIC1111/stable-diffusion-webui
Stable Diffusion web UI
Language:Python146k 1.1k 7.7k27.4k
google-research/tuning_playbook
A playbook for systematically maximizing the performance of deep learning models.
27.9k 293 432.3k
openai/CLIP
CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image
Language:Jupyter Notebook27.1k 327 4073.4k
haotian-liu/LLaVA
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
Language:Python21.2k 158 1.6k2.3k
NVIDIA/NeMo
A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)
Language:Python13k 213 2.4k2.6k
CASIA-IVA-Lab/FastSAM
Fast Segment Anything
Language:Python7.7k 55 210713
vladmandic/automatic
SD.Next: All-in-one for AI generative image
Language:Python5.9k 63 2.3k453
mnotgod96/AppAgent
AppAgent: Multimodal Agents as Smartphone Users, an LLM-based multimodal agent framework designed to operate smartphone apps.
Language:Python5.4k 69 88594
OFA-Sys/Chinese-CLIP
Chinese version of CLIP which achieves Chinese cross-modal retrieval and representation generation.
Language:Python4.8k 37 342481
openai/glide-text2im
GLIDE: a diffusion-based text-conditional image synthesis model
Language:Python3.6k 162 44506
Luodian/Otter
🦦 Otter, a multi-modal model based on OpenFlamingo (open-sourced version of DeepMind's Flamingo), trained on MIMIC-IT and showcasing improved instruction-following and in-context learning ability.
Language:Python3.2k 80 164242
Tencent/FaceDetection-DSFD
腾讯优图高精度双分支人脸检测器
Language:Python2.9k 106 89728
pharmapsychotic/clip-interrogator
Image to prompt with BLIP and CLIP
Language:Python2.8k 31 99432
Daisy-Zhang/Awesome-Deepfakes-Detection
A list of tools, papers and code related to Deepfake Detection.
1.2k 17 20111
DirtyHarryLYL/LLM-in-Vision
Recent LLM-based CV and related works. Welcome to comment/contribute!
851 53 1436
CircleRadon/Osprey
[CVPR2024] The code for "Osprey: Pixel Understanding with Visual Instruction Tuning"
Language:Python783 15 4542
penghao-wu/vstar
PyTorch Implementation of "V* : Guided Visual Search as a Core Mechanism in Multimodal LLMs"
Language:Python559 11 2038
shansongliu/M2UGen
This is the official repository for M2UGen
Language:Jupyter Notebook453 10 1138
jbohnslav/opencv_transforms
OpenCV implementation of Torchvision's image augmentations
Language:Python377 13 1946
phellonchen/X-LLM
X-LLM: Bootstrapping Advanced Large Language Models by Treating Multi-Modalities as Foreign Languages
Language:Python307 9 1517
brandontrabucco/da-fusion
Effective Data Augmentation With Diffusion Models
Language:Python231 4 3218
hityzy1122/opencv_transforms_torchvision
opencv reimplement for transforms in torchvision
Language:Python194 3 1329
CVMI-Lab/SyntheticData
Is synthetic data from generative models ready for image recognition?
Language:Python179 13 96
guozix/TaI-DPT
Language:Python89 1 118
yossigandelsman/clip_prs
official implementation of "Interpreting CLIP's Image Representation via Text-Based Decomposition"
Language:Jupyter Notebook71 3 17
Yuheng-Li/PACGen
65 20 24
sunxm2357/DualCoOp
Implementation for "DualCoOp: Fast Adaptation to Multi-Label Recognition with Limited Annotations" (NeurIPS 2022))
Language:Python54 6 157
kodenii/ImaginaryNet
ImaginaryNet: Learning Object Detectors without Real Images and Annotations
Language:Jupyter Notebook26 2 01
Chen94yue/Torchvision.TransformsbyOpencv
Opencv based implementation of Torchvision.Transforms
Language:Python12 1 04
Tma2333/StableDiffusionProject
Multiple Stable Diffusion Projects.
Language:Python6 2 01

iamxiaoyubei

iamxiaoyubei's Stars

AUTOMATIC1111/stable-diffusion-webui

google-research/tuning_playbook

openai/CLIP

haotian-liu/LLaVA

NVIDIA/NeMo

CASIA-IVA-Lab/FastSAM

vladmandic/automatic

mnotgod96/AppAgent

OFA-Sys/Chinese-CLIP

openai/glide-text2im

Luodian/Otter

Tencent/FaceDetection-DSFD

pharmapsychotic/clip-interrogator

Daisy-Zhang/Awesome-Deepfakes-Detection

DirtyHarryLYL/LLM-in-Vision

CircleRadon/Osprey

penghao-wu/vstar

shansongliu/M2UGen

jbohnslav/opencv_transforms

phellonchen/X-LLM

brandontrabucco/da-fusion

hityzy1122/opencv_transforms_torchvision

CVMI-Lab/SyntheticData

guozix/TaI-DPT

yossigandelsman/clip_prs

Yuheng-Li/PACGen

sunxm2357/DualCoOp

kodenii/ImaginaryNet

Chen94yue/Torchvision.TransformsbyOpencv

Tma2333/StableDiffusionProject