Pinned Repositories
ADE20K
ADE20K Dataset
ChatGPT_Trading_Bot
This is the code for the "ChatGPT Trading Bot" Video by Siraj Raval on Youtube
CLIP-self-attention-visualization
Plotting heatmaps with the self-attention of the [CLS] tokens in the last layer.
CLIP_Attention
CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image
CLIP_prefix_caption
Simple image captioning model
docker-course-remastered
DoctorGPT
DoctorGPT is an LLM that can pass the US Medical Licensing Exam. It works offline, it's cross-platform, & your health data stays private.
efficientvit
EfficientViT is a new family of vision models for efficient high-resolution vision.
img2dataset
Easily turn large sets of image urls to an image dataset. Can download, resize and package 100M urls in 20h on one machine.
yiluzhou's Repositories
yiluzhou/ADE20K
ADE20K Dataset
yiluzhou/ChatGPT_Trading_Bot
This is the code for the "ChatGPT Trading Bot" Video by Siraj Raval on Youtube
yiluzhou/CLIP_Attention
CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image
yiluzhou/CLIP_prefix_caption
Simple image captioning model
yiluzhou/docker-course-remastered
yiluzhou/DoctorGPT
DoctorGPT is an LLM that can pass the US Medical Licensing Exam. It works offline, it's cross-platform, & your health data stays private.
yiluzhou/efficientvit
EfficientViT is a new family of vision models for efficient high-resolution vision.
yiluzhou/img2dataset
Easily turn large sets of image urls to an image dataset. Can download, resize and package 100M urls in 20h on one machine.
yiluzhou/InDuDoNet_plus
【MedIA2023】Extend the InDuDoNet with Knowledge-Driven Prior-Net
yiluzhou/lang-seg
Language-Driven Semantic Segmentation
yiluzhou/large_laguage_models
yiluzhou/lazy_import
A module for lazy loading of Python modules
yiluzhou/Learn-Modern-Advanced-Cpp
The source code for the examples in my Udemy course "Learn Advanced Modern C++"
yiluzhou/LISA
Project Page for "LISA: Reasoning Segmentation via Large Language Model"
yiluzhou/llama
Inference code for LLaMA models
yiluzhou/medclip
A multi-modal CLIP model trained on the medical dataset ROCO
yiluzhou/MedCLIP1
EMNLP'22 | MedCLIP: Contrastive Learning from Unpaired Medical Images and Texts
yiluzhou/mednerf
yiluzhou/odl
Operator Discretization Library https://odlgroup.github.io/odl/
yiluzhou/open-clip
Test out OpenCLIP for Image Search and Automatic Captioning
yiluzhou/open_clip
An open source implementation of CLIP.
yiluzhou/ov-seg
This is the official PyTorch implementation of the paper Open-Vocabulary Semantic Segmentation with Mask-adapted CLIP.
yiluzhou/pytorch-deep-learning
Materials for the Learn PyTorch for Deep Learning: Zero to Mastery course.
yiluzhou/PyTorch-Encoding
A CV toolkit for my papers.
yiluzhou/Retinexformer
"Retinexformer: One-stage Retinex-based Transformer for Low-light Image Enhancement" (ICCV 2023)
yiluzhou/roco-dataset
Radiology Objects in COntext (ROCO): A Multimodal Image Dataset
yiluzhou/Transformer-MM-Explainability
[ICCV 2021- Oral] Official PyTorch implementation for Generic Attention-model Explainability for Interpreting Bi-Modal and Encoder-Decoder Transformers, a novel method to visualize any Transformer-based network. Including examples for DETR, VQA.
yiluzhou/UniDetector
Code release for our CVPR 2023 paper "Detecting Everything in the Open World: Towards Universal Object Detection".
yiluzhou/VIGC
Visual Instruction Generation and Correction
yiluzhou/X-AnyLabeling
Effortless data labeling with AI support from Segment Anything and other awesome models.