Wykay

Postgraduate student @ BJTU

Beijing Jiaotong UniversityHaidian, Beijing, China

Wykay's Stars

google-research/tuning_playbook
A playbook for systematically maximizing the performance of deep learning models.
27.2k 286 422.3k
Stability-AI/generative-models
Generative Models by Stability AI
Language:Python24.6k 258 3112.7k
Dao-AILab/flash-attention
Fast and memory-efficient exact attention
Language:Python14.2k 119 1.1k1.3k
microsoft/Swin-Transformer
This is an official implementation for "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows".
Language:Python13.9k 126 3142.1k
triton-lang/triton
Development repository for the Triton language and compiler
Language:C++13.4k 193 1.5k1.6k
BradyFU/Awesome-Multimodal-Large-Language-Models
:sparkles::sparkles:Latest Advances on Multimodal Large Language Models
12.7k 270 120809
microsoft/LoRA
Code for loralib, an implementation of "LoRA: Low-Rank Adaptation of Large Language Models"
Language:Python10.7k 70 106686
apple/ml-ferret
Language:Python8.5k 159 0498
IDEA-Research/GroundingDINO
[ECCV 2024] Official implementation of the paper "Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection"
Language:Python6.7k 42 303685
IDEA-Research/DINO
[ICLR 2023] Official implementation of the paper "DINO: DETR with Improved DeNoising Anchor Boxes for End-to-End Object Detection"
Language:Python2.3k 32 264253
facebookresearch/Detic
Code release for "Detecting Twenty-thousand Classes using Image-level Supervision".
Language:Python1.9k 21 103210
ytongbai/LVM
Language:Python1.8k 120 2254
Hedlen/awesome-segment-anything
Tracking and collecting papers/projects/others related to Segment Anything.
1.5k 62 14133
IDEA-Research/awesome-detection-transformer
Collect some papers about transformer for detection and segmentation. Awesome Detection Transformer for Computer Vision (CV)
1.3k 58 10111
facebookresearch/MetaCLIP
ICLR2024 Spotlight: curation/training code, metadata, distribution and pre-trained models for MetaCLIP; CVPR 2024: MoDE: CLIP Data Experts via Clustering
Language:Python1.3k 12 3154
sfzhang15/ATSS
Bridging the Gap Between Anchor-based and Anchor-free Detection via Adaptive Training Sample Selection, CVPR, Oral, 2020
Language:Python1.1k 24 103162
luo3300612/Visualizer
assistant tools for attention visualization in deep learning
Language:Jupyter Notebook1k 3 2580
airsplay/lxmert
PyTorch code for EMNLP 2019 paper "LXMERT: Learning Cross-Modality Encoder Representations from Transformers".
Language:Python935 18 113158
henghuiding/ReLA
[CVPR2023 Highlight] GRES: Generalized Referring Expression Segmentation
Language:Python656 5 2419
OpenGVLab/all-seeing
[ICLR 2024 & ECCV 2024] The All-Seeing Projects: Towards Panoptic Visual Recognition&Understanding and General Relation Comprehension of the Open World"
Language:Python459 23 2216
amazon-science/bigdetection
BigDetection: A Large-scale Benchmark for Improved Object Detector Pre-training
Language:Python393 8 1025
lucidrains/memory-efficient-attention-pytorch
Implementation of a memory efficient multi-head attention as proposed in the paper, "Self-attention Does Not Need O(n²) Memory"
Language:Python360 9 634
jozhang97/DETA
Detection Transformers with Assignment
Language:Python244 5 2520
alirezazareian/ovr-cnn
A new framework for open-vocabulary object detection, based on maskrcnn-benchmark
Language:Python226 5 2828
xk-huang/segment-caption-anything
[CVPR 24] The repository provides code for running inference and training for "Segment and Caption Anything" (SCA) , links for downloading the trained model checkpoints, and example notebooks / gradio demo that show how to use the model.
Language:Python199 7 157
198808xc/Vision-AGI-Survey
A temporary webpage for our survey in AGI for computer vision
118 6 00
jianzongwu/betrayed-by-captions
(ICCV 2023) Betrayed by Captions: Joint Caption Grounding and Generation for Open Vocabulary Instance Segmentation
Language:Jupyter Notebook45 7 84
yangbang18/MultiCapCLIP
(ACL'2023) MultiCapCLIP: Auto-Encoding Prompts for Zero-Shot Multilingual Visual Captioning
Language:Python35 3 51
TencentARC/FLM
Accelerating Vision-Language Pretraining with Free Language Modeling (CVPR 2023)
Language:Python31 6 01
xk-huang/Promptable-GRiT
Promptable GRiT: support inference with both automatic proposal generation and custom point/box prompts.
Language:Python4 0 00

Wykay

Wykay's Stars

google-research/tuning_playbook

Stability-AI/generative-models

Dao-AILab/flash-attention

microsoft/Swin-Transformer

triton-lang/triton

BradyFU/Awesome-Multimodal-Large-Language-Models

microsoft/LoRA

apple/ml-ferret

IDEA-Research/GroundingDINO

IDEA-Research/DINO

facebookresearch/Detic

ytongbai/LVM

Hedlen/awesome-segment-anything

IDEA-Research/awesome-detection-transformer

facebookresearch/MetaCLIP

sfzhang15/ATSS

luo3300612/Visualizer

airsplay/lxmert

henghuiding/ReLA

OpenGVLab/all-seeing

amazon-science/bigdetection

lucidrains/memory-efficient-attention-pytorch

jozhang97/DETA

alirezazareian/ovr-cnn

xk-huang/segment-caption-anything

198808xc/Vision-AGI-Survey

jianzongwu/betrayed-by-captions

yangbang18/MultiCapCLIP

TencentARC/FLM

xk-huang/Promptable-GRiT