mzhaoshuai

Man, what can I say.

Sydney

Pinned Repositories

Divide-and-Co-training
[TIP 2022] Towards Better Accuracy-efficiency Trade-offs: Divide and Co-training. Plus, an image classification toolbox includes ResNet, Wide-ResNet, ResNeXt, ResNeSt, ResNeXSt, SENet, Shake-Shake, DenseNet, PyramidNet, and EfficientNet.
Language:Python104 4 322
CenterCLIP
[SIGIR 2022] CenterCLIP: Token Clustering for Efficient Text-Video Retrieval. Also, a text-video retrieval toolbox based on CLIP + fast pyav video decoding.
Language:Python127 3 46
CLIP4STR
[TIP 2024] CLIP4STR: A Simple Baseline for Scene Text Recognition with Pre-trained Vision-Language Model.
Language:Python5 1 00
RLCF
[ICLR 2024] Test-Time Adaptation with CLIP Reward for Zero-Shot Generalization in Vision-Language Models.
Language:Python56 3 51
simple_flann
simple_flann包括一个KDTrees检索算法和一个基于p稳定分布的LSH
Language:C++9 2 13
SlimCLR
[IJCV 2024] Slimmable Networks for Contrastive Self-supervised Learning.
Language:Python7 3 00
OpenRLHF
An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & RingAttention)
Language:Python2.8k 24 299258
CLIP4STR
An implementation of "CLIP4STR: A Simple Baseline for Scene Text Recognition with Pre-trained Vision-Language Model".
Language:Python124 4 2515
RMI
This is the code for the NeurIPS 2019 paper Region Mutual Information Loss for Semantic Segmentation.
Language:Python270 10 2138

mzhaoshuai/CenterCLIP
[SIGIR 2022] CenterCLIP: Token Clustering for Efficient Text-Video Retrieval. Also, a text-video retrieval toolbox based on CLIP + fast pyav video decoding.
Language:Python127 3 46
mzhaoshuai/RLCF
[ICLR 2024] Test-Time Adaptation with CLIP Reward for Zero-Shot Generalization in Vision-Language Models.
Language:Python56 3 51
mzhaoshuai/simple_flann
simple_flann包括一个KDTrees检索算法和一个基于p稳定分布的LSH
Language:C++9 2 13
mzhaoshuai/SlimCLR
[IJCV 2024] Slimmable Networks for Contrastive Self-supervised Learning.
Language:Python7 3 00
mzhaoshuai/CLIP4STR
[TIP 2024] CLIP4STR: A Simple Baseline for Scene Text Recognition with Pre-trained Vision-Language Model.
Language:Python5 1 00
mzhaoshuai/GLaDOS-CheckIn
GLaDOS AutoCheckIn 定时自动签到
Language:JavaScript