Pinned Repositories
CLIPA
[NeurIPS 2023] This repository includes the official implementation of our paper "An Inverse Scaling Law for CLIP Training"
EVP
[TMLR'24] This repository includes the official implementation our paper "Unleashing the Power of Visual Prompting At the Pixel Level"
Image-Pretraining-for-Video
[ECCV 2022] This repository includes the official implementation our paper "In Defense of Image Pre-Training for Spatiotemporal Recognition".
awesome-readme
A guide to writing an Awesome README. Read the full article in Towards Data Science.
big_vision
Official codebase used to develop Vision Transformer, MLP-Mixer, LiT and more.
C2D
PyTorch implementation of "Contrast to Divide: self-supervised pre-training for learning with noisy labels"
deit_repo
Official DeiT repository
Depth-Anything
[CVPR 2024] Depth Anything: Unleashing the Power of Large-Scale Unlabeled Data. Foundation Model for Monocular Depth Estimation
SmallBigNet
SmallBigNet: Integrating Core and Contextual Views for Video Classification (CVPR2020)
L2B
This repository includes the official project of L2B, from our paper "Learning to Bootstrap for Combating Label Noise".
xhl-video's Repositories
xhl-video/SmallBigNet
SmallBigNet: Integrating Core and Contextual Views for Video Classification (CVPR2020)
xhl-video/awesome-readme
A guide to writing an Awesome README. Read the full article in Towards Data Science.
xhl-video/big_vision
Official codebase used to develop Vision Transformer, MLP-Mixer, LiT and more.
xhl-video/C2D
PyTorch implementation of "Contrast to Divide: self-supervised pre-training for learning with noisy labels"
xhl-video/deit_repo
Official DeiT repository
xhl-video/Depth-Anything
[CVPR 2024] Depth Anything: Unleashing the Power of Large-Scale Unlabeled Data. Foundation Model for Monocular Depth Estimation
xhl-video/EasyLM
Large language models (LLMs) made easy, EasyLM is a one stop solution for pre-training, finetuning, evaluating and serving LLMs in JAX/Flax.
xhl-video/faster-rcnn.pytorch
A faster pytorch implementation of faster r-cnn
xhl-video/lorax
LoRA for arbitrary JAX models and functions
xhl-video/MLC
Meta Label Correction for Noisy Label Learning
xhl-video/open_flamingo
An open-source framework for training large multimodal models.
xhl-video/PGDF
xhl-video/pytorch-image-models
PyTorch image models, scripts, pretrained weights -- ResNet, ResNeXT, EfficientNet, EfficientNetV2, NFNet, Vision Transformer, MixNet, MobileNet-V3/V2, RegNet, DPN, CSPNet, and more
xhl-video/xianhangli
Github Pages template for academic personal websites, forked from mmistakes/minimal-mistakes
xhl-video/diffusers
🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch
xhl-video/interview_note
DL
xhl-video/maxtext
A simple, performant and scalable Jax LLM!
xhl-video/PixArt-alpha
PixArt-α: Fast Training of Diffusion Transformer for Photorealistic Text-to-Image Synthesis
xhl-video/UNICON-Noisy-Label
Official Implementation of the CVPR 2022 paper "UNICON: Combating Label Noise Through Uniform Selection and Contrastive Learning"
xhl-video/vision_transformer
xhl-video/vit-vqgan
JAX implementation ViT-VQGAN