luuuyi
Study in DeepLearning, Computer Vision. Object Detection, Face Detection, Person ReID, Semantic Segmentation for now.
Zhejiang UniversityHangzhou, Zhejiang
luuuyi's Stars
ultralytics/ultralytics
Ultralytics YOLO11 🚀
openai/CLIP
CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image
microsoft/unilm
Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities
WongKinYiu/yolov7
Implementation of paper - YOLOv7: Trainable bag-of-freebies sets new state-of-the-art for real-time object detectors
mlfoundations/open_clip
An open source implementation of CLIP.
QwenLM/Qwen2
Qwen2 is the large language model series developed by Qwen team, Alibaba Cloud.
facebookresearch/mae
PyTorch implementation of MAE https//arxiv.org/abs/2111.06377
facebookresearch/ConvNeXt
Code release for ConvNeXt model
LANDrop/LANDrop
Drop any files to any devices on your LAN.
open-mmlab/mmselfsup
OpenMMLab Self-Supervised Learning Toolbox and Benchmark
open-mmlab/mmdeploy
OpenMMLab Model Deployment Framework
microsoft/Cream
This is a collection of our NAS and Vision Transformer work.
visual-layer/fastdup
fastdup is a powerful, free tool designed to rapidly generate valuable insights from image and video datasets. It helps enhance the quality of both images and labels, while significantly reducing data operation costs, all with unmatched scalability.
JunMa11/SOTA-MedSeg
SOTA medical image segmentation methods based on various challenges
vturrisi/solo-learn
solo-learn: a library of self-supervised methods for visual representation learning powered by Pytorch Lightning
czczup/ViT-Adapter
[ICLR 2023 Spotlight] Vision Transformer Adapter for Dense Predictions
cvzone/cvzone
This is a Computer vision package that makes its easy to run Image processing and AI functions. At the core it uses OpenCV and Mediapipe libraries.
open-mmlab/mmengine
OpenMMLab Foundational Library for Training Deep Learning Models
joyycom/VNN
VNN是由欢聚集团(Joyy Inc.)推出的高性能、轻量级神经网络部署框架。目前已为Hago、VOO、VFly、马克相机等App提供20余种AI能力的支持,覆盖直播、短视频、视频编辑等泛娱乐场景和工程场景
bytedance/ibot
iBOT :robot:: Image BERT Pre-Training with Online Tokenizer (ICLR 2022)
Sense-GVT/DeCLIP
Supervision Exists Everywhere: A Data Efficient Contrastive Language-Image Pre-training Paradigm
bytedance/matxscript
A high-performance, extensible Python AOT compiler.
VDIGPKU/CBNetV2
[TIP 2022] CBNetV2: A Composite Backbone Network Architecture for Object Detection
iwatake2222/InferenceHelper
C++ Helper Class for Deep Learning Inference Frameworks: TensorFlow Lite, TensorRT, OpenCV, OpenVINO, ncnn, MNN, SNPE, Arm NN, NNabla, ONNX Runtime, LibTorch, TensorFlow
yukimasano/PASS
The PASS dataset: pretrained models and how to get the data
implus/UM-MAE
Official Codes for "Uniform Masking: Enabling MAE Pre-training for Pyramid-based Vision Transformers with Locality"
billjie1/Chinese-CLIP
Chinese version of CLIP which achieves Chinese cross-modal retrieval and representation generation.
Yuting-Gao/DisCo-pytorch
Code for DisCo: Remedy Self-supervised Learning on Lightweight Models with Distilled Contrastive Learning
Kylin9511/CRNet
Channel Reconstruction Network implemented in PyTorch
Visual-Computing/GPR1200
Official repository of the paper "GPR1200: A Benchmark for General-PurposeContent-Based Image Retrieval"