xyc2690's Stars
lllyasviel/LayerDiffuse
Transparent Image Layer Diffusion using Latent Transparency
yipoh/AesBench
An expert benchmark aiming to comprehensively evaluate the aesthetic perception capacities of MLLMs.
OPPOMKLab/u-LLaVA
u-LLaVA: Unifying Multi-Modal Tasks via Large Language Model
apple/ml-autofocusformer-segmentation
This is an official implementation for "AutoFocusFormer: Image Segmentation off the Grid".
berkeley-hipie/HIPIE
[NeurIPS2023] Code release for "Hierarchical Open-vocabulary Universal Image Segmentation"
FeiGeChuanShu/ncnn-android-yolov8
Real time yolov8 Android demo by ncnn
microsoft/FIBER
Coarse-to-Fine Vision-Language Pre-training with Fusion in the Backbone
Computer-Vision-in-the-Wild/CVinW_Readings
A collection of papers on the topic of ``Computer Vision in the Wild (CVinW)''
mli/paper-reading
深度学习经典、新论文逐段精读
facebookresearch/mobile-vision
Mobile vision models and code
huggingface/pytorch-image-models
The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (ViT), MobileNetV4, MobileNet-V3 & V2, RegNet, DPN, CSPNet, Swin Transformer, MaxViT, CoAtNet, ConvNeXt, and more
obss/sahi
Framework agnostic sliced/tiled inference + interactive ui + error analysis plots
naver-ai/vidt
wpeebles/gangealing
Official PyTorch Implementation of "GAN-Supervised Dense Visual Alignment" (CVPR 2022 Oral, Best Paper Finalist)
wang-xinyu/tensorrtx
Implementation of popular deep learning networks with TensorRT network definition API
facebookresearch/swav
PyTorch implementation of SwAV https//arxiv.org/abs/2006.09882
tier4/AutowareArchitectureProposal.proj
This is the source code of the feasibility study for Autoware architecture proposal.
hustvl/YOLOP
You Only Look Once for Panopitic Driving Perception.(MIR2022)
zhanxlin/Product1M
Product1M
liuruijin17/LSTR
This is an official repository of End-to-end Lane Shape Prediction with Transformers.
voxel51/fiftyone
Refine high-quality datasets and visual AI models
qxiaofan/awesome_3d_slam_resources
记录3D视觉、VSLAM、计算机视觉的干货资料。
kzl/decision-transformer
Official codebase for Decision Transformer: Reinforcement Learning via Sequence Modeling.
labuladong/fucking-algorithm
刷算法全靠套路,认准 labuladong 就够了!English version supported! Crack LeetCode, not only how, but also why.
qiguming/MLAPP_CN_CODE
《Machine Learning: A Probabilistic Perspective》(Kevin P. Murphy)中文翻译和书中算法的Python实现。
probml/pyprobml
Python code for "Probabilistic Machine learning" book by Kevin Murphy
dk-liang/Awesome-Visual-Transformer
Collect some papers about transformer with vision. Awesome Transformer with Computer Vision (CV)
hpc203/nanodet-opncv-dnn-cpp-python
用opencv部署nanodet目标检测,包含C++和Python两种版本程序的实现
JialeCao001/PedSurvey
From Handcrafted to Deep Features for Pedestrian Detection: A Survey (TPAMI 2021)
xyc2690/SRNTT_Pytorch
Pytorch implementation of paper 'Image Super-Resolution by Neural Texture Transfer'