xyc2690

xyc2690's Stars

lllyasviel/LayerDiffuse
Transparent Image Layer Diffusion using Latent Transparency
2.1k29
yipoh/AesBench
An expert benchmark aiming to comprehensively evaluate the aesthetic perception capacities of MLLMs.
Language:Python2216
OPPOMKLab/u-LLaVA
u-LLaVA: Unifying Multi-Modal Tasks via Large Language Model
Language:Python1266
apple/ml-autofocusformer-segmentation
This is an official implementation for "AutoFocusFormer: Image Segmentation off the Grid".
Language:Python656
berkeley-hipie/HIPIE
[NeurIPS2023] Code release for "Hierarchical Open-vocabulary Universal Image Segmentation"
Language:Jupyter Notebook27621
FeiGeChuanShu/ncnn-android-yolov8
Real time yolov8 Android demo by ncnn
Language:C++45989
microsoft/FIBER
Coarse-to-Fine Vision-Language Pre-training with Fusion in the Backbone
Language:Python12711
Computer-Vision-in-the-Wild/CVinW_Readings
A collection of papers on the topic of ``Computer Vision in the Wild (CVinW)''
1.2k58
mli/paper-reading
深度学习经典、新论文逐段精读
27.8k2.5k
facebookresearch/mobile-vision
Mobile vision models and code
Language:Python904160
huggingface/pytorch-image-models
The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (ViT), MobileNetV4, MobileNet-V3 & V2, RegNet, DPN, CSPNet, Swin Transformer, MaxViT, CoAtNet, ConvNeXt, and more
Language:Python32.9k4.8k
obss/sahi
Framework agnostic sliced/tiled inference + interactive ui + error analysis plots
Language:Python4.3k614
naver-ai/vidt
Language:Python30840
wpeebles/gangealing
Official PyTorch Implementation of "GAN-Supervised Dense Visual Alignment" (CVPR 2022 Oral, Best Paper Finalist)
Language:Python1k120
wang-xinyu/tensorrtx
Implementation of popular deep learning networks with TensorRT network definition API
Language:C++7.1k1.8k
facebookresearch/swav
PyTorch implementation of SwAV https//arxiv.org/abs/2006.09882
Language:Python2k283
tier4/AutowareArchitectureProposal.proj
This is the source code of the feasibility study for Autoware architecture proposal.
Language:Shell663238
hustvl/YOLOP
You Only Look Once for Panopitic Driving Perception.（MIR2022）
Language:Python2k419
zhanxlin/Product1M
Product1M
Language:Python876
liuruijin17/LSTR
This is an official repository of End-to-end Lane Shape Prediction with Transformers.
Language:Python644130
voxel51/fiftyone
Refine high-quality datasets and visual AI models
Language:Python9.1k591
qxiaofan/awesome_3d_slam_resources
记录3D视觉、VSLAM、计算机视觉的干货资料。
415122
kzl/decision-transformer
Official codebase for Decision Transformer: Reinforcement Learning via Sequence Modeling.
Language:Python2.5k458
labuladong/fucking-algorithm
刷算法全靠套路，认准 labuladong 就够了！English version supported! Crack LeetCode, not only how, but also why.
Language:Markdown127k23.3k
qiguming/MLAPP_CN_CODE
《Machine Learning: A Probabilistic Perspective》（Kevin P. Murphy）中文翻译和书中算法的Python实现。
Language:Jupyter Notebook582137
probml/pyprobml
Python code for "Probabilistic Machine learning" book by Kevin Murphy
Language:Jupyter Notebook6.6k1.6k
dk-liang/Awesome-Visual-Transformer
Collect some papers about transformer with vision. Awesome Transformer with Computer Vision (CV)
3.4k401
hpc203/nanodet-opncv-dnn-cpp-python
用opencv部署nanodet目标检测，包含C++和Python两种版本程序的实现
Language:C++10530
JialeCao001/PedSurvey
From Handcrafted to Deep Features for Pedestrian Detection: A Survey (TPAMI 2021)
17128
xyc2690/SRNTT_Pytorch
Pytorch implementation of paper 'Image Super-Resolution by Neural Texture Transfer'
Language:Python6