HybridGait: A Benchmark for Spatial-Temporal Cloth-Changing Gait Recognition with Hybrid Explorations
- Arxiv : https://arxiv.org/pdf/2401.00271.pdf
- Github : https://github.com/HCVLab/HybridGait
A Large-Scale Re-identification Analysis in Sporting Scenarios: the Betrayal of Reaching a Critical Point
- Arxiv : https://arxiv.org/pdf/2401.00080.pdf
- Github : -
- Arxiv : https://arxiv.org/pdf/2312.05281.pdf
- Github : https://github.com/xujiamu123/X2-Softmax/tree/main
- Arxiv : https://arxiv.org/pdf/2312.05349.pdf
- Github : https://github.com/diegobonilla98/PixLore?tab=readme-ov-file
- Arxiv : https://arxiv.org/pdf/2312.05391.pdf
- Github : https://github.com/YilmazKadir/Segmentation_Losses
- Arxiv : https://arxiv.org/pdf/2312.05634.pdf
- Github : https://github.com/huyquoctrinh/PGS
SSPNet: Scale and spatial priors guided generalizable and interpretable pedestrian attribute recognition
- Arxiv : https://arxiv.org/pdf/2312.06049.pdf
- Github : https://github.com/guotengg/SSPNet
- Arxiv : https://arxiv.org/pdf/2312.06052.pdf
- Github : https://github.com/tensorflow/models/tree/master/official/projects/maskconver
- Arxiv : https://arxiv.org/pdf/2312.06059.pdf
- Github : https://conform-diffusion.github.io/
NutritionVerse-Synth: An Open Access Synthetically Generated 2D Food Scene Dataset for Dietary Intake Estimation
- Arxiv : https://arxiv.org/pdf/2312.06420.pdf
- Github : https://github.com/LiljaAdam/geographical-splits
- Arxiv : https://arxiv.org/pdf/2312.06495.pdf
- Github : -
Global Localization: Utilizing Relative Spatio-Temporal Geometric Constraints from Adjacent and Distant Cameras
- Arxiv : https://arxiv.org/pdf/2312.00500.pdf
- Github : -
A knowledge-based data-driven (KBDD) framework for all-day identification of cloud types using satellite remote sensing
- Arxiv : https://arxiv.org/pdf/2312.00308.pdf
- Github : https://github.com/rsai0/PMD/tree/main/CldNetV1_0_0
- Arxiv : https://arxiv.org/pdf/2311.18537.pdf
- Github : https://github.com/TACJu/MaXTron
- Arxiv : https://arxiv.org/pdf/2311.17960.pdf
- Github : https://github.com/dair-iitd/Guided-Prompting-SAM
- Arxiv : https://arxiv.org/pdf/2311.18082.pdf
- Github : https://github.com/allenai/satlas-super-resolution/tree/main
- Arxiv : https://arxiv.org/pdf/2311.18257.pdf
- Github : -
- Arxiv : https://arxiv.org/pdf/2311.16843.pdf
- Github : https://github.com/tim-learn/GeoNet23_casia_tim
- Arxiv : https://arxiv.org/pdf/2311.16346.pdf
- Github : -
- Arxiv : https://arxiv.org/pdf/2311.16497.pdf
- Github : -
- Arxiv : https://arxiv.org/pdf/2311.15679.pdf
- Github : -
- Arxiv : https://arxiv.org/pdf/2311.15937.pdf
- Github : https://github.com/serizba/salad
Multi-Task Faces (MTF) Data Set: A Legally and Ethically Compliant Collection of Face Images for Various Classification Tasks
- Arxiv : https://arxiv.org/pdf/2311.11888.pdf
- Github : -
- Arxiv : https://arxiv.org/pdf/2311.11904.pdf
- Github : https://github.com/zhuole1025/LLMs_as_Visual_Explainers
Exchanging Dual Encoder-Decoder: A New Strategy for Change Detection with Semantic Guidance and Spatial Localization
- Arxiv : https://arxiv.org/pdf/2311.10296.pdf
- Github : -
- Arxiv : https://arxiv.org/pdf/2311.10476.pdf
- Github : https://frcsyn.github.io/
- Arxiv : https://arxiv.org/pdf/2311.10572.pdf
- Github : https://github.com/YUE-FAN/SSB
- Arxiv : https://arxiv.org/pdf/2311.10605.pdf
- Github : -
- Arxiv : https://arxiv.org/pdf/2311.10709.pdf
- Github : https://emu-video.metademolab.com/
- Arxiv : https://arxiv.org/pdf/2311.09939.pdf
- Github : https://github.com/stevejpapad/relevant-evidence-detection
- Arxiv : https://arxiv.org/pdf/2311.09240.pdf
- Github : -
MUDD: A New Re-Identification Dataset with Efficient Annotation for Off-Road Racers in Extreme Conditions
- Arxiv : https://arxiv.org/pdf/2311.08488.pdf
- Github : https://github.com/JacobTyo/MUDD
- Arxiv : https://arxiv.org/pdf/2311.08557.pdf
- Github : -
- Arxiv : https://arxiv.org/pdf/2311.09064.pdf
- Github : https://systematic-visual-imagination.github.io/
- Arxiv : https://arxiv.org/pdf/2311.09084.pdf
- Github : -
- Arxiv : https://arxiv.org/pdf/2311.09118.pdf
- Github : https://github.com/WildlifeDatasets/wildlife-datasets
- Arxiv : https://arxiv.org/pdf/2303.17368.pdf
- Github : https://story2motion.github.io/
- Arxiv : https://arxiv.org/pdf/2311.07514.pdf
- Github : -
- Arxiv : https://arxiv.org/pdf/2311.07407.pdf
- Github : -
- Arxiv : https://arxiv.org/pdf/2311.07002.pdf
- Github : -
- Arxiv : https://arxiv.org/pdf/2311.06772.pdf
- Github : https://chatanything.github.io/
- Arxiv : https://arxiv.org/pdf/2311.06222.pdf
- Github : https://zenodo.org/records/8144238
- Arxiv : https://arxiv.org/pdf/2311.06224.pdf
- Github : -
- Arxiv : https://arxiv.org/pdf/2311.06231.pdf
- Github : https://github.com/howardzh01/PPMA
- Arxiv : https://arxiv.org/pdf/2311.06242.pdf
- Github : -
- Arxiv : https://arxiv.org/pdf/2311.05725.pdf
- Github : -
- Arxiv : https://arxiv.org/pdf/2311.03970.pdf
- Github : -
- Arxiv : https://arxiv.org/pdf/2311.03828.pdf
- Github : https://github.com/nengdong96/MVIIP
- Arxiv : https://arxiv.org/pdf/2311.03572.pdf
- Github : -
- Arxiv : https://arxiv.org/pdf/2311.03082.pdf
- Github : -
- Arxiv : https://arxiv.org/pdf/2311.02803.pdf
- Github : -
- Arxiv : https://arxiv.org/pdf/2311.02733.pdf
- Github : -
- Arxiv : https://arxiv.org/pdf/2311.02538.pdf
- Github : -
- Arxiv : https://arxiv.org/pdf/2311.02523.pdf
- Github : https://github.com/CVI-SZU/UniTSFace
- Arxiv : https://arxiv.org/pdf/2311.02122.pdf
- Github : -
- Arxiv : https://arxiv.org/pdf/2311.01702.pdf
- Github : -
- Arxiv : https://arxiv.org/pdf/2310.16667v1.pdf
- Github : https://github.com/CVMI-Lab/CoDet
Land-cover change detection using paired OpenStreetMap data and optical high-resolution imagery via object-guided Transformer
- Arxiv : https://arxiv.org/pdf/2310.02674v2.pdf
- Github : -
-
Github : https://github.com/YilongLv/AID
AANet: Aggregation and Alignment Network with Semi-hard Positive Sample Mining for Hierarchical Place Recognition
-
Github : https://github.com/Lu-Feng/AANet
-
Github : -
Land-cover change detection using paired OpenStreetMap data and optical high-resolution imagery via object-guided Transformer
-
Github : -
-
Github : -
-
Github : https://mathvista.github.io/
-
Github : -
-
Github : https://github.com/mwxely/AIGS
-
Arxiv :
-
Github : -
-
Arxiv :
-
Github : -
-
Arxiv :
-
Github : -
-
Arxiv :
-
Github : -
-
Arxiv :
-
Github : -
-
Arxiv :
-
Github : -
-
Arxiv :
-
Github : -
-
Github : https://holoassist.github.io/
Scalable Multi-Temporal Remote Sensing Change Data Generation via Simulating Stochastic Change Process
-
Github : https://github.com/Z-Zheng/Changen
-
Github : https://www.retail-786k.org/
-
Github : -
-
Github : -
-
Github : -
ENIGMA-51: Towards a Fine-Grained Understanding of Human-Object Interactions in Industrial Scenarios
-
Github : -
-
Github : -
-
Github : https://zenodo.org/record/8144361
-
Github : https://species-dataset.github.io/
-
Github : https://github.com/mlzxy/devit
FUTURE-AI: International consensus guideline for trustworthy and deployable artificial intelligence in healthcare
-
Github : -
-
Github : -
DIOR: Dataset for Indoor-Outdoor Reidentification - Long Range 3D/2D Skeleton Gait Collection Pipeline, Semi-Automated Gait Keypoint Labeling and Baseline Evaluation Methods
-
Github : -
-
Github : https://github.com/ffi-no.
-
Github : -
Exploring Different Levels of Supervision for Detecting and Localizing Solar Panels on Remote Sensing Imagery
-
Github : -
-
Github : -
Rethinking Cross-Domain Pedestrian Detection: A Background-Focused Distribution Alignment Framework for Instance-free One-Stage Detectors
-
Github : https://github.com/caiyancheng/BFDA
IMPROVED BREAST CANCER DIAGNOSIS THROUGH TRANSFER LEARNING ON HEMATOXYLIN AND EOSIN STAINED HISTOLOGY IMAGES
-
Github : https://www.bracs.icar.cnr.it/
Double Domain Guided Real-Time Low-Light Image Enhancement for Ultra-High-Definition Transportation Surveillance
-
Github : https://github.com/QuJX/DDNet
-
Github : https://github.com/MathLee/GeleNet
-
Github : -
TOWARDS LARGE-SCALE BUILDING ATTRIBUTE MAPPING USING CROWDSOURCED IMAGES: SCENE TEXT RECOGNITION ON FLICKR AND PROBLEMS TO BE SOLVED
-
Github : -
-
Github : -
-
Github : -
-
Github : -
-
Github : -
-
Github : -
-
Github : -
Do We Still Need Non-Maximum Suppression? Accurate Confidence Estimates and Implicit Duplication Modeling with IoU-Aware Calibration
-
Github : -
Prompt me a Dataset: An investigation of text-image prompting for historical image dataset creation using foundation models
Towards Universal Image Embeddings: A Large-Scale Dataset and Challenge for Generic Image Representations
-
Github : https://cmp.felk.cvut.cz/univ_emb/
-
Github : https://nice.lgresearch.ai/
-
Github : -
-
Github : -
-
Github : https://lorjul.github.io/haystack/
-
Github : -
-
Github : https://github.com/Nanne/ProtoSim
-
Github : https://github.com/csccsccsccsc/DARC
-
Github : -
-
Github : -
-
Github : -
-
Github : -
-
Github : https://github.com/aimagelab/PMA-Net
-
Github : -
-
Github : -
-
Github : https://github.com/xq141839/SPPNet
-
Github : -
HarvestNet: A Dataset for Detecting Smallholder Farming Activity Using Harvest Piles and Remote Sensing
-
Github : -
SwinV2DNet: Pyramid and Self-Supervision Compounded Feature Learning for Remote Sensing Images Change Detection
A three in one bottom-up framework for simultaneous semantic segmentation, instance segmentation and classification of multi-organ nuclei in digital cancer histology
-
Github : -
DiffCloth: Diffusion Based Garment Synthesis and Manipulation via Structural Cross-modal Semantic Alignment
-
Github : -
-
Github : -
-
Github : -
-
Github : -
-
Github : -
-
Github : -
-
Github : -
-
Github : -
-
Github : -
-
Github : -
-
Github : -
-
Github : -
-
Github : -
-
Github : -
Lip Reading for Low-resource Languages by Learning and Combining General Speech Knowledge and Language-specific Knowledge
-
Github : -
-
Github : -
-
Github : -
-
Github : https://huangyangyi.github.io/tech
-
Github : https://github.com/neu-vi/Diag-HOI
-
Github : https://github.com/Parskatt/DeDoDe
-
Github : https://github.com/deepglint/ALIP
-
Github : https://github.com/yfguo91/MPBN
-
Github : -
CARE: A Large Scale CT Image Dataset and Clinical Applicable Benchmark Model for Rectal Cancer Segmentation
-
Github : -
-
Github : https://github.com/OSVAI/Ske2Grid
-
Github : -
-
Github : -
Beyond Geo-localization: Fine-grained Orientation of Street-view Images by Cross-view Matching with Satellite Imagery
-
Github : -
-
Github : -
To pretrain or not to pretrain? A case study of domain-specific pretraining for semantic segmentation in histopathology
-
Github : -
PseudoCell: Hard Negative Mining as Pseudo Labeling for Deep Learning-Based Centroblast Cell Detection
CityTrack: Improving City-Scale Multi-Camera Multi-Target Tracking by Location-Aware Tracking and Box-Grained Matching
-
Github : -
-
Github : -
-
Github : -
-
Github : -
-
Github : -
Rethinking Multiple Instance Learning for Whole Slide Image Classification: A Good Instance Classifier is All You Need
-
Github : -
-
Github : https://github.com/siyi-wind/MDViT
-
Github : -
-
Github : -
-
Github : -
-
Github : -
-
Github : -
-
Github : -
-
Github : -
-
Github : https://github.com/k4ntz/OC_Atari
What can a cook in Italy teach a mechanic in India? Action Recognition Generalisation Over Scenarios and Locations
-
Github : -
-
Github : -
-
Github : https://dreamsim-nights.github.io/
-
Github : https://github.com/Anima-Lab/MaskDiT
-
Github : -
-
Github : https://zju3dv.github.io/neusc/
-
Github : -
-
Github : -
-
Github : -
MovieFactory: Automatic Movie Creation from Text using Large Generative Models for Language and Images
-
Github : -
PhenoBench — A Large Dataset and Benchmarks for Semantic Image Interpretation in the Agricultural Domain
-
Github : -
-
Github : -
-
Github : -
-
Github : -
-
Github : -
-
Github : https://github.com/VITA-Group/SLaK
-
Github : -
-
Github : https://github.com/YifanXu74/MQ-Det
-
Github : -
Ghost in the Minecraft: Generally Capable Agents for Open-World Enviroments via Large Language Models with Text-based Knowledge and Memory
-
Github : https://github.com/OpenGVLab/GITM
-
Github : -
-
Github : -
-
Github : http://cnceleb.org/#portfolio
-
Github : https://detgpt.github.io/
-
Github : https://mipi-challenge.org/MIPI2023/
MaskCL: Semantic Mask-Driven Contrastive Learning for Unsupervised Person Re-Identification with Clothes Change
-
Github : -
-
Github : -
-
Github : https://github.com/yuezih/Movie101
-
Github : https://rrc.cvc.uab.es/?ch=18
ON THE HIDDEN MYSTERY OF OCR IN LARGE MULTIMODAL MODELS
-
Github : https://github.com/Zplusdragon/PLIP
-
Github : -
CLIP-VG: Self-paced Curriculum Adapting of CLIP via Exploiting Pseudo-Language Labels for Visual Grounding
M6Doc: A Large-Scale Multi-Format, Multi-Type, Multi-Layout, Multi-Language, Multi-Annotation Category Dataset for Modern Document Layout Analysis
-
Github : https://github.com/HCIILAB/M6Doc
-
Github : https://github.com/jfkuang/CFAM
-
Github : -
-
Github : -
-
Github : https://github.com/CyrilSterling/LPV
-
Github : https://github.com/simplify23/TPS_PP
Restormer-Plus for Real World Image Deraining: One State-of-the-Art Solution to the GT-RAIN Challenge (CVPR 2023 UG2+ Track 3)
-
Github : -
-
Github : https://github.com/Royalvice/DocDiff
FEW SHOT LEARNING FOR MEDICAL IMAGING: A COMPARATIVE ANALYSIS OF METHODOLOGIES AND FORMAL MATHEMATICAL FRAMEWORK
-
Github : -
-
Github : -
SwinDocSegmenter: An End-to-End Unified Domain Adaptive Transformer for Document Instance Segmentation
-
Github : -
-
Github : -
-
Github : -
-
Github : -
-
Github : -
-
Github : -
-
Github : -
-
Github : https://bci.grand-challenge.org/
-
Github : http://attentionviz.com/
-
Github : -
Development of a Realistic Crowd Simulation Environment for Fine-grained Validation of People Tracking Methods
-
Github : -
-
Github : https://datalab-groupe.github.io/
-
Github : https://haian-jin.github.io/TensoIR/
-
Github : -
-
Github : -
-
Github : -
-
Github : -
Survey on Unsupervised Domain Adaptation for Semantic Segmentation for Visual Perception in Automated Driving
-
Github : -
-
Github : -
-
Github : https://github.com/bowang-lab/MedSAM
-
Github : https://github.com/DengPingFan/CSU
-
Github : https://www.omnilabel.org/
-
Github : https://airbirdsdata.github.io/
-
Github : -
-
Github : -
-
Github : https://github.com/NerdFNY/PGVTON
DO HUMANS AND MACHINES HAVE THE SAME EYES? HUMAN-MACHINE PERCEPTUAL DIFFERENCES ON IMAGE CLASSIFICATION
-
Github : -
-
Github : https://github.com/Liuxinyv/SAZS
-
Github : -
-
Github : -
-
Github : -
-
Github : -
-
Github : -
-
Github : https://github.com/xwf199/PARFormer
-
Github : -
-
Github : https://github.com/JJGO/UniverSeg