查看2024年综述文献点这里↘️ 2024-CV-Surveys
2024 年,计算机视觉相关综述。包括目标检测、跟踪........
📗📗📗在【我爱计算机视觉】微信公众号后台回复“CV综述”,即可收到本文列出的全部论文的打包下载。至7月11日已公开 251+5 篇。
1月份共计44篇。
2月份共计36篇。
3月份共计25篇。
4月份共计33篇。
5月份共计50篇。
计188篇。
🐱 | 🐶 | 🐯 | 🐺 |
---|---|---|---|
1.Unkown(未分) |
- A Comprehensive Overview of Fish-Eye Camera Distortion Correction Methods
[2024-01-02] - Surround-View Fisheye Optics in Computer Vision and Simulation: Survey and Challenge
[2024-02-20]
- Toxic Memes: A Survey of Computational Perspectives on the Detection and Explanation of Meme Toxicities
[2024-06-12]
- Deepfake Generation and Detection: A Benchmark and Survey
[2024-03-27]
⭐code - A Timely Survey on Vision Transformer for Deepfake Detection
[2024-05-15] - Media Forensics and Deepfake Systematic Survey
[2024-06-21] - The Tug-of-War Between Deepfake Generation and Detection
[2024-07-09]
- A Systematic Review of Available Datasets in Additive Manufacturing
[2024-01-30] - A Comprehensive Survey on Machine Learning Driven Material Defect Detection: Challenges, Solutions, and Future Prospects
[2024-06-13] - A PRISMA Driven Systematic Review of Publicly Available Datasets for Benchmark and Model Developments for Industrial Defect Detection
[2024-06-13] - VAD
- 点云的工业系统 3D 缺陷检测和分类
- A Comprehensive Review of Machine Learning Advances on Data Change: A Cross-Field Perspective
[2024-02-21] - Open-world Machine Learning: A Review and New Outlooks
[2024-03-06]无PDF - Inference Attacks in Machine Learning as a Service: A Taxonomy, Review, and Promising Directions
[2024-06-05] - 持续学习
- 迁移学习
- 联邦学习
- 物体重识别
- 物体姿态估计
- 自监督
- 无监督学习
- Neural Radiance Field-based Visual Rendering: A Comprehensive Review
[2024-04-02] - Dynamic NeRF: A Review
[2024-05-15]
- How Can Large Language Models Enable Better Socially Assistive Human-Robot Interaction: A Brief Survey
[2024-04-02]
- SLAM
- VR
- 地理定位
- 机器人
- PR
- A Survey on Autonomous Driving Datasets: Data Statistic, Annotation, and Outlook
[2024-01-04] - Robustness-Aware 3D Object Detection in Autonomous Driving: A Review and Outlook
[2024-01-15] - Data-Centric Evolution in Autonomous Driving: A Comprehensive Survey of Big Data System, Data Mining, and Closed-Loop Technologies
[2024-01-24] - A Survey for Foundation Models in Autonomous Driving
[2024-02-05] - Review of the Learning-based Camera and Lidar Simulation Methods for Autonomous Driving Systems
[2024-02-16] - Explainable AI for Safe and Trustworthy Autonomous Driving: A Systematic Review
[2024-02-16] - A Survey of Vision Transformers in Autonomous Driving: Current Trends and Future Directions
[2024-03-13] - Monocular 3D lane detection for Autonomous Driving: Recent Achievements, Challenges, and Outlooks
[2024-04-11] - Neural Radiance Field in Autonomous Driving: A Survey
[2024-04-23] - Collaborative Perception Datasets in Autonomous Driving: A Survey
[2024-04-23] - A Survey on Intermediate Fusion Methods for Collaborative Perception Categorized by Real World Challenges
[2024-04-26] - Vision-based 3D occupancy prediction in autonomous driving: a review and outlook
⭐code
[2024-05-07] - Deep Event-based Object Detection in Autonomous Driving: A Survey
[2024-05-08] - A Survey on Occupancy Perception for Autonomous Driving: The Information Fusion Perspective
⭐code
[2024-05-09] - Cooperative Visual-LiDAR Extrinsic Calibration Technology for Intersection Vehicle-Infrastructure: A review
[2024-05-17] - Collective Perception Datasets for Autonomous Driving: A Comprehensive Review
[2024-05-28] - 车辆重识别
- Semantically-aware Neural Radiance Fields for Visual Scene Understanding: A Comprehensive Review
[2024-02-20]
- Assistive Image Annotation Systems with Deep Learning and Natural Language Capabilities: A Review
[2024-07-02]
- A Survey on Hallucination in Large Vision-Language Models
[2024-02-02] - Exploring the Frontier of Vision-Language Models: A Survey of Current Methodologies and Future Directions
[2024-04-12] - A Survey on Visual Mamba
[2024-04-25] - Vision Mamba: A Comprehensive Survey and Taxonomy
⭐code
[2024-05-08] - A Survey on Vision-Language-Action Models for Embodied AI
[2024-05-24] - JailbreakZoo: Survey, Landscapes, and Horizons in Jailbreaking Large Language and Vision-Language Models
🏠project
[2024-07-03] - 基础模型
- MLLM
- The (R)Evolution of Multimodal Large Language Models: A Survey
[2024-02-21] - Efficient Multimodal Large Language Models: A Survey
⭐code
[2024-05-20] - A Survey of Multimodal Large Language Model from A Data-centric Perspective
[2024-05-28] - The Synergy between Data and Multi-Modal Large Language Models: A Survey from Co-Development Perspective
⭐code
[2024-07-12]
- The (R)Evolution of Multimodal Large Language Models: A Survey
- VLN
- LLM
- Large Multimodal Agents: A Survey
⭐code
[2024-02-26] - Unbridled Icarus: A Survey of the Potential Perils of Image Inputs in Multimodal Large Language Model Security
[2024-04-09] - Hallucination of Multimodal Large Language Models: A Survey
⭐code
[2024-04-30] - Multi-Modal and Multi-Agent Systems Meet Rationality: A Survey
⭐code
[2024-06-04] - A Survey of Attacks on Large Vision-Language Models: Resources, Advances, and Future Trends
⭐code
[2024-07-11]
- Large Multimodal Agents: A Survey
- Exploring the Synergies of Hybrid CNNs and ViTs Architectures for Computer Vision: A survey
[2024-02-06] - Mamba-360: Survey of State Space Models as Transformer Alternative for Long Sequence Modelling: Methods, Applications, and Challenges
⭐code
[2024-04-26] - A Comparative Survey of Vision Transformers for Feature Extraction in Texture Analysis
[2024-06-11]
- Evaluation in Neural Style Transfer: A Review
[2024-01-31]
- A Comprehensive Survey and Taxonomy on Point Cloud Registration Based on Deep Learning
⭐code
[2024-04-23]
- Transformers and Language Models in Form Understanding: A Comprehensive Review of Scanned Document Analysis
[2024-03-08] - A short review on graphonometric evaluation tools in children.
[2024-06-11] - 文本图像处理
- 图表理解
- 手写识别
- Video Diffusion Models: A Survey
⭐code
[2024-05-07] - Is Sora a World Simulator? A Comprehensive Survey on General World Models and Beyond
⭐code
[2024-05-07] - Survey on Visual Signal Coding and Processing with Generative Models: Technologies, Standards and Optimization
[2024-05-24] - LLMs Meet Multimodal Generation and Editing: A Survey
⭐code
[2024-05-30] - Diffusion Models and Representation Learning: A Survey
⭐code
[2024-07-02] - 文本-图像生成
- Text-to-Image Cross-Modal Generation: A Systematic Review
[2024-01-23] - Controllable Generation with Text-to-Image Diffusion Models: A Survey
⭐code
[2024-03-08] - Evaluating Text to Image Synthesis: Survey and Taxonomy of Image Quality Metrics
[2024-03-19] - Survey of Bias In Text-to-Image Generation: Definition, Evaluation, and Mitigation
[2024-04-02] - Theoretical research on generative diffusion models: an overview
[2024-04-16] - Exploring Feedback Generation in Automated Skeletal Movement Assessment: A Comprehensive Overview
[2024-04-16]
- Text-to-Image Cross-Modal Generation: A Systematic Review
- 内容生成
- A Survey on Personalized Content Synthesis with Diffusion Models
[2024-05-10] - 文本-3D
- 3D 内容生成
- A Comprehensive Survey on 3D Content Generation
[2024-02-05]
- A Comprehensive Survey on 3D Content Generation
- A Survey on Personalized Content Synthesis with Diffusion Models
- AIGC
- 图像编辑
- 文本-视频
- 视频生成
- 视频编辑
- Diffusion Model-Based Video Editing: A Survey
⭐code
[2024-07-11]
- Diffusion Model-Based Video Editing: A Survey
- GAN
- 街景视角合成
- Bird's-Eye View to Street-View: A Survey
[2024-05-16]
- Bird's-Eye View to Street-View: A Survey
- 人体情感识别
- Reid
- 行人检测
- Body-Area Capacitive or Electric Field Sensing for Human Activity Recognition and Human-Computer Interaction: A Comprehensive Survey
[2024-01-12] - A Survey of IMU Based Cross-Modal Transfer Learning in Human Activity Recognition
[2024-03-26] - A Survey on Backbones for Deep Video Action Recognition
[2024-05-10] - From CNNs to Transformers in Multimodal Human Action Recognition: A Survey
[2024-05-28] - Self-Supervised Skeleton Action Representation Learning: A Benchmark and Beyond
[2024-06-06] - RNNs, CNNs and Transformers in Human Action Recognition: A Survey and A Hybrid Model
[2024-07-09] - 跌倒检测
- In-Bed Pose Estimation: A Review
[2024-02-02] - Survey of 3D Human Body Pose and Shape Estimation Methods for Contemporary Dance Applications
[2024-01-05] - Deep Learning for 3D Human Pose Estimation and Mesh Recovery: A Survey
⭐code
[2024-03-01] - A Survey on 3D Egocentric Human Pose Estimation
[2024-03-27] - Human Modelling and Pose Estimation Overview
[2024-06-28] - Markerless Multi-view 3D Human Pose Estimation: a survey
[2024-07-08] - 三维人体
- Deep video representation learning: a survey
[2024-05-13] - 视频理解
- Video Understanding with Large Language Models: A Survey
⭐code
[2024-01-01] - A Survey on Generative AI and LLM for Video Generation, Understanding, and Streaming
[2024-04-26] - Foundation Models for Video Understanding: A Survey
⭐code
[2024-05-08] - A Survey of Video Datasets for Grounded Event Understanding
[2024-06-17]
- Video Understanding with Large Language Models: A Survey
- 视频预测
- 视频制作
- 视频异常检测
- Beyond Traditional Single Object Tracking: A Survey
[2024-05-20] - The Progression of Transformers from Language to Vision to MOT: A Literature Review on Multi-Object Tracking with Transformers
[2024-06-25] - 多模态目标跟踪
- Awesome Multi-modal Object Tracking
⭐code
[2024-05-24]
- Awesome Multi-modal Object Tracking
- Agricultural Object Detection with You Look Only Once (YOLO) Algorithm: A Bibliometric and Systematic Literature Review
[2024-01-22] - YOLOv1 to YOLOv10: A comprehensive review of YOLO variants and their application in the agricultural domain
[2024-06-17] - YOLOv10 to Its Genesis: A Decadal and Comprehensive Review of The You Only Look Once Series
[2024-07-01] - Semi-Supervised Object Detection: A Survey on Progress from CNN to Transformer
[2024-07-12] - 海洋垃圾检测
- 3D目标识别
- Image Fusion in Remote Sensing: An Overview and Meta Analysis
[2024-01-18] - UAV-borne Mapping Algorithms for Canopy-Level and High-Speed Drone Applications
[2024-01-15] - Solid Waste Detection in Remote Sensing Images: A Survey
[2024-02-15] - A Comprehensive Review on Computer Vision Analysis of Aerial Data
[2024-02-16] - Deep Learning for Satellite Image Time Series Analysis: A Review
[2024-04-08] - A Review on Machine Learning Algorithms for Dust Aerosol Detection using Satellite Data
[2024-04-16] - Sugarcane Health Monitoring With Satellite Spectroscopy and Machine Learning: A Review
[2024-04-29]利用卫星光谱和机器学习监测甘蔗健康 - Wildfire Risk Prediction: A Review
[2024-05-06] - Dehazing Remote Sensing and UAV Imagery: A Review of Deep Learning, Prior-based, and Hybrid Approaches
[2024-05-14] - Visual place recognition for aerial imagery: A survey
⭐code
[2024-06-04] - Deep Learning for Slum Mapping in Remote Sensing Images: A Meta-analysis and Review
[2024-06-13] - Hyperspectral Pansharpening: Critical Review, Tools and Future Perspectives
⭐code
[2024-07-02] - 交叉视角地理定位
- Cross-view geo-localization: a survey
[2024-06-17]
- Cross-view geo-localization: a survey
- 航空航天
- Empowering Medical Imaging with Artificial Intelligence: A Review of Machine Learning Approaches for the Detection, and Segmentation of COVID-19 Using Radiographic and Tomographic Images
[2024-01-17] - Advancing Low-Rank and Local Low-Rank Matrix Approximation in Medical Imaging: A Systematic Literature Review and Future Directions
[2024-02-23] - When Eye-Tracking Meets Machine Learning: A Systematic Review on Applications in Medical Image Analysis
[2024-03-33] - Out-of-distribution Detection in Medical Image Analysis: A survey
[2024-04-30] - Development of Skip Connection in Deep Neural Networks for Computer Vision and Medical Image Analysis: A Survey
⭐code
[2024-05-06] - Continual Learning in Medical Imaging from Theory to Practice: A Survey and Practical Analysis
[2024-05-24] - Computation-Efficient Era: A Comprehensive Survey of State Space Models in Medical Image Analysis
⭐code
[2024-06-06] - Solving the Inverse Problem of Electrocardiography for Cardiac Digital Twins: A Survey
[2024-06-18] - A Comprehensive Survey of Foundation Models in Medicine
[2024-06-18] - Review of Zero-Shot and Few-Shot AI Algorithms in The Medical Domain
[2024-06-25] - Applications of interpretable deep learning in neuroimaging: a comprehensive review
[2024-06-27] - Foundational Models for Pathology and Endoscopy Images: Application for Gastric Inflammation
[2024-06-27] - A Review of Image Processing Methods in Prostate Ultrasound
[2024-07-02] - 息肉分割
- 生物医学图像分割
- 微创外科视觉
- 牙科 X 射线成像分割
- 胶质瘤组织切片分析
- 手术
- 人工耳蜗
- 医学图像配准
- stroke segmentation
- CT
- 医学图像分类
- 医学图像分割
- 神经成像中的异常检测
- 报告生成
- 基于步态的神经退行性疾病诊断中的人工智能调查
- 目标检测
- 癌症检测
- MRI 重建
- High-energy physics image classification: A Survey of Jet Applications
[2024-03-19] - Noisy Label Processing for Classification: A Survey
[2024-04-08] - Traditional to Transformers: A Survey on Current Trends and Future Prospects for Hyperspectral Image Classification
⭐code
[2024-04-24] - Convolutional Neural Networks and Vision Transformers for Fashion MNIST Classification: A Literature Review
[2024-06-06]
- 语义分割
- 修复
- 去噪
- 去模糊
- 图像增强
- SoK: Facial Deepfake Detectors
[2024-01-10] - Neuromorphic Face Analysis: a Survey
[2024-02-20] - A Comprehensive Survey of Masked Faces: Recognition, Detection, and Unmasking
[2024-05-10] - Evolving from Single-modal to Multi-modal Facial Deepfake Detection: A Survey
⭐code
[2024-06-12] - Artificial Immune System of Secure Face Recognition Against Adversarial Attacks
⭐code
[2024-06-27]
- 3D Scene Geometry Estimation from 360∘ Imagery: A Survey
[2024-01-18] - Survey on Modeling of Articulated Objects
[2024-03-25] - RGB Guided ToF Imaging System: A Survey of Deep Learning-based Methods
[2024-05-20] - A Survey on Text-guided 3D Visual Grounding: Elements, Recent Advances, and Future Directions
⭐code
[2024-06-11] - 三维重建
- 3D 生成
- Advances in 3D Generation: A Survey
[2024-02-01]
- Advances in 3D Generation: A Survey
- 3D 密集字幕
- 深度估计
- 三维场景理解
- Stereo Matching
- A Survey on Deep Stereo Matching in the Twenties
⭐code
[2024-07-11]
- A Survey on Deep Stereo Matching in the Twenties
- Comprehensive Exploration of Synthetic Data Generation: A Survey
[2024-01-08] - Image-based Deep Learning for Smart Digital Twins: a Review
[2024-01-08] - A Survey on 3D Gaussian Splatting
[2024-01-09] - A Survey on African Computer Vision Datasets, Topics and Researchers
⭐code
[2024-01-23] - Parameter-Efficient Fine-Tuning for Pre-Trained Vision Models: A Survey
⭐code
[2024-02-06] - A Literature Review of Literature Reviews in Pattern Analysis and Machine Intelligence
[2024-02-21] - Asphalt Concrete Characterization Using Digital Image Correlation: A Systematic Review of Best Practices, Applications, and Future Vision
[2024-02-28] - Lightweight Deep Learning for Resource-Constrained Environments: A Survey
[2024-04-12] - A Survey of Neural Network Robustness Assessment in Image Recognition
[2024-04-15] - State Space Model for New-Generation Network Alternative to Transformers: A Survey
⭐code
[2024-04-16] - A Survey on Vision Mamba: Models, Applications and Challenges
⭐code
[2024-04-30] - Generative Artificial Intelligence: A Systematic Review and Applications
[2024-05-21] - A Review of Pulse-Coupled Neural Network Applications in Computer Vision and Image Processing
[2024-06-04] - Exploring the Potential of Polynomial Basis Functions in Kolmogorov-Arnold Networks: A Comparative Study of Different Groups of Polynomials
[2024-06-06] - Deep learning for precipitation nowcasting: A survey from the perspective of time series forecasting
[2024-06-11] - Diffusion Models in Low-Level Vision: A Survey
⭐code
[2024-06-18] - Public Computer Vision Datasets for Precision Livestock Farming: A Systematic Survey
[2024-06-18] - Aligning Cyber Space with Physical World: A Comprehensive Survey on Embodied AI
⭐code
[2024-07-10] - Event-based vision on FPGAs -- a survey
[2024-07-12]