Usage instructions: here
Table of Contents
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2024-02-28 | Orchid: Flexible and Data-Dependent Convolution for Sequence Modeling | Mahdi Karami et.al. | 2402.18508 | null |
2024-02-28 | Prompt-Driven Dynamic Object-Centric Learning for Single Domain Generalization | Deng Li et.al. | 2402.18447 | null |
2024-02-28 | A Modular System for Enhanced Robustness of Multimedia Understanding Networks via Deep Parametric Estimation | Francesco Barbato et.al. | 2402.18402 | null |
2024-02-28 | A Multimodal Handover Failure Detection Dataset and Baselines | Santosh Thoduka et.al. | 2402.18319 | null |
2024-02-28 | Classes Are Not Equal: An Empirical Study on Image Recognition Fairness | Jiequan Cui et.al. | 2402.18133 | null |
2024-02-27 | Understanding Neural Network Binarization with Forward and Backward Proximal Quantizers | Yiwei Lu et.al. | 2402.17710 | null |
2024-02-27 | SDF2Net: Shallow to Deep Feature Fusion Network for PolSAR Image Classification | Mohammed Q. Alkhatib et.al. | 2402.17672 | link |
2024-02-27 | Predict the Next Word: | Evgenia Ilia et.al. | 2402.17527 | null |
2024-02-27 | Scaling Supervised Local Learning with Augmented Auxiliary Networks | Chenxiang Ma et.al. | 2402.17318 | link |
2024-02-26 | Offline Writer Identification Using Convolutional Neural Network Activation Features | Vincent Christlein et.al. | 2402.17029 | null |
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2024-02-28 | UniMODE: Unified Monocular 3D Object Detection | Zhuoling Li et.al. | 2402.18573 | null |
2024-02-28 | Detection of Micromobility Vehicles in Urban Traffic Videos | Khalil Sabri et.al. | 2402.18503 | link |
2024-02-28 | Sunshine to Rainstorm: Cross-Weather Knowledge Distillation for Robust 3D Object Detection | Xun Huang et.al. | 2402.18493 | null |
2024-02-28 | Prompt-Driven Dynamic Object-Centric Learning for Single Domain Generalization | Deng Li et.al. | 2402.18447 | null |
2024-02-28 | Unveiling novel insights into Kirchhoff migration for effective object detection using experimental Fresnel dataset | Won-Kwang Park et.al. | 2402.18322 | null |
2024-02-28 | Zero-Shot Aerial Object Detection with Visual Description Regularization | Zhengqing Zang et.al. | 2402.18233 | null |
2024-02-28 | VulMCI : Code Splicing-based Pixel-row Oversampling for More Continuous Vulnerability Image Generation | Tao Peng et.al. | 2402.18189 | null |
2024-02-27 | SDDGR: Stable Diffusion-based Deep Generative Replay for Class Incremental Object Detection | Junsu Kim et.al. | 2402.17323 | null |
2024-02-27 | A Vanilla Multi-Task Framework for Dense Visual Prediction Solution to 1st VCL Challenge -- Multi-Task Robustness Track | Zehui Chen et.al. | 2402.17319 | null |
2024-02-27 | Probing Multimodal Large Language Models for Global and Local Semantic Representation | Mingxu Tao et.al. | 2402.17304 | null |
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2024-02-28 | Separate and Conquer: Decoupling Co-occurrence via Decomposition and Representation for Weakly Supervised Semantic Segmentation | Zhiwei Yang et.al. | 2402.18467 | link |
2024-02-28 | A Modular System for Enhanced Robustness of Multimedia Understanding Networks via Deep Parametric Estimation | Francesco Barbato et.al. | 2402.18402 | null |
2024-02-28 | Enhancing Roadway Safety: LiDAR-based Tree Clearance Analysis | Miriam Louise Carnot et.al. | 2402.18309 | null |
2024-02-28 | Feature Denoising For Low-Light Instance Segmentation Using Weighted Non-Local Blocks | Joanne Lin et.al. | 2402.18307 | null |
2024-02-28 | Self-Supervised Learning in Electron Microscopy: Towards a Foundation Model for Advanced Image Analysis | Bashir Kazimi et.al. | 2402.18286 | null |
2024-02-28 | PRCL: Probabilistic Representation Contrastive Learning for Semi-Supervised Semantic Segmentation | Haoyu Xie et.al. | 2402.18117 | null |
2024-02-28 | Spannotation: Enhancing Semantic Segmentation for Autonomous Navigation with Efficient Image Annotation | Samuel O. Folorunsho et.al. | 2402.18084 | link |
2024-02-27 | Weakly Supervised Co-training with Swapping Assignments for Semantic Segmentation | Xinyu Yang et.al. | 2402.17891 | link |
2024-02-27 | Mitigating Distributional Shift in Semantic Segmentation via Uncertainty Estimation from Unlabelled Data | David S. W. Williams et.al. | 2402.17653 | null |
2024-02-27 | Masked Gamma-SSL: Learning Uncertainty Estimation via Masked Image Modeling | David S. W. Williams et.al. | 2402.17622 | null |
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2024-02-28 | Estimation of railway vehicle response for track geometry evaluation using branch Fourier neural operator | Qingjing Wang et.al. | 2402.18366 | null |
2024-02-28 | EchoTrack: Auditory Referring Multi-Object Tracking for Autonomous Driving | Jiacheng Lin et.al. | 2402.18302 | link |
2024-02-28 | Enhancing Tracking Robustness with Auxiliary Adversarial Defense Networks | Zhewei Wu et.al. | 2402.17976 | null |
2024-02-27 | SWTrack: Multiple Hypothesis Sliding Window 3D Multi-Object Tracking | Sandro Papais et.al. | 2402.17892 | null |
2024-02-27 | In Defense and Revival of Bayesian Filtering for Thermal Infrared Object Tracking | Peng Gao et.al. | 2402.17098 | null |
2024-02-26 | Searching a Lightweight Network Architecture for Thermal Infrared Pedestrian Tracking | Peng Gao et.al. | 2402.16570 | null |
2024-02-26 | SeqTrack3D: Exploring Sequence Information for Robust 3D Point Cloud Tracking | Yu Lin et.al. | 2402.16249 | null |
2024-02-26 | Real-Time Vehicle Detection and Urban Traffic Behavior Analysis Based on UAV Traffic Videos on Mobile Devices | Yuan Zhu et.al. | 2402.16246 | null |
2024-02-24 | Multi-Object Tracking by Hierarchical Visual Representations | Jinkun Cao et.al. | 2402.15895 | null |
2024-02-24 | Detection Is Tracking: Point Cloud Multi-Sweep Deep Learning Models Revisited | Lingji Chen et.al. | 2402.15756 | null |
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2024-02-23 | Multimodal Transformer With a Low-Computational-Cost Guarantee | Sungjin Park et.al. | 2402.15096 | null |
2024-02-17 | Implementation of a Model of the Cortex Basal Ganglia Loop | Naoya Arakawa et.al. | 2402.13275 | null |
2024-02-20 | Radar-Based Recognition of Static Hand Gestures in American Sign Language | Christian Schuessler et.al. | 2402.12800 | null |
2024-02-20 | Learning Domain-Invariant Temporal Dynamics for Few-Shot Action Recognition | Yuke Li et.al. | 2402.12706 | null |
2024-02-19 | Comprehensive Cognitive LLM Agent for Smartphone GUI Automation | Xinbei Ma et.al. | 2402.11941 | null |
2024-02-15 | Hand Shape and Gesture Recognition using Multiscale Template Matching, Background Subtraction and Binary Image Analysis | Ketan Suhaas Saichandran et.al. | 2402.09663 | null |
2024-02-14 | TikTokActions: A TikTok-Derived Video Dataset for Human Action Recognition | Yang Qian et.al. | 2402.08875 | null |
2024-02-13 | BdSLW60: A Word-Level Bangla Sign Language Dataset | Husne Ara Rubaiyeat et.al. | 2402.08635 | link |
2024-02-13 | Vision-Based Hand Gesture Customization from a Single Demonstration | Soroush Shahi et.al. | 2402.08420 | null |
2024-02-12 | PBADet: A One-Stage Anchor-Free Approach for Part-Body Association | Zhongpai Gao et.al. | 2402.07814 | null |
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2024-02-28 | Attention-Propagation Network for Egocentric Heatmap to 3D Pose Lifting | Taeho Kang et.al. | 2402.18330 | link |
2024-02-28 | Location-guided Head Pose Estimation for Fisheye Image | Bing Li et.al. | 2402.18320 | null |
2024-02-28 | NToP: NeRF-Powered Large-scale Dataset Generation for 2D and 3D Human Pose Estimation in Top-View Fisheye Images | Jingrui Yu et.al. | 2402.18196 | null |
2024-02-28 | Six-Point Method for Multi-Camera Systems with Reduced Solution Space | Banglei Guan et.al. | 2402.18066 | null |
2024-02-27 | Real-Time Estimation of Relative Pose for UAVs Using a Dual-Channel Feature Association | Zhaoying Wang et.al. | 2402.17504 | null |
2024-02-26 | HOISDF: Constraining 3D Hand-Object Pose Estimation with Global Signed Distance Fields | Haozhe Qi et.al. | 2402.17062 | link |
2024-02-26 | DRSI-Net: Dual-Residual Spatial Interaction Network for Multi-Person Pose Estimation | Shang Wu et.al. | 2402.16640 | null |
2024-02-26 | GEA: Reconstructing Expressive 3D Gaussian Avatar from Monocular Video | Xinqi Liu et.al. | 2402.16607 | null |
2024-02-26 | DreamUp3D: Object-Centric Generative Models for Single-View 3D Scene Understanding and Real-to-Sim Transfer | Yizhe Wu et.al. | 2402.16308 | null |
2024-02-25 | XAI-based gait analysis of patients walking with Knee-Ankle-Foot orthosis using video cameras | Arnav Mishra et.al. | 2402.16175 | null |
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2024-02-28 | MambaMIR: An Arbitrary-Masked Mamba for Joint Medical Image Reconstruction and Uncertainty Estimation | Jiahao Huang et.al. | 2402.18451 | null |
2024-02-28 | FineDiffusion: Scaling up Diffusion Models for Fine-grained Image Generation with 10,000 Classes | Ziying Pan et.al. | 2402.18331 | null |
2024-02-28 | Balancing Act: Distribution-Guided Debiasing in Diffusion Models | Rishubh Parihar et.al. | 2402.18206 | null |
2024-02-28 | Misalignment-Robust Frequency Distribution Loss for Image Transformation | Zhangkai Ni et.al. | 2402.18192 | null |
2024-02-28 | VulMCI : Code Splicing-based Pixel-row Oversampling for More Continuous Vulnerability Image Generation | Tao Peng et.al. | 2402.18189 | null |
2024-02-28 | Block and Detail: Scaffolding Sketch-to-Image Generation | Vishnu Sarukkai et.al. | 2402.18116 | null |
2024-02-28 | Coarse-to-Fine Latent Diffusion for Pose-Guided Person Image Synthesis | Yanzuo Lu et.al. | 2402.18078 | link |
2024-02-28 | SynArtifact: Classifying and Alleviating Artifacts in Synthetic Images via Vision-Language Model | Bin Cao et.al. | 2402.18068 | null |
2024-02-28 | Breaking the Black-Box: Confidence-Guided Model Inversion Attack for Distribution Shift | Xinhao Liu et.al. | 2402.18027 | null |
2024-02-27 | CustomSketching: Sketch Concept Extraction for Sketch-based Image Synthesis and Editing | Chufeng Xiao et.al. | 2402.17624 | null |
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2024-02-28 | Arithmetic Control of LLMs for Diverse User Preferences: Directional Preference Alignment with Multi-Objective Rewards | Haoxiang Wang et.al. | 2402.18571 | link |
2024-02-28 | A Categorization of Complexity Classes for Information Retrieval and Synthesis Using Natural Logic | Gregory Coppola et.al. | 2402.18566 | null |
2024-02-28 | Implicit Bias of Next-Token Prediction | Christos Thrampoulidis et.al. | 2402.18551 | null |
2024-02-28 | Few-Shot Fairness: Unveiling LLM's Potential for Fairness-Aware Classification | Garima Chhikara et.al. | 2402.18502 | null |
2024-02-28 | Take It, Leave It, or Fix It: Measuring Productivity and Trust in Human-AI Collaboration | Crystal Qian et.al. | 2402.18498 | null |
2024-02-28 | Language Models Represent Beliefs of Self and Others | Wentao Zhu et.al. | 2402.18496 | null |
2024-02-28 | Meta-Task Prompting Elicits Embedding from Large Language Models | Yibin Lei et.al. | 2402.18458 | null |
2024-02-28 | Beyond Natural Language: LLMs Leveraging Alternative Formats for Enhanced Reasoning and Communication | Weize Chen et.al. | 2402.18439 | link |
2024-02-28 | Unsupervised Cross-Domain Image Retrieval via Prototypical Optimal Transport | Bin Li et.al. | 2402.18411 | link |
2024-02-28 | A Cognitive Evaluation Benchmark of Image Reasoning and Description for Large Vision Language Models | Xiujie Song et.al. | 2402.18409 | null |
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2024-02-28 | Windowed-FourierMixer: Enhancing Clutter-Free Room Modeling with Fourier Transform | Bruno Henriques et.al. | 2402.18287 | null |
2024-02-27 | LiveHPS: LiDAR-based Scene-level Human Pose and Shape Estimation in Free Environment | Yiming Ren et.al. | 2402.17171 | null |
2024-02-27 | Efficiently Leveraging Linguistic Priors for Scene Text Spotting | Nguyen Nguyen et.al. | 2402.17134 | null |
2024-02-26 | DreamUp3D: Object-Centric Generative Models for Single-View 3D Scene Understanding and Real-to-Sim Transfer | Yizhe Wu et.al. | 2402.16308 | null |
2024-02-24 | Sequential Visual and Semantic Consistency for Semi-supervised Text Recognition | Mingkun Yang et.al. | 2402.15806 | null |
2024-02-23 | OpenSUN3D: 1st Workshop Challenge on Open-Vocabulary 3D Scene Understanding | Francis Engelmann et.al. | 2402.15321 | null |
2024-02-22 | S^2Former-OR: Single-Stage Bimodal Transformer for Scene Graph Generation in OR | Jialun Pei et.al. | 2402.14461 | null |
2024-02-22 | Swin3D++: Effective Multi-Source Pretraining for 3D Indoor Scene Understanding | Yu-Qi Yang et.al. | 2402.14215 | link |
2024-02-21 | Class-Aware Mask-Guided Feature Refinement for Scene Text Recognition | Mingkun Yang et.al. | 2402.13643 | link |
2024-02-25 | DriveVLM: The Convergence of Autonomous Driving and Large Vision-Language Models | Xiaoyu Tian et.al. | 2402.12289 | null |
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2024-02-28 | CFDNet: A Generalizable Foggy Stereo Matching Network with Contrastive Feature Distillation | Zihua Liu et.al. | 2402.18181 | null |
2024-02-28 | Self-Supervised Spatially Variant PSF Estimation for Aberration-Aware Depth-from-Defocus | Zhuofeng Wu et.al. | 2402.18175 | null |
2024-02-28 | Passive Snapshot Coded Aperture Dual-Pixel RGB-D Imaging | Bhargav Ghanekar et.al. | 2402.18102 | null |
2024-02-27 | A Vanilla Multi-Task Framework for Dense Visual Prediction Solution to 1st VCL Challenge -- Multi-Task Robustness Track | Zehui Chen et.al. | 2402.17319 | null |
2024-02-26 | Automated Floodwater Depth Estimation Using Large Multimodal Model for Rapid Flood Mapping | Temitope Akinboyewa et.al. | 2402.16684 | null |
2024-02-22 | GAM-Depth: Self-Supervised Indoor Depth Estimation Leveraging a Gradient-Aware Mask and Semantic Constraints | Anqi Cheng et.al. | 2402.14354 | null |
2024-02-22 | TIE-KD: Teacher-Independent and Explainable Knowledge Distillation for Monocular Depth Estimation | Sangwon Choi et.al. | 2402.14340 | link |
2024-02-21 | Zero-BEV: Zero-shot Projection of Any First-Person Modality to BEV Maps | Gianluca Monaci et.al. | 2402.13848 | null |
2024-02-19 | An Endoscopic Chisel: Intraoperative Imaging Carves 3D Anatomical Models | Jan Emily Mangulabnan et.al. | 2402.11840 | null |
2024-02-19 | Unveiling the Depths: A Multi-Modal Fusion Framework for Challenging Scenarios | Jialei Xu et.al. | 2402.11826 | null |
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2024-02-28 | Exploration of Adapter for Noise Robust Automatic Speech Recognition | Hao Shi et.al. | 2402.18275 | null |
2024-02-28 | Multilingual Speech Models for Automatic Speech Recognition Exhibit Gender Performance Gaps | Giuseppe Attanasio et.al. | 2402.17954 | null |
2024-02-24 | ByteComposer: a Human-like Melody Composition Method based on Language Model Agent | Xia Liang et.al. | 2402.17785 | null |
2024-02-27 | High-Fidelity Neural Phonetic Posteriorgrams | Cameron Churchwell et.al. | 2402.17735 | null |
2024-02-27 | Natural Language Processing Methods for Symbolic Music Generation and Information Retrieval: a Survey | Dinh-Viet-Toan Le et.al. | 2402.17467 | null |
2024-02-27 | An Effective Mixture-Of-Experts Approach For Code-Switching Speech Recognition Leveraging Encoder Disentanglement | Tzu-Ting Yang et.al. | 2402.17189 | null |
2024-02-27 | Extreme Encoder Output Frame Rate Reduction: Improving Computational Latencies of Large End-to-End Models | Rohit Prabhavalkar et.al. | 2402.17184 | null |
2024-02-26 | Towards Decoding Brain Activity During Passive Listening of Speech | Milán András Fodor et.al. | 2402.16996 | link |
2024-02-26 | Effect of utterance duration and phonetic content on speaker identification using second-order statistical methods | Ivan Magrin-Chagnolleau et.al. | 2402.16429 | null |
2024-02-24 | ArEEG_Chars: Dataset for Envisioned Speech Recognition using EEG for Arabic Characters | Hazem Darwish et.al. | 2402.15733 | null |
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2024-02-28 | Multimodal Learning To Improve Cardiac Late Mechanical Activation Detection From Cine MR Images | Jiarui Xing et.al. | 2402.18507 | null |
2024-02-28 | DecisionNCE: Embodied Multimodal Representations via Implicit Preference Learning | Jianxiong Li et.al. | 2402.18137 | null |
2024-02-27 | Multimodal Learned Sparse Retrieval with Probabilistic Expansion Control | Thong Nguyen et.al. | 2402.17535 | link |
2024-02-27 | Curriculum Learning Meets Directed Acyclic Graph for Multimodal Emotion Recognition | Cam-Van Thi Nguyen et.al. | 2402.17269 | null |
2024-02-26 | GROUNDHOG: Grounding Large Language Models to Holistic Segmentation | Yichi Zhang et.al. | 2402.16846 | null |
2024-02-26 | Gradient-Guided Modality Decoupling for Missing-Modality Robustness | Hao Wang et.al. | 2402.16318 | null |
2024-02-24 | FedMM: Federated Multi-Modal Learning with Modality Heterogeneity in Computational Pathology | Yuanzhe Peng et.al. | 2402.15858 | null |
2024-02-20 | GRAFFORD: A Benchmark Dataset for Testing the Knowledge of Object Affordances of Language and Vision Models | Sayantan Adak et.al. | 2402.12881 | link |
2024-02-19 | Multimodal Emotion Recognition from Raw Audio with Sinc-convolution | Xiaohui Zhang et.al. | 2402.11954 | null |
2024-02-18 | Efficient Multimodal Learning from Data-centric Perspective | Muyang He et.al. | 2402.11530 | link |
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2024-02-28 | Objective and Interpretable Breast Cosmesis Evaluation with Attention Guided Denoising Diffusion Anomaly Detection Model | Sangjoon Park et.al. | 2402.18362 | null |
2024-02-28 | Grid-Based Continuous Normal Representation for Anomaly Detection | Joo Chan Lee et.al. | 2402.18293 | link |
2024-02-28 | A Compact Anomaly Detection Solution for Science Instruments | Alfonso Lagares de Toledo et.al. | 2402.17961 | null |
2024-02-27 | Outlier-Detection for Reactive Machine Learned Potential Energy Surfaces | Luis Itza Vazquez-Salazar et.al. | 2402.17686 | null |
2024-02-27 | Fraud Detection with Binding Global and Local Relational Interaction | Haolin Li et.al. | 2402.17472 | null |
2024-02-27 | CGGM: A conditional graph generation model with adaptive sparsity for node anomaly detection in IoT networks | Xianshi Su et.al. | 2402.17363 | null |
2024-02-27 | Structural Teacher-Student Normality Learning for Multi-Class Anomaly Detection and Localization | Hanqiu Deng et.al. | 2402.17091 | null |
2024-02-26 | Deep Learning Algorithms Used in Intrusion Detection Systems -- A Review | Richard Kimanzi et.al. | 2402.17020 | null |
2024-02-25 | An Adversarial Robustness Benchmark for Enterprise Network Intrusion Detection | João Vitorino et.al. | 2402.16912 | null |
2024-02-26 | Uncertainty Quantification in Anomaly Detection with Cross-Conformal |
Oliver Hennhöfer et.al. | 2402.16388 | null |
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2024-02-28 | TAMM: TriAdapter Multi-Modal Learning for 3D Shape Understanding | Zhihao Zhang et.al. | 2402.18490 | null |
2024-02-28 | Universal neural network potentials as descriptors: Towards scalable chemical property prediction using quantum and classical computers | Tomoya Shiota et.al. | 2402.18433 | null |
2024-02-28 | Emotion Classification in Low and Moderate Resource Languages | Shabnam Tafreshi et.al. | 2402.18424 | null |
2024-02-28 | A Modular System for Enhanced Robustness of Multimedia Understanding Networks via Deep Parametric Estimation | Francesco Barbato et.al. | 2402.18402 | null |
2024-02-28 | Exploration of Adapter for Noise Robust Automatic Speech Recognition | Hao Shi et.al. | 2402.18275 | null |
2024-02-28 | Challenges in Pre-Training Graph Neural Networks for Context-Based Fake News Detection: An Evaluation of Current Strategies and Resource Limitations | Gregor Donabauer et.al. | 2402.18179 | null |
2024-02-28 | Diffusion-based Neural Network Weights Generation | Bedionita Soro et.al. | 2402.18153 | null |
2024-02-28 | Automated Testing of Spatially-Dependent Environmental Hypotheses through Active Transfer Learning | Nicholas Harrison et.al. | 2402.18064 | null |
2024-02-28 | OpenMEDLab: An Open-source Platform for Multi-modality Foundation Models in Medicine | Xiaosong Wang et.al. | 2402.18028 | null |
2024-02-28 | Collaborative decoding of critical tokens for boosting factuality of large language models | Lifeng Jin et.al. | 2402.17982 | null |
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2024-02-28 | Digging Into Normal Incorporated Stereo Matching | Zihua Liu et.al. | 2402.18171 | link |
2024-02-28 | 3DSFLabelling: Boosting 3D Scene Flow Estimation by Pseudo Auto-labelling | Chaokang Jiang et.al. | 2402.18146 | link |
2024-02-27 | ICP-Flow: LiDAR Scene Flow Estimation with ICP | Yancong Lin et.al. | 2402.17351 | null |
2024-02-25 | LSTP: Language-guided Spatial-Temporal Prompt Learning for Long-form Video-Text Understanding | Yuxuan Wang et.al. | 2402.16050 | link |
2024-02-18 | TDE-3: An improved prior for optical flow computation in spiking neural networks | Matthew Yedutenko et.al. | 2402.11662 | null |
2024-02-17 | Dense Matchers for Dense Tracking | Tomáš Jelínek et.al. | 2402.11287 | null |
2024-02-16 | Multi-Model 3D Registration: Finding Multiple Moving Objects in Cluttered Point Clouds | David Jin et.al. | 2402.10865 | null |
2024-02-14 | Moving Object Proposals with Deep Learned Optical Flow for Video Object Segmentation | Ge Shi et.al. | 2402.08882 | null |
2024-02-12 | A Flow-based Credibility Metric for Safety-critical Pedestrian Detection | Maria Lyssenko et.al. | 2402.07642 | null |
2024-02-09 | Image-based Deep Learning for the time-dependent prediction of fresh concrete properties | Max Meyer et.al. | 2402.06611 | null |
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2024-02-28 | Arithmetic Control of LLMs for Diverse User Preferences: Directional Preference Alignment with Multi-Objective Rewards | Haoxiang Wang et.al. | 2402.18571 | link |
2024-02-28 | Unifying F1TENTH Autonomous Racing: Survey, Methods and Benchmarks | Benjamin David Evans et.al. | 2402.18558 | null |
2024-02-28 | Human-Centric Aware UAV Trajectory Planning in Search and Rescue Missions Employing Multi-Objective Reinforcement Learning with AHP and Similarity-Based Experience Replay | Mahya Ramezani et.al. | 2402.18487 | null |
2024-02-28 | FinAgent: A Multimodal Foundation Agent for Financial Trading: Tool-Augmented, Diversified, and Generalist | Wentao Zhang et.al. | 2402.18485 | null |
2024-02-28 | Implementing Online Reinforcement Learning with Clustering Neural Networks | James E. Smith et.al. | 2402.18472 | null |
2024-02-28 | Why Do Animals Need Shaping? A Theory of Task Composition and Curriculum Learning | Jin Hwa Lee et.al. | 2402.18361 | null |
2024-02-28 | Solving Multi-Entity Robotic Problems Using Permutation Invariant Neural Networks | Tianxu An et.al. | 2402.18345 | null |
2024-02-28 | Whole-body Humanoid Robot Locomotion with Human Reference | Qiang Zhang et.al. | 2402.18294 | null |
2024-02-28 | Is Crowdsourcing Breaking Your Bank? Cost-Effective Fine-Tuning of Pre-trained Language Models with Proximal Policy Optimization | Shuo Yang et.al. | 2402.18284 | null |
2024-02-28 | Reinforcement Learning and Graph Neural Networks for Probabilistic Risk Assessment | Joachim Grimstad et.al. | 2402.18246 | null |
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2024-02-28 | Graph Regularized Encoder Training for Extreme Classification | Anshul Mittal et.al. | 2402.18434 | null |
2024-02-28 | Universal neural network potentials as descriptors: Towards scalable chemical property prediction using quantum and classical computers | Tomoya Shiota et.al. | 2402.18433 | null |
2024-02-28 | CafkNet: GNN-Empowered Forward Kinematic Modeling for Cable-Driven Parallel Robots | Zeqing Zhang et.al. | 2402.18420 | null |
2024-02-28 | Recursive GNNs for Learning Precoding Policies with Size-Generalizability | Jia Guo et.al. | 2402.18332 | null |
2024-02-28 | A BiRGAT Model for Multi-intent Spoken Language Understanding with Hierarchical Semantic Frames | Hongshen Xu et.al. | 2402.18258 | link |
2024-02-28 | Reinforcement Learning and Graph Neural Networks for Probabilistic Risk Assessment | Joachim Grimstad et.al. | 2402.18246 | null |
2024-02-28 | Challenges in Pre-Training Graph Neural Networks for Context-Based Fake News Detection: An Evaluation of Current Strategies and Resource Limitations | Gregor Donabauer et.al. | 2402.18179 | null |
2024-02-28 | Hierarchical Multi-Relational Graph Representation Learning for Large-Scale Prediction of Drug-Drug Interactions | Mengying Jiang et.al. | 2402.18127 | link |
2024-02-27 | Using Graph Neural Networks to Predict Local Culture | Thiago H Silva et.al. | 2402.17905 | null |
2024-02-27 | Learning Topological Representations with Bidirectional Graph Attention Network for Solving Job Shop Scheduling Problem | Cong Zhang et.al. | 2402.17606 | null |