Pinned Repositories
3DConvCaps
[ICPR 2022] 3DConvCaps: 3DUnet with Convolutional Capsule Encoder for Medical Image Segmentation
AerialFormer
[Remote Sensing] AerialFormer: Multi-resolution Transformer for Aerial Image Segmentation
AISFormer
[BMVC 2022] AISFormer: Amodal Instance Segmentation with Transformer
ECG_SSL_12Lead
[IEEE BHI 2022] Multimodality Multi-Lead ECG Arrhythmia Classification using Self-Supervised Learning
Embryos
[WACV 2023] EmbryosFormer: Deformable Transformer and Collaborative Encoding-Decoding for Embryos Stage Development Classification
MEGANet
[WACV 2024] An implementation of MEGANet for polyp segmentation with multi-scale edge-guided attention
OpenFusion
[ICRA 2024 Oral] Open-Fusion: Real-time Open-Vocabulary 3D Mapping and Queryable Scene Representation
SAM3D
[ISBI 2024] An implementation of SAM3D which adapts Segment Anything Model for Volumetric Medical Image Segmentation
VLCAP
[ICIP 2022] VLCap: Vision-Language with Contrastive Learning for Coherent Video Paragraph Captioning
VLTinT
[AAAI 2023 Oral] VLTinT: Visual-Linguistic Transformer-in-Transformer for Coherent Video Paragraph Captioning
AICV Lab's Repositories
UARK-AICV/OpenFusion
[ICRA 2024 Oral] Open-Fusion: Real-time Open-Vocabulary 3D Mapping and Queryable Scene Representation
UARK-AICV/VLTinT
[AAAI 2023 Oral] VLTinT: Visual-Linguistic Transformer-in-Transformer for Coherent Video Paragraph Captioning
UARK-AICV/MEGANet
[WACV 2024] An implementation of MEGANet for polyp segmentation with multi-scale edge-guided attention
UARK-AICV/SAM3D
[ISBI 2024] An implementation of SAM3D which adapts Segment Anything Model for Volumetric Medical Image Segmentation
UARK-AICV/AerialFormer
[Remote Sensing] AerialFormer: Multi-resolution Transformer for Aerial Image Segmentation
UARK-AICV/3DConvCaps
[ICPR 2022] 3DConvCaps: 3DUnet with Convolutional Capsule Encoder for Medical Image Segmentation
UARK-AICV/ECG_SSL_12Lead
[IEEE BHI 2022] Multimodality Multi-Lead ECG Arrhythmia Classification using Self-Supervised Learning
UARK-AICV/AISFormer
[BMVC 2022] AISFormer: Amodal Instance Segmentation with Transformer
UARK-AICV/VLCAP
[ICIP 2022] VLCap: Vision-Language with Contrastive Learning for Coherent Video Paragraph Captioning
UARK-AICV/Embryos
[WACV 2023] EmbryosFormer: Deformable Transformer and Collaborative Encoding-Decoding for Embryos Stage Development Classification
UARK-AICV/AOE-Net
[IJCV] AOE-Net: Entities Interactions Modeling with Adaptive Attention Mechanism for Temporal Action Proposals Generation
UARK-AICV/TSRNet
[ISBI 2024] An implementation of TSRNet for ECG Anomaly Detection
UARK-AICV/TAPG-AgentEnvInteration
[BMVC 2021 Oral] AEI: Actors-Environment Interaction with Adaptive Attention for Temporal Action Proposal Generation
UARK-AICV/TrackGUI
UARK-AICV/UARK-AICV.github.io
[Lab] lab website
UARK-AICV/IAI
[WACV 2024] Decoding Radiologists’ Intense Focus for Accurate CXR Diagnoses: A Controllable & Interpretable AI System
UARK-AICV/Video_Representation
[Asilomar 2022] Contextual Explainable Video Representation: Human Perception-based Understanding
UARK-AICV/CattleFace-RGBT-benchmark
UARK-AICV/FG-CXR
The repository of the ACCV 2024 paper "FG-CXR: A Radiologist-Aligned Gaze Dataset for Enhancing Interpretability in Chest X-Ray Report Generation"
UARK-AICV/ZEETAD
[WACV2024] ZEETAD: Adapting Pretrained Vision-Language Model for Zero-Shot End-to-End Temporal Action Detection
UARK-AICV/SS-3DCapsNet
[ISBI 2022] SS-3DCapsNet: Self-supervised 3D Capsule Networks for Medical Segmentation on Less Labeled Data
UARK-AICV/CarcassFormer
Poultry Science Journal - CarcassFormer: An End-to-end Transformer-based Framework for Simultaneous Localization, Segmentation and Classification of Poultry Carcass Defect
UARK-AICV/HENASY
HENASY: Learning to Assemble Scene-Entities for Interpretable Egocentric Video-Language Model
UARK-AICV/ItpCtrl-AI
ItpCtrl-AI: End-to-End Interpretable and Controllable Artificial Intelligence by Modeling Radiologists’ Intentions
UARK-AICV/ShapeFormer
ShapeFormer: Shape Prior Visible-to-Amodal Transformer-based Amodal Instance Segmentation