multimodal-fusion
There are 31 repositories under multimodal-fusion topic.
icey-zhang/SuperYOLO
SuperYOLO is accepted by TGRS
v-iashin/BMT
Source code for "Bi-modal Transformer for Dense Video Captioning" (BMVC 2020)
declare-lab/Multimodal-Infomax
This repository contains the official implementation code of the paper Improving Multimodal Fusion with Hierarchical Mutual Information Maximization for Multimodal Sentiment Analysis, accepted at EMNLP 2021.
mahmoodlab/MCAT
Multimodal Co-Attention Transformer for Survival Prediction in Gigapixel Whole Slide Images - ICCV 2021
thuiar/MIntRec
MIntRec: A New Dataset for Multimodal Intent Recognition (ACM MM 2022)
akashe/Multimodal-action-recognition
Code on selecting an action based on multimodal inputs. Here in this case inputs are voice and text.
icey-zhang/E2E-MFD
E2E-MFD-OOD
ai-forever/fusion_brain_aij2021
Creating multimodal multitask models
declare-lab/hfusion
Multimodal sentiment analysis using hierarchical fusion with context modeling
gholste/breast_mri_fusion
[CVAMD 2021] "End-to-End Learning of Fused Image and Non-Image Feature for Improved Breast Cancer Classification from MRI"
Asichurter/MalFusionFSL
Few-Shot malware classification using fused features of static analysis and dynamic analysis (基于静态+动态分析的混合特征的小样本恶意代码分类框架)
declare-lab/M2H2-dataset
This repository contains the dataset and baselines explained in the paper: M2H2: A Multimodal Multiparty Hindi Dataset For HumorRecognition in Conversations
ai-forever/fbc2_aij2022
FusionBrain Challenge 2.0: creating multimodal multitask model
imadhou/multimodal-sentiment-analysis
Multimodal sentiment analysis
icey-zhang/E2E-MFD-HOD
E2E-MFD-HOD
shengyangsun/MSBT
Official implementation of "Multi-scale Bottleneck Transformer for Weakly Supervised Multimodal Violence Detection"
sverma88/Deep-HOSeq--ICDM-2020
Deep-HOSeq: Deep Higher-Order Sequence Fusion for Multimodal Sentiment Analysis.
AlfredsLapkovskis/MultimodalPlantClassifier
Source code for the paper "Automatic Fused Multimodal Deep Learning for Plant Identification" (Alfreds Lapkovskis, Natalia Nefedova & Ali Beikmohammadi, 2024)
marcomoldovan/multimodal-self-distillation
A generalized self-supervised training paradigm for unimodal and multimodal alignment and fusion.
zzbn12345/Climate_Stance_Multimodal
The code and data for the Paper 'Inferring Climate Change Stances from Multimodal Tweets' accepted by the Short Paper track of SIGIR 2024
AlfredsLapkovskis/MultimodalPlantClassifier-iOS
Source code of a sample iOS app for the paper "Automatic Fused Multimodal Deep Learning for Plant Identification" (Alfreds Lapkovskis, Natalia Nefedova & Ali Beikmohammadi, 2024)
Clealiya/Multimodal-model
[FR|EN - Trio] 2023 - 2024 Centrale Méditerranée AI Master | Multimodal retranscription with text, audio and video
kasunweerkoon/VAPOR
VAPOR: Legged Robot Navigation in Outdoor Vegetation using Offline Reinforcement Learning (ICRA2024)
sustainable-computing/Centaur
Repo for "Centaur: Robust Multimodal Fusion for Human Activity Recognition"
brian-zZZ/Guided-PLI
A Transferability-guided Protein-Ligand Interaction Prediction Method
EesunMoon/On-device_Multimodal_ER
[Research] Multimodal Emotion Recognition for On-device AI
usc-sail/mica-context-emotion-recognition
Repository for context based emotion recognition
anisha0325/MM-CliConSummation
The codebase for our paper on Multi-modal Medical Dialogue Summarization
fatemafaria142/BanglaCalamityMMD-A-Comprehensive-Benchmark-Dataset-for-Multimodal-Disaster-Identification
This study presents a novel multimodal fusion technique for disaster identification in Bangla, combining text and image data using the "BanglaCalamityMMD" dataset. Employing DisasterTextNet, DisasterImageNet, and DisasterMultFusionNet, the approach addresses a key gap in Bangla disaster research.
kaykobad/MMSFormer
We propose Multi-Modal Segmentation TransFormer (MMSFormer) that incorporates a novel fusion strategy to perform multimodal material segmentation.
ivanovsdesign/information_retrieval
Web scraper for Wildberries + simple vectorization/multimodal embedding workflow