multimodal-fusion

There are 31 repositories under multimodal-fusion topic.

icey-zhang/SuperYOLO
SuperYOLO is accepted by TGRS
Language:Python333 2 13454
v-iashin/BMT
Source code for "Bi-modal Transformer for Dense Video Captioning" (BMVC 2020)
Language:Jupyter Notebook227 6 4857
declare-lab/Multimodal-Infomax
This repository contains the official implementation code of the paper Improving Multimodal Fusion with Hierarchical Mutual Information Maximization for Multimodal Sentiment Analysis, accepted at EMNLP 2021.
Language:Python167 4 2134
mahmoodlab/MCAT
Multimodal Co-Attention Transformer for Survival Prediction in Gigapixel Whole Slide Images - ICCV 2021
Language:Jupyter Notebook167 4 2236
thuiar/MIntRec
MIntRec: A New Dataset for Multimodal Intent Recognition (ACM MM 2022)
Language:Python78 2 1414
akashe/Multimodal-action-recognition
Code on selecting an action based on multimodal inputs. Here in this case inputs are voice and text.
Language:Python69 1 613
icey-zhang/E2E-MFD
E2E-MFD-OOD
Language:Jupyter Notebook51 1 152
ai-forever/fusion_brain_aij2021
Creating multimodal multitask models
Language:Jupyter Notebook50 5 115
declare-lab/hfusion
Multimodal sentiment analysis using hierarchical fusion with context modeling
Language:Python44 4 522
gholste/breast_mri_fusion
[CVAMD 2021] "End-to-End Learning of Fused Image and Non-Image Feature for Improved Breast Cancer Classification from MRI"
Language:Python33 2 17
Asichurter/MalFusionFSL
Few-Shot malware classification using fused features of static analysis and dynamic analysis （基于静态+动态分析的混合特征的小样本恶意代码分类框架）
Language:Python29 3 22
declare-lab/M2H2-dataset
This repository contains the dataset and baselines explained in the paper: M2H2: A Multimodal Multiparty Hindi Dataset For HumorRecognition in Conversations
Language:Python18 4 212
ai-forever/fbc2_aij2022
FusionBrain Challenge 2.0: creating multimodal multitask model
Language:Python16 2 11
imadhou/multimodal-sentiment-analysis
Multimodal sentiment analysis
Language:Jupyter Notebook14 1 02
icey-zhang/E2E-MFD-HOD
E2E-MFD-HOD
Language:Python11 2 11
shengyangsun/MSBT
Official implementation of "Multi-scale Bottleneck Transformer for Weakly Supervised Multimodal Violence Detection"
Language:Python11 2 22
sverma88/Deep-HOSeq--ICDM-2020
Deep-HOSeq: Deep Higher-Order Sequence Fusion for Multimodal Sentiment Analysis.
Language:Python11 2 13
AlfredsLapkovskis/MultimodalPlantClassifier
Source code for the paper "Automatic Fused Multimodal Deep Learning for Plant Identification" (Alfreds Lapkovskis, Natalia Nefedova & Ali Beikmohammadi, 2024)
Language:Jupyter Notebook5 2 01
marcomoldovan/multimodal-self-distillation
A generalized self-supervised training paradigm for unimodal and multimodal alignment and fusion.
Language:Python5 2 02
zzbn12345/Climate_Stance_Multimodal
The code and data for the Paper 'Inferring Climate Change Stances from Multimodal Tweets' accepted by the Short Paper track of SIGIR 2024
Language:Jupyter Notebook4 1 00
AlfredsLapkovskis/MultimodalPlantClassifier-iOS
Source code of a sample iOS app for the paper "Automatic Fused Multimodal Deep Learning for Plant Identification" (Alfreds Lapkovskis, Natalia Nefedova & Ali Beikmohammadi, 2024)
Language:Swift3 2 01
Clealiya/Multimodal-model
[FR|EN - Trio] 2023 - 2024 Centrale Méditerranée AI Master | Multimodal retranscription with text, audio and video
Language:Python3 1 00
kasunweerkoon/VAPOR
VAPOR: Legged Robot Navigation in Outdoor Vegetation using Offline Reinforcement Learning (ICRA2024)
Language:Python3 2 01
sustainable-computing/Centaur
Repo for "Centaur: Robust Multimodal Fusion for Human Activity Recognition"
Language:Jupyter Notebook3 1 01
brian-zZZ/Guided-PLI
A Transferability-guided Protein-Ligand Interaction Prediction Method
Language:Python1 2 00
EesunMoon/On-device_Multimodal_ER
[Research] Multimodal Emotion Recognition for On-device AI
Language:Python1
usc-sail/mica-context-emotion-recognition
Repository for context based emotion recognition
Language:Python1 4 00
anisha0325/MM-CliConSummation
The codebase for our paper on Multi-modal Medical Dialogue Summarization
Language:Python0 0 00
fatemafaria142/BanglaCalamityMMD-A-Comprehensive-Benchmark-Dataset-for-Multimodal-Disaster-Identification
This study presents a novel multimodal fusion technique for disaster identification in Bangla, combining text and image data using the "BanglaCalamityMMD" dataset. Employing DisasterTextNet, DisasterImageNet, and DisasterMultFusionNet, the approach addresses a key gap in Bangla disaster research.
Language:Jupyter Notebook0 0 00
kaykobad/MMSFormer
We propose Multi-Modal Segmentation TransFormer (MMSFormer) that incorporates a novel fusion strategy to perform multimodal material segmentation.
Language:Python0 0 00
ivanovsdesign/information_retrieval
Web scraper for Wildberries + simple vectorization/multimodal embedding workflow
Language:Jupyter Notebook1 0

multimodal-fusion

icey-zhang/SuperYOLO

v-iashin/BMT

declare-lab/Multimodal-Infomax

mahmoodlab/MCAT

thuiar/MIntRec

akashe/Multimodal-action-recognition

icey-zhang/E2E-MFD

ai-forever/fusion_brain_aij2021

declare-lab/hfusion

gholste/breast_mri_fusion

Asichurter/MalFusionFSL

declare-lab/M2H2-dataset

ai-forever/fbc2_aij2022

imadhou/multimodal-sentiment-analysis

icey-zhang/E2E-MFD-HOD

shengyangsun/MSBT

sverma88/Deep-HOSeq--ICDM-2020

AlfredsLapkovskis/MultimodalPlantClassifier

marcomoldovan/multimodal-self-distillation

zzbn12345/Climate_Stance_Multimodal

AlfredsLapkovskis/MultimodalPlantClassifier-iOS

Clealiya/Multimodal-model

kasunweerkoon/VAPOR

sustainable-computing/Centaur

brian-zZZ/Guided-PLI

EesunMoon/On-device_Multimodal_ER

usc-sail/mica-context-emotion-recognition

anisha0325/MM-CliConSummation

fatemafaria142/BanglaCalamityMMD-A-Comprehensive-Benchmark-Dataset-for-Multimodal-Disaster-Identification

kaykobad/MMSFormer

ivanovsdesign/information_retrieval