This repository contains the code and pre-trained models for our paper Multimodal Cross-lingual Phrase Retrieval.
**************************** Updates ****************************
- 2/16 Our paper has been accepted to LREC-COLING 2024.
We propose a method for retrieving parallel phrases across languages from multimodal data, termed Multimodal Cross-lingual Phrase Retrieval.
In the following sections, we describe how to use our MXPR.
- First, install PyTorch by following the instructions from the official website. To faithfully reproduce our results, please use the correct
torch==1.8.1+cu111
version corresponding to your platforms/CUDA versions. PyTorch version higher than1.8.1
should also work. - Then, run the following script to fetch the repo and install the remaining dependencies.
git clone https://github.com/sdongchuanqi/MXPR.git
cd MXPR
pip install -r requirements.txt
mkdir data
mkdir model
mkdir result
Before using MXPR, please process the dataset by following the steps below.
-
Download Our Dataset Here: link
-
Unzip our dataset and move dataset into data folder. (Make sure the path in bash file is the path of dataset)
-
Get relation text from m-plug link
-
We alse offer our labeled Here: link
链接:https://pan.baidu.com/s/1QQ6eRMkLNJAzr75VoZm7dg 提取码:0enw
bash train.sh