Multimodal Cross-lingual Phrase Retrieval

This repository contains the code and pre-trained models for our paper Multimodal Cross-lingual Phrase Retrieval.

**************************** Updates ****************************

Overview

We propose a method for retrieving parallel phrases across languages from multimodal data, termed Multimodal Cross-lingual Phrase Retrieval.

In the following sections, we describe how to use our MXPR.

First, install PyTorch by following the instructions from the official website. To faithfully reproduce our results, please use the correct torch==1.8.1+cu111 version corresponding to your platforms/CUDA versions. PyTorch version higher than 1.8.1 should also work.
Then, run the following script to fetch the repo and install the remaining dependencies.

git clone https://github.com/sdongchuanqi/MXPR.git
cd MXPR
pip install -r requirements.txt
mkdir data
mkdir model
mkdir result

Before using MXPR, please process the dataset by following the steps below.

Download Our Dataset Here: link
Unzip our dataset and move dataset into data folder. (Make sure the path in bash file is the path of dataset)
Get relation text from m-plug link
We alse offer our labeled Here: link

bash train.sh