/MLM_Geo

Multimodal and Multilingual Georeferencing and News Retrieval

Primary LanguagePythonMIT LicenseMIT

MLM-Geo

MLM-Geo is a web application based on two main tasks: information retrieval and location estimation. Currently, the application receives as input an image and performs both tasks. Regarding location estimation, the top 10 predicted locations are displayed on the map. While for the information retrieval task, the top 10 visually similar human settlement entities from Wikidata are returned. MLM-Geo also provides the current news taking place in any of the retrieved entities using EventRegistry!

MLM-Geo is built on top of the MLM dataset. In particular an extension of it with 7 more languages. For more details about MLM dataset click here!

MLM-Geo is currently online and you can give it a try here! More details on how to use it are present by clicking the info button on the page.

Run MLM-Geo locally

Requirements and Setup

Python version >= 3.7

PyTorch version = 1.5.1

# clone the repository
git clone https://github.com/GOALCLEOPATRA/MLM_Geo.git
cd MLM_Geo
pip install -r requirements.txt

Download checkpoints and dataset files

For working with MLM-Geo locally you will need to download the model checkpoints and dataset files. Checkpoints should be placed under the models' directory in a folder named checkpoints. While all other files should be placed under a folder named data.

Link for the checkpoints and dataset files will be provided soon!

EventRegistry API key

To be able to retrieve the latest news for the entities, you will need to register on EventRegistry and use the API key. Please save the API key in a text file and name it ER_API_KEY.

Run server

We serve MLM-Geo using Waitress pure-Python WSGI server. After having the checkpoints, the dataset files and the EventRegistry API key you can simply run:

# run waitress server
python waitress_server.py

This should run MLM-Geo in the following address 0.0.0.0:9000.

Live API

A live API is provided alongside the application. It may be accessed in the following manner:

curl \
-F 'file=@/path/to/image/image.jpg' \
-F 'lang=en'  \
http://cleopatra.ijs.si/mlm-demo/predict

Allowed image extensions: png, jpg, jpeg.

Currently, we support the following languages: en, de, fr, it, es, pl, ro, nl, hu, pt.

License

The repository is under MIT License.