MLM-Geo is a web application based on two main tasks: information retrieval and location estimation. Currently, the application receives as input an image and performs both tasks. Regarding location estimation, the top 10 predicted locations are displayed on the map. While for the information retrieval task, the top 10 visually similar human settlement entities from Wikidata are returned. MLM-Geo also provides the current news taking place in any of the retrieved entities using EventRegistry!
MLM-Geo is built on top of the MLM dataset. In particular an extension of it with 7 more languages. For more details about MLM dataset click here!
MLM-Geo is currently online and you can give it a try here! More details on how to use it are present by clicking the info button on the page.
Python version >= 3.7
PyTorch version = 1.5.1
# clone the repository
git clone https://github.com/GOALCLEOPATRA/MLM_Geo.git
cd MLM_Geo
pip install -r requirements.txt
For working with MLM-Geo locally you will need to download the model checkpoints and dataset files. Checkpoints should be placed under the models' directory in a folder named checkpoints. While all other files should be placed under a folder named data.
Link for the checkpoints and dataset files will be provided soon!
To be able to retrieve the latest news for the entities, you will need to register on EventRegistry and use the API key. Please save the API key in a text file and name it ER_API_KEY.
We serve MLM-Geo using Waitress pure-Python WSGI server. After having the checkpoints, the dataset files and the EventRegistry API key you can simply run:
# run waitress server
python waitress_server.py
This should run MLM-Geo in the following address 0.0.0.0:9000.
A live API is provided alongside the application. It may be accessed in the following manner:
curl \
-F 'file=@/path/to/image/image.jpg' \
-F 'lang=en' \
http://cleopatra.ijs.si/mlm-demo/predict
Allowed image extensions: png, jpg, jpeg.
Currently, we support the following languages: en, de, fr, it, es, pl, ro, nl, hu, pt.
The repository is under MIT License.