Backend of Cross-modal Retrieval System

Author: Zhiqiang Yuan

-------------------------------------------------------------------------------------

Welcome 👍`Fork and Star`👍, then we'll let you know when we update

The back-end of cross-modal retrieval system，wihch will contain services such as semantic location .etc . The purpose of this project is to provide a set of applicable retrieval framework for the retrieval model. We will use RS image data as the baseline for development, and demonstrate the potential of the project through services such as semantic positioning and cross-modal retrieval.

-------------------------------------------------------------------------------------

Requirements

numpy>=1.7.1
six>=1.1.0
PyTorch > 0.3
flask >= 1.1.1
Numpy
h5py
nltk
yaml

-------------------------------------------------------------------------------------

Apis

------------------------------------------
#/api/image_encode/ [POST]  
# FUNC: encode images
   
data = {
 # image_id: file_path
 11:"../data/test_data/images/00013.jpg",
 33: "../data/test_data/images/00013.jpg",
 32: "../data/test_data/images/00013.jpg",
}
url = 'http://192.168.43.216:49205/api/image_encode/'

r = requests.post(url, data=json.dumps(data))
print(r.json())

------------------------------------------
#/api/delete_encode/ [POST]  
# FUNC: delete encodes
   
# image_id
data = [3, 4]
url = 'http://192.168.43.216:49205/api/delete_encode/'
r = requests.post(url, data=json.dumps(data))
print(r.json())

------------------------------------------
#/api/text_search/ [POST]  
# FUNC: cross-modal retrieval 
   
data = {
     'text': "One block has a cross shaped roof church.",  # retrieved text
     'retrieved_ids': "*",  # retrieved images pool
     'start': 0,    # from top
     'end': 100     # to end
 }
url = 'http://192.168.43.216:49205/api/text_search/'
r = requests.post(url, data=json.dumps(data))
print(r.json())

------------------------------------------
#/api/image_search/ [POST]  
# FUNC: image-image retrieval 
   
data = {
     'image_path': "../data/test_data/images/00013.jpg",,  # retrieved image
     'retrieved_ids': "*",  # retrieved images pool: 1) * represents all, 2) [1, 2, 4] represent images pool
     'start': 0,    # from top
     'end': 100     # to end
 }
url = 'http://192.168.43.216:49205/api/image_search/'
r = requests.post(url, data=json.dumps(data))
print(r.json())

------------------------------------------
#/api/semantic_localization/ [POST]  
# FUNC: semantic localization
   
data = {
    'image_path': "../data/test_data/images/demo1.tif",
    'text': "there are two tennis courts beside the playground",
    'params': {
        'steps': [64, 128,256,512]
    },
}
url = 'http://192.168.43.216:49205/api/semantic_localization/'
r = requests.post(url, data=json.dumps(data))
print(r.json())

-------------------------------------------------------------------------------------

Architecture

-- code     # all codes
    -- api_controls     # control files
    -- common           # config file
    -- models           # put the retrieval mdoel here
    -- globalvar.py     # global varibles define
    -- main.py          # main file

-- data
    -- retrieval_system_data    # project data here
    -- test_data        # image database here

-- figure   # some figures about this project

-- test     # test function

-------------------------------------------------------------------------------------

Three Steps to Use This Framework

Step 1. Install the environment, download the code to the local, and change the path setting of the ./code/common/config file. At the same time, you need to change the yaml path file under ./code/models/options/ .

Step 2. Enter the ./code directory and run main.py to start the flask service.

Step 3. Use Postman etc. or python's built-in request service for sample requests. Some interface samples have been shown in ./test/test_qpi.py .

-------------------------------------------------------------------------------------

Customize Your Rerieval Model

You only need to change the ./code/models folder to make your retrieval model run in the service. For this, you should provide encoding interfaces and model initialization interfaces for different modal data. For more information about this, please see the README file under ./code/models/ .

Under Updating

Citation

If you feel this code helpful or use this code or dataset, please cite it as

Z. Yuan et al., "Exploring a Fine-Grained Multiscale Method for Cross-Modal Remote Sensing Image Retrieval," in IEEE Transactions on Geoscience and Remote Sensing, doi: 10.1109/TGRS.2021.3078451.

Z. Yuan et al., "A Lightweight Multi-scale Crossmodal Text-Image Retrieval Method In Remote Sensing," in IEEE Transactions on Geoscience and Remote Sensing, doi: 10.1109/TGRS.2021.3124252.

lpf471800/retrievalSystem

Backend of Cross-modal Retrieval System

Author: Zhiqiang Yuan

-------------------------------------------------------------------------------------

Welcome 👍Fork and Star👍, then we'll let you know when we update

Summary

-------------------------------------------------------------------------------------

Requirements

-------------------------------------------------------------------------------------

Apis

-------------------------------------------------------------------------------------

Architecture

-------------------------------------------------------------------------------------

Three Steps to Use This Framework

-------------------------------------------------------------------------------------

Customize Your Rerieval Model

Under Updating

Citation

Welcome 👍`Fork and Star`👍, then we'll let you know when we update