A Python module that allows users to extract documents from a source, parse and ingest into Elasticsearch.
To setup, install the dependencies with pip
:
python3 -m pip3 install -r requirements.txt
usage: transformer [-h] [-c] [--source_ip SOURCE_IP]
[--source SOURCE] [--target_ip TARGET_IP]
--target TARGET
Parses documents, extracts required fields and inserts them to
the elasticsearch server.
optional arguments:
-h, --help show this help message and exit
-c, --config configure metadata for the indices
--source_ip SOURCE_IP
ip of source
--source SOURCE name of the source index
required arguments:
--target_ip TARGET_IP
ip of the target
--target TARGET name of the target index
Refer to the following example for more details:
python3 src/transformer.py --config --target="sample"
python3 src/transformer.py --source="source" --target="sample"