omop-concept-solr

Syncs the OMOP concept with a SolR instance

OMOP concept
1. load the OMOP concept table from ATHENA CSV
2. extends the model with external informations
SolR cloud
1. install and configure apache SolR
Spark
1. install and configure apache Spark
2. ETL postgres -> SolR

Configuration

In order to make zookeeper able to ingest large configurations such synonyms

add SOLR_OPTS="$SOLR_OPTS -Djute.maxbuffer=0x9fffff" to the '$SOLR_HOME/bin/solr.in.sh'

Livy configuration can be found: $LIVY_HOME/conf/livy.conf

You will need at least 65000 open files to make spark and solr work fine.

sudo bash ulimit -n 8192 sudo -

Edit the /etc/security/limits.conf file Add:

root hard nofile 65000 root soft nofile 65000

The spark library are loaded thought apache livy. They are specified into a yaml file and loaded from the pylivy library.

Clone the spark-postgres Compile it and move the shaded jar into some place.

Clone the spark-postgres Compile it and move the shaded jar into some place.