/sesame-loader

Loads data into an sesame repository using multiple threads per file without any server over head.

Primary LanguageJavaOtherNOASSERTION

This is a small utility that uses multiple threads to load data into a RDF triple store using
the sesame API. You can load it into a native store from sesame or with some work into OWLIM-se.
Unfortunatly to get an OWLIM-se jar you need to contact ontotext. 

# HOW TO USE

## To use with a Sesame Native repository

cd sesame-loader/
./load.sh -infile $1 -dataFile $2 -commitInterval $3 -pushThreads $4 -baseUri $5

## To use with an OWLIM repository

cd sesame-loader/

### Need to additionally copy the owlim jar files into sesame-loader-runtime/dist/conf/ so they will be included on the classpath
cp /path/to/owlim.jar sesame-loader-runtime/dist/conf/owlim.jar
cp /path/to/crypto.jar sesame-loader-runtime/dist/conf/crypto.jar

./load_owlim.sh -infile $1 -dataFile $2 -commitInterval $3 -pushThreads $4 -baseUri $5

Where $2 is the location on disk where OWLIM or Sesame Native should store their files. $3 is how often you want to commit to the backend. $4 is the number of threads you want to use. $5 is a baseUri i.e. http://purl.uniprot.org/uniprot/
$1 is the file you are loading. 

# Things to improve:

* Changes to java options and Owlim options may require changes to pom.xml to add or remove settings from the appassembler-maven-plugin configuration, unless they can be placed in a configuration file in src/main/config/ (which will then be copied to dist/conf/)
* Memory usage