/sparql-query-dynamics

A framework for predicting SPARQL Query Dynamics.

Primary LanguageJupyter NotebookApache License 2.0Apache-2.0

Sparql Query Dynamics

A framework for predicting SPARQL Query Dynamics.

Table of Contents

  1. Installation
  2. Data
  3. Queries
  4. Indexes
  5. Results
  6. Predictions

Requirements

Gz-sort

Gz-sort sorts gzipped data files. http://kmkeen.com/gz-sort/

sudo apt-get install libz-dev make
git clone https://github.com/keenerd/gz-sort; cd gz-sort; make; ./gz-sort -h

virtuoso-opensource-7.2.5

Virtuoso Engine and its SPARQL endpoint http://vos.openlinksw.com/owiki/wiki/VOS

sparql-query-dynamics\virtuoso\setup\setup.sh
sparql-query-dynamics\virtuoso\setup\configure.sh

Data

Source

https://www.wikidata.org/wiki/Wikidata:Database_download#RDF_dumps https://dumps.wikimedia.org/wikidatawiki/entities/ https://www.mediawiki.org/wiki/Wikibase/Indexing/RDF_Dump_Format#Truthy_statements

pre-processing

flip (spo2pso, clean) sort (duplicate remove) flip (pso2spo)

processing

cardinality stats predicate dynamics predicate multiplicity


Queries

Source

pre-processing

truthy lang bgp

processing

card stats


Indexes

load

query


Results

cmp diff rdyn


Predictions

Training

Evaluation