Konvertering/Import av data inn i ElasticSearch

Question

Konvertering/Import av data inn i ElasticSearch

Closed this issue 9 years ago · 9 comments

Answer 1 · 2016-01-13T08:11:09.000Z

Will need to investigate first how to get the data from Fuseki into Elasticsearch and then depending on the kinds of queries we expect eventual users to carry out I can then define field types and so forth. What do Fuseki dumps look like?

Answer 2 · 2016-01-13T14:05:25.000Z

Given that we will have an atom/RSS feed it's possible we could use this as the source for Elasticsearch. Looking a bit into this.

Answer 3 · 2016-01-14T09:04:06.000Z

It seems the Atom/Rss feed might be a bit to basic. I would be ok to have all of the relevant data from the datasets into ElasticSearch

Answer 4 · 2016-01-15T09:54:02.000Z

Håvard and I had a little discussion about this and like the idea of having the logic for retrieving, processing and sending data to Elasticsearch within the harvest app. Would essentially be a java process that executes SPARQL against Fuseki and then sends JSON documents to Elasticsearch.

Answer 5 · 2016-01-18T08:10:27.000Z

We can use a SPARQL construct query to generate JSON-LD data. Then use a custom frame to generate the equivalent of pure JSON.

Answer 6 · 2016-01-19T08:54:27.000Z

Key tasks:

Handle creation and update (of data source metadata) of data source index
Handle deletion of data source index
Handle update of data source index contents i.e. documents

Once this is handled, then can continue with #49

Answer 7 · 2016-01-19T12:04:46.000Z

RDF transform for denormalisation
JSON-LD Framing
JSON Transform to remove JSON-LD parts: https://google-gson.googlecode.com/svn/tags/1.2.3/docs/javadocs/com/google/gson/JsonObject.html

Answer 8 · 2016-01-28T14:15:58.000Z

Looking into the issues connecting to Elasticsearch. Experiencing the same issues when trying to connect to Elasticsearch to create Kibana dashboards on the fly.

Answer 9 · 2016-02-04T08:29:40.000Z

Connection to Elasticsearch now correctly remains open until indexing is complete and is carried out using bulk indexing. Can probably close this issue.