Have you ever thought of integrating OrientDB as a node level store for Apache Spark?

Question

Have you ever thought of integrating OrientDB as a node level store for Apache Spark?

MironAtHome opened this issue 9 months ago · 1 comments

I have pulled this project out of a thin air and thought that columnar datastore can be one upped by graph data store, and on top of it with indexes ( distributed ).
The cool part is it's just an optimizer plug in to have it to generate accessors for OrientDB instead of parquet.
I think we can push GPU's out the window, for majority of the projects and finally do most of computations in memory.
And I don't mean going delta architecture. Since JVM can use all the memory on the node ( and OrientDB is thread friendly, so, it can safely multi-task ).

Answer 1 · 2024-02-28T11:25:38.000Z

Hi,

This is probably possible, and quite interesting, on the other end is quite out of the scope of the OrientDB project itself, in the meaning that we would happily help someone else to implement it, but not do it ourselves.

Regards