gbif/pipelines

Clustering module upgrade

Opened this issue · 0 comments

The module clustering-gbif has been modified to compile against latests versions of Spark, Scala, HBase and Beam, however it has not been tested.

This task is about test it and adjust it accordingly to make it work using the proposed versions.

https://github.com/gbif/pipelines/tree/843_upgrade_beam_hadoop_spark contains the work-in-progress to migrate the entire pipelines project to use latest versions of frameworks and libraries mentioned before.

#843 contains related work to this task