/Simba-Carbon-Integration

Primary LanguageScalaApache License 2.0Apache-2.0

Simba-Carbon Integration Project

This is a collaberate project between InitialDLab and Huawei, proposed to integrate functionalities from both Simba and CarbonData so that they can work together as a unfied spatial-temporal analysis system. It is supposed to support efficient spatial analytics query on carbon formatted data and peristing Simba's index structure through CarbonData onto distributed file systems.

Features

  • Support native connection from Simba to CarbonData.
  • CarbonData formated persistent spatial index structure built by Simba.
  • Spatial predicates pushdown onto CarbonData formatted file.

Notes: For a detailed integration plan, please refer to the wiki page of this project.

Version Info

In particular, we are now using Simba standalone-2.1 version and CarbonData 1.1.1, which are both compatible with Apache Spark 2.1.x.

Contributors

  • Dong Xie: dongx [at] cs [dot] utah [dot] edu
  • Yanqing Peng: cosineufo [at] gmail [dot] com
  • Feifei Li: lifeifei [at] cs [dot] utah [dot] edu
  • Kun Li: jacky [dot] likun [at] huawei [dot] com
  • Jihong Ma: Jihong [dot] Ma [at] huawei [dot] com
  • Qingqing Zhou: Qingqing [dot] Zhou [at] huawei [dot] com