Click and play the interactive Sedona Python Jupyter Notebook immediately!
Apache Sedona™(incubating) is a cluster computing system for processing large-scale spatial data. Sedona equips cluster computing systems such as Apache Spark and Apache Flink with a set of out-of-the-box distributed Spatial Datasets and Spatial SQL that efficiently load, process, and analyze large-scale spatial data across machines.
Download statistics | Maven | PyPI | CRAN |
---|---|---|---|
Apache Sedona | 80k/month | ||
Archived GeoSpark releases | 300k/month |
Name | API | Introduction |
---|---|---|
Core | Scala/Java | Distributed Spatial Datasets and Query Operators |
SQL | Spark RDD/DataFrame in Scala/Java/SQL | Geospatial data processing on Apache Spark |
Flink | Flink DataStream/Table in Scala/Java/SQL | Geospatial data processing on Apache Flink |
Viz | Spark RDD/DataFrame in Scala/Java/SQL | Geospatial data visualization on Apache Spark |
Python | Spark RDD/DataFrame in Python | Python wrapper for Sedona |
R | Spark RDD/DataFrame in R | R wrapper for Sedona |
Zeppelin | Apache Zeppelin | Plugin for Apache Zeppelin 0.8.1+ |
Please refer to Sedona website
Feedback to improve Apache Sedona: Google Form
Twitter: Sedona@Twitter
Sedona JIRA: Bugs, Pull Requests, and other similar issues
- dev@sedona.apache.org: project development, general questions or tutorials