/geomesa

GeoMesa is a suite of tools for working with big geo-spatial data in a distributed fashion.

Primary LanguageScalaApache License 2.0Apache-2.0


GeoMesa is an open-source, distributed, spatio-temporal database built on top of the Apache Accumulo column family store. GeoMesa implements standard Geotools interfaces to provide geospatial functionality over very large data sets to application developers. GeoMesa provides plugins for exposing geospatial data stored in Accumulo via standards-based OGC HTTP services and cluster monitoring and management tools within the GeoServer administrative interface.
Join the chat at https://gitter.im/locationtech/geomesa

LocationTech GeoMesa is a member of the LocationTech working group of the Eclipse Foundation.

Versions and Downloads

Latest 1.3 milestone: 1.3.0-m2

Development version (source only): 1.3.0-m3-SNAPSHOT

Latest 1.2 release: 1.2.6

1.1.x release: geomesa-1.1.0-rc.7

NOTE: The current recommended version is 1.2.6. The most recent tar.gz assembly can be downloaded here which contains the Accumulo distributed runtime jar, Accumulo GeoServer plugin, and command line tools.

GeoMesa artifacts can be downloaded from the LocationTech Maven repository

Snapshots are available in the LocationTech Snapshots Repository

Upgrading

To upgrade between minor releases of GeoMesa, the versions of all GeoMesa components must match.

This means that the version of the geomesa-accumulo-distributed-runtime JAR installed on Accumulo tablet servers must match the version of the geomesa-accumulo-gs-plugin JAR installed in the WEB-INF/lib directory of GeoServer.

Maven Integration

GeoMesa artifacts are published to the LocationTech Maven repository. To include GeoMesa in your project, add the following repositories to your pom:

<repositories>
  <repository>
    <id>locationtech-releases</id>
    <url>https://repo.locationtech.org/content/groups/releases</url>
    <snapshots>
      <enabled>false</enabled>
    </snapshots>
  </repository>
  <repository>
    <id>geomesa-snapshots</id>
    <url>https://repo.locationtech.org/content/repositories/geomesa-snapshots</url>
    <releases>
      <enabled>false</enabled>
    </releases>
    <snapshots>
      <enabled>true</enabled>
    </snapshots>
  </repository>
  <repository>
    <id>boundlessgeo</id>
    <url>http://repo.boundlessgeo.com/main</url>
  </repository>
  <repository>
    <id>osgeo</id>
    <url>http://download.osgeo.org/webdav/geotools</url>
  </repository>
  <repository>
    <id>conjars.org</id>
    <url>http://conjars.org/repo</url>
  </repository>
</repositories>

Building from Source

Requirements:

Navigate to where you would like to download this project.

git clone git@github.com:locationtech/geomesa.git
cd geomesa

The project is managed by Maven. To build, run:

mvn clean install

Alternatively, the build/mvn script is a wrapper around Maven that builds the project using the Zinc incremental compiler:

build/mvn clean install

Scala 2.10 Support

GeoMesa uses Scala 2.11 by default. To build for Scala 2.10, run:

build/change-scala-version.sh 2.10

This will update the project poms to publish artifacts with a _2.10 suffix. Then build with the 2.10 profile, which is activated with a system property:

mvn clean install -Dscala-2.10

Documentation

GeoMesa Project Structure

  • geomesa-accumulo: the implementations of the core Accumulo indexing structures, Accumulo iterators, and the GeoTools interfaces for exposing the functionality as a DataStore to both application developers and GeoServer. Assembles a jar with dependencies that must be distributed to Accumulo tablet servers lib/ext directory or to an HDFS directory where Accumulo's VFSClassLoader can pick it up.
  • geomesa-accumulo-compute: utilities for working with distributed computing environments. Currently, there are methods for instantiating an Apache Spark Resilient Distributed Dataset from a CQL query against data stored in GeoMesa. Eventually, this project will contain bindings for traditional map-reduce processing and other environments.
  • geomesa-accumulo-jobs: map/reduce for maintaining GeoMesa.
  • geomesa-accumulo-raster: adds support for ingesting and working with geospatially-referenced raster data in GeoMesa.
  • geomesa-accumulo-gs-plugin: provides WFS and WMS support for GeoServer. Install by extracting into geoserver/WEB-INF/lib/.
  • geomesa-blobstore: an Accumulo-based store designed to store and retrieve files which have spatio-temporal data associated with them.
  • geomesa-cassandra: an implementation of GeoMesa on Apache Cassandra.
  • geomesa-cassandra-gs-plugin: provides WFS and WMS support for GeoServer. Install by extracting into geoserver/WEB-INF/lib/.
  • geomesa-convert: a configurable and extensible library for converting data into SimpleFeatures.
  • geomesa-features: includes code for serializing SimpleFeatures and custom SimpleFeature implementations designed for GeoMesa.
  • geomesa-filter: a library for manipulating and working with GeoTools Filters.
  • geomesa-hbase: an implementation of GeoMesa on Apache HBase and Google Cloud Bigtable.
  • geomesa-hbase-gs-plugin: provides WFS and WMS support for GeoServer. Install by extracting into geoserver/WEB-INF/lib/.
  • geomesa-index-api: common structure and methods for indexing and querying simple features.
  • geomesa-kafka: an implementation of GeoMesa in Kafka for maintaining near-real-time caches of streaming data.
  • geomesa-kafka-gs-plugin: provides WFS and WMS support for GeoServer. Install by extracting into geoserver/WEB-INF/lib/.
  • geomesa-logger: logging facade for scala version compatibility.
  • geomesa-metrics: extensions and configuration for dropwizard metrics integration.
  • geomesa-native-api: a non-GeoTools-based API for persisting and querying data in GeoMesa Accumulo.
  • geomesa-process: analytic processes optimized on GeoMesa data stores.
  • geomesa-security: adds support for managing security and authorization levels for data stored in GeoMesa.
  • geomesa-stream: a GeoMesa library that provides tools to process streams of SimpleFeatures.
  • geomesa-tools: a common set of command line tools for managing features, ingesting and exporting data, configuring tables, and explaining queries in GeoMesa.
  • geomesa-utils: stores our GeoHash implementation and other general library functions unrelated to Accumulo. This sub-project contains any helper tools for geomesa. Some of these tools such as the GeneralShapefileIngest have Map/Reduce components, so the geomesa-utils JAR lives on HDFS.
  • geomesa-web: web services for accessing GeoMesa.
  • geomesa-z3: the implementation of Z3, GeoMesa's space-filling Z-order curve.

Scala console via scala-maven-plugin

To test and interact with core functionality, the Scala console can be invoked in a couple of ways. From the root directory by specifying geomesa-accumulo-datastore

mvn -pl geomesa-accumulo/geomesa-accumulo-datastore scala:console

By default, all of the project packages in geomesa-accumulo-datastore are loaded along with JavaConversions, JavaConverters.