/OpenSF-Apache-Spark

Exploring the City of San Francisco public data with Apache Spark 2.0

Primary LanguageJupyter Notebook

OpenSF-Apache-Spark

Spark Logo + SF Open Data Logo

Exploring the City of San Francisco public data with Apache Spark 2.0

Fireworks

The SF OpenData project was launched in 2009 and contains hundreds of datasets from the city and county of San Francisco. Open government data has the potential to increase the quality of life for residents, create more efficient government services, better public decisions, and even new local businesses and services.

APACHE SPARK:

Spark is a unified processing engine that can analyze big data using SQL, machine learning, graph processing or real time stream analysis:

Spark Engines

Spark Goal