The SF OpenData project was launched in 2009 and contains hundreds of datasets from the city and county of San Francisco. Open government data has the potential to increase the quality of life for residents, create more efficient government services, better public decisions, and even new local businesses and services.
APACHE SPARK:
Spark is a unified processing engine that can analyze big data using SQL, machine learning, graph processing or real time stream analysis: