cheidger's Stars
apache/spark
Apache Spark - A unified analytics engine for large-scale data processing
wurstmeister/kafka-docker
Dockerfile for Apache Kafka
hjwp/book-example
Example code for my book on TDD with Python
graphframes/graphframes
hamvocke/dotfiles
A collection of my personal dotfiles
wsvincent/restapiswithdjango
Source code for Django for APIs
amplab/spark-ec2
Scripts used to setup a Spark cluster on EC2
planetlabs/planet-client-python
Python client for Planet APIs
mirkonasato/graphipedia
Creates a Neo4j graph of Wikipedia links.
joeyism/py-edgar
A small library to access files from SEC's edgar
neo4j-graph-analytics/networkx-neo4j
NetworkX API for Neo4j Graph Algorithms.
plotly/dash-salesforce-crm
kevin-crook-ucb/ucb_w205_crook_supplement
UC Berkeley, W205 Data Engineering, 2018 Spring, Kevin Crook's supplement
DistrictDataLabs/baleen
An automated ingestion service for blogs to construct a corpus for NLP research.
jakevdp/jakevdp.github.io-source
Source for my Pythonic Perambulations blog
neo4j-graph-analytics/book
neo4j-graph-analytics/ml-models
Machine Learning Procedures and Functions for Neo4j
neo4j-graph-analytics/graph-algorithms-notebooks
Jupyter notebooks showing how to use Neo4j Graph Algorithms
UCB-INFO-PYTHON/Drills
Quick introductory exercises to help remember course content.
neo4j-graph-analytics/sklearn-neo4j
lucassa3/Enron-pyspark-analysis
A spark application that retrieves content from Enron's mail dataset and builds a Neo4j graph with some social network measurements
UCB-INFO-PYTHON/W200CourseOverview
Materials for presentations describing the course
w203-summer-19/w203-summer-19.github.io
Class Website