Pinned Repositories
aws-data-analytics
aws data analytics study guide (DAS-C01) for boto3 python client.
convnet
Convolutional Neural Net to classify dog vs. cat pics
encore.ai
Generate new lyrics in the style of any artist using LSTMs and TensorFlow
extraction-framework
The software used to extract structured data from Wikipedia
iMRMC
iMRMC user manual and other resources
k-means
code to run k-means from scratch via command line.
la-tools-test
automation tools used as a load analyst at engie, na.
reddit-streaming
streaming eight subreddits from reddit api using kafka producer & spark structured streaming.
twitter.ai
code to scrape tweets from specified users, save results (bigquery & dynamodb tables), and run neural network to generate new tweets. (in progress)
yelp
yelp kaggle data
stevenhurwitt's Repositories
stevenhurwitt/reddit-streaming
streaming eight subreddits from reddit api using kafka producer & spark structured streaming.
stevenhurwitt/twitter.ai
code to scrape tweets from specified users, save results (bigquery & dynamodb tables), and run neural network to generate new tweets. (in progress)
stevenhurwitt/aws-data-analytics
aws data analytics study guide (DAS-C01) for boto3 python client.
stevenhurwitt/extraction-framework
The software used to extract structured data from Wikipedia
stevenhurwitt/la-tools-test
automation tools used as a load analyst at engie, na.
stevenhurwitt/reliant-scrape
simple webscrape of my energy usage data into an AWS database.
stevenhurwitt/yelp
yelp kaggle data
stevenhurwitt/alphavantage
stevenhurwitt/arxiv
stevenhurwitt/aws-glue-samples
AWS Glue code samples
stevenhurwitt/azure-cosmos-throughput-scheduler
Simple utility that can be used to scale Azure Cosmos DB resources up and down using Azure Functions Timer Triggers
stevenhurwitt/dbt-core
dbt enables data analysts and engineers to transform their data using the same practices that software engineers use to build applications.
stevenhurwitt/dbt-xdb
Cross-database support for dbt
stevenhurwitt/dealersocket-speedtest
speedtest.dealersocket.com
stevenhurwitt/docker-pgredshift
Redshift docker image based on postgres
stevenhurwitt/docker-spark-cluster
A simple spark standalone cluster for your testing environment purposses
stevenhurwitt/DockerSpark245
Spark cluster in docker containers with sample training Jupyter notebooks
stevenhurwitt/jupyterhub
Multi-user server for Jupyter notebooks
stevenhurwitt/kafka-docker
Dockerfile for Apache Kafka
stevenhurwitt/kubernetes-intro
intro to working w/ a $200 kubernetes cluster (3 raspberry pi's). 8 gb RAM, 64-256 gb MicroSD, 4 cores, arm64, raspbian
stevenhurwitt/kubernetes-the-hard-way
Bootstrap Kubernetes the hard way on Google Cloud Platform. No scripts.
stevenhurwitt/lil-web3
Simple, intentionally-limited versions of web3 protocols & apps.
stevenhurwitt/pandas
Flexible and powerful data analysis / manipulation library for Python, providing labeled data structures similar to R data.frame objects, statistical functions, and much more
stevenhurwitt/pyosolver
Python wrapper around piosolver
stevenhurwitt/pytest_dbconnect
Testing pyspark with pytest and Databricks Connect
stevenhurwitt/Selenium-Python-Example
small example project with selenium + python + pytest + allure report
stevenhurwitt/spark-scala-examples
This project provides Apache Spark SQL, RDD, DataFrame and Dataset examples in Scala language
stevenhurwitt/twitter-ingestion
stevenhurwitt/visualboyadvance-m
The continuing development of the legendary VBA gameboy advance emulator.
stevenhurwitt/yelp-3nf
3NF-normalize Yelp data on S3 with Spark and load it into Redshift - automate the whole thing with Apache Airflow