Pinned Repositories
airflow
Apache Airflow - A platform to programmatically author, schedule, and monitor workflows
Apache-Cassandra-ETL
A music streaming startup, wanted to analyze the data they'd collected on songs and user activity on their new streaming app. They are especially interested in knowing what songs users are listening to. Currently, their data is stored in a directory of CSV files, and they would like a data engineer to create an Apache Cassandra database that will allow them to answer questions on play data. Given that we need to have the queries we want to run before we create our Cassandra tables in order to optimize our database, the Sparkify analytics team has provided us with the following analytical queries
Belly-Button-Biodiversity
confluent-kafka-python
Confluent's Kafka Python Client
dapp-base
Data-Engineering-Nanodegree
Udacity Data Engineering Nanodegree Program
Data-Warehouse-Redshift
Data warehouse utilizing S3 and Redshift of AWS.
ethers.js
Complete Ethereum library and wallet implementation in JavaScript.
explainx
Explainable AI framework for data scientists. Explain & debug any blackbox machine learning model with a single line of code.
spark
Apache Spark - A unified analytics engine for large-scale data processing
RandyPayano's Repositories
RandyPayano/airflow
Apache Airflow - A platform to programmatically author, schedule, and monitor workflows
RandyPayano/Apache-Cassandra-ETL
A music streaming startup, wanted to analyze the data they'd collected on songs and user activity on their new streaming app. They are especially interested in knowing what songs users are listening to. Currently, their data is stored in a directory of CSV files, and they would like a data engineer to create an Apache Cassandra database that will allow them to answer questions on play data. Given that we need to have the queries we want to run before we create our Cassandra tables in order to optimize our database, the Sparkify analytics team has provided us with the following analytical queries
RandyPayano/Belly-Button-Biodiversity
RandyPayano/confluent-kafka-python
Confluent's Kafka Python Client
RandyPayano/dapp-base
RandyPayano/Data-Engineering-Nanodegree
Udacity Data Engineering Nanodegree Program
RandyPayano/Data-Warehouse-Redshift
Data warehouse utilizing S3 and Redshift of AWS.
RandyPayano/ethers.js
Complete Ethereum library and wallet implementation in JavaScript.
RandyPayano/explainx
Explainable AI framework for data scientists. Explain & debug any blackbox machine learning model with a single line of code.
RandyPayano/spark
Apache Spark - A unified analytics engine for large-scale data processing
RandyPayano/Sparkify-Postgres-ETL
Sparkify, a startup, wants to analyze the data they've gathered on songs and user activity on their new music streaming app. The analytics team is particularly interested in learning about the songs that users are listening to. They currently lack an easy way to query their data, which is stored in a directory of JSON logs on user activity on the app, in addition to a directory containing JSON metadata on the songs in their app.
RandyPayano/SQL-Leetcode-Challenge
Contains all the 117 Leetcode questions with their solutions ranging from Easy to Hard in MySQL.
RandyPayano/SQLalchemy-Data-Storage-and-Retrieval
RandyPayano/Web-Scrapping-Mars
RandyPayano/web3.js
Ethereum JavaScript API
RandyPayano/web3.py
A python interface for interacting with the Ethereum blockchain and ecosystem.
RandyPayano/web3j
Lightweight Java and Android library for integration with Ethereum clients