Pinned Repositories
120-Data-Science-Interview-Questions
Answers to 120 commonly asked data science interview questions.
amazon-sagemaker-local-mode
Amazon SageMaker Local Mode Examples
amazon-sagemaker-safe-deployment-pipeline
Safe blue/green deployment of Amazon SageMaker endpoints using AWS CodePipeline, CodeBuild and CodeDeploy.
apache-spark-internals
The Internals of Apache Spark
awk_sed_cron
azure-databricks-streaming-analytics
Stream processing with Azure Databricks
binzhango
Config files for my GitHub profile.
binzhango.github.io
blockchain-1
A simple Blockchain in Python
firefox_multi_row
binzhango's Repositories
binzhango/firefox_multi_row
binzhango/amazon-sagemaker-local-mode
Amazon SageMaker Local Mode Examples
binzhango/amazon-sagemaker-safe-deployment-pipeline
Safe blue/green deployment of Amazon SageMaker endpoints using AWS CodePipeline, CodeBuild and CodeDeploy.
binzhango/apache-spark-internals
The Internals of Apache Spark
binzhango/azure-databricks-streaming-analytics
Stream processing with Azure Databricks
binzhango/binzhango
Config files for my GitHub profile.
binzhango/binzhango.github.io
binzhango/CoolplaySpark
酷玩 Spark: Spark 源代码解析、Spark 类库等
binzhango/Data-Science-Notes
数据科学的笔记以及资料搜集
binzhango/databricks-nutter-repos-demo
Demo of using the Nutter for testing of Databricks notebooks in the CI/CD pipeline
binzhango/dbt-on-airflow
binzhango/frameless
Expressive types for Spark.
binzhango/graphsense-transformation
GraphSense Transformation Pipeline
binzhango/handson-ml3
A series of Jupyter notebooks that walk you through the fundamentals of Machine Learning and Deep Learning in Python using Scikit-Learn, Keras and TensorFlow 2.
binzhango/kafka_avro
binzhango/machine-learning
从零基础开始机器学习之旅
binzhango/mlfrm
binzhango/OpenLineage
An Open Standard for lineage metadata collection
binzhango/pyspark-example-project
Example project implementing best practices for PySpark ETL jobs and applications.
binzhango/scala-style-guide
Databricks Scala Coding Style Guide
binzhango/spark-cassandra-connector
DataStax Spark Cassandra Connector
binzhango/spark-graphx
apache spark-graphx for distributed graph calculate, also included spark-sql spark-streaming and RDD operations
binzhango/spark-playground
Code snippets used in demos recorded for the blog.
binzhango/spark-rapids-examples
A repo for all spark examples using Rapids Accelerator including ETL, ML/DL, etc.
binzhango/spark-scala-examples
This project provides Apache Spark SQL, RDD, DataFrame and Dataset examples in Scala language
binzhango/SparkPlugins
Code and examples of how to deploy Apache Spark Plugins with Spark 3.x. This allows extending the Spark metrics systems with user-provided monitoring probes for OS, I/O, and custom libraries/applications.
binzhango/springboot-azure-log-analytics
binzhango/streamlit-dbt-metrics-explorer
Lightweight Streamlit app to test out metrics functionality in dbt
binzhango/SynapseML
Simple and Distributed Machine Learning
binzhango/vs
Visualization of Google's autocomplete