LeeByungwoo's Stars
codecrafters-io/build-your-own-x
Master programming by recreating your favorite technologies from scratch.
jhuangtw/xg2xg
by ex-googlers, for ex-googlers - a lookup table of similar tech & services
infracost/infracost
Cloud cost estimates for Terraform in pull requests💰📉 Shift FinOps Left!
trinodb/trino
Official repository of Trino, the distributed SQL query engine for big data, formerly known as PrestoSQL (https://trino.io)
chiphuyen/machine-learning-systems-design
A booklet on machine learning systems design with exercises. NOT the repo for the book "Designing Machine Learning Systems"
runatlantis/atlantis
Terraform Pull Request Automation
tobymao/sqlglot
Python SQL Parser and Transpiler
elastic/go-elasticsearch
The official Go client for Elasticsearch
dear-github/dear-github
:incoming_envelope: An open letter to GitHub from the maintainers of open source projects
QasimWani/LeetHub
Automatically sync your leetcode solutions to your github account - top 5 trending GitHub repository
jupyterlite/jupyterlite
Wasm powered Jupyter running in the browser 💡
jghoman/awesome-apache-airflow
Curated list of resources about Apache Airflow
gruns/furl
🌐 URL parsing and manipulation made easy.
scalapb/ScalaPB
Protocol buffer compiler for Scala.
OBenner/data-engineering-interview-questions
More than 2000+ Data engineer interview questions.
Kamva/mgm
Mongo Go Models (mgm) is a fast and simple MongoDB ODM for Go (based on official Mongo Go Driver)
aws/aws-mwaa-local-runner
This repository provides a command line interface (CLI) utility that replicates an Amazon Managed Workflows for Apache Airflow (MWAA) environment locally.
scallop/scallop
a simple Scala CLI parsing library
AbsaOSS/spline
Data Lineage Tracking And Visualization Solution
YotpoLtd/metorikku
A simplified, lightweight ETL Framework based on Apache Spark
dacort/metabase-athena-driver
An Amazon Athena driver for Metabase 0.32 and later
memsql/singlestore-spark-connector
A connector for SingleStore and Spark
smart-data-lake/smart-data-lake
Smart Automation Tool for building modern Data Lakes and Data Pipelines
scalapb/sparksql-scalapb
SparkSQL utils for ScalaPB
dojinkimm/awesome-ab-testing
AB Testing 📈 related articles
JahstreetOrg/spark-on-kubernetes-docker
Spark on Kubernetes infrastructure Docker images repo
dojinkimm/go-grpc-example
crflynn/pbspark
protobuf pyspark conversion
dojinkimm/cryptoexchange-go
API wrapper for cryptocurrency exchanges impemented in golang
devsisters/aws-glue-data-catalog-client-for-apache-hive-metastore
The AWS Glue Data Catalog is a fully managed, Apache Hive Metastore compatible, metadata repository. Customers can use the Data Catalog as a central repository to store structural and operational metadata for their data. AWS Glue provides out-of-box integration with Amazon EMR that enables customers to use the AWS Glue Data Catalog as an external Hive Metastore. This is an open-source implementation of the Apache Hive Metastore client on Amazon EMR clusters that uses the AWS Glue Data Catalog as an external Hive Metastore. It serves as a reference implementation for building a Hive Metastore-compatible client that connects to the AWS Glue Data Catalog. It may be ported to other Hive Metastore-compatible platforms such as other Hadoop and Apache Spark distributions