jhryu1208's Stars
prestodb/presto
The official home of the Presto distributed SQL query engine for big data
pypa/pip
The Python package installer
jupyter/docker-stacks
Ready-to-run Docker images containing Jupyter applications
databricks/Spark-The-Definitive-Guide
Spark: The Definitive Guide's Code Repository
sodadata/soda-core
:zap: Data quality testing for the modern data stack (SQL, Spark, and Pandas) https://www.soda.io
aws-samples/aws-glue-samples
AWS Glue code samples
utilForever/awesome-cafe
☕ 모각코하기 좋은 국내 카페 리스트
awslabs/aws-athena-query-federation
The Amazon Athena Query Federation SDK allows you to customize Amazon Athena with your own data sources and code.
blockchain-etl/ethereum-etl-airflow
Airflow DAGs for exporting, loading, and parsing the Ethereum blockchain data. How to get any Ethereum smart contract into BigQuery https://towardsdatascience.com/how-to-get-any-ethereum-smart-contract-into-bigquery-in-8-mins-bab5db1fdeee
dhkdn9192/data_engineer_should_know
데이터 엔지니어가 알아야 하는 것들
FVBros/Spark-The-Definitive-Guide
한빛미디어에서 출간한 스파크 완벽 가이드 1판의 소스코드 저장소
googleapis/python-bigquery-datatransfer
This library has moved to https://github.com/googleapis/google-cloud-python/tree/main/packages/google-cloud-bigquery-datatransfer
playinpap/awesome-data-and-analytics-governance
데이터 & 분석 거버넌스 제고를 위한 양질의 레퍼런스들을 수집하고 생각을 나눌 수 있습니다.
jordansinger/figma-slack-updates
Post updates to Slack from a Figma file's version history
embulk/embulk-filter-column
A filter plugin for Embulk to filter out columns
webysther/aws-glue-docker
🐋 Docker image for AWS Glue Spark/Python
embulk/embulk-output-s3
Embulk S3 output plugin
embulk/embulk-filter-expand_json
mgi166/embulk-filter-eval
Eval ruby code on filtering
googleapis/python-storage-transfer
This library has moved to https://github.com/googleapis/google-cloud-python/tree/main/packages/google-cloud-storage-transfer
sonots/embulk-filter-timestamp_format
A filter plugin for Embulk to change timestamp format
svajiraya/aws-glue-libs
AWS Glue Libraries are additions and enhancements to Spark for ETL operations.
civitaspo/embulk-filter-distinct
shinji19/embulk-input-athena
Athena input plugins for Embulk
toyama0919/embulk-filter-google_translate_api
Google Translate Api filter plugin for Embulk.
tkuchiki/docker-embulk
ariarijp/embulk-input-redash
Redash input plugin for Embulk
civitaspo/embulk-input-union
An input plugin for Embulk (https://github.com/embulk/embulk/) that unions all data loaded by your defined embulk input & filters plugin configuration.
rubik-ai/embulk-input-s3_parquet
Embulk s3 parquet reader
reflet/docker-embulk
公式コンテナ(Java8)をベースにEmbulkのコンテナを作ってみる