Pinned Repositories
aws-glue-python-kickstart
AWS Glue Python Job Kick Start
aws-ses-test
dbt-spark
spark plugin for dbt
dbt_201
Bootstrap dbt project (with Apache Spark)
iceberg
Apache Iceberg
largefile
https://stackoverflow.com/questions/46373895/how-to-open-a-huge-excel-file-efficiently
spark-excel
A Spark plugin for reading Excel files via Apache POI
spark-xml
XML data source for Spark SQL and DataFrames
skadyan's Repositories
skadyan/aws-glue-python-kickstart
AWS Glue Python Job Kick Start
skadyan/aws-ses-test
skadyan/largefile
https://stackoverflow.com/questions/46373895/how-to-open-a-huge-excel-file-efficiently
skadyan/dbt-spark
spark plugin for dbt
skadyan/dbt_201
Bootstrap dbt project (with Apache Spark)
skadyan/iceberg
Apache Iceberg
skadyan/spark-excel
A Spark plugin for reading Excel files via Apache POI
skadyan/spark-xml
XML data source for Spark SQL and DataFrames
skadyan/cassandra-migration
Database migration (evolution) tool for Apache Cassandra
skadyan/cobrix
A generic cobol parser and cobol data source for Apache Spark
skadyan/dbt-snowflake
dbt-snowflake contains all of the code enabling dbt to work with Snowflake
skadyan/deequ
Deequ is a library built on top of Apache Spark for defining "unit tests for data", which measure data quality in large datasets.
skadyan/desiutils
Collection of general utility java classes
skadyan/devops-maven
Demo Ops
skadyan/experiments
skadyan/flake8-html
Generate HTML reports of flake8 violations
skadyan/huginn
Build agents that monitor and act on your behalf. Your agents are standing by!
skadyan/incubator-superset
Apache Superset (incubating) is a modern, enterprise-ready business intelligence web application
skadyan/jupyterhub
Multi-user server for Jupyter notebooks
skadyan/mypythonsandobox
Sandbox - Python 3
skadyan/pyspark-test
Testing library for pyspark, inspired from pandas testing module but for pyspark, to help users write unit tests.
skadyan/ronak
Parser for Resource Query Language (RQL)
skadyan/spark-glue-support
Spark 3.2.0 Support for Glue catalog
skadyan/sqltoolsservice
SQL Tools API service that provides SQL Server data management capabilities.
skadyan/swagger-maven-example
Example of using swagger-maven-plugin (https://github.com/kongchen/swagger-maven-plugin)
skadyan/universal_pathlib
pathlib api extended to use fsspec backends
skadyan/zeus
Zeus