Pinned Repositories
AzureDatabricksBestPractices
Version 1 of Technical Best Practices of Azure Databricks based on real world Customer and Technical SME inputs
csv2avro
Command line script to convert CSV/TSV files to AVRO
dbx-charming-aurora
hive_metadata_utils
Find Hive Tables by Table or Column Names
increments
A gem to facilitate incrementing values
meta_func
Python decorator function to track metadata on function calls
pndb
Pseudo-Normalized Database Engine Proof of Concept
public-sandbox
runtime_stats
Python decorator function to track runtime stats on function calls
gstaubli's Repositories
gstaubli/csv2avro
Command line script to convert CSV/TSV files to AVRO
gstaubli/dbx-charming-aurora
gstaubli/increments
A gem to facilitate incrementing values
gstaubli/meta_func
Python decorator function to track metadata on function calls
gstaubli/runtime_stats
Python decorator function to track runtime stats on function calls
gstaubli/AzureDatabricksBestPractices
Version 1 of Technical Best Practices of Azure Databricks based on real world Customer and Technical SME inputs
gstaubli/hive_metadata_utils
Find Hive Tables by Table or Column Names
gstaubli/pndb
Pseudo-Normalized Database Engine Proof of Concept
gstaubli/public-sandbox
gstaubli/pyspark-intro
Intro to PySpark codebase
gstaubli/pyspark-nlp
Using PySpark with Natural Language Processing (NLP) and Machine Learning (ML)
gstaubli/split_file_by_key
Given a *SORTED* file, delimiter, and key(s), split the file into numerous out files based on the key(s).
gstaubli/testpy
gstaubli/Workshops
Training Workshop Code & Materials