ronanstokes-db
Senior SSA at Databricks focussed on IOT / manufacturing. Maintainer of Databricks Labs Data Generator (synthetic data generation open source project)
Databricks
ronanstokes-db's Stars
apache/spark
Apache Spark - A unified analytics engine for large-scale data processing
ydataai/ydata-profiling
1 Line of code data quality profiling & exploratory data analysis for Pandas and Spark DataFrames.
databrickslabs/dolly
Databricks’ Dolly, a large language model trained on the Databricks Machine Learning Platform
tobymao/sqlglot
Python SQL Parser and Transpiler
pyparsing/pyparsing
Python library for creating PEG parsers
pyspark-ai/pyspark-ai
English SDK for Apache Spark
databrickslabs/dbx
🧱 Databricks CLI eXtensions - aka dbx is a CLI tool for development and advanced Databricks workflows management.
sdv-dev/TGAN
Generative adversarial training for generating synthetic tabular data.
databrickslabs/ucx
Automated migrations to Unity Catalog
databrickslabs/overwatch
Capture deep metrics on one or all assets within a Databricks workspace
sdv-dev/SDMetrics
Metrics to evaluate quality and efficacy of synthetic datasets.
databrickslabs/migrate
Old scripts for one-off ST-to-E2 migrations. Use "terraform exporter" linked in the readme.
databrickslabs/dlt-meta
Metadata driven Databricks Delta Live Tables framework for bronze/silver pipelines
FelixMohr/Deep-learning-with-Python
Example projects I completed to understand Deep Learning techniques with Tensorflow. Please note that I do no longer maintain this repository.
imsweb/x12-parser
A Java parser for ANSI ASC X12 documents.
walmartlabs/gozer
Open source library to parse various X12 file formats for retail/supply chain
ronanstokes-db/SDV
Synthetic Data Generation for tabular, relational and time series data.