This repo contains documentation, examples, benchmarks, and schemas for SDF.
SDF is a multi-dialect SQL compiler, transformation framework, and analytical database engine. It natively compiles SQL dialects, like Snowflake, and connects to their corresponding data warehouses to materialize models.
For more, check out our official documentation at docs.sdf.com.
SDF is open core built on Apache DataFusion`.
To install the SDF CLI, run the following command:
curl -LSfs https://cdn.sdf.com/releases/download/install.sh | sh -s
For more thorough installation docs, check out the installation guide.
SDF has a rich open-source ecosystem. Here are a few libraries that we recommend you check out:
- SQL Functions - This Rust Crate powers SDF function typing and execution for every supported dialect. This enables our local compute engine Apache DataFusion to run SQL functions across dialects. This Rust Crate is used in the CLI, and contributions are welcome.
- SDF Materialization - The library used to materialize SDF models. This is packaged up into the CLI and can be found in the
.sdfcache
. - SDF Tests - The library used to write data tests with SDF. This is packaged up into the CLI and can be found in the
.sdfcache
. - SDF Utils - A collection of utilities for working with SDF models. This is packaged up into the CLI and can be found in the
.sdfcache
. This library should act as a model for how to author your own SDF libraries. - SDF Workspace Evaluator - A library for evaluating SDF workspaces with SDF reports. This library contains some of the most used reports and can be used as a model for authoring your own reports. Some of the most popular are
dead_column
analysis (find columns that are never used and are wasting compute / storage) and ` - SDF GitHub Action - Our official GitHub action for running SDF commands in CI/CD.
Non-default libraries like the SDF Workspace Evaluator can be added to an SDF workspace like so:
workspace:
...
dependencies:
- name: evaluator
git: https://github.com/sdf-labs/workspace-evaluator/workspace-evaluator.git
Note that SDF Utils, SDF Materialization, and SDF Tests are already included in the SDF CLI and do not need to be added to the workspace.
This repo is organized as follows:
- Docs - Contains the official SDF documentation found at docs.sdf.com. All docs are open source, and contributions are welcome.
- Examples - Contains example SDF projects that demonstrate how to use SDF in practice. These examples are open source and are packaged into the SDF binary. Run
sdf new --sample <sample-name>
to create a workspace locally from one of these examples. - Schemas - Contains the JSON schemas for the SDF configuration YML files. These schemas are auto-generated every time the binary is released. They are used to validate the configuration files in the SDF CLI and can be used to power integrations that leverage lineage or metatadata from the SDF compiler artifacts.
All SDF releases are reflected in this repo as official releases. You can find the latest release here. And yes, those release notes are begrudgingly reflective of the release notes in our internal repo.
For in-depth release notes, check out the SDF changelog.
Contributions are welcome to our examples and docs. However, since the compiler is closed source, we cannot accept contributions to the core compiler. That being said, our execution capabilities are based on Apache DataFusion and function execution is enabled by our open source SQL Functions Crate. If you're ever missing support for a function, finding that a function is mistyped, or dealing with an execution error - feel free to contribute to either of those repos. If you have a feature request or bug report with the compiler, please open an issue on this repo. We'll track it and prioritize it ASAP.