Skip to content

This is the main repository for SDF documentation found at docs.sdf.com, as well as public schemas, benchmarks, and examples

Notifications You must be signed in to change notification settings

sdf-labs/sdf-cli

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

SDF

This repo contains documentation, examples, benchmarks, and schemas for SDF.

SDF is a multi-dialect SQL compiler, transformation framework, and analytical database engine. It natively compiles SQL dialects, like Snowflake, and connects to their corresponding data warehouses to materialize models.

For more, check out our official documentation at docs.sdf.com.

SDF is open core built on Apache DataFusion`.

Installation

To install the SDF CLI, run the following command:

curl -LSfs https://cdn.sdf.com/releases/download/install.sh | sh -s

For more thorough installation docs, check out the installation guide.

Featured Libraries

SDF has a rich open-source ecosystem. Here are a few libraries that we recommend you check out:

  • SQL Functions - This Rust Crate powers SDF function typing and execution for every supported dialect. This enables our local compute engine Apache DataFusion to run SQL functions across dialects. This Rust Crate is used in the CLI, and contributions are welcome.
  • SDF Materialization - The library used to materialize SDF models. This is packaged up into the CLI and can be found in the .sdfcache.
  • SDF Tests - The library used to write data tests with SDF. This is packaged up into the CLI and can be found in the .sdfcache.
  • SDF Utils - A collection of utilities for working with SDF models. This is packaged up into the CLI and can be found in the .sdfcache. This library should act as a model for how to author your own SDF libraries.
  • SDF Workspace Evaluator - A library for evaluating SDF workspaces with SDF reports. This library contains some of the most used reports and can be used as a model for authoring your own reports. Some of the most popular are dead_column analysis (find columns that are never used and are wasting compute / storage) and column_description_coverage (find columns that are missing descriptions).
  • SDF GitHub Action - Our official GitHub action for running SDF commands in CI/CD.

Non-default libraries like the SDF Workspace Evaluator can be added to an SDF workspace like so:

workspace:
  ...
  dependencies:
    - name: evaluator
      git: https://github.com/sdf-labs/workspace-evaluator/workspace-evaluator.git

Note that SDF Utils, SDF Materialization, and SDF Tests are already included in the SDF CLI and do not need to be added to the workspace.

Structure

This repo is organized as follows:

  • Docs - Contains the official SDF documentation found at docs.sdf.com. All docs are open source, and contributions are welcome.
  • Examples - Contains example SDF projects that demonstrate how to use SDF in practice. These examples are open source and are packaged into the SDF binary. Run sdf new --sample <sample-name> to create a workspace locally from one of these examples.
  • Schemas - Contains the JSON schemas for the SDF configuration YML files. These schemas are auto-generated every time the binary is released. They are used to validate the configuration files in the SDF CLI and can be used to power integrations that leverage lineage or metatadata from the SDF compiler artifacts.

Releases

All SDF releases are reflected in this repo as official releases. You can find the latest release here. And yes, those release notes are begrudgingly reflective of the release notes in our internal repo.

For in-depth release notes, check out the SDF changelog.

SDF is updated frequently and adheres to a strict versioning system: ..* || Patch versions include patch fixes, and additive improvments which should not contain any breaking changes .*. || Minor versions may include breaking changes. These might be YML schema changes, information schema changes, or changes in SDF's internal logic *.. || Major versions

Contributing

Contributions are welcome to our examples and docs. Although SDF is still being incubated and is not yet open source, it is powered by an open source core: Apache DataFusion and function execution is enabled by our open source SQL Functions Crate. If you're ever missing support for a function, finding that a function is mistyped, or dealing with an execution error - feel free to contribute to either of those repos. If you have a feature request or bug report with the compiler, please open an issue on this repo. We'll track it and prioritize it ASAP.

About

This is the main repository for SDF documentation found at docs.sdf.com, as well as public schemas, benchmarks, and examples

Resources

Stars

Watchers

Forks

Packages

No packages published

Languages