Skip to content

David-Wobrock/sqlvalidator

Repository files navigation

sqlvalidator

Build Status PyPI codecov

SQL queries formatting, syntactic and semantic validation

Only supports SELECT statements

Command line usage

SQL Formatting

sql.py

def fun():
    return "select col1, column2 from table"

Command line:

$ sqlvalidator --format sql.py
reformatted sql.py (1 changed SQL)
1 file reformatted (1 changed SQL queries).

sql.py

def fun():
    return """
SELECT
 col1,
 column2
FROM table
"""

A nosqlformat comment can be appended to indicate to sqlvalidator that this string should not be formatted.

Check SQL format

One can verify also that the file would be reformatted or not:

$ sqlvalidator --check-format sql.py
would reformat sql.py (1 changed SQL)
1 file would be reformatted (1 changed SQL queries).


$ sqlvalidator --format sql.py
reformatted sql.py (1 changed SQL)
1 file reformatted (1 changed SQL queries).


$ sqlvalidator --check-format sql.py
No file would be reformatted.


$ sqlvalidator --format sql.py
No file reformatted.

--check-format won't write the file back and just return a status code:

  • Status code 0 when nothing would change.
  • Status code 1 when some files would be reformatted.

The option is meant to be used within the CI/CD pipeline and ensure that SQL statements are formatted.

SQL Validation

One can verify that the files SQL is valid:

$ sqlvalidator --validate sql.py
invalid queries in sql.py (1 invalid SQL)
1 file detected with invalid SQL (1 invalid SQL queries).

# ... do some manual fixes to the SQL ...

$ sqlvalidator --validate sql.py
No invalid queries found.

To get more details about the found invalid elements, use --verbose-validate

API / Python code usage

SQL Formatting

import sqlvalidator

formatted_sql = sqlvalidator.format_sql("SELECT * FROM table")

SQL Validation

import sqlvalidator

sql_query = sqlvalidator.parse("SELECT * from table")

if not sql_query.is_valid():
    print(sql_query.errors)

Warning: only a limited set of validation are implemented.

Details about SQL Validation

Validation contains:

  • not using a missing column
  • existing functions
  • correct aggregations
  • schemaless (not assume that table names and columns in those exist)
  • types correctness in functions

(only on SELECT-statements)

SQL Syntax

Use with pre-commit

Add this to your .pre-commit-config.yaml:

  - repo: https://github.com/David-Wobrock/sqlvalidator
    rev: <sha1 of the latest sqlvalidator commit>
    hooks:
      - id: sqlvalidator

Contributing

If you want to contribute to the sqlvalidator, first, thank you for the interest.

Don't hesitate to open an Issue with a snippet of the failing SQL query and what the expected output would be.

However, I don't guarantee that will accept any Pull Request made to the repository. This is not because I don't value the work and energy put into contribution, but more because the project is still early stage, and I want to keep full control of its direction for now.

Internals

Run tests

pytest

Publishing

  • python3 setup.py sdist bdist_wheel
  • twine upload dist/sqlvalidator-X.Y.Z-py3-none-any.whl dist/sqlvalidator-X.Y.Z.tar.gz

About

SQL queries formatting, syntactic and semantic validation

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages