Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
allows naming conventions to be changed (#998)
* allows to decorate async function with dlt.source * adds pytest-async and updates pytest to 7.x * fixes forked teardown issue 7.x * bumps deps for py 3.12 * adds py 12 common tests * fixes typings after deps bump * bumps airflow, yanks duckdb to 0.9.2 * fixes tests * fixes pandas version * adds 3.12 duckdb dep * adds right hand pipe operator * fixes docker ci build * adds docs on async sources and resources * normalizes default hints and preferred types in schema * defines pipeline state table in utils, column normalization in simple regex * normalizes all identifiers used by relational normalizer, fixes other modules * fixes sql job client to use normalized identifiers in queries * runs state sync tests for lower and upper case naming conventions * fixes weaviate to use normalized identifiers in queries * partially fixes qdrant incorrect state and version retrieval queries * initial sql uppercase naming convention * adds native df readers to databricks and bigquery * adds casing identifier capability to support different casing in naming conventions, fixes how identifiers are normalized in destinations * cleans typing for relational normalizer * renames escape functions * destination capabilities for case fold and case sensitivity * drops supports naming module and allows naming to be instance in config and schema * checks all tables in information schema in one go, observes case folding and sensitivity in sql destinations * moves schema verification to destination utils * adds method to remove processing hints from schema, helper functions for schema settings, refactor, tests * accepts naming convention instances when resolving configs * fixes the cloning of schema in decorator, removes processing hints * removes processing hints when saving imported schema * adds docs on naming conventions, removes technical docs * adds casing info to databrick caps, makes caps an instance attr * adjusts destination casing in caps from schema naming and config * raises detailed schema identifier clash exceptions * adds is_case_sensitive and name to NamingConvention * adds sanity check if _dlt prefix is preserved * finds genric types in non generic classes deriving from generic * uses casefold INSERT VALUES job column names * adds a method make_qualified_table_name_path that calculates components of fully qualified table name and uses it to query INFO SCHEMA * adds casing info to destinations, caps as instance attrs, custom table name paths * adds naming convention to restore state tests, make them essential * fixes table builder tests * removes processing hints when exporting schema to import folder, warns on schema import overriding local schema, warns on processing hints present * allows to subclass INFO SCHEMA query generation and uses specialized big query override * uses correct schema escaping function in sql jobs * passes pipeline state to package state via extract * fixes optional normalizers module * excludes version_hash from pipeline state SELECT * passes pipeline state to package state pt.2 * re-enables sentry tests * bumps qdrant client, makes test running for local version * makes weaviate running * uses schemata to find databases on athena * uses api get_table for hidden dataset on bigquery to reflect schemas, support case insensitive datasets * adds naming conventions to two restore state tests * fixes escape identifiers to column escape * fix conflicts in docs * adjusts capabilities in capabilities() method, uses config and naming optionally * allows to add props to classes without vectorizer in weaviate * moves caps function into factories, cleansup adapters and custom destination * sentry_dsn * adds basic destination reference tests * fixes table builder tests * fix deps and docs * fixes more tests * case sensitivity docs stubs * fixes drop_pipeline fixture * improves partial config generation for capabilities * adds snowflake csv support * creates separate csv tests * allows to import files into extract storage, adds import file writer and spec * handles ImportFileMeta in extractor * adds import file item normalizer and router to normalize * supports csv format config for snowflake * removes realpath wherever possible and adds fast make_full_path to FileStorage * adds additional methods to load_package storage to make listings faster * adds file_format to dlt.resource, uses preferred file format for dlt state table * docs for importing files, file_format * code improvements and tests * docs hard links note * moves loader parallelism test to pipeliens, solves duckdb ci test error issue * fixes tests * moves drop_pipeline fixture level up * drops default naming convention from caps so naming in saved schema persists, allows (section, <schema_name>, schema) config section for schema settings * unifies all representations of pipeline state * tries to decompress text file first in fs_client * tests get stored state in test_job_client * removes credentials from dlt.attach, addes destination and staging factories * cleans up env variables and pipeline dropping fixutere precedence * removes dev_mode from dlt.attach * adds missing arguments to filesystem factory * fixes tests * updates destination and naming convention docs * removes is_case_sensitive from naming convention initializer * simplifies with_file_import mark * adds case sensitivity tests * uses dev_mode everywhere * improves csv docs * fixes encodings in fsspec * improves naming convention docs * fixes tests and renames clash to collision * fixes getting original bases from instance
- Loading branch information