Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Added BigQuery Dialect and support format_date functionality #1296

Closed
wants to merge 616 commits into from
Closed
Show file tree
Hide file tree
Changes from all commits
Commits
Show all changes
616 commits
Select commit Hold shift + click to select a range
e377a3f
Generate `CASE ... WHEN ... ELSE` clauses (#773)
vil1 Aug 12, 2024
1533415
Added logger statements in get_schema function (#756)
vijaypavann-db Aug 12, 2024
cab0bf2
Aggregates Reconcile documentation (#779)
vijaypavann-db Aug 12, 2024
4ad7a52
Generate IN (#754)
vil1 Aug 12, 2024
21b5657
Generate table alias (#774)
vil1 Aug 12, 2024
01ae6df
Generate Deduplicate (#780)
vil1 Aug 12, 2024
8c55bd5
TSQL: Improve transpilation coverage (#766)
jimidle Aug 12, 2024
6a99806
[internal] run Scala and ANTLR formatters in separate Maven profile (…
nfx Aug 12, 2024
a01a719
Corrections to Snowflake grammar for explicit precedence (#786)
jimidle Aug 12, 2024
bd18476
Improve correctness of snowflake transpilation by improving Snowflake…
vil1 Aug 13, 2024
645e4ab
Make non-standard default for Snowflake rank-related functions' windo…
vil1 Aug 13, 2024
8adc44e
TSQL: Implement grammar for CREATE DATABASE and CREATE DATABASE SCOPE…
jimidle Aug 13, 2024
d8d4b75
TSQL: Implement correct grammar for CREATE TABLE in all forms (#796)
jimidle Aug 13, 2024
48d8e92
Update developer documentation to provide extra ANTLR instructions (#…
jimidle Aug 13, 2024
743c643
Generate window functions (#772)
vil1 Aug 14, 2024
9d33a24
Fix CI coverage tests configuration (#798)
vil1 Aug 14, 2024
4a81860
Implement ANTLR4 grammar customized linter (#797)
jimidle Aug 15, 2024
061122b
Aggregates Reconcile Dashboard updates (#799)
vijaypavann-db Aug 15, 2024
dc8c596
Adding verification dataset (#800)
aman-db Aug 15, 2024
6f0987e
TSQL: Resolve orphaned grammar rules / grammar cleanup (#802)
jimidle Aug 16, 2024
e0cd57e
Snowflake: Remove redundant grammar rules for a clean lint (#804)
jimidle Aug 16, 2024
538b412
Greatly improve grammar for CREATE/ALTER (#805)
jimidle Aug 16, 2024
4968c61
[internal] run queries over real backends (#727)
nfx Aug 16, 2024
0cb54c4
[chore] fixed invalid SQL line (#806)
nfx Aug 16, 2024
bd033a2
Fix parsing of Snowflake LIKE expressions and generate SQL for them. …
vil1 Aug 19, 2024
e01e5c7
Fix translation of Snowflake's join types (#808)
vil1 Aug 19, 2024
060a0c3
[internal] Remove recursive dependency between logical plan and expre…
nfx Aug 19, 2024
ae9cf34
[snowflake] fix data type mapping and explain `UnparsedType` (#810)
nfx Aug 19, 2024
dc63fbf
[internal] always format transpiled queries (#814)
nfx Aug 19, 2024
52b2518
[internal] remove dead code (#815)
nfx Aug 19, 2024
38f0313
[internal] better errors for call mapper (#816)
nfx Aug 19, 2024
1f82e1c
TSQL: Bring coverage back to previous levels (#813)
jimidle Aug 20, 2024
fc76784
Sql formatter Patch (#819)
sundarshankar89 Aug 20, 2024
35cb8ce
[chore] cleanup snowflake functional tests (#821)
nfx Aug 20, 2024
0ba2bba
[snowflake] `UPDATE` as `MERGE INTO` (#817)
nfx Aug 20, 2024
b05807f
[sql] generate `INSERT INTO ...` (#823)
nfx Aug 20, 2024
ce7e483
[sql] generate `DELETE FROM ...` (#824)
nfx Aug 20, 2024
a989e00
[snowflake] map more functions to Databricks SQL (#826)
nfx Aug 20, 2024
b2dc8a9
[chore] increase coverage by 8% (#827)
nfx Aug 20, 2024
e62244f
Improve coverage by properly handling Snowflake's date/time related f…
vil1 Aug 21, 2024
ed38f01
Aggregates Reconcile Updates (#784)
vijaypavann-db Aug 21, 2024
b08f628
[sql] generate basic `CREATE TABLE` definition (#829)
nfx Aug 21, 2024
19d25c6
[sql] generate basic `TABLESAMPLE` (#830)
nfx Aug 21, 2024
2c98cd8
[snowflake] cleanup functional tests (#831)
nfx Aug 21, 2024
18b6f8d
[snowflake] more cleanup of functional tests (#834)
nfx Aug 21, 2024
a2a15b2
[sql] generate basic JSON access (#835)
nfx Aug 21, 2024
cf17f47
[chore] make expr generator tests more maintainable (#837)
nfx Aug 21, 2024
bdaa7ee
TSQL: Improve parser and lexer parsing level coverage ready for CREAT…
jimidle Aug 21, 2024
c333275
Improve coverage test success rate around snowflake's conversion func…
vil1 Aug 22, 2024
4779576
[chore] concrete implementation of Literal (#839)
nfx Aug 22, 2024
2ed4c03
[chore] speedup build process by not running unit tests twice (#842)
nfx Aug 22, 2024
9782fb3
[internal] added JVM command proxy in development mode (#843)
nfx Aug 22, 2024
1802a2e
Changing String data type to Varchar for validation dataset (#845)
aman-db Aug 23, 2024
8888a6a
Handling window frame of rank-related functions in snowflake (#833)
ganeshdogiparthi-db Aug 23, 2024
96c6764
Added Translation Support for `!` as `commands` and `&` for `Paramete…
sriram251-code Aug 23, 2024
ea60dbb
Add an optimizer rule for SF's WITHIN GROUP clause (#844)
vil1 Aug 23, 2024
6f4ab46
[chore] ensure `Alias(Expression, Id)` shape (#847)
nfx Aug 23, 2024
05d65c3
Release v0.5.0 (#849)
nfx Aug 23, 2024
573f905
Added support for `STRUCT` types and conversions (#852)
nfx Aug 23, 2024
8ccbfb8
[internal] added query plan debugger and dependency injection baselin…
nfx Aug 23, 2024
d4f90fc
Fix CommentBasedQueryExtractor so that it works as intended even when…
vil1 Aug 26, 2024
6de49da
Improve coverage around snowflake functions (#860)
vil1 Aug 28, 2024
031d31b
Reconcile Documentation updates (#862)
vijaypavann-db Aug 28, 2024
1ab2645
extra ";" generation has been taken care for Bang command (#858)
sriram251-code Aug 29, 2024
cdcb761
[snowflake] parse parameters (#855)
nfx Aug 29, 2024
e42c0c3
Added support for `ALTER TABLE`: `ADD COLUMNS`, `DROP COLUMNS`, `RENA…
sundarshankar89 Aug 29, 2024
acd70b5
Reconcile Table Recon JSON filename updates (#866)
vijaypavann-db Aug 29, 2024
f18b459
Minor upgrades to Snowflake parameter processing (#871)
jimidle Aug 30, 2024
287822e
Miscellaneous Snowflake coverage fixes (#872)
vil1 Aug 30, 2024
cb217c0
Added query history retrieval from Snowflake (#874)
nfx Aug 30, 2024
8402a18
[snowflake] initial support for lateral views (#838)
nfx Aug 30, 2024
6794006
Add basis of ANTLR utilities, starting with text extractor (#882)
jimidle Sep 4, 2024
1de7ede
Updated Spark setup script (#875)
vijaypavann-db Sep 6, 2024
c6f780a
unresolved commands `alter session | stream...` `create stream` `crea…
sundarshankar89 Sep 6, 2024
a9ec557
Fix Query Generation IR for Select Distinct (#887)
sundarshankar89 Sep 6, 2024
2aab91e
fixed current_date() generation (#890)
sundarshankar89 Sep 10, 2024
622af53
Make CallMapper a Rule[LogicalPlan] (#899)
vil1 Sep 13, 2024
97e73f8
Patch/reconcile support temp views (#901)
ganeshdogiparthi-db Sep 14, 2024
e9e70d8
Release v0.6.0 (#905)
nfx Sep 16, 2024
94c141e
Make coverage test fail CI in case of failure (#908)
vil1 Sep 18, 2024
1c6c890
Upgrade script Implementation (#777)
vijaypavann-db Sep 18, 2024
63519e1
TSQL: Implement CREATE TABLE (#911)
jimidle Sep 18, 2024
633e453
Chore/other command (#881)
sundarshankar89 Sep 18, 2024
7113b2b
TSQL: Clean up and correct TSQL CREATE TABLE (#914)
jimidle Sep 19, 2024
407cd38
Updated remorph reconcile workflow to use wheels instead of pypi (#884)
vijaypavann-db Sep 19, 2024
94076af
Create more robust toolchain control with error recording for repeata…
jimidle Sep 20, 2024
852f577
Fetch TableDefinitions from Snowflake (#904)
sriram251-code Sep 20, 2024
16fb43c
TSQL: Reorg of DML IR and generation production after CREATE TABLE (#…
jimidle Sep 20, 2024
b3ec274
Patch Pylint to fixed version 3.2.2 (#925)
sundarshankar89 Sep 23, 2024
a51ea01
feat: Implement all of TSQL predicates except for SOME ALL ANY (#922)
jimidle Sep 23, 2024
8b24f54
Added private key authentication for sf (#917)
ganeshdogiparthi-db Sep 24, 2024
e28ef0a
Ensure that successful parsing is recorded even if transpilation fail…
jimidle Sep 24, 2024
60d99d8
Added documentation for Reconcile authentication keys (#931)
ganeshdogiparthi-db Sep 24, 2024
9253912
Patch Function without Brackets (#907)
sundarshankar89 Sep 24, 2024
a86301b
Move coverage tests into the core module (#912)
vil1 Sep 24, 2024
511a74e
Guide user on missing configuration file (#930)
ludovicc Sep 24, 2024
458f788
Created a mapping dict for algo for each dialect at layer level (#934)
ganeshdogiparthi-db Sep 25, 2024
b0e33fe
Create repeatable estimator for Snowflake query history (#924)
jimidle Sep 26, 2024
83f2f72
Make debug-coverage a proper command (#940)
vil1 Sep 27, 2024
8405131
Translation Complexity Analyzer (#937)
jimidle Sep 27, 2024
dfc50e1
Dashboard As Code for Reconcile (#768)
sundarshankar89 Oct 2, 2024
8856a85
Establish rules infrastructure for complexity analysis (#941)
jimidle Oct 2, 2024
30dc687
Added support for `PARSE_JSON` and `VARIANT` datatype (#906)
aman-db Oct 3, 2024
715342d
Miscellaneous improvements to Snowflake parser (#952)
vil1 Oct 8, 2024
72f0f3a
Move intermediate package out of parsers (#972)
vil1 Oct 8, 2024
3b69654
Introduce typed errors (#981)
vil1 Oct 9, 2024
cacc1a0
Functional Test Ignore Folder (#982)
sundarshankar89 Oct 9, 2024
20ddb74
Update catalog operations logging (#910)
vijaypavann-db Oct 10, 2024
0a3090c
Root Table (#936)
sundarshankar89 Oct 10, 2024
6d5e808
bug fix for to_array function (#961)
aman-db Oct 10, 2024
5ec1944
Updated Installation to handle install errors (#962)
vijaypavann-db Oct 10, 2024
2c53bda
Transpile Snow MONTHS_BETWEEN function correctly (#963)
bishwajit-db Oct 10, 2024
798008e
Establish incomplete parser and aggregate functions for all Visitors …
jimidle Oct 10, 2024
98f6a6f
Snowflake Merge Implementation (#964)
sundarshankar89 Oct 10, 2024
1c843d3
Added _null_recon documentation in README (#967)
vijaypavann-db Oct 10, 2024
6a06bc4
Basic CTAS Implementation (#968)
sundarshankar89 Oct 10, 2024
d2b09ef
Transpile Snow ARRAY_SORT function (#973)
bishwajit-db Oct 10, 2024
66539ae
Implement more missing visitor functions for Snowflake and TSQL (#975)
jimidle Oct 10, 2024
59be9d7
Snowflake: Correct parsing of Array literals (#984)
jimidle Oct 10, 2024
72c81be
Implement more missing visitor functions for Snowflake and TSQL (#983)
jimidle Oct 10, 2024
c4c662d
Added upgrade script and modified metrics sql (#990)
ganeshdogiparthi-db Oct 10, 2024
5f27d15
Implement missing visitor functions for Snowflake and TSQL (#974)
jimidle Oct 10, 2024
9679215
Release v0.7.0 (#992)
nfx Oct 10, 2024
7457d08
SNOWFLAKE: Correctly specify search conditions in Snowflake (#985)
jimidle Oct 10, 2024
89f70a5
Don't draft automated releases (#995)
nfx Oct 11, 2024
67c3fe6
Add tests for Snow ARRAY_REMOVE function (#979)
bishwajit-db Oct 11, 2024
6b4731a
Enhance the Snow ARRAY_SORT function support (#994)
bishwajit-db Oct 11, 2024
1fe4103
Adorn all UnresolvedXXX Ir with ruleName and token name (#993)
jimidle Oct 15, 2024
d1e6ef1
Add rule name and token name to syntax error reporting (#1003)
jimidle Oct 15, 2024
cb75b48
Add Variant Support (#998)
sundarshankar89 Oct 15, 2024
3212612
Query History From Folder (#991)
sundarshankar89 Oct 16, 2024
7208dec
Return `Result` out of generators, instead of throwing exceptions (#1…
vil1 Oct 16, 2024
296a83e
Change the structure of Result to allow for accumulating errors (#1010)
vil1 Oct 18, 2024
c18f189
Rework error return system and use it to generate parsing error and c…
jimidle Oct 18, 2024
c484f6a
Ensure that unparsable text is not lost in the generated output (#1012)
jimidle Oct 21, 2024
2145b51
Improve error recovery code to preserve text in AST (#1014)
jimidle Oct 24, 2024
1ea2384
Adorn TSQL visitors/builders with error node checking (#1019)
jimidle Oct 24, 2024
34e4a54
Implement text gatherers for additional error types (#1020)
jimidle Oct 25, 2024
fd6febd
Added `PySpark` generator (#1026)
nfx Oct 29, 2024
0acb9a5
Improve Snowflake grammar to improve syntax error recovery (#1022)
jimidle Oct 29, 2024
4d9cf63
TSQL: prevent selectListElem from ignoring valid columns when errorNo…
jimidle Oct 29, 2024
fdc454a
Added IR baseline for workflow definitions (#1035)
nfx Oct 29, 2024
77496af
Correctly generate `F.fn_name` for builtin PySpark functions (#1037)
nfx Oct 30, 2024
da083f8
Fetch table definitions for TSQL (#986)
sriram251-code Oct 30, 2024
448ea6a
Some window functions does not support window frame conditions (#999)
sriram251-code Oct 30, 2024
0fcac7b
Table comments captured as part of Snowflake Table Definition (#989)
sriram251-code Oct 30, 2024
5574efe
Split workflow definitions into sensible packages (#1039)
nfx Oct 30, 2024
d69ffdd
Added `ruff` Python formatter proxy (#1038)
nfx Oct 30, 2024
50a3f23
Introduce an abstraction for handling stateful traversal of the tree …
vil1 Oct 30, 2024
10568b0
Use Transformation to get rid of the ctx parameter in generators (#1040)
vil1 Oct 31, 2024
c70df5e
Improve IR for job nodes (#1041)
nfx Oct 31, 2024
032a880
Added baseline for translating workflows (#1042)
nfx Oct 31, 2024
bcd2025
Add documentation around Transformation (#1043)
vil1 Nov 1, 2024
2c083d5
Added correct generation of `databricks.yml` for `QueryHistory` (#1044)
nfx Nov 1, 2024
1a64c5d
Added `debug-bundle` command for folder-to-folder translation (#1045)
nfx Nov 1, 2024
3bde3be
Collect errors in `Phase` (#1046)
vil1 Nov 4, 2024
835e9ab
Handling any_keys_match from presto (#1048)
ganeshdogiparthi-db Nov 4, 2024
b96aa6a
Create Command Extended (#1033)
sundarshankar89 Nov 4, 2024
2df8105
[chore] Improved `ApplicationContext` to generically parse `--dialect…
nfx Nov 4, 2024
e6defea
Avoid non local return (#1052)
vil1 Nov 4, 2024
cbea0c0
TO_NUMBER/TO_DECIMAL/TO_NUMERIC without precision and scale (#1053)
sriram251-code Nov 4, 2024
acfbb4f
Fixed handling of projected expressions in `TreeNode` (#1159)
nfx Nov 4, 2024
498f09c
Added IR for stored procedures (#1161)
nfx Nov 4, 2024
11f6983
[chore] fixed query coverage report (#1160)
nfx Nov 4, 2024
205595c
With Recursive (#1000)
sundarshankar89 Nov 5, 2024
a92b05c
Rework serialization using circe+jackson (#1163)
vil1 Nov 6, 2024
6ad95e6
Release v0.8.0 (#1170)
nfx Nov 7, 2024
ef23a91
update setup docs (#1172)
ericvergnaud Nov 8, 2024
6df3b89
make hatch a pre-requisite (#1171)
ericvergnaud Nov 8, 2024
4486e58
Avoid processing escapes in interpolator arguments (#1167)
vil1 Nov 8, 2024
c6baa47
Generic stored procedure parsing (#1047)
jimidle Nov 8, 2024
27b143b
Fix serialization of MultipleErrors (#1177)
vil1 Nov 8, 2024
50771bf
[Snowflake] implement GROUP BY ALL (#1180)
vil1 Nov 12, 2024
8e619a0
[Snowflake] implement AST for positional column identifiers (#1181)
vil1 Nov 13, 2024
583188a
[Snowflake] Do not confuse single row VALUES with a function call (#1…
vil1 Nov 13, 2024
7601ced
[Snowflake] Add support for inline columns in CTEs (#1184)
vil1 Nov 13, 2024
cf04ded
patches optimizer rule Failure for NameOrPosition (#1206)
sundarshankar89 Nov 14, 2024
d4ad0cd
Bump sqlglot from 25.8.1 to 25.35.0 (#1205)
ganeshdogiparthi-db Nov 14, 2024
f567573
Fixed `CODEOWNERS` (#1216)
nfx Nov 18, 2024
d2d2a4b
Limit LSQL to releases before 0.14.0 (#1217)
asnare Nov 19, 2024
2b3d5f6
TSQL parser support for `(LHS) UNION RHS` queries (#1211)
asnare Nov 19, 2024
ea50af8
Fixing presto strpos and array_average functions (#1196)
ganeshdogiparthi-db Nov 21, 2024
3163132
Handling presto Unnest cross join to Databricks lateral view (#1209)
ganeshdogiparthi-db Nov 21, 2024
0f06d16
Support multiple columns in order by clause in for ARRAYAGG (#1228)
bishwajit-db Nov 22, 2024
9dcc986
Fix implementation of Snowflake's TO_TIME/TO_TIMESTAMP functions (#1218)
vil1 Nov 22, 2024
c1b4afd
bug fix for alter table add multiple columns (#1179)
aman-db Nov 25, 2024
0303dc0
Added support for json_size function in presto (#1236)
ganeshdogiparthi-db Nov 26, 2024
94cefa5
Add tests for partially successful parsing (#1237)
ericvergnaud Nov 26, 2024
49ed265
document setup for GPG signing (#1243)
ericvergnaud Nov 26, 2024
adba055
Tech debt: replace empty case classes with case objects (#1248)
asnare Nov 27, 2024
c61b377
patching memory leak file (#1247)
sundarshankar89 Nov 28, 2024
7875249
Enclose subqueries in parenthesis (#1232)
ericvergnaud Nov 28, 2024
a08cd61
Refactor `x.tail.foldleft(x.head)` chains as `x.reduceLeft` (#1249)
asnare Nov 28, 2024
a5bbdb6
Implement remaining TSQL set operations. (#1227)
asnare Nov 28, 2024
d50712f
Added support for format_datetime function in presto to Databricks (…
ganeshdogiparthi-db Nov 28, 2024
9fb429d
Drop unnecessary unit test (#1257)
asnare Nov 28, 2024
f192757
Bump codecov/codecov-action from 4 to 5 (#1210)
dependabot[bot] Nov 28, 2024
220b303
Add support for SnowFlake `SUBSTR` (#1238)
ericvergnaud Nov 29, 2024
6ec9c55
Bump sqlglot from 25.30.0 to 25.32.1 (#1254)
dependabot[bot] Nov 29, 2024
0b2596c
Allow easier debugging of test failures (#1258)
asnare Nov 29, 2024
c3df7ce
Release v0.9.0 (#1262)
sundarshankar89 Dec 2, 2024
a82a394
patch upgrade script (#1261)
sundarshankar89 Dec 3, 2024
9ffc6a0
EditorConfig setup for project (#1246)
asnare Dec 3, 2024
502e46f
Chore: remove some redundant (default) arguments on case classes in a…
asnare Dec 4, 2024
470c386
Update TSQL grammar so that INTERSECT precedence is handled there (#1…
asnare Dec 4, 2024
2ed569d
Style: avoid unscoped `private` fields (#1263)
asnare Dec 4, 2024
be81409
Fix precedence of CTEs with respect to set operations in SnowFlake (#…
asnare Dec 4, 2024
15a23f5
Create preprocessor and workflow for Jinja template processing (#1168)
jimidle Dec 4, 2024
5f87798
Mark incorrect functional test fixture for Snowflake's `RANDOM()` (#1…
asnare Dec 5, 2024
6aed1bb
Add documentation for Oracle JDBC reader options (#1264)
bishwajit-db Dec 5, 2024
67f3d47
Fix dropped set operations within a CTE in Snowflake (#1277)
asnare Dec 5, 2024
2602bf4
Added support sha2 function in SQLGLOT engine (#1279)
ganeshdogiparthi-db Dec 5, 2024
2dd5642
Add basic support for SnowFlake LCAs (#1242)
ericvergnaud Dec 5, 2024
0d998f8
Upgrade jdbc dependencies (#1289)
sundarshankar89 Dec 5, 2024
87fe90e
Added BigQuery Dialect and support format_date functionality
ganeshdogiparthi-db Dec 5, 2024
bb79df0
Bump antlr.version from 4.13.1 to 4.13.2 (#1309)
dependabot[bot] Dec 6, 2024
1e067e2
Fix TSQL precedence of `INTERSECT` with respect to `UNION`/`EXCEPT` (…
asnare Dec 6, 2024
98d6e09
Bump org.junit:junit-bom from 5.10.0 to 5.11.3 (#1310)
dependabot[bot] Dec 9, 2024
bda8336
Fix Snowflake precedence of `INTERSECT` with respect to other set ope…
asnare Dec 9, 2024
416fd3f
Avoid using one stack frame per non-jinja character in the input quer…
vil1 Dec 9, 2024
160adcd
Bump com.databricks:databricks-sdk-java from 0.21.0 to 0.36.0 (#1292)
dependabot[bot] Dec 9, 2024
6ad0b47
Bump org.scoverage:scoverage-maven-plugin from 2.0.5 to 2.0.6 (#1308)
dependabot[bot] Dec 9, 2024
7aa1d44
Bump com.lihaoyi:ujson_2.12 from 3.0.0-M2 to 4.0.2 (#1311)
dependabot[bot] Dec 9, 2024
07dc39d
Bump org.apache.maven.plugins:maven-jar-plugin from 2.4 to 3.4.2 (#1318)
dependabot[bot] Dec 9, 2024
669312a
Prevent Dependabot from upgrading Databricks Connect. (#1322)
asnare Dec 9, 2024
0281186
Properly translate Snowflake REGEXP_SUBSTR (#1312)
vil1 Dec 9, 2024
5b815fb
Refactor python transpiler code (#1314)
ericvergnaud Dec 9, 2024
ccbb359
Fix incorrect script for no-pylint-disable (#1329)
ericvergnaud Dec 9, 2024
906c2a6
Bump com.lihaoyi:pprint_2.12 from 0.8.1 to 0.9.0 (#1326)
dependabot[bot] Dec 9, 2024
4278f0a
Bump org.apache.maven.plugins:maven-compiler-plugin from 3.11.0 to 3.…
dependabot[bot] Dec 9, 2024
3544cab
Bump net.alchim31.maven:scala-maven-plugin from 4.8.1 to 4.9.2 (#1323)
dependabot[bot] Dec 9, 2024
287e110
Merge remote-tracking branch 'origin/main' into feature/transpile_int…
ganeshdogiparthi-db Dec 9, 2024
4598772
Mark scala as a dependency for Dependabot to ignore. (#1331)
asnare Dec 10, 2024
335d1ae
Bump sqlglot from 25.32.1 to 25.33.0 (#1285)
dependabot[bot] Dec 10, 2024
aa5eccd
Bump com.fasterxml.jackson.module:jackson-module-scala_2.12 from 2.15…
dependabot[bot] Dec 10, 2024
d962587
Bump org.apache.maven.plugins:maven-source-plugin from 3.3.0 to 3.3.1…
dependabot[bot] Dec 10, 2024
d66df51
Adjust Dependabot's maven update policies (#1338)
asnare Dec 10, 2024
4150125
Fix dependabot configuration (#1339)
asnare Dec 10, 2024
5b57a47
rename ci jobs for clarity (#1330)
ericvergnaud Dec 11, 2024
d8c75ea
Configure Dependabot to ignore Mockito from 5.x onwards. (#1342)
asnare Dec 11, 2024
8123fee
Allow ignoring tests using a function (#1337)
vil1 Dec 11, 2024
69d91fd
Bump com.databricks:databricks-sdk-java from 0.36.0 to 0.37.0 (#1340)
dependabot[bot] Dec 11, 2024
6f58449
Bump org.apache.maven.plugins:maven-assembly-plugin from 3.6.0 to 3.7…
dependabot[bot] Dec 11, 2024
4818287
Bump org.scala-lang:scala-library from 2.12.19 to 2.12.20 (#1344)
dependabot[bot] Dec 11, 2024
806b760
Bump org.codehaus.mojo:build-helper-maven-plugin from 3.4.0 to 3.6.0 …
dependabot[bot] Dec 11, 2024
5106fb7
Bump com.github.eirslett:frontend-maven-plugin from 1.15.0 to 1.15.1 …
dependabot[bot] Dec 11, 2024
188ca98
rename fields in TranspileConfig (#1321)
ericvergnaud Dec 11, 2024
da80c9f
Bump org.scalatest:scalatest_2.12 from 3.3.0-SNAP3 to 3.3.0-SNAP4 (#1…
dependabot[bot] Dec 11, 2024
a885406
Bump com.fasterxml.jackson.dataformat:jackson-dataformat-yaml from 2.…
dependabot[bot] Dec 11, 2024
942f0f2
Merge branch 'main' into feature/transpile_introduce_bigquery
ganeshdogiparthi-db Dec 12, 2024
File filter

Filter by extension

Filter by extension


Conversations
Failed to load comments.
Loading
Jump to
The table of contents is too big for display.
Diff view
Diff view
  •  
  •  
  •  
10 changes: 10 additions & 0 deletions .codegen.json
Original file line number Diff line number Diff line change
@@ -0,0 +1,10 @@
{
"version": {
"src/databricks/labs/remorph/__about__.py": "__version__ = \"$VERSION\""
},
"toolchain": {
"required": ["hatch"],
"pre_setup": ["hatch env create"],
"prepend_path": ".venv/bin"
}
}
22 changes: 22 additions & 0 deletions .editorconfig
Original file line number Diff line number Diff line change
@@ -0,0 +1,22 @@
# Top-most EditorConfig file.
root = true

# Universal settings.
[*]
indent_style = space
indent_size = 4
tab_width = 8
end_of_line = lf
charset = utf-8
trim_trailing_whitespace = true
insert_final_newline = true

[Makefile]
indent_style = tab
indent_size = tab

[*.{json,yml}]
indent_size = 2

[*.scala]
indent_size = 2
92 changes: 92 additions & 0 deletions .github/ISSUE_TEMPLATE/bug.yml
Original file line number Diff line number Diff line change
@@ -0,0 +1,92 @@
# See https://docs.github.com/en/communities/using-templates-to-encourage-useful-issues-and-pull-requests/syntax-for-issue-forms
# and https://docs.github.com/en/communities/using-templates-to-encourage-useful-issues-and-pull-requests/syntax-for-githubs-form-schema
name: Bug Report
description: Something is not working in Remorph
title: "[BUG]: "
labels: ["bug", "needs-triage"]
# assignees:
# - remorph-write
body:
- type: checkboxes
attributes:
label: Is there an existing issue for this?
description: Please search to see if an issue already exists for the bug you encountered.
options:
- label: I have searched the existing issues
required: true
- type: dropdown
id: category
attributes:
label: Category of Bug / Issue
description: Please select the category that best describes the bug / issue you are reporting.
options:
- TranspileParserError
- TranspileValidationError
- TranspileLateralColumnAliasError
- ReconcileError
- Other
validations:
required: true
- type: textarea
attributes:
label: Current Behavior
description: |
A concise description of what you're experiencing.
**Do not paste links to attachments with logs and/or images, as all issues with attachments will get deleted.**
Use the `Relevant log output` field to paste redacted log output without personal identifying information (PII).
You can Ctrl/Cmd+V the screenshot, which would appear as a rendered image if it doesn't contain any PII.
validations:
required: false
- type: textarea
attributes:
label: Expected Behavior
description: A concise description of what you expected to happen.
validations:
required: false
- type: textarea
attributes:
label: Steps To Reproduce
description: Steps to reproduce the behavior.
placeholder: |
1. In this environment...
1. With this config...
1. Run '...'
1. See error...
validations:
required: false
- type: textarea
id: logs
attributes:
label: Relevant log output or Exception details
description: Please copy and paste any relevant log output. This will be automatically formatted into code, so no need for backticks.
render: shell
- type: textarea
id: query
attributes:
label: Sample Query
description: Please copy and paste anonymized Query. This will be automatically formatted into code, so no need for backticks.
render: shell
- type: dropdown
id: os
attributes:
label: Operating System
description: Which operating system do you have Remorph installed on?
options:
- macOS
- Linux
- Windows
validations:
required: true
- type: dropdown
id: version
attributes:
label: Version
description: What version of our software are you running?
options:
- latest via Databricks CLI
- v0.1.5
- v0.1.4
- other
default: 0
validations:
required: true
9 changes: 9 additions & 0 deletions .github/ISSUE_TEMPLATE/config.yml
Original file line number Diff line number Diff line change
@@ -0,0 +1,9 @@
blank_issues_enabled: false
contact_links:
- name: General Databricks questions
url: https://help.databricks.com/
about: Issues related to Databricks and not related to Remorph

- name: Remorph Documentation
url: https://github.com/databrickslabs/remorph/tree/main/docs
about: Documentation about Remorph
45 changes: 45 additions & 0 deletions .github/ISSUE_TEMPLATE/feature.yml
Original file line number Diff line number Diff line change
@@ -0,0 +1,45 @@
# See https://docs.github.com/en/communities/using-templates-to-encourage-useful-issues-and-pull-requests/syntax-for-issue-forms
# and https://docs.github.com/en/communities/using-templates-to-encourage-useful-issues-and-pull-requests/syntax-for-githubs-form-schema
name: Feature Request
description: Something new needs to happen with Remorph
title: "[FEATURE]: "
labels: ["enhancement", "needs-triage"]
# assignees:
# - remorph-write
body:
- type: checkboxes
attributes:
label: Is there an existing issue for this?
description: Please search to see if an issue already exists for the feature request you're willing to submit
options:
- label: I have searched the existing issues
required: true
- type: dropdown
id: category
attributes:
label: Category of feature request
description: Please select the category that best describes the feature you are requesting for.
options:
- Transpile
- Reconcile
- Other
validations:
required: true
- type: textarea
attributes:
label: Problem statement
description: A clear and concise description of what the problem is. Ex. I'm always frustrated when [...]
validations:
required: true
- type: textarea
attributes:
label: Proposed Solution
description: A clear and concise description of what you want to happen.
validations:
required: true
- type: textarea
attributes:
label: Additional Context
description: Add any other context, references or screenshots about the feature request here.
validations:
required: false
23 changes: 22 additions & 1 deletion .github/dependabot.yml
Original file line number Diff line number Diff line change
Expand Up @@ -3,4 +3,25 @@ updates:
- package-ecosystem: "pip"
directory: "/"
schedule:
interval: "daily"
interval: "daily"
- package-ecosystem: "github-actions"
directory: "/"
schedule:
interval: "daily"
- package-ecosystem: "maven"
directory: "/"
schedule:
interval: "daily"
ignore:
# Ignore updates for Databricks Connect: the version in use needs to match the testing infrastructure.
- dependency-name: "com.databricks:databricks-connect"
# Ignore non-patch updates for Scala: we manually manage the Scala version.
- dependency-name: "org.scala-lang:scala-library"
update-types:
# (Scala 2 patch releases are binary compatible, so they're the only type allowed.)
- "version-update:semver-minor"
- "version-update:semver-major"
# Mockito from 5.x requires JDK 11, but we are using JDK 8.
- dependency-name: "org.mockito:mockito-core"
versions:
- ">=5.0.0"
60 changes: 60 additions & 0 deletions .github/scripts/setup_spark_remote.sh
Original file line number Diff line number Diff line change
@@ -0,0 +1,60 @@
#!/usr/bin/env bash

set -xve

mkdir -p "$HOME"/spark
cd "$HOME"/spark || exit 1

version=$(wget -O - https://dlcdn.apache.org/spark/ | grep 'href="spark' | grep -v 'preview' | sed 's:</a>:\n:g' | sed -n 's/.*>//p' | tr -d spark- | tr -d / | sort -r --version-sort | head -1)
if [ -z "$version" ]; then
echo "Failed to extract Spark version"
exit 1
fi

spark=spark-${version}-bin-hadoop3
spark_connect="spark-connect_2.12"

mkdir -p "${spark}"


SERVER_SCRIPT=$HOME/spark/${spark}/sbin/start-connect-server.sh

## check the spark version already exist ,if not download the respective version
if [ -f "${SERVER_SCRIPT}" ];then
echo "Spark Version already exists"
else
if [ -f "${spark}.tgz" ];then
echo "${spark}.tgz already exists"
else
wget "https://dlcdn.apache.org/spark/spark-${version}/${spark}.tgz"
fi
tar -xvf "${spark}.tgz"
fi

cd "${spark}" || exit 1
## check spark remote is running,if not start the spark remote
result=$(${SERVER_SCRIPT} --packages org.apache.spark:${spark_connect}:"${version}" > "$HOME"/spark/log.out; echo $?)

if [ "$result" -ne 0 ]; then
count=$(tail "${HOME}"/spark/log.out | grep -c "SparkConnectServer running as process")
if [ "${count}" == "0" ]; then
echo "Failed to start the server"
exit 1
fi
# Wait for the server to start by pinging localhost:4040
echo "Waiting for the server to start..."
for i in {1..30}; do
if nc -z localhost 4040; then
echo "Server is up and running"
break
fi
echo "Server not yet available, retrying in 5 seconds..."
sleep 5
done

if ! nc -z localhost 4040; then
echo "Failed to start the server within the expected time"
exit 1
fi
fi
echo "Started the Server"
Loading
Loading