Ensure "nessie.commit.id table" property is set when updating the table #19524

ajantha-bhat · 2023-10-25T09:08:47Z

Description

Spark sets the table property NESSIE_COMMIT_ID_PROPERTY in NessieTableOperations#loadTableMetadata. Then NessieIcebergClient.commitTable uses this property.

In Trino, this property is never set but used in NessieIcebergClient.commitTable as it is a common code. Hence, the commit id is old and doesn't allow new commits.

Use the common code (available From Iceberg 1.4.0) NessieUtil.updateTableMetadataWithNessieSpecificProperties in Trino, which handles setting the property like "nessie.commit.id".

Additional context and related issues

Fixes #17813

Release notes

(x) This is not user-visible or is docs only, and no release notes are required.
( ) Release notes are required. Please propose a release note for me.
( ) Release notes are required, with the following suggested text:

# Section
* Fix some things. ({issue}`issuenumber`)

...roduct-tests/src/main/java/io/trino/tests/product/iceberg/TestIcebergSparkCompatibility.java

ajantha-bhat · 2023-10-30T04:59:35Z

...roduct-tests/src/main/java/io/trino/tests/product/iceberg/TestIcebergSparkCompatibility.java

+        onSpark().executeQuery("CREATE TABLE " + sparkTableName + " (a INT, b INT, c INT) USING ICEBERG");
+        onSpark().executeQuery("INSERT INTO " + sparkTableName + " VALUES (1, 2, 3)");
+
+        onTrino().executeQuery("INSERT INTO " + trinoTableName + " VALUES (4, 5, 6)");


This used to fail for Nessie and there was no test case to cover this case.

ajantha-bhat · 2023-10-30T05:00:07Z

cc: @findepi, @ebyhr, @findinpath

ajantha-bhat · 2023-10-31T12:48:22Z

also cc: @dimas-b, @nastra

dimas-b

nice catch, @ajantha-bhat !

ajantha-bhat · 2023-11-13T09:58:08Z

@findinpath and @findepi: Can I please get a review on this? This is a small PR.

...no-iceberg/src/main/java/io/trino/plugin/iceberg/catalog/AbstractIcebergTableOperations.java

findinpath · 2023-11-14T15:36:15Z

...roduct-tests/src/main/java/io/trino/tests/product/iceberg/TestIcebergSparkCompatibility.java

+        String sparkTableName = sparkTableName(baseTableName);
+        String trinoTableName = trinoTableName(baseTableName);
+
+        onSpark().executeQuery("CREATE TABLE " + sparkTableName + " (a INT, b INT, c INT) USING ICEBERG");


i think we can stick to one column - there's no need to use multiple columns to test this change.

findinpath · 2023-11-14T15:52:41Z

...roduct-tests/src/main/java/io/trino/tests/product/iceberg/TestIcebergSparkCompatibility.java

+        String trinoTableName = trinoTableName(baseTableName);
+
+        onSpark().executeQuery("CREATE TABLE " + sparkTableName + " (a INT, b INT, c INT) USING ICEBERG");
+        onSpark().executeQuery("INSERT INTO " + sparkTableName + " VALUES (1, 2, 3)");


Do you want to ensure that the nessie.commit.id gets updated ?

public static String getNessieCommitId(String tableName) { String propertiesTableName = "\"" + baseTableName + "$properties\""; String trinoPropertiesTableName = trinoTableName(propertiesTableName); return (String) onTrino() .executeQuery("SELECT value FROM " + trinoPropertiesTableName + " WHERE key = 'nessie.commit.id'") .getOnlyValue(); }

This test is common for other catalogs (groups) too. So, I didn't want to have a catalog (group) specific checks.

findinpath · 2023-11-20T15:44:25Z

...roduct-tests/src/main/java/io/trino/tests/product/iceberg/TestIcebergSparkCompatibility.java

+        onSpark().executeQuery("INSERT INTO " + sparkTableName + " VALUES (1)");
+
+        onTrino().executeQuery("INSERT INTO " + trinoTableName + " VALUES (2)");


Suggested change

onSpark().executeQuery("INSERT INTO " + sparkTableName + " VALUES (1)");

onTrino().executeQuery("INSERT INTO " + trinoTableName + " VALUES (2)");

onSpark().executeQuery("INSERT INTO " + sparkTableName + " VALUES 1");

onTrino().executeQuery("INSERT INTO " + trinoTableName + " VALUES 2");

Spark sets the table property NESSIE_COMMIT_ID_PROPERTY in NessieTableOperations#loadTableMetadata. Then NessieIcebergClient.commitTable uses this property. In Trino, this property is never set but used in NessieIcebergClient.commitTable as it is a common code. Hence, the commit id is old and doesn't allow new commits. Use the common code (available From Iceberg 1.4.0) NessieUtil.updateTableMetadataWithNessieSpecificProperties in Trino, which handles setting the property like "nessie.commit.id".

ajantha-bhat · 2023-11-23T06:39:27Z

The failed test io.trino.tests.product.hive.TestHiveTransactionalTable.testReadFullAcidPartitioned is unrelated to the change.

cla-bot bot added the cla-signed label Oct 25, 2023

github-actions bot added tests:hive iceberg Iceberg connector labels Oct 25, 2023

ajantha-bhat marked this pull request as draft October 25, 2023 11:49

ajantha-bhat force-pushed the issue branch from aa4c6db to d764894 Compare October 30, 2023 02:08

ajantha-bhat marked this pull request as ready for review October 30, 2023 04:48

ajantha-bhat requested review from ebyhr and findinpath October 30, 2023 04:49

ajantha-bhat commented Oct 30, 2023

View reviewed changes

...roduct-tests/src/main/java/io/trino/tests/product/iceberg/TestIcebergSparkCompatibility.java Show resolved Hide resolved

ajantha-bhat commented Oct 30, 2023

View reviewed changes

ebyhr removed their request for review October 31, 2023 13:03

dimas-b approved these changes Oct 31, 2023

View reviewed changes

ajantha-bhat requested review from findepi and mosabua November 2, 2023 13:20

mosabua removed their request for review November 6, 2023 20:09

nastra approved these changes Nov 13, 2023

View reviewed changes

findinpath reviewed Nov 14, 2023

View reviewed changes

ajantha-bhat changed the title ~~Fix Trino cannot write to Nessie managed Iceberg table after spark~~ Fix Trino cannot write to Iceberg table after spark with Nessie catalog Nov 16, 2023

ajantha-bhat force-pushed the issue branch from d764894 to 1be14dc Compare November 16, 2023 14:08

findinpath reviewed Nov 20, 2023

View reviewed changes

ajantha-bhat force-pushed the issue branch from 1be14dc to 2303374 Compare November 21, 2023 09:41

ajantha-bhat changed the title ~~Fix Trino cannot write to Iceberg table after spark with Nessie catalog~~ Ensure "nessie.commit.id table" property is set when updating the table Nov 21, 2023

findinpath approved these changes Nov 21, 2023

View reviewed changes

ajantha-bhat mentioned this pull request Nov 27, 2023

iceberg connector cannot perform write operations when use Nessie catalog #17813

Closed

ajantha-bhat mentioned this pull request Dec 5, 2023

Remove deprecated HttpClientBuilder projectnessie/nessie#7803

Merged

electrum approved these changes Dec 15, 2023

View reviewed changes

electrum merged commit face6d8 into trinodb:master Dec 15, 2023
47 of 49 checks passed

github-actions bot added this to the 436 milestone Dec 15, 2023

colebow mentioned this pull request Dec 19, 2023

Add Trino 436 release notes #20166

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Ensure "nessie.commit.id table" property is set when updating the table #19524

Ensure "nessie.commit.id table" property is set when updating the table #19524

ajantha-bhat commented Oct 25, 2023 •

edited

Loading

ajantha-bhat Oct 30, 2023

ajantha-bhat commented Oct 30, 2023

ajantha-bhat commented Oct 31, 2023

dimas-b left a comment

ajantha-bhat commented Nov 13, 2023

findinpath Nov 14, 2023

ajantha-bhat Nov 16, 2023

findinpath Nov 14, 2023

ajantha-bhat Nov 16, 2023

findinpath Nov 20, 2023

ajantha-bhat commented Nov 23, 2023

		onSpark().executeQuery("INSERT INTO " + sparkTableName + " VALUES (1)");

		onTrino().executeQuery("INSERT INTO " + trinoTableName + " VALUES (2)");

Ensure "nessie.commit.id table" property is set when updating the table #19524

Ensure "nessie.commit.id table" property is set when updating the table #19524

Conversation

ajantha-bhat commented Oct 25, 2023 • edited Loading

Description

Additional context and related issues

Release notes

ajantha-bhat Oct 30, 2023

Choose a reason for hiding this comment

ajantha-bhat commented Oct 30, 2023

ajantha-bhat commented Oct 31, 2023

dimas-b left a comment

Choose a reason for hiding this comment

ajantha-bhat commented Nov 13, 2023

findinpath Nov 14, 2023

Choose a reason for hiding this comment

ajantha-bhat Nov 16, 2023

Choose a reason for hiding this comment

findinpath Nov 14, 2023

Choose a reason for hiding this comment

ajantha-bhat Nov 16, 2023

Choose a reason for hiding this comment

findinpath Nov 20, 2023

Choose a reason for hiding this comment

ajantha-bhat commented Nov 23, 2023

ajantha-bhat commented Oct 25, 2023 •

edited

Loading