Add support to write deephaven tables to iceberg #6125

malhotrashivam · 2024-09-25T15:00:58Z

The spec is being developed along with the development work but at a higher level, the decided APIs look like:

void append(Tables...) or appendDataFiles/appendTables: writes tables to data files 1:1, does a transaction to add new data files
void overwrite(Tables...) : writes tables to data files 1:1, does a transaction to remove all data files and add new ones
List<URI> write(Tables…) : writes tables to data files 1:1, does not put anything in transaction

An important requirement is that we need to persist the Iceberg schema element field-ids into the parquet schema Type field_id field, to map iceberg columns to parquet columns.

The text was updated successfully, but these errors were encountered:

devinrsmith · 2024-09-25T16:17:14Z

We should also see if there is any specific guidance on metadata we should be writing down; in the case of writing a pyarrow table using pyiceberg, we've noticed that the metadata key ARROW:schema contains the arrow schema; in the case of pyspark, it wrote a metadata key iceberg.schema that contains the iceberg schema.

malhotrashivam added feature request New feature or request parquet Related to the Parquet integration iceberg labels Sep 25, 2024

malhotrashivam added this to the 0.37.0 milestone Sep 25, 2024

malhotrashivam self-assigned this Sep 25, 2024

malhotrashivam mentioned this issue Sep 25, 2024

feat: Added support to write iceberg tables #5989

Merged

devinrsmith mentioned this issue Sep 25, 2024

More Control over Parquet Writing #6123

Closed

devinrsmith closed this as completed in #5989 Nov 22, 2024

devinrsmith closed this as completed in ecdc8e7 Nov 22, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add support to write deephaven tables to iceberg #6125

Add support to write deephaven tables to iceberg #6125

malhotrashivam commented Sep 25, 2024

devinrsmith commented Sep 25, 2024

Add support to write deephaven tables to iceberg #6125

Add support to write deephaven tables to iceberg #6125

Comments

malhotrashivam commented Sep 25, 2024

devinrsmith commented Sep 25, 2024