Allow uploading ZARR files #6205

philippotto · 2022-05-12T13:14:28Z

One of the following would probably make sense to have for better zarr support:

Support arbitrary zarr files (probably should be converted with wkcuber so that a mag hierarchy and meta data exist)
Support zarr files which already follow our format

jstriebel · 2022-06-10T09:34:18Z

see also #6120

fm3 · 2023-10-16T14:32:07Z

I’d say it makes sense to run a conversion job for zarr uploads to do re-chunking, sharding, etc.

We could also skip that in case there is already a datasource-properties.json (assuming that the user used the libs to create the zarr dataset already with optimal parameters). In this case, the backend also does not have to infer anything, but can just put the dataset on disk as it comes.

@normanrz @philippotto Do you think the existence of a datasource-properties.json (maybe together with the format-identifying zarr.json) is a good enough heuristic here? My guess is that we would always want to do re-chunking for zarr2 because it does not support sharding?

normanrz · 2023-10-16T16:05:53Z

We don't have a rechunking job yet. So, maybe just ingest the zarr as is and write a datasource-properties.json?

fm3 · 2023-10-16T16:19:27Z

Fair enough. @frcroth I guess a good spot for this would be postProcessUploadedDataSource in UploadService.scala.

It would be nice if you could reuse some of the Explorer code to create the json from the files. I’m not sure how to do that in the datastore. Maybe you can figure that out. The Explorer classes will probably need to be moved. Also have a look at #7389 for recent changes of the FileSystemDataVault.

The frontend should also be adapted to not set needsConversion=true in this case. You can find a heuristic in the frontend (I think it checks for wkw files being present?).

Please let us know if you need further information!

philippotto · 2023-10-17T08:26:51Z

You can find a heuristic in the frontend (I think it checks for wkw files being present?).

Yes, if the uploaded files contain a WKW file (or a ZIP which contains a WKW file), it is assumed that no conversion is needed. This is implemented in this method:

webknossos/frontend/javascripts/admin/dataset/dataset_upload_view.tsx

Line 475 in 9cecaab

validateFiles = async (files: FileWithPath[]) => {

philippotto added the zarr label May 12, 2022

philippotto assigned normanrz Jun 9, 2022

normanrz changed the title ~~Allow importing ZARR files~~ Allow uploading ZARR files Jun 10, 2022

philippotto assigned frcroth and unassigned normanrz Sep 18, 2023

fm3 added the discussion label Sep 18, 2023

frcroth mentioned this issue Oct 17, 2023

Allow uploading zarr datasets #7397

Merged

5 tasks

frcroth closed this as completed in #7397 Nov 8, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Allow uploading ZARR files #6205

Allow uploading ZARR files #6205

philippotto commented May 12, 2022 •

edited by normanrz

Loading

jstriebel commented Jun 10, 2022

fm3 commented Oct 16, 2023

normanrz commented Oct 16, 2023

fm3 commented Oct 16, 2023

philippotto commented Oct 17, 2023

Allow uploading ZARR files #6205

Allow uploading ZARR files #6205

Comments

philippotto commented May 12, 2022 • edited by normanrz Loading

jstriebel commented Jun 10, 2022

fm3 commented Oct 16, 2023

normanrz commented Oct 16, 2023

fm3 commented Oct 16, 2023

philippotto commented Oct 17, 2023

philippotto commented May 12, 2022 •

edited by normanrz

Loading