Update for zarr 2.18.0 and zarr 2.18.1 #195

mavaylon1 · 2024-05-16T21:45:44Z

Motivation

What was the reasoning behind this change? Please explain the changes briefly.
This PR is to update hdmf-zarr for changes that came n 2.18.0 and new changes in 2.18.0

2.18.0
This is in response to the failing workflows using the latest version of zarr, specifically the failing test test_fsspec_streaming.

The issue is how we are reading zarr scalar arrays. We are currently trying to directly index and access the scalar. From looking at the documentation, zarr accessed a scalar array as z[:], which seems to solve the problem. I also saw that z[()] was another syntax that works.

Why?
Not completely clear.
When we access a scalar array in numpy we do so with notation [()]. Supposedly, Zarr used to support indexing scalar arrays as z[0], but that was updated to numpy standard earlier in one of the zarr version 2.#. Maybe this was a functionality that finally got flushed out fully.

2.18.1
To ensure that the data being assigned is in a format that the Zarr array can handle efficiently, Zarr arrays seem to require that we set with numpy arrays, i.e., dset[:] = np.array(data).

Another weird behavior is when setting a dataset of references, the current method puts the entire dataset in each index vs matching the individual reference dictionary to the index (which is what we expect).

The solution to both is make sure the data is in an array first.

How to test the behavior?

Show how to reproduce the new behavior (can be a bug fix or a new feature)

Checklist

Did you update CHANGELOG.md with your changes?
Have you checked our Contributing document?
Have you ensured the PR clearly describes the problem and the solution?
Is your contribution compliant with our coding style? This can be checked running ruff from the source directory.
Have you checked to ensure that there aren't other open Pull Requests for the same change?
Have you included the relevant issue number using "Fix #XXX" notation where XXX is the issue number? By including "Fix #XXX" you allow GitHub to close issue #XXX when the PR is merged.

codecov-commenter · 2024-05-16T21:49:02Z

Codecov Report

All modified and coverable lines are covered by tests ✅

Project coverage is 86.05%. Comparing base (72ff80f) to head (e9cfb84).

Additional details and impacted files

@@           Coverage Diff           @@
##              dev     #195   +/-   ##
=======================================
  Coverage   86.05%   86.05%           
=======================================
  Files           5        5           
  Lines        1162     1162           
  Branches      287      287           
=======================================
  Hits         1000     1000           
  Misses        107      107           
  Partials       55       55

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

mavaylon1 · 2024-05-16T23:09:31Z

Hey @oruebel can you give me your thoughts. I believe this is related to #192

oruebel · 2024-05-16T23:14:41Z

Using the [()] syntax to access scalars makes sense to me

mavaylon1 · 2024-05-16T23:17:40Z

@oruebel There are also some deprecation warnings for zarr 3.0. I will add filters in this PR, but I'll also make a ticket for those deprecations to be resolved to whatever zarr will be pivoting to.

mavaylon1 · 2024-05-22T19:46:10Z

Review Notes:
I need to ping to see if this now resolved with this update or to see what changes:
#192

rly

LGTM. Thanks for taking care of this.

Update pyproject.toml

daa4037

mavaylon1 added 2 commits May 16, 2024 15:41

Update pyproject.toml

78c31ef

Scalar Arrays read

68aa541

mavaylon1 changed the title ~~Update pyproject.toml~~ Fix reading scalar arrays May 16, 2024

Update CHANGELOG.md

ed6805b

mavaylon1 mentioned this pull request May 16, 2024

[Bug]: Zarr 2.18.0 with Blosc #192

Closed

3 tasks

Update test_gallery.py

3804b76

mavaylon1 mentioned this pull request May 18, 2024

Don't open with consolidated metadata in mode r+ #193

Merged

6 tasks

mavaylon1 added 4 commits May 17, 2024 17:35

Update test_gallery.py

59f8285

Update CHANGELOG.md

313d7f0

array setting

80c3fef

array setting

e793785

mavaylon1 changed the title ~~Fix reading scalar arrays~~ Update for zarr 2.18.0 and zarr 2.18.1 May 22, 2024

mavaylon1 added 2 commits May 22, 2024 12:31

possible fixes

26afd08

Update CHANGELOG.md

e9cfb84

mavaylon1 requested a review from rly May 22, 2024 19:42

mavaylon1 marked this pull request as ready for review May 22, 2024 19:44

rly approved these changes May 22, 2024

View reviewed changes

mavaylon1 merged commit 07a5bb2 into dev May 22, 2024
24 checks passed

mavaylon1 deleted the schedule branch May 22, 2024 20:39

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Update for zarr 2.18.0 and zarr 2.18.1 #195

Update for zarr 2.18.0 and zarr 2.18.1 #195

mavaylon1 commented May 16, 2024 •

edited

Loading

codecov-commenter commented May 16, 2024 •

edited

Loading

mavaylon1 commented May 16, 2024

oruebel commented May 16, 2024

mavaylon1 commented May 16, 2024

mavaylon1 commented May 22, 2024

rly left a comment

Update for zarr 2.18.0 and zarr 2.18.1 #195

Update for zarr 2.18.0 and zarr 2.18.1 #195

Conversation

mavaylon1 commented May 16, 2024 • edited Loading

Motivation

How to test the behavior?

Checklist

codecov-commenter commented May 16, 2024 • edited Loading

Codecov Report

mavaylon1 commented May 16, 2024

oruebel commented May 16, 2024

mavaylon1 commented May 16, 2024

mavaylon1 commented May 22, 2024

rly left a comment

Choose a reason for hiding this comment

mavaylon1 commented May 16, 2024 •

edited

Loading

codecov-commenter commented May 16, 2024 •

edited

Loading