New data store preload API #1097

forman · 2024-12-18T13:28:24Z

Added a new preload API to xcube data stores:

Enhanced the xcube.core.store.DataStore class to optionally support
preloading of datasets via an API represented by the
new xcube.core.store.DataPreloader interface.
Added handy default implementations NullPreloadHandle and ExecutorPreloadHandle
to be returned by implementations of the prepare_data() method of a
given data store.

Checklist:

Add unit tests and/or doctests in docstrings
Add docstrings and API docs for any new/modified user-facing classes and functions
~~New/modified features documented in docs/source/*~~
Changes documented in CHANGES.md
GitHub CI passes
AppVeyor CI passes
Test coverage remains or increases (target 100%)

konstntokas

I like very much, that we already have the multi-threading in there.I think this makes it fairly easy to do the implementation in the data store plugins.

I tested the notebook locally which works fine!

xcube/core/store/accessor.py

xcube/core/store/preload.py

konstntokas · 2024-12-18T14:45:49Z

xcube/core/store/preload.py

+        for data_id, future in self._futures.items():
+            if f is future:
+                break
+        if data_id is None:


Why is this needed?

I search for the data_id that belongs to the passed future f.

I'll add a comment.

Co-authored-by: Konstantin Ntokas <[email protected]>

konstntokas

I just saw that the failed test are not related to preload API. So I approve :)

b-yogesh

It looks good overall, thanks. I have added some comments below, please have a look.

xcube/core/store/preload.py

b-yogesh · 2024-12-18T19:50:58Z

examples/notebooks/datastores/preload.ipynb

@@ -0,0 +1,368 @@
+{


We can also add the context manager example in the notebook.

konstntokas · 2024-12-19T07:07:58Z

xcube/core/store/preload.py

+    def __init__(
+        self,
+        data_ids: tuple[str, ...],
+        preload_data: Callable[[PreloadHandle, str], None] | None = None,


Maybe it is useful to allow preload_params in the preload_data function.

E.g. I had the case, where I had multiple tiff files within a zip file which could be merged if the user requests it by a keyword boolean.

I don't understand the use case. Why don't you just add a dedicated opener parameter then?
You could also use functools.partial().

xcube/core/store/preload.py

b-yogesh

Approved! Added a tiny change suggestion. Please have a look before merging.
Btw, the tests are failing.

xcube/core/store/preload.py

Co-authored-by: b-yogesh <[email protected]>

forman and others added 3 commits December 16, 2024 19:04

1st draft of the preload API

810b380

black formatting

7fb5a00

simpler draft version of the preload API

a9d969f

forman requested review from b-yogesh and konstntokas December 18, 2024 13:28

update

9c42d8e

konstntokas reviewed Dec 18, 2024

View reviewed changes

forman and others added 3 commits December 18, 2024 16:34

Update xcube/core/store/accessor.py

9f5b30c

Co-authored-by: Konstantin Ntokas <[email protected]>

Update xcube/core/store/accessor.py

0572dbb

Co-authored-by: Konstantin Ntokas <[email protected]>

Some more doc

ea8c6e5

forman requested a review from konstntokas December 18, 2024 16:22

konstntokas approved these changes Dec 18, 2024

View reviewed changes

b-yogesh requested changes Dec 18, 2024

View reviewed changes

konstntokas reviewed Dec 19, 2024

View reviewed changes

b-yogesh reviewed Dec 19, 2024

View reviewed changes

xcube/core/store/preload.py Show resolved Hide resolved

forman requested a review from b-yogesh December 27, 2024 07:04

Some more doc and added option to suppress state update outputs.

3954cf7

b-yogesh approved these changes Dec 27, 2024

View reviewed changes

xcube/core/store/preload.py Outdated Show resolved Hide resolved

Update xcube/core/store/preload.py

647e107

Co-authored-by: b-yogesh <[email protected]>

forman marked this pull request as ready for review December 27, 2024 14:40

forman merged commit 3473d04 into main Dec 27, 2024
0 of 2 checks passed

forman deleted the forman-1093-preload_api branch December 27, 2024 14:41

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

New data store preload API #1097

New data store preload API #1097

forman commented Dec 18, 2024 •

edited

Loading

konstntokas left a comment •

edited

Loading

konstntokas Dec 18, 2024

forman Dec 18, 2024

forman Dec 18, 2024

konstntokas left a comment

b-yogesh left a comment

b-yogesh Dec 18, 2024

konstntokas Dec 19, 2024

forman Dec 27, 2024

b-yogesh left a comment •

edited

Loading

New data store preload API #1097

New data store preload API #1097

Conversation

forman commented Dec 18, 2024 • edited Loading

konstntokas left a comment • edited Loading

Choose a reason for hiding this comment

konstntokas Dec 18, 2024

Choose a reason for hiding this comment

forman Dec 18, 2024

Choose a reason for hiding this comment

forman Dec 18, 2024

Choose a reason for hiding this comment

konstntokas left a comment

Choose a reason for hiding this comment

b-yogesh left a comment

Choose a reason for hiding this comment

b-yogesh Dec 18, 2024

Choose a reason for hiding this comment

konstntokas Dec 19, 2024

Choose a reason for hiding this comment

forman Dec 27, 2024

Choose a reason for hiding this comment

b-yogesh left a comment • edited Loading

Choose a reason for hiding this comment

forman commented Dec 18, 2024 •

edited

Loading

konstntokas left a comment •

edited

Loading

b-yogesh left a comment •

edited

Loading