Make it easy to get the correct file path of a dataset #3611

merelcht · 2024-02-09T15:44:01Z

Description

To get the correct (versioned) file path for a dataset is quite hard. There doesn't seem to be one way that works for both non-versioned and versioned datasets and/or local and remote datasets.

The main question here is why users need to access the file path. Not all datasets have a file path, e.g. APIDataset and so it's important to understand the true user need, before diving into solution.

Context

dataset._filepath() for non-versioned, local dataset
dataset_get_load_path() for versioned, local datasets
Remote datasets: get_filepath_str(self._get_load_path(), self._protocol)

This might not even be the full list of ways to get the file path.

Important

This idea is based on observations from several Kedro engineers see e.g. #1778. However, we need a clear view on what user needs are when it comes to why they need the file path and what their use cases are. Any implementation should be preceded by user research: #1978

The text was updated successfully, but these errors were encountered:

merelcht added this to Kedro Framework Feb 9, 2024

merelcht converted this from a draft issue Feb 9, 2024

merelcht added this to the Redesign the API for IO (catalog) milestone Feb 9, 2024

github-actions bot mentioned this issue Mar 1, 2024

Monthly issue metrics report #3671

Open

kedro-org locked and limited conversation to collaborators Mar 28, 2024

merelcht converted this issue into discussion #3753 Mar 28, 2024

github-project-automation bot moved this to Done in Kedro Framework Mar 28, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

This issue was moved to a discussion.

Make it easy to get the correct file path of a dataset #3611

Make it easy to get the correct file path of a dataset #3611

merelcht commented Feb 9, 2024 •

edited

Loading

This issue was moved to a discussion.

This issue was moved to a discussion.

Make it easy to get the correct file path of a dataset #3611

Make it easy to get the correct file path of a dataset #3611

Comments

merelcht commented Feb 9, 2024 • edited Loading

Description

Context

This issue was moved to a discussion.

merelcht commented Feb 9, 2024 •

edited

Loading