Large amount of queries when getting a dataset with API #9683
Labels
Feature: API
Feature: Performance & Stability
Type: Bug
a defect
User Role: API User
Makes use of APIs
Milestone
What steps does it take to reproduce the issue?
Retrieve a large dataset (containing thousands of files) through the API
When does this issue occur?
It is the same for all datasets, but is more problematic for larger datasets.
Which page(s) does it occur on?
API calls
What happens?
When we check the query log, we see multiple queries for each file of the dataset.
To whom does it occur (all users, curators, superusers)?
All users.
What did you expect to happen?
One larger query that retrieves all the necessary data at once.
Which version of Dataverse are you using?
Develop.
Any related open or closed issues to this bug report?
A better solution would be to have an API that retrieves only the dataset and a separate API calls for retrieving the file metadata in a paginated way, as proposed by @Kris-LIBIS. However, many existing applications already use the retrieve dataset API call, including all file metadata in one call, therefore, making it more efficient should be beneficial too.
The text was updated successfully, but these errors were encountered: