Write out chunks #203

umertens · 2024-10-29T08:04:14Z

Hi,

this is not an issue but rather a basic question. I hope you can still help me.

I would like to store my data (locally) in chunks because I do not know the final size beforehand. So eventually, I want to store n training examples of 2d tensors of shape (m, l). The first dimension n is not known, so I would like to write chunks of say 512 training examples and resize the first dimension accordingly.

Once stored, I want to load a random batch for further use in Torch.

Thanks in advance for your help! :)

The text was updated successfully, but these errors were encountered:

BrianMichell · 2024-10-30T19:54:17Z

It sounds like you're looking for the resize method. It'd initialize the store to some arbitrarily large dimensions and then resize with the resize_tied_bounds once you know the final extent of your store.

Here's a snippet of our C++ code that handles just that. We use the implicit dims as the lower bound because we expect everything to have an origin at zero.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Write out chunks #203

Write out chunks #203

umertens commented Oct 29, 2024 •

edited

Loading

BrianMichell commented Oct 30, 2024

Write out chunks #203

Write out chunks #203

Comments

umertens commented Oct 29, 2024 • edited Loading

BrianMichell commented Oct 30, 2024

umertens commented Oct 29, 2024 •

edited

Loading