Cache processed activities #42

hugovk · 2024-01-14T12:02:39Z

This speeds up the "Processing data..." step by caching the generated Pandas dataframe as a pickle file on disk.

For example, with an 8-core Mac, processing all my 3,699 GPX files takes 34s on first pass and creates a 305 MB cache file on disk (the GPX files are 822 MB). For the second run, it takes less than 2s to load the cache file.

For 580 GPX files from 2023, it takes 4s on first pass to create a 50 MB file.

Also add some type hints.

marcusvolz · 2024-02-11T01:25:47Z

Fantastic!

hugovk added the enhancement New feature or request label Jan 14, 2024

hugovk added 2 commits February 9, 2024 18:54

Cache processed activities

2cc2364

Allow modern type hints for older Python

0e705cd

hugovk force-pushed the cache-processed-activities branch from e99c44c to 0e705cd Compare February 9, 2024 16:54

Remove unused import

fe894f9

hugovk merged commit 157d0de into marcusvolz:main Feb 11, 2024
19 checks passed

hugovk deleted the cache-processed-activities branch February 11, 2024 11:42

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Cache processed activities #42

Cache processed activities #42

hugovk commented Jan 14, 2024 •

edited

Loading

marcusvolz commented Feb 11, 2024

Cache processed activities #42

Cache processed activities #42

Conversation

hugovk commented Jan 14, 2024 • edited Loading

marcusvolz commented Feb 11, 2024

hugovk commented Jan 14, 2024 •

edited

Loading