-
Notifications
You must be signed in to change notification settings - Fork 308
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Dat file compression #57
Comments
One relevant question: is the order of on the fly decompression consistent? Would be interesting if for a |
I ran some basic benchmarks on the dat files of the first 50 patients in the mimic3 matched waveform database, on my computer. This is in memory compression/decompression, so these times are slower than compressing to disc.
All libraries have python bindings, with bzip2 and gzip being built into the cpython core distribution. |
Benchmark results of 5980 files, with total size 21.01 G. Total time is shown in HH:MM:SS, indicating the sum of times by all cores, effectively a single core benchmark.
|
Leaning towards flac. There may be some challenges: https://xiph.org/flac/format.html
|
I found that first converting the float array to int16 using the |
FLAC formats have been implemented. |
This may be a feature for way down the line.
Incorporate some form of file compression for dat files. One level is to use some generic compression to save space, which will increase load times. The next level would be to come up with wfdb's own compression system to load files even faster.
The text was updated successfully, but these errors were encountered: