0.9.1
Internal improvements 🧹
Source data is now stored on disk in CSV format rather than in memory as Pandas DataFrames, resulting in a reduced overall memory footprint.
Synthetic composite keys now more accurately reflect characteristics of the source data.
JSON handling has been refactored and an edge case where too-long and/or deeply-nested JSON table names caused failures in Gretel Cloud has been fixed.
All PRs
- Core refactors by @mikeknep in #126
- Source on disk by @mikeknep in #130
- fix: unit test errors on new mac m2 by @benmccown in #134
- Chunked independent synthetics preprocessing by @mikeknep in #133
- fix: nested json table string length by @benmccown in #135
- Expose ExtractorConfig from root relational module by @mikeknep in #137
- Fix RTD by @matthewgrossman in #138
- New impl for synthesizing composite keys by @mikeknep in #136
- Re-add source_ prefix to archived source files by @mikeknep in #140
New Contributors
- @benmccown made their first contribution in #134
- @matthewgrossman made their first contribution in #138
Full Changelog: v0.9.0...v0.9.1