Release 0.9.1 · gretelai/trainer

Internal improvements 🧹

Source data is now stored on disk in CSV format rather than in memory as Pandas DataFrames, resulting in a reduced overall memory footprint.

Synthetic composite keys now more accurately reflect characteristics of the source data.

JSON handling has been refactored and an edge case where too-long and/or deeply-nested JSON table names caused failures in Gretel Cloud has been fixed.

All PRs

Core refactors by @mikeknep in #126
Source on disk by @mikeknep in #130
fix: unit test errors on new mac m2 by @benmccown in #134
Chunked independent synthetics preprocessing by @mikeknep in #133
fix: nested json table string length by @benmccown in #135
Expose ExtractorConfig from root relational module by @mikeknep in #137
Fix RTD by @matthewgrossman in #138
New impl for synthesizing composite keys by @mikeknep in #136
Re-add source_ prefix to archived source files by @mikeknep in #140

New Contributors

@benmccown made their first contribution in #134
@matthewgrossman made their first contribution in #138

Full Changelog: v0.9.0...v0.9.1

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

0.9.1

Internal improvements 🧹

All PRs

New Contributors

Contributors