Skip to content

0.10.0

Compare
Choose a tag to compare
@mikeknep mikeknep released this 02 Aug 18:57
· 60 commits to main since this release
11ab6c2

Benchmark v2 🎉

Benchmark has received several internal improvements. While general usage mostly stays the same, there are a few user-facing changes from previous versions:

Breaking changes

  • Datatype.TABULAR_NUMERIC and Datatype.TABULAR_MIXED have been replaced by a single enum variant, Datatype.TABULAR.
  • If you're passing a list of multiple sources to make_dataset an exception will be raised.

Deprecations

  • make_dataset is being replaced by create_dataset
  • The freestanding functions for Gretel datasets (get_gretel_dataset, list_gretel_datasets, list_gretel_dataset_tags) are being replaced by methods on a new object:
    repo = GretelDatasetRepo()
    repo.get_dataset(...)
    repo.list_datasets(...)
    repo.list_gretel_dataset_tags(...)

Trainer column partitioning removed 👋

Trainer no longer partitions datasets by column. The max_header_clusters argument to the Gretel model classes in gretel_trainer.models is deprecated, and will be removed in a future release.

Smaller notes 🧹

  • A bug when downloading record handler data in Relational Trainer in hybrid deployments has been fixed
  • Relational Trainer uses Pandas features that were added in 1.5, so the dependency version has been corrected.
  • Trainer no longer depends on gretel-synthetics

All PRs

Full Changelog: v0.9.1...v0.10.0