-
Notifications
You must be signed in to change notification settings - Fork 4
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
feat: rework TaggedTable
#680
Conversation
Remove all overridden `Table` methods and no longer inherit from `Table`
It was also used for metrics, so the previous name did not cover all use cases.
🦙 MegaLinter status: ✅ SUCCESS
See detailed report in MegaLinter reports |
Codecov ReportAll modified and coverable lines are covered by tests ✅
Additional details and impacted files@@ Coverage Diff @@
## main #680 +/- ##
=========================================
Coverage 100.00% 100.00%
=========================================
Files 66 67 +1
Lines 4869 4824 -45
=========================================
- Hits 4869 4824 -45 ☔ View full report in Codecov by Sentry. |
## [0.22.0](v0.21.0...v0.22.0) (2024-05-01) ### Features * `is_fitted` is now always a property ([#662](#662)) ([b1db881](b1db881)), closes [#586](#586) * add `Column.missing_value_count` ([#682](#682)) ([f084916](f084916)), closes [#642](#642) * Add `InputConversion` & `OutputConversion` for nn interface ([#625](#625)) ([fd723f7](fd723f7)), closes [#621](#621) * Add hash,eq and sizeof in ForwardLayer ([#634](#634)) ([72f7fde](72f7fde)), closes [#633](#633) * allow using tables that already contain target for prediction ([#687](#687)) ([e9f1cfb](e9f1cfb)), closes [#636](#636) * callback `Row.sort_columns` takes four parameters instead of two tuples ([#683](#683)) ([9c3e3de](9c3e3de)), closes [#584](#584) * rename `group_rows_by` in `Table` to `group_rows` ([#661](#661)) ([c1644b7](c1644b7)), closes [#611](#611) * rename `number_of_column` in `Row` to `number_of_columns` ([#660](#660)) ([0a08296](0a08296)), closes [#646](#646) * rework `TaggedTable` ([#680](#680)) ([db2b613](db2b613)), closes [#647](#647) * show missing value count/ratio in summarized statistics ([#684](#684)) ([74b8a35](74b8a35)), closes [#619](#619) * specify `extras` instead of `features` in `to_tabular_dataset` ([#685](#685)) ([841657f](841657f)), closes [#623](#623) ### Bug Fixes * actually use `kernel` of support vector machines for training ([#681](#681)) ([09c5082](09c5082)), closes [#602](#602) ### Performance Improvements * Faster plot_histograms and more reliable plots ([#659](#659)) ([b5f0a12](b5f0a12))
🎉 This PR is included in version 0.22.0 🎉 The release is available on:
Your semantic-release bot 📦🚀 |
Closes #647
Summary of Changes
TaggedTable
is now calledTabularDataset
,safeds.data.tabular.containers
tosafeds.data.labeled.containers
. That's where all dataset classes for supervised learning will go, like the upcomingImageDataset
.TabularDataset
no longer inherits fromTable
.TabularDataset
now has a very small interface. It's only meant to be used as input for supervised ML models. Table manipulation is now solely done using theTable
class.tag_columns
onTable
is now calledto_tabular_dataset
. This makes it consistent with other conversion methods and emphasizes that this is a terminal operation and should only be used once one is done manipulating the table.TabularDataset
now has a publicto_table
method to get aTable
again.