Enhance validation in Learners and Task #597

RaphaelS1 · 2021-01-21T11:21:05Z

Now I've looked at documentation in more detail. I'd like to suggest two minor enhancements for validation in Task and Learner:

There are many cases where a validation dataset may be required by a learner in order to make use of internal processes such as early stopping (e.g. GBMs, NNs). As validation is a common use-case, it may be worthwhile adding a simple public method to Task that creates arbitrary train/validation splits that can be passed to learner hyper-parameters, e.g.

split_validation = function(prob) {
  self$row_roles$validation = sample(seq(self$nrow), self$nrow * prob)
}

Whilst this is clearly a thin wrapper around a basic function that could be performed by a user, it still requires user knowledge about row_roles and where to find validation splits (which aren't too well documented).

Alternatively if you don't agree this is worthwhile then I'd suggest adding an example to mlr-org/mlr3book#201, e.g.

> set.seed(1)
> t = tsk("mtcars")
> t$row_roles$validation = sample(seq(t$nrow), t$nrow * 0.3)
> t$row_roles$validation
[1] 25  4  7  1  2 23 11 14 18

Add 'validation' to learner and task properties (similar to 'weights'). Doing so would allow a more efficient method of implementing validation datasets in learners that can handle them, as currently we assume the user will pass in the validation data in the correct format, which is inconsistent with the general task interface, i.e. it is inconsistent for a user to pass Task for training data but a data object for validation data. This would also highlight to users when they can/cannot set the validation role.

EDIT: Fixed examples.

The text was updated successfully, but these errors were encountered:

sebffischer · 2024-08-16T09:21:18Z

this is implemented

RaphaelS1 mentioned this issue Jan 22, 2021

validation data split from mlr3 mlr-org/mlr3keras#44

Open

mllg added Priority: Medium Status: Available Type: Enhancement labels Feb 2, 2021

mllg mentioned this issue Feb 9, 2021

Properly support validation #607

Closed

sebffischer mentioned this issue Jun 19, 2024

Feat/inner valid #1020

Merged

5 tasks

sebffischer closed this as completed Aug 16, 2024

github-project-automation bot added this to Workshop 2021 Aug 28, 2024

github-project-automation bot moved this to Done in Workshop 2021 Aug 28, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Enhance validation in Learners and Task #597

Enhance validation in Learners and Task #597

RaphaelS1 commented Jan 21, 2021 •

edited

Loading

sebffischer commented Aug 16, 2024

Enhance validation in Learners and Task #597

Enhance validation in Learners and Task #597

Comments

RaphaelS1 commented Jan 21, 2021 • edited Loading

sebffischer commented Aug 16, 2024

RaphaelS1 commented Jan 21, 2021 •

edited

Loading