[python-package] support customizing Dataset creation in Booster.refit() (fixes #3038) #4894

TremaMiguel · 2021-12-19T02:18:39Z

Context:
This PR aims to close #3038

Changes

refit method accepts kwargs_for_dataset parameter to pass weight parameter to Dataset initialization inside refit().
refit method accepts kwargs_for_predict parameter to pass original params to predict method.

Tests:
Based on @jtilly example on issue #3038 test that refit method returns a valid prediction value when passing kwargs_for_dataset or kwargs_for_predict.

ghost · 2021-12-19T02:18:51Z

All CLA requirements met.

jameslamb

Thanks for picking this up! I left some initial comments based on the format of what you're proposing, but I'm not the best person to comment on whether or not #3038 should be adopted at all.

For example, I don't know if the comment mentioned at #3038 (comment) means that init_score, weight, and group is still true.

Hopefully @guolinke @shiyu1994 or @StrikerRUS can comment on whether we should proceed with this feature.

python-package/lightgbm/basic.py

tests/python_package_test/test_engine.py

shiyu1994 · 2021-12-23T06:59:36Z

For example, I don't know if the comment mentioned at #3038 (comment) means that init_score, weight, and group is still true.

I think it is useful that we want to change the weights of data points when refit. If we see weight, init_score and group as properties of a dataset, then when refitting on a new dataset, it is very meaningful to allow new settings of these properties.

For example, a new ranking dataset can have totally different group.

tests/python_package_test/test_engine.py

shiyu1994 · 2021-12-23T07:53:08Z

@TremaMiguel Thanks for working on this!

jameslamb

Thanks for the testing changes. Please see a few more suggestions I've provided.

python-package/lightgbm/basic.py

tests/python_package_test/test_engine.py

jameslamb

Thanks very much. Please see some additional suggestions to make the tests a bit stronger.

tests/python_package_test/test_engine.py

jameslamb

Looks good to me, thanks very much!

I've edited the PR description so it will be a bit more informative when it's used as a bullet point in the release notes.

Before this is merged, I'd like another Python maintainer like @jmoralez or @StrikerRUS to review.

StrikerRUS

Thanks for working on this! Generally LGTM, just few minor fixes for the docstring.

python-package/lightgbm/basic.py

tests/python_package_test/test_engine.py

python-package/lightgbm/basic.py

TremaMiguel · 2022-01-13T22:26:43Z

@StrikerRUS @jameslamb @jmoralez is there any pending change to close this PR?

StrikerRUS

LGTM! Thank you so much for your contribution!

github-actions · 2023-08-23T14:22:19Z

This pull request has been automatically locked since there has not been any recent activity since it was closed. To start a new related discussion, open a new issue at https://github.com/microsoft/LightGBM/issues including a reference to this.

TremaMiguel added 3 commits December 18, 2021 12:56

feat: refit additional kwargs for dataset and predict

39ecd82

test: kwargs for refit method

5f01c83

fix: __init__ got multiple values for argument

bf7af5f

TremaMiguel requested review from chivee, henry0312, hzy46, jameslamb, shiyu1994, StrikerRUS and tongwu-sh as code owners December 19, 2021 02:18

fix: pycodestyle E302 error

f14d522

jameslamb added the feature label Dec 21, 2021

jameslamb requested changes Dec 21, 2021

View reviewed changes

TremaMiguel added 2 commits December 22, 2021 21:04

refactor: dataset_params to avoid breaking change

5f6245e

refactor: expose all Dataset params in refit

3e860df

shiyu1994 reviewed Dec 23, 2021

View reviewed changes

tests/python_package_test/test_engine.py Outdated Show resolved Hide resolved

feat: dataset_params updates new_params

11d75ec

jameslamb requested changes Dec 27, 2021

View reviewed changes

fix: remove unnecessary params to test

dad3278

jameslamb requested changes Dec 28, 2021

View reviewed changes

test: parameters input are the same

4bbd86a

jameslamb approved these changes Dec 29, 2021

View reviewed changes

jameslamb changed the title ~~[python-package] refit support weights (fixes #3038)~~ [python-package] support customizing Dataset creation in Booster.refit() (fixes #3038) Dec 29, 2021

StrikerRUS requested changes Dec 30, 2021

View reviewed changes

jmoralez requested changes Dec 30, 2021

View reviewed changes

tests/python_package_test/test_engine.py Outdated Show resolved Hide resolved

tests/python_package_test/test_engine.py Outdated Show resolved Hide resolved

tests/python_package_test/test_engine.py Outdated Show resolved Hide resolved

TremaMiguel mentioned this pull request Dec 30, 2021

[python-package] [docs]: weights params description should read weights should be non-negative #4921

Closed

docs: address StrikeRUS changes

0198d3f

jameslamb added the in progress label Jan 1, 2022

StrikerRUS reviewed Jan 4, 2022

View reviewed changes

python-package/lightgbm/basic.py Outdated Show resolved Hide resolved

test: refit test changes in train dataset

935fdde

test: set init_score and decay_rate to zero

cbc5e00

jmoralez requested review from jmoralez and removed request for chivee January 15, 2022 18:16

jmoralez approved these changes Jan 15, 2022

View reviewed changes

jameslamb mentioned this pull request Jan 15, 2022

[python-package] refit sets init_score=0 #4951

Open

StrikerRUS removed the in progress label Jan 16, 2022

StrikerRUS approved these changes Jan 16, 2022

View reviewed changes

StrikerRUS merged commit e6a2f71 into microsoft:master Jan 22, 2022

jameslamb mentioned this pull request Oct 7, 2022

[DO NOT MERGE] Release v3.3.3 #5525

Closed

40 tasks

jameslamb mentioned this pull request Jun 27, 2023

[docs] add versionadded notes for v4.0.0 features #5948

Merged

github-actions bot locked as resolved and limited conversation to collaborators Aug 23, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[python-package] support customizing Dataset creation in Booster.refit() (fixes #3038) #4894

[python-package] support customizing Dataset creation in Booster.refit() (fixes #3038) #4894

TremaMiguel commented Dec 19, 2021

ghost commented Dec 19, 2021 •

edited by ghost

Loading

jameslamb left a comment

shiyu1994 commented Dec 23, 2021

shiyu1994 commented Dec 23, 2021

jameslamb left a comment

jameslamb left a comment

jameslamb left a comment

StrikerRUS left a comment

TremaMiguel commented Jan 13, 2022

StrikerRUS left a comment

github-actions bot commented Aug 23, 2023

[python-package] support customizing Dataset creation in Booster.refit() (fixes #3038) #4894

[python-package] support customizing Dataset creation in Booster.refit() (fixes #3038) #4894

Conversation

TremaMiguel commented Dec 19, 2021

ghost commented Dec 19, 2021 • edited by ghost Loading

jameslamb left a comment

Choose a reason for hiding this comment

shiyu1994 commented Dec 23, 2021

shiyu1994 commented Dec 23, 2021

jameslamb left a comment

Choose a reason for hiding this comment

jameslamb left a comment

Choose a reason for hiding this comment

jameslamb left a comment

Choose a reason for hiding this comment

StrikerRUS left a comment

Choose a reason for hiding this comment

TremaMiguel commented Jan 13, 2022

StrikerRUS left a comment

Choose a reason for hiding this comment

github-actions bot commented Aug 23, 2023

ghost commented Dec 19, 2021 •

edited by ghost

Loading