[python] add type hints on train() in engine.py #4544

jameslamb · 2021-08-22T02:11:33Z

Proposes adding type hints on train(), as part of #3756.

StrikerRUS

Oh, that was a hard work, thanks!

sklearn-wrapper and hence Dask-package use callables with another signature for custom objective and metric functions:

LightGBM/python-package/lightgbm/sklearn.py

Lines 23 to 49 in 4e18c60

    
                   This class transforms objective function to match objective function with signature ``new_func(preds, dataset)`` 
        
                   as expected by ``lightgbm.engine.train``. 
        
                   Parameters 
        
                   ---------- 
        
                   func : callable 
        
                       Expects a callable with signature ``func(y_true, y_pred)`` or ``func(y_true, y_pred, group) 
        
                       and returns (grad, hess): 
        
                           y_true : array-like of shape = [n_samples] 
        
                               The target values. 
        
                           y_pred : array-like of shape = [n_samples] or shape = [n_samples * n_classes] (for multi-class task) 
        
                               The predicted values. 
        
                               Predicted values are returned before any transformation, 
        
                               e.g. they are raw margin instead of probability of positive class for binary task. 
        
                           group : array-like 
        
                               Group/query data. 
        
                               Only used in the learning-to-rank task. 
        
                               sum(group) = n_samples. 
        
                               For example, if you have a 100-document dataset with ``group = [10, 20, 40, 10, 10, 10]``, that means that you have 6 groups, 
        
                               where the first 10 records are in the first group, records 11-30 are in the second group, records 31-70 are in the third group, etc. 
        
                           grad : array-like of shape = [n_samples] or shape = [n_samples * n_classes] (for multi-class task) 
        
                               The value of the first order derivative (gradient) of the loss 
        
                               with respect to the elements of y_pred for each sample point. 
        
                           hess : array-like of shape = [n_samples] or shape = [n_samples * n_classes] (for multi-class task) 
        
                               The value of the second order derivative (Hessian) of the loss 
        
                               with respect to the elements of y_pred for each sample point.

LightGBM/python-package/lightgbm/sklearn.py

Lines 112 to 144 in 4e18c60

    
                   This class transforms evaluation function to match evaluation function with signature ``new_func(preds, dataset)`` 
        
                   as expected by ``lightgbm.engine.train``. 
        
                   Parameters 
        
                   ---------- 
        
                   func : callable 
        
                       Expects a callable with following signatures: 
        
                       ``func(y_true, y_pred)``, 
        
                       ``func(y_true, y_pred, weight)`` 
        
                       or ``func(y_true, y_pred, weight, group)`` 
        
                       and returns (eval_name, eval_result, is_higher_better) or 
        
                       list of (eval_name, eval_result, is_higher_better): 
        
                           y_true : array-like of shape = [n_samples] 
        
                               The target values. 
        
                           y_pred : array-like of shape = [n_samples] or shape = [n_samples * n_classes] (for multi-class task) 
        
                               The predicted values. 
        
                               In case of custom ``objective``, predicted values are returned before any transformation, 
        
                               e.g. they are raw margin instead of probability of positive class for binary task in this case. 
        
                           weight : array-like of shape = [n_samples] 
        
                               The weight of samples. 
        
                           group : array-like 
        
                               Group/query data. 
        
                               Only used in the learning-to-rank task. 
        
                               sum(group) = n_samples. 
        
                               For example, if you have a 100-document dataset with ``group = [10, 20, 40, 10, 10, 10]``, that means that you have 6 groups, 
        
                               where the first 10 records are in the first group, records 11-30 are in the second group, records 31-70 are in the third group, etc. 
        
                           eval_name : string 
        
                               The name of evaluation function (without whitespace). 
        
                           eval_result : float 
        
                               The eval result. 
        
                           is_higher_better : bool 
        
                               Is eval result higher better, e.g. AUC is ``is_higher_better``.

jameslamb · 2021-08-23T03:28:24Z

sklearn-wrapper and hence Dask-package use callables with another signature for custom objective and metric functions

oh interesting, all this time I didn't know that the interfaces were different!

I'll remove those changes from this PR and make them in a separate one.

StrikerRUS · 2021-08-23T22:53:12Z

I'll remove those changes from this PR and make them in a separate one.

Nice plan, thanks!

python-package/lightgbm/engine.py

Co-authored-by: Nikita Titov <[email protected]>

StrikerRUS

Great job, many thanks!

python-package/lightgbm/engine.py

github-actions · 2023-08-23T16:27:27Z

This pull request has been automatically locked since there has not been any recent activity since it was closed. To start a new related discussion, open a new issue at https://github.com/microsoft/LightGBM/issues including a reference to this.

[python] add type hints on train() in engine.py

a6ca273

jameslamb added the maintenance label Aug 22, 2021

jameslamb requested a review from StrikerRUS August 22, 2021 02:11

jameslamb requested review from chivee, henry0312 and shiyu1994 as code owners August 22, 2021 02:11

StrikerRUS requested changes Aug 22, 2021

View reviewed changes

Merge branch 'master' into train-hints

b998ae0

revert dask.py and sklearn.py changes

38b9743

jameslamb mentioned this pull request Aug 23, 2021

[python] add type hints for custom objective and metric functions in scikit-learn interface #4547

Merged

jameslamb requested a review from StrikerRUS August 23, 2021 03:53

StrikerRUS requested changes Aug 23, 2021

View reviewed changes

python-package/lightgbm/engine.py Outdated Show resolved Hide resolved

python-package/lightgbm/engine.py Outdated Show resolved Hide resolved

python-package/lightgbm/engine.py Show resolved Hide resolved

python-package/lightgbm/engine.py Outdated Show resolved Hide resolved

jameslamb and others added 2 commits August 24, 2021 23:37

Apply suggestions from code review

0ccd9fe

Co-authored-by: Nikita Titov <[email protected]>

update docs on evals_result contents

2120d71

StrikerRUS approved these changes Aug 25, 2021

View reviewed changes

jameslamb commented Aug 25, 2021

View reviewed changes

python-package/lightgbm/engine.py Outdated Show resolved Hide resolved

Update python-package/lightgbm/engine.py

42a9c06

StrikerRUS merged commit 13fa6d9 into master Aug 25, 2021

StrikerRUS deleted the train-hints branch August 25, 2021 23:22

StrikerRUS mentioned this pull request Aug 25, 2021

[docs][python] Improve description of eval_result argument in record_evaluation() #4559

Merged

jameslamb mentioned this pull request Oct 9, 2021

Custom objective function doesn't produce expected results #4659

Closed

github-actions bot locked as resolved and limited conversation to collaborators Aug 23, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[python] add type hints on train() in engine.py #4544

[python] add type hints on train() in engine.py #4544

jameslamb commented Aug 22, 2021

StrikerRUS left a comment •

edited

Loading

jameslamb commented Aug 23, 2021

StrikerRUS commented Aug 23, 2021

StrikerRUS left a comment

github-actions bot commented Aug 23, 2023

	This class transforms objective function to match objective function with signature ``new_func(preds, dataset)``
	as expected by ``lightgbm.engine.train``.

	Parameters
	----------
	func : callable
	Expects a callable with signature ``func(y_true, y_pred)`` or ``func(y_true, y_pred, group)
	and returns (grad, hess):

	y_true : array-like of shape = [n_samples]
	The target values.
	y_pred : array-like of shape = [n_samples] or shape = [n_samples * n_classes] (for multi-class task)
	The predicted values.
	Predicted values are returned before any transformation,
	e.g. they are raw margin instead of probability of positive class for binary task.
	group : array-like
	Group/query data.
	Only used in the learning-to-rank task.
	sum(group) = n_samples.
	For example, if you have a 100-document dataset with ``group = [10, 20, 40, 10, 10, 10]``, that means that you have 6 groups,
	where the first 10 records are in the first group, records 11-30 are in the second group, records 31-70 are in the third group, etc.
	grad : array-like of shape = [n_samples] or shape = [n_samples * n_classes] (for multi-class task)
	The value of the first order derivative (gradient) of the loss
	with respect to the elements of y_pred for each sample point.
	hess : array-like of shape = [n_samples] or shape = [n_samples * n_classes] (for multi-class task)
	The value of the second order derivative (Hessian) of the loss
	with respect to the elements of y_pred for each sample point.

	This class transforms evaluation function to match evaluation function with signature ``new_func(preds, dataset)``
	as expected by ``lightgbm.engine.train``.

	Parameters
	----------
	func : callable
	Expects a callable with following signatures:
	``func(y_true, y_pred)``,
	``func(y_true, y_pred, weight)``
	or ``func(y_true, y_pred, weight, group)``
	and returns (eval_name, eval_result, is_higher_better) or
	list of (eval_name, eval_result, is_higher_better):

	y_true : array-like of shape = [n_samples]
	The target values.
	y_pred : array-like of shape = [n_samples] or shape = [n_samples * n_classes] (for multi-class task)
	The predicted values.
	In case of custom ``objective``, predicted values are returned before any transformation,
	e.g. they are raw margin instead of probability of positive class for binary task in this case.
	weight : array-like of shape = [n_samples]
	The weight of samples.
	group : array-like
	Group/query data.
	Only used in the learning-to-rank task.
	sum(group) = n_samples.
	For example, if you have a 100-document dataset with ``group = [10, 20, 40, 10, 10, 10]``, that means that you have 6 groups,
	where the first 10 records are in the first group, records 11-30 are in the second group, records 31-70 are in the third group, etc.
	eval_name : string
	The name of evaluation function (without whitespace).
	eval_result : float
	The eval result.
	is_higher_better : bool
	Is eval result higher better, e.g. AUC is ``is_higher_better``.

[python] add type hints on train() in engine.py #4544

[python] add type hints on train() in engine.py #4544

Conversation

jameslamb commented Aug 22, 2021

StrikerRUS left a comment • edited Loading

Choose a reason for hiding this comment

jameslamb commented Aug 23, 2021

StrikerRUS commented Aug 23, 2021

StrikerRUS left a comment

Choose a reason for hiding this comment

github-actions bot commented Aug 23, 2023

StrikerRUS left a comment •

edited

Loading