Releases · jinlow/forust

20 Apr 00:32

jinlow

v0.2.0

594c769

Release v0.2.0

This release is a major refactor to how splitting is handled internally, the external API and python API remain the same. With these changes it will be easier to allow for missing to be treated explicitly while training. Future releases will implement the ability to split out missing into it's own separate branch.

Assets 2

20 Aug 19:35

jinlow

v0.1.7

86c368c

v0.1.7

This release adds the following changes to the packages

Support for monotonic constraints. Features can now be supplied with a constraint so that they are forced to either have a monotonic increasing, decreasing, or unconstrained relationship with the target variable. This can be adjusted using the monotone_constraints parameter.
Experimental support for dealing with missing in different ways. This includes the ability to not allow splits on missing or non-missing alone, as well as not automatically imputing missing, and instead always sending it down a default branch instead of learning the best direction to send it. See the documentation on the allow_missing_splits and allow_missing_splits parameters.
The default value of the min_leaf_weight parameter was changed from 0.0, to 1.0.
Additional refactoring of the code to better align with modern python type hints, as well as adding pre-commit support for development, and adjusting some of the naming of modules to be clearer.

Assets 2

19 Aug 14:19

jinlow

v0.1.6

6fef27c

v0.1.6

This release fixes a bug where the README and LICENSE files aren't included in the source distribution.

Assets 2

31 Jul 20:11

jinlow

v0.1.5

9c28696

v0.1.5

Added support and documentation for calculating partial dependency information for a model feature. This allows users to get an estimate of how a given feature is being used in the model.

Assets 2

18 Jun 19:15

jinlow

v0.1.4

d288c31

v0.1.4

Fix issue where sample weight would become misaligned when binning a feature with missing values.
Update links in README to point to correct URLs.

Assets 2

17 Jun 00:05

jinlow

v0.1.3

e8d6fb6

v0.1.3

This release introduces many additional optimization, leading to a speedup of more than 7X on data with more than 300K rows.

All internal statistics (histograms, gradient/hessian sums) have been converted to using f32 data types. However, for any summing aggregations these values are cast to f64 and then summed, this is to ensure that higher precision is maintained.
All gradients are aligned in memory before calculating feature histograms. This led to a about half of the performance improvement.
The data is realigned in memory prior to each tree being constructed, this led to most of the remaining speed gain.
The histograms, which where originally a hashmap of vectors, has been converted to a jagged matrix, to have a data structure with faster access.

By aligning the data in memory, this reduced the overall number of cache hits, which leads to drastically increased performance.

Assets 2

09 Jun 00:15

jinlow

v0.1.2

8db72ee

v0.1.2

Assets 2

08 Jun 04:21

jinlow

v0.0.1

681af3f

v0.0.1

Assets 2

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Releases: jinlow/forust

Release v0.2.0

v0.1.7

v0.1.6

v0.1.5

v0.1.4

v0.1.3

v0.1.2

v0.0.1