You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
This undesirably leaks information from the (unseen) test dataset, biasing the reported metrics. Instead fit_transform should be applied on the train data and transform on the test data.
The text was updated successfully, but these errors were encountered:
Thanks for filing this @dionman. I'll check back in with the team in case there's any reason why it was implemented the way it is. Will keep this issue open to report any progress or updates.
Environment Details
Error Description
At current implementation of MLEfficacyMetric base class, the transformer is fit on the test data
SDMetrics/sdmetrics/single_table/efficacy/base.py
Line 50 in 6983bc0
fit_transform
should be applied on the train data andtransform
on the test data.The text was updated successfully, but these errors were encountered: