353 create imputation markers #14

Jday7879 · 2024-05-16T10:25:13Z

Summary

Added function which creates a logical column checking if the target variable is a return, or can be imputed by forward or backwards imputation or imputed by construction
Updated original unit tests ready to expand to create a single test dataset, but this should be added to the backlog and looked at further down the line

Checklists

This pull request meets the following requirements:

If you feel some of these conditions do not apply for this pull request, please
add a comment to explain why.

…ols into test data

src/imputation_flags.py

AntonZogk

@Jday7879 lovely work for a complicated task love what you did with ffil and bffill smart solution :)

I have left some comments, happy to chat in more detail about them

… (can be extracted to yaml file if needed)

Instead of calling flag_matched_pair_merge within the function to create the predictive_auxiliary, it is defined as function argument. Hence flag_matched_pair_merge must be called before create_impute_flags. This will convert flag_matched_pair_merge to a low level function and using pandas framework.

AntonZogk · 2024-05-22T12:32:46Z

I updated the function .

Instead of calling flag_matched_pair_merge within the function to create the predictive_auxiliary, it is defined as function argument. Hence flag_matched_pair_merge must be called before create_impute_flags. This will convert flag_matched_pair_merge to a low level function and using only pandas framework.

I added the predictive_auxiliary to test_data csv file and I also updated the tests accordingly to match expected columns

Ticket needs new review from different developer

robertswh

Looks good to me, tests pass.

Jday7879 added 5 commits May 15, 2024 14:28

Change unit tests from dropping to selecting, ready for adding more c…

9ddd6af

…ols into test data

Adding module to calculate imputation flag columns

1fbbd83

Creating unit test and test data for imputation flag

70dfad4

Copying input data to fix pandas copy warnings

9bd4c2a

Adding docstrings

f334147

Jday7879 requested a review from AntonZogk May 16, 2024 10:26

Jday7879 marked this pull request as ready for review May 16, 2024 10:26

Jday7879 closed this May 16, 2024

Jday7879 added 3 commits May 16, 2024 15:18

Refactoring matched_pair column to include target column in name

2bd4b04

Update impute flags to include impute from construction

122610b

Create function to convert impute flags into single column with strings

f1372f0

Jday7879 reopened this May 17, 2024

Jday7879 marked this pull request as draft May 17, 2024 08:24

Jday7879 added 4 commits May 17, 2024 10:47

Fixing pandas copy on slice warning

f1abca8

Updating docstring and handle case where needed columns are not included

77855a5

Update error message

0607562

Adding unit test for string flag column

e24f451

Jday7879 marked this pull request as ready for review May 17, 2024 12:15

AntonZogk reviewed May 21, 2024

View reviewed changes