Multiply robust #35

sami6mz · 2023-07-07T08:49:46Z

Multiply_robust estimator is fully implemented according Tchetgen Tchetgen 2012 and Huber 2016.
For linear generated data (even with x multidimensional), relative error is around 5% even for indirect effects, which is a noteworthy performance.
Docstring was changed, code was made readable, and .ravel() input was rectified (solving #34 and making #14 to progress).

Test code :

from med_bench.src.get_simulated_data import simulate_data
from med_bench.src.benchmark_mediation import *
data = simulate_data(1000, default_rng(4), False, False, 5, 1, 7, "binary", 0.5, 0.5, 0.5, 0.5)
x = data[0]
t = data[1]
m = data[2]
y = data[3]
effects = np.array(data[4:9])

effects_chap = multiply_robust_efficient(y, t, m, x)[0:5]
error = abs((effects_chap - effects) / effects)
print(effects)
print(error)

print(multiply_robust_efficient(y.ravel(), t.ravel(), m, x) == multiply_robust_efficient(y, t, m, x))

sami6mz · 2023-07-07T08:52:43Z

my bad, beb5f26 should be named solving #34

src/benchmark_mediation.py

judithabk6 · 2023-07-07T15:09:17Z

src/benchmark_mediation.py

+        direct effect on the unexposed,
+        indirect effect on the exposed,
+        indirect effect on the unexposed,
+        number of clipped samples]


not sure this is the wanted behavior, the last returned value should have the same meaning in all methods, so the number of discarded observations by trimming, not an alternance of clipped and trimmed examples. WDYT @sami6mz @bthirion?

+1 for consistency

More importantly, there should be 6 output arguments, not one list.

not sure this is the wanted behavior, the last returned value should have the same meaning in all methods, so the number of discarded observations by trimming, not an alternance of clipped and trimmed examples. WDYT @sami6mz @bthirion?

Then should we keep clipping? Or replace clipping by trimming?

keep clipping but not return the number of clipped examples. And potentially open an issue to implement trimming in this method. And another issue to implement clipping in all methods and return 6 results instead of 5 in that case

I ended up removing clipping count with df600a9 + putting the clipping code in issue #39

keep clipping but not return the number of clipped examples. And potentially open an issue to implement trimming in this method. And another issue to implement clipping in all methods and return 6 results instead of 5 in that case

Let's just implement trimming for now, and remove clipping

src/benchmark_mediation.py

bthirion

Thx for the great job !

bthirion · 2023-07-07T20:31:28Z

src/benchmark_mediation.py

@@ -66,20 +66,22 @@ def get_interactions(interaction, *args):
           [ 2.,  3.,  1.,  2.,  2.,  3.,  4.,  6.,  2.],
           [ 4.,  5.,  1.,  2.,  4.,  5.,  8., 10.,  2.]])
    """
-    variables = args
+    variables = list(args)


is there any reason to sue this patterns. Arguments should be passed explicitly.

you mean to use args directly rather than renaming it variables?

actually you can specify a list of variables between which you want to compute interaction terms. Do you think this should be done differently?

src/benchmark_mediation.py

bthirion · 2023-07-07T20:34:02Z

src/benchmark_mediation.py

+        direct effect on the unexposed,
+        indirect effect on the exposed,
+        indirect effect on the unexposed,
+        number of clipped samples]


+1 for consistency

bthirion · 2023-07-07T20:34:23Z

src/benchmark_mediation.py

+        direct effect on the unexposed,
+        indirect effect on the exposed,
+        indirect effect on the unexposed,
+        number of clipped samples]


More importantly, there should be 6 output arguments, not one list.

src/benchmark_mediation.py

Sami Boumaiza added 3 commits July 7, 2023 10:27

solving judithabk6#33

beb5f26

docstring changed

28f6989

implemented all effects for multiply_robust

cea9eee

This was referenced Jul 7, 2023

Implemented test in respond to #15 #36

Merged

need to correct IPW code for y and t input format #14

Open

judithabk6 reviewed Jul 7, 2023

View reviewed changes

bthirion reviewed Jul 7, 2023

View reviewed changes

Sami Boumaiza added 8 commits July 10, 2023 11:50

formatting get_interactions

5eb4d2b

changed docstring

017e5b9

variable name modified

2308342

clipping removed

df600a9

predict_proba run once

af8fff6

updated TOLERANCE_DICT

38129d9

docstring changed

d5cd409

trimming added, clipping removed

4cc528d

sami6mz mentioned this pull request Jul 25, 2023

m.astype(int) for multiply_robust #41

Open

judithabk6 merged commit a2d8e02 into judithabk6:main Jul 30, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Multiply robust #35

Multiply robust #35

sami6mz commented Jul 7, 2023

sami6mz commented Jul 7, 2023

judithabk6 Jul 7, 2023

bthirion Jul 7, 2023

bthirion Jul 7, 2023

sami6mz Jul 10, 2023

judithabk6 Jul 10, 2023

sami6mz Jul 10, 2023

sami6mz Jul 11, 2023

bthirion left a comment

bthirion Jul 7, 2023

judithabk6 Jul 10, 2023

judithabk6 Jul 10, 2023

bthirion Jul 7, 2023

bthirion Jul 7, 2023

Multiply robust #35

Multiply robust #35

Conversation

sami6mz commented Jul 7, 2023

sami6mz commented Jul 7, 2023

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

bthirion left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment