Felix/regression #3

fleibfried · 2020-02-07T12:14:44Z

No description provided.

vdutor · 2020-02-07T14:54:14Z

bayesian_benchmarks/tasks/classification.py

-    p = model.predict(data.X_test)  # N_test, K
+    p = model.predict(data.X_test)  # [N_test x K] or [samples x N_test x K]
+
+    assert len(p.shape) in {2, 3}  # 3-dim in case of approximate predictions (multiple samples per each X)


Suggested change

assert len(p.shape) in {2, 3} # 3-dim in case of approximate predictions (multiple samples per each X)

assert p.ndim in {2, 3} # 3-dim in case of approximate predictions (multiple samples per each X)

vdutor · 2020-02-07T17:53:19Z

bayesian_benchmarks/tasks/classification.py

+
+        res['test_acc'] = np.average(np.array(pred == data.Y_test.flatten()).astype(float))
+
+        res['Y_test'] = data.Y_test


why do we need to store res["Y_test"] in the results?

you add Y_test and p to res for both p.shape == 2 and p.shape == 3. Better to take it out

I just followed Hugh's implementation. I don't know why he added it.

vdutor · 2020-02-07T17:58:46Z

bayesian_benchmarks/tasks/classification.py

    # evaluation metrics
    res = {}

-    logp = multinomial.logpmf(Y_oh, n=1, p=p)
+    if len(p.shape) == 2:  # keep analysis as in the original code in case 2-dim predictions


using p.ndim is more concise here

vdutor · 2020-02-07T17:59:24Z

bayesian_benchmarks/tasks/classification.py

-    logp = multinomial.logpmf(Y_oh, n=1, p=p)
+    if len(p.shape) == 2:  # keep analysis as in the original code in case 2-dim predictions
+
+        logp = multinomial.logpmf(Y_oh, n=1, p=p)


adding shapes here would be useful. I believe logp is [N] here?

vdutor · 2020-02-07T18:01:42Z

bayesian_benchmarks/tasks/classification.py


-    pred = np.argmax(p, axis=-1)
+        # Mixture test likelihood (mean over per data point evaluations)
+        logp = logsumexp(res['test_loglik'], axis=0) - np.log(p.shape[0])


Personally I would create a helper function meansumexp, you can use it in the other file as well

vdutor · 2020-02-11T11:07:34Z

bayesian_benchmarks/tasks/classification.py


-    pred = np.argmax(p, axis=-1)
+        p = np.mean(p, axis=0)
+        pred = np.argmax(p, axis=-1)

    res['test_acc'] = np.average(np.array(pred == data.Y_test.flatten()).astype(float))

    res['Y_test'] = data.Y_test


I think it is worth figuring out, and deleting them if we don't use them anywhere. Storing 2 arrays of length N in our database is not a good idea anyway.

vdutor · 2020-02-11T13:11:33Z

bayesian_benchmarks/tasks/utils.py

+import numpy as np
+from scipy.special import logsumexp
+
+def meansumexp(logps: List[np.ndarray]) -> np.ndarray:


Given that this is a helper function I would make it more general by letting it accept an array (instead of a List of array's) and an axis on which to reduce. Namely,
meansumexp(a: np.array, axis)

vdutor · 2020-02-11T13:14:20Z

bayesian_benchmarks/tasks/regression.py

+    log_eps = np.log(1e-12)  # log probability threshold
+    log_1_minus_eps = np.log(1.0 - 1e-12)
+
+    if len(m.shape) == 2:  # keep analysis as in the original code in case of 2-dim predictions


same with ndim

vdutor · 2020-02-11T13:14:32Z

bayesian_benchmarks/tasks/regression.py

+    m, v = model.predict(data.X_test)  # both [data points x output dim] or [samples x data points x output dim]
+
+    assert len(m.shape) == len(v.shape)
+    assert len(m.shape) in {2, 3}  # 3-dim in case of approximate predictions (multiple samples per each X)


ndim is cleaner here

vdutor · 2020-02-11T13:22:55Z

tests/test_tasks.py

+def test_regression(tuple):
+    data, model, correct_result = tuple
+    result = run_regression(None, data=data, model=model, is_test=True)
+    assert correct_result['test_loglik'] == pytest.approx(result['test_loglik'], 1e-3)


I think most of us use np.testing.assert_almost_equal instead of pytest.approx. np.testing gives nice and clear errors when the assert isn't fulfilled.

vdutor · 2020-02-11T13:27:13Z

tests/test_tasks.py

+def test_regression(tuple):
+    data, model, correct_result = tuple
+    result = run_regression(None, data=data, model=model, is_test=True)
+    assert correct_result['test_loglik'] == pytest.approx(result['test_loglik'], 1e-3)


Instead of the copying here I would write this as a for loop over

for (ref_key, ref_value), (actual_key, actual_value) in zip(correct_result.items(), result.items()): np.testing.assert_equal(ref_key, actual_key) np.testing.assert_almost_equal(ref_value, actual_value)

vdutor · 2020-02-11T13:27:39Z

tests/test_tasks.py

+    assert correct_result['test_loglik'] == pytest.approx(result['test_loglik'], 1e-3)
+    assert correct_result['test_acc'] == pytest.approx(result['test_acc'], 1e-3)
+    assert np.allclose(correct_result['Y_test'], result['Y_test'], rtol=0.0, atol=1e-3)
+    assert np.allclose(correct_result['p_test'], result['p_test'], rtol=0.0, atol=1e-3)


nitpick: you want to newline at the end of the file.

vdutor · 2020-02-11T13:28:22Z

tests/test_tasks.py

+    Approximate regression mock.
+    """
+    def predict(self, X: np.ndarray) -> (np.ndarray, np.ndarray):
+        mu = np.array([[[1., 2., 3.], [4., 5., 6.]], [[1.5, 2.5, 3.5], [4.5, 5.5, 6.5]]])


can you explain in the code why these classes return these specific shapes, please.

vdutor · 2020-02-12T11:02:38Z

bayesian_benchmarks/tasks/utils.py

+    :param axis: determines reduction
+    :return: avg probability value [1]
+    """
+    return np.mean(logsumexp(logps, axis=axis) - np.log(len(logps)))


np.log(len(logps)) doesnt work np.log(logp.shape[axis])

Felix Leibfried added 11 commits February 6, 2020 14:09

modified run methods to optionally handle 3-dim predictions

09be261

fixed small bug regarding thresholding

03783f4

added clipping

393e530

changed some comments

4780b64

added some comments

13d4753

added mocks for testing

35e9a08

continued with testing metrics

44e06d1

first draft of tests finished

aacd848

added comments for tests

50c8a7b

minor modifactions to tests

2107fe9

tests work and pass

ab1b464

fleibfried requested a review from vdutor February 7, 2020 12:14

Felix Leibfried added 3 commits February 7, 2020 14:20

changed a few comments

22e729d

changed something minor for legibility

8e11ad8

changed one comment

c6ded4a

vdutor requested changes Feb 7, 2020

View reviewed changes

Felix Leibfried added 5 commits February 10, 2020 12:00

addressed review comments

83b49e8

changed comment

088e08e

fixed tests

1097d39

renamed one test

0aa65c2

Merge branch 'master' into felix/regression

dc93a76

vdutor requested changes Feb 11, 2020

View reviewed changes

Felix Leibfried added 2 commits February 11, 2020 13:57

addressed final comments

e4576a6

renamed a method

725a13d

vdutor approved these changes Feb 12, 2020

View reviewed changes

final comment addressed

0048b74

fleibfried merged commit 03f59c7 into master Feb 12, 2020

fleibfried deleted the felix/regression branch February 12, 2020 11:21

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Felix/regression #3

Felix/regression #3

fleibfried commented Feb 7, 2020

vdutor Feb 7, 2020

vdutor Feb 7, 2020

vdutor Feb 7, 2020

fleibfried Feb 10, 2020

vdutor Feb 7, 2020

vdutor Feb 7, 2020

vdutor Feb 7, 2020

vdutor Feb 11, 2020

vdutor Feb 11, 2020

vdutor Feb 11, 2020

vdutor Feb 11, 2020

vdutor Feb 11, 2020

vdutor Feb 11, 2020

vdutor Feb 11, 2020

vdutor Feb 11, 2020

vdutor Feb 12, 2020

	assert len(p.shape) in {2, 3} # 3-dim in case of approximate predictions (multiple samples per each X)
	assert p.ndim in {2, 3} # 3-dim in case of approximate predictions (multiple samples per each X)


		res['test_acc'] = np.average(np.array(pred == data.Y_test.flatten()).astype(float))

		res['Y_test'] = data.Y_test

Felix/regression #3

Felix/regression #3

Conversation

fleibfried commented Feb 7, 2020

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment