features/110-split_output_randn: Adding the new code for random #114

coquelin77 · 2019-02-13T14:53:14Z

see issue 110.

heads up @ClaudiaComito i mentioned this in the issue as well but this will change the tests for argmin, min, and max. This is because the random numbers generated on separate processes are not equal but they are the same every run.

codecov-io · 2019-02-13T14:56:38Z

Codecov Report

Merging #114 into master will decrease coverage by 0.1%.
The diff coverage is 85.18%.

@@            Coverage Diff             @@
##           master     #114      +/-   ##
==========================================
- Coverage   90.34%   90.23%   -0.11%     
==========================================
  Files          20       20              
  Lines        2735     2735              
==========================================
- Hits         2471     2468       -3     
- Misses        264      267       +3

Impacted Files	Coverage Δ
heat/core/tensor.py	`85.98% <0%> (ø)`	⬆️
heat/core/tests/test_operations.py	`100% <100%> (ø)`	⬆️
heat/core/communication.py	`78.46% <50%> (-0.91%)`	⬇️
heat/core/random.py	`81.48% <75%> (-6.02%)`	⬇️

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update 8bcf571...797cd94. Read the comment docs.

ClaudiaComito

Looks good to me.

Markus-Goetz · 2019-02-19T13:42:10Z

heat/core/tests/test_operations.py

-        self.assertTrue((ht.argmin(random_data,axis=0)._tensor__array == random_data_split.argmin(axis=0)._tensor__array).all())
-        self.assertTrue((ht.argmin(random_data,axis=1)._tensor__array == random_data_split.argmin(axis=1)._tensor__array).all())
-        self.assertIsInstance(ht.argmin(random_data_split,axis=1),ht.tensor)    
+        # todo: these tests now fail if random_data is split.


Remove or fix test

Markus-Goetz · 2019-02-19T13:42:22Z

heat/core/tests/test_operations.py

-
-        self.assertTrue((ht.max(random_data,axis=0)._tensor__array[0] == random_data_split.max(axis=0)._tensor__array[0]).all())
-        self.assertTrue((ht.max(random_data,axis=1)._tensor__array[0] == random_data_split.max(axis=1)._tensor__array[0]).all())
+        # todo: these tests now fail if random_data is split.


Remove or fix text

Markus-Goetz · 2019-02-19T13:44:39Z

heat/core/random.py

@@ -30,7 +30,7 @@ def randn(*args, split=None, comm=MPI_WORLD):

    Parameters
    ----------
-    d0, d1, …, dn : int, optional
+    d0, d1, …, dn : ints, optional


Implementation of the function is closely related to issue #54. What is the intended behaviour of this function? Output a number of random numbers where each of the split parts is random and not identical to the other split chunks of the other nodes? What will happen if I use the very same seed but a different node count? Will the random tensor differ?

It looks like torch's randn will not generate the same values for arrays of different sizes. This means that even though the same seed is used, the split tensor will not be the same as a tensor generated only on one process. i.e.

torch.randn((3, 3, 3))[0] != torch.randn((1, 3, 3))

One solution to this would be to generate the whole dataset then split it. But this will not scale. I do not see another way to do this.

It must be noted that the matrix generated by the ht.randn with a fixed seed (torch.manual_seed) and a fixed size produces a reproducible result

In my opinion, we can leave it as proposed for now, however, for the future, I would want to actually have a different behaviour. Consider the following example:

ht.set_seed(1) ht.randn(100, 5, 3, split=0)

Should always produce the same set of random numbers independent of the utilized nodes (for reproducibility reasons as you mentioned). This is exactly what is requested in issue #54. Obviously, this means that we would have to come up with a pseudo random generator allowing to skip to arbitrary/some fixed positions into the random sequence.

In the proposed fix, we would have a simplified randn() call. It will only provide reproducible results for the exact same node count.

100% agree. i am trying to figure a way to do this. i will move this discussion to the issue and close this request.

Markus-Goetz · 2019-02-19T13:45:06Z

heat/core/tests/test_operations.py

-
-        self.assertTrue((ht.min(random_data,axis=0)._tensor__array[0] == random_data_split.min(axis=0)._tensor__array[0]).all())
-        self.assertTrue((ht.min(random_data,axis=1)._tensor__array[0] == random_data_split.min(axis=1)._tensor__array[0]).all())
+        # todo: these tests now fail if random_data is split.


Remove or fix test

…to updating from main

Daniel Coquelin added 2 commits February 13, 2019 15:20

reset banch to current master (13.02.19), adding the new code for random

9ce86d0

changes to pass tests for argmin, min, max

1879b06

coquelin77 requested review from Markus-Goetz and ClaudiaComito February 13, 2019 14:53

coquelin77 changed the title ~~Adding the new code for random~~ features/110-split_output_randn: Adding the new code for random Feb 13, 2019

ClaudiaComito approved these changes Feb 19, 2019

View reviewed changes

Markus-Goetz requested changes Feb 19, 2019

View reviewed changes

Daniel Coquelin added 3 commits February 20, 2019 10:38

removed failing tests, added get rank function to comm, commit prior …

ef98f85

…to updating from main

Merge branch 'master' into features/110-split_output_randn

4dc5f66

accidentally uncommented comment

797cd94

coquelin77 closed this Feb 20, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

features/110-split_output_randn: Adding the new code for random #114

features/110-split_output_randn: Adding the new code for random #114

coquelin77 commented Feb 13, 2019

codecov-io commented Feb 13, 2019 •

edited

Loading

ClaudiaComito left a comment •

edited

Loading

Markus-Goetz Feb 19, 2019

Markus-Goetz Feb 19, 2019

Markus-Goetz Feb 19, 2019

coquelin77 Feb 20, 2019

Markus-Goetz Feb 20, 2019

coquelin77 Feb 20, 2019

Markus-Goetz Feb 19, 2019

features/110-split_output_randn: Adding the new code for random #114

features/110-split_output_randn: Adding the new code for random #114

Conversation

coquelin77 commented Feb 13, 2019

codecov-io commented Feb 13, 2019 • edited Loading

Codecov Report

ClaudiaComito left a comment • edited Loading

Choose a reason for hiding this comment

Markus-Goetz Feb 19, 2019

Choose a reason for hiding this comment

Markus-Goetz Feb 19, 2019

Choose a reason for hiding this comment

Markus-Goetz Feb 19, 2019

Choose a reason for hiding this comment

coquelin77 Feb 20, 2019

Choose a reason for hiding this comment

Markus-Goetz Feb 20, 2019

Choose a reason for hiding this comment

coquelin77 Feb 20, 2019

Choose a reason for hiding this comment

Markus-Goetz Feb 19, 2019

Choose a reason for hiding this comment

codecov-io commented Feb 13, 2019 •

edited

Loading

ClaudiaComito left a comment •

edited

Loading