make learners picklable #264

basnijholt · 2020-04-09T19:59:54Z

Description

I realized that in some cases it's very useful to pickle learners. For example to send it over the network when parallelizing code.

With these changes, most learners become picklable.

Checklist

Fixed style issues using pre-commit run --all (first install using pip install pre-commit)
pytest passed

Type of change

Check relevant option(s).

Bug fix (non-breaking change which fixes an issue)
New feature (non-breaking change which adds functionality)
Breaking change (fix or feature that would cause existing functionality to not work as expected)
This change requires a documentation update

basnijholt · 2020-04-10T13:01:53Z

@akhmerov and @jbweston, I wonder if we should also pickle adaptive.__version__ and warn the user that if they are loading a learner that is pickled with another version, that there is no guarantee that stuff's correct?

let's do that in a future PR.

codecov-io · 2020-04-10T13:35:36Z

Codecov Report

Merging #264 into master will increase coverage by 0.66%.
The diff coverage is 94.23%.

@@            Coverage Diff             @@
##           master     #264      +/-   ##
==========================================
+ Coverage   79.52%   80.18%   +0.66%     
==========================================
  Files          32       33       +1     
  Lines        4425     4522      +97     
  Branches      815      819       +4     
==========================================
+ Hits         3519     3626     +107     
+ Misses        779      773       -6     
+ Partials      127      123       -4

Impacted Files	Coverage Δ
adaptive/tests/test_pickling.py	`88.67% <88.67%> (ø)`
adaptive/learner/average_learner.py	`83.72% <100.00%> (+4.97%)`	⬆️
adaptive/learner/balancing_learner.py	`75.00% <100.00%> (+1.22%)`	⬆️
adaptive/learner/data_saver.py	`90.00% <100.00%> (+1.76%)`	⬆️
adaptive/learner/integrator_learner.py	`91.37% <100.00%> (+3.42%)`	⬆️
adaptive/learner/learner1D.py	`92.81% <100.00%> (+0.48%)`	⬆️
adaptive/learner/learner2D.py	`79.28% <100.00%> (+1.26%)`	⬆️
adaptive/learner/sequence_learner.py	`85.52% <100.00%> (-0.19%)`	⬇️
... and 4 more

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update a0b22ff...acc5400. Read the comment docs.

akhmerov

I think tests should be more strict with idempotency. For example the loss reported by the new learner should be exactly the same, the learner's response to .ask should be exactly the same, etc.

basnijholt · 2020-04-12T11:43:32Z

@akhmerov, good idea, I've added that in 54017f3.

adaptive/tests/test_pickling.py

akhmerov · 2020-04-13T20:45:50Z

@basnijholt any idea why tests fail?

basnijholt · 2020-04-13T20:47:30Z

The failure is intermittent and unrelated to these changes, also happens in master sometimes.

It seems like the distributed test keeps getting stuck on Windows and MacOS.

akhmerov · 2020-04-13T20:50:51Z

But it seems like serialization tests are failing?

basnijholt · 2020-04-13T20:57:10Z

Oh, you are right! I assumed it were the same failures I had seen before.

I think that is fixed by bc65213.

adaptive/tests/test_pickling.py

jbweston

Good work, and thanks for taking feedback on board.

The refactors I proposed are nice-to-haves, but I understand if we just want to get this merged.

OTOH we use an unseeded random number generator in such a way that I cannot see why the tests pass.

For me, explaining/fixing this is a prerequisite for merging.

adaptive/tests/test_pickling.py

adaptive/learner/learnerND.py

tlaeven · 2020-04-23T15:19:53Z

Using a cached function doesn't work in this branch, is this intentional?

Minimal example:

import adaptive
from functools import lru_cache
adaptive.notebook_extension()

@lru_cache()
def g(x):
    return x**2

def f(x):
    return x-g(0)

learner = adaptive.Learner1D(f, [-1,1])

runner = adaptive.Runner(learner)
runner.live_info()

Raises an error:
loky.process_executor.BrokenProcessPool: A task has failed to un-serialize. Please ensure that the arguments of the function are all picklable.

Note

It does work when the cached function is imported from a module:

basnijholt · 2020-04-23T15:54:51Z

@tlaeven, this is actually unrelated to these changes. The code you posted never worked, and has nothing to do with Adaptive:

from functools import lru_cache
import loky
from concurrent.futures import ProcessPoolExecutor

@lru_cache()
def g(x):
    return x ** 2

def f(x):
    return x - g(0)

# Before
with ProcessPoolExecutor() as ex:
    fut = ex.submit(f, 0)
    try:
        fut.result()
    except Exception as e:
        print(f"ProcessPoolExecutor failed: {e}")

# Now
ex = loky.get_reusable_executor()
fut = ex.submit(f, 0)
try:
    fut.result()
except Exception as e:
    print(f"loky failed: {e}")

which prints:

ProcessPoolExecutor failed: A process in the process pool was terminated abruptly while the future was running or pending.
loky failed: A task has failed to un-serialize. Please ensure that the arguments of the function are all picklable.

I guess the difference is because of basnijholt/adaptive-scheduler#39.

basnijholt · 2020-04-23T20:23:31Z

@jbweston, I've fixed all that was brought up.

The LearnerND takes some more work, so I moved those commits to #272.

I think it's ready to merge. Would you take a final look?

jbweston

LGTM! Nice work.

basnijholt force-pushed the pickle branch 3 times, most recently from b2965d8 to cb2cb25 Compare April 9, 2020 21:32

basnijholt changed the title ~~WIP: make learners picklable~~ make learners picklable Apr 9, 2020

basnijholt requested a review from jbweston April 10, 2020 11:48

basnijholt force-pushed the pickle branch from cb2cb25 to 612d701 Compare April 10, 2020 12:24

basnijholt requested a review from akhmerov April 10, 2020 12:25

basnijholt force-pushed the pickle branch from 4792417 to 5eb232d Compare April 10, 2020 12:28

basnijholt force-pushed the pickle branch 2 times, most recently from 817cda9 to 303efaa Compare April 10, 2020 17:34

akhmerov requested changes Apr 12, 2020

View reviewed changes

basnijholt mentioned this pull request Apr 12, 2020

remove the learners_file and pass learners directly basnijholt/adaptive-scheduler#39

Merged

2 tasks

akhmerov reviewed Apr 12, 2020

View reviewed changes

adaptive/tests/test_pickling.py Outdated Show resolved Hide resolved

basnijholt requested a review from akhmerov April 13, 2020 20:40

akhmerov reviewed Apr 13, 2020

View reviewed changes

adaptive/tests/test_pickling.py Outdated Show resolved Hide resolved

basnijholt requested a review from akhmerov April 14, 2020 16:00

jbweston requested changes Apr 15, 2020

View reviewed changes

basnijholt force-pushed the pickle branch from d4add09 to 29aea69 Compare April 23, 2020 10:10

basnijholt added 5 commits April 23, 2020 12:29

make Learner1D picklable

f75b099

make Learner2D picklable

77f3613

make AverageLearner picklable

06c1dd2

make IntegratorLearner picklable

062c2f7

make SequenceLearner picklable

cf7dad4

basnijholt added 14 commits April 23, 2020 12:29

make DataSaver picklable

7ff01d2

add tests for pickling

99fadf1

add cloudpickle to testing dependencies

5831592

test serialization with pickle, cloudpickle, and dill

dfd8b0c

only test cloudpickle and dill if installed

d6172e0

test for idential ask and loss response

64bb2e6

add flaky

7bc0ade

use an exact equality in checking the number of points

978a62c

set learner._recompute_losses_factor = 1

1a31669

use exact equalities

2e40ebb

make Learner1D's datastructures identical before and after pickling

44c6446

make Learner2D's datastructures identical before and after pickling

f7a3b03

do not specially treat Learner1D and Learner2D

30619d5

test for more points

9dccd05

basnijholt force-pushed the pickle branch from 29aea69 to 89935b7 Compare April 23, 2020 10:33

basnijholt mentioned this pull request Apr 23, 2020

Make LearnerND pickleable #272

Merged

6 tasks

basnijholt added 3 commits April 23, 2020 12:36

refactor tests

1e4c495

do not initialize child-learners twice in BalancingLearner

ca28f2e

do not initialize child-learners twice in DataSaver

acc5400

basnijholt force-pushed the pickle branch from 89935b7 to acc5400 Compare April 23, 2020 10:36

basnijholt requested a review from jbweston April 23, 2020 10:44

jbweston approved these changes Apr 24, 2020

View reviewed changes

akhmerov approved these changes Apr 24, 2020

View reviewed changes

basnijholt merged commit de0cc0c into master Apr 24, 2020

basnijholt deleted the pickle branch April 24, 2020 16:04

basnijholt mentioned this pull request May 19, 2020

Release v0.11 #277

Closed

basnijholt added the enhancement label Jun 13, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

make learners picklable #264

make learners picklable #264

basnijholt commented Apr 9, 2020 •

edited

Loading

basnijholt commented Apr 10, 2020 •

edited

Loading

codecov-io commented Apr 10, 2020 •

edited

Loading

akhmerov left a comment

basnijholt commented Apr 12, 2020

akhmerov commented Apr 13, 2020

basnijholt commented Apr 13, 2020

akhmerov commented Apr 13, 2020

basnijholt commented Apr 13, 2020

jbweston left a comment

tlaeven commented Apr 23, 2020 •

edited

Loading

basnijholt commented Apr 23, 2020 •

edited

Loading

basnijholt commented Apr 23, 2020

jbweston left a comment

make learners picklable #264

make learners picklable #264

Conversation

basnijholt commented Apr 9, 2020 • edited Loading

Description

Checklist

Type of change

basnijholt commented Apr 10, 2020 • edited Loading

codecov-io commented Apr 10, 2020 • edited Loading

Codecov Report

akhmerov left a comment

Choose a reason for hiding this comment

basnijholt commented Apr 12, 2020

akhmerov commented Apr 13, 2020

basnijholt commented Apr 13, 2020

akhmerov commented Apr 13, 2020

basnijholt commented Apr 13, 2020

jbweston left a comment

Choose a reason for hiding this comment

tlaeven commented Apr 23, 2020 • edited Loading

Minimal example:

Note

basnijholt commented Apr 23, 2020 • edited Loading

basnijholt commented Apr 23, 2020

jbweston left a comment

Choose a reason for hiding this comment

basnijholt commented Apr 9, 2020 •

edited

Loading

basnijholt commented Apr 10, 2020 •

edited

Loading

codecov-io commented Apr 10, 2020 •

edited

Loading

tlaeven commented Apr 23, 2020 •

edited

Loading

basnijholt commented Apr 23, 2020 •

edited

Loading