Ensure adjoint allocates memory for max concurrent observables only #221

mlxd · 2022-02-02T15:15:06Z

All new features must include a unit test.
If you've fixed a bug or added code that should be tested, add a test to the
tests directory!
All new functions and code must be clearly commented and documented.
If you do make documentation changes, make sure that the docs build and
render correctly by running make docs.
Ensure that the test suite passes, by running make test.
Add a new entry to the .github/CHANGELOG.md file, summarizing the
change, and including a link back to the PR.
Ensure that code is properly formatted by running make format.

When all the above are checked, delete everything above the dashed
line and fill in the pull request template.

Context: The current implementation of the adjoint Jacobian method in Lightning parallelizes over observables, with each observable handled by a separate OpenMP thread. This works fine in practice unless a large number of observables are required, wherein a new statevector memory block is allocated upfront for the total number of observables. This causes OOM errors for large numbers of observables, even for modest qubit counts. The solution is to restrict the requested number of observables into batches of OMP_NUM_THREADS concurrent executions, where the number of statevectors allocated is limited by the number of executing threads. This imposes a small overhead in requiring additional computation across batches, but ensures the user can reach much higher qubit counts for large numbers of concurrent executions.

Description of the Change: The adjoint Jacobian calculation is now batched at the maximum number of OpenMP threads, and allocates enough memory for a given batch only.

Benefits: Reduces overall memory footprint within a given series of concurrent calculations, allowing larger numbers of qubits in a given calculation, prevents OOM errors encountered for large workflows.

Possible Drawbacks: Imposes additional overheads due to repetition between batches.

Related GitHub Issues:

github-actions · 2022-02-02T15:15:40Z

Hello. You may have forgotten to update the changelog!
Please edit .github/CHANGELOG.md with:

A one-to-two sentence description of the change. You may include a small working example for new features.
A link back to this PR.
Your name (or GitHub username) in the contributors section.

codecov · 2022-02-02T15:16:57Z

Codecov Report

Merging #221 (a80fa50) into master (602352b) will increase coverage by 0.01%.
The diff coverage is 100.00%.

❗ Current head a80fa50 differs from pull request most recent head 81064a8. Consider uploading reports for the commit 81064a8 to get more accurate results

@@            Coverage Diff             @@
##           master     #221      +/-   ##
==========================================
+ Coverage   99.70%   99.72%   +0.01%     
==========================================
  Files           4        4              
  Lines         344      358      +14     
==========================================
+ Hits          343      357      +14     
  Misses          1        1

Impacted Files	Coverage Δ
pennylane_lightning/_version.py	`100.00% <100.00%> (ø)`
pennylane_lightning/lightning_qubit.py	`99.63% <100.00%> (+0.02%)`	⬆️

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update 602352b...81064a8. Read the comment docs.

github-actions · 2022-02-02T15:20:16Z

Test Report (C++) on Ubuntu

      1 files ±0       1 suites ±0 0s ⏱️ ±0s
  555 tests ±0   555 ✔️ ±0 0 💤 ±0 0 ❌ ±0
2 289 runs ±0 2 289 ✔️ ±0 0 💤 ±0 0 ❌ ±0

Results for commit 81064a8. ± Comparison against base commit 602352b.

♻️ This comment has been updated with latest results.

This reverts commit 2f48486. Favour chunking at Python layer due to configurability.

AmintorDusko · 2022-02-03T18:46:05Z

pennylane_lightning/lightning_qubit.py

@@ -65,6 +67,12 @@
 )


+def _chunk_iterable(it, num_chunks):
+    "Lazy-evaluted chunking of given iterable from https://stackoverflow.com/a/22045226"


Suggested change

"Lazy-evaluted chunking of given iterable from https://stackoverflow.com/a/22045226"

"Lazy-evaluated chunking of given iterable from https://stackoverflow.com/a/22045226"

tests/test_comparison.py

AmintorDusko

Nice work @mlxd! I'm happy to approve!

maliasadi

Awesome work! 🚀 I've had only a few comments...

doc/devices.rst

pennylane_lightning/lightning_qubit.py

Co-authored-by: Ali Asadi <[email protected]>

trevor-vincent

Nice work! Don't see any problem beyond what Ali pointed out.

mlxd added 2 commits February 1, 2022 14:56

Add data chunking utilities

29f3fb1

Add support for batched adjoint with max OMP threads

2f48486

mlxd and others added 12 commits February 2, 2022 13:00

Add chunking at Python layer

74fd6b2

Revert "Add support for batched adjoint with max OMP threads"

0fcdff1

This reverts commit 2f48486. Favour chunking at Python layer due to configurability.

Add tests for chunking obs

da664f7

Fix formatting and skip batching tests if unsupported

636147b

Fix clang-tidy errors

1818530

Update testing coverage

6773e27

Black

1857c45

Update docs

8818977

Skip tests without binary

18b6f6b

Mark skip on tests

17dc33c

Update docs and changelog

ac46dc5

Merge branch 'master' into openmp_memory_conc

e4fecc5

mlxd requested review from maliasadi, AmintorDusko and trevor-vincent February 3, 2022 14:43

mlxd and others added 2 commits February 3, 2022 10:00

Update changelog and version

4ee032d

Update CHANGELOG

3b128f5

AmintorDusko reviewed Feb 3, 2022

View reviewed changes

mlxd changed the base branch from master to v0.21.0-rc0 February 3, 2022 21:10

mlxd and others added 2 commits February 3, 2022 16:13

Fix testing

b17737a

Merge branch 'v0.21.0-rc0' into openmp_memory_conc

fdd2cd1

AmintorDusko approved these changes Feb 3, 2022

View reviewed changes

maliasadi requested changes Feb 3, 2022

View reviewed changes

mlxd and others added 2 commits February 3, 2022 17:51

Update doc/devices.rst

ccc0da0

Co-authored-by: Ali Asadi <[email protected]>

Update doc/devices.rst

64dfd93

Co-authored-by: Ali Asadi <[email protected]>

trevor-vincent approved these changes Feb 4, 2022

View reviewed changes

mlxd added 2 commits February 4, 2022 09:21

Add conditional check for batching of observables with OMP thread number

e451a4f

Import only essential modules

f827897

mlxd requested a review from maliasadi February 4, 2022 14:22

mlxd changed the base branch from v0.21.0-rc0 to master February 4, 2022 14:22

maliasadi approved these changes Feb 4, 2022

View reviewed changes

mlxd and others added 11 commits February 4, 2022 10:07

Add coverage for threaded tests

613fc8e

Merge branch 'master' into openmp_memory_conc

02d5d6d

Merge branch 'master' into openmp_memory_conc

c04966e

Auto update version

cb45441

Merge branch 'master' into openmp_memory_conc

e0470ac

Auto update version

6c29c9a

Black format

c3395c5

Merge branch 'master' into openmp_memory_conc

afae72f

Auto update version

73e12bb

Fix matrix accessor

3336653

Ensure testing always runs with multiple threads

81064a8

mlxd merged commit dbe6e19 into master Feb 25, 2022

mlxd deleted the openmp_memory_conc branch February 25, 2022 21:38

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Ensure adjoint allocates memory for max concurrent observables only #221

Ensure adjoint allocates memory for max concurrent observables only #221

mlxd commented Feb 2, 2022 •

edited

Loading

github-actions bot commented Feb 2, 2022

codecov bot commented Feb 2, 2022 •

edited

Loading

github-actions bot commented Feb 2, 2022 •

edited

Loading

AmintorDusko Feb 3, 2022

AmintorDusko left a comment

maliasadi left a comment

trevor-vincent left a comment

	"Lazy-evaluted chunking of given iterable from https://stackoverflow.com/a/22045226"
	"Lazy-evaluated chunking of given iterable from https://stackoverflow.com/a/22045226"

Ensure adjoint allocates memory for max concurrent observables only #221

Ensure adjoint allocates memory for max concurrent observables only #221

Conversation

mlxd commented Feb 2, 2022 • edited Loading

github-actions bot commented Feb 2, 2022

codecov bot commented Feb 2, 2022 • edited Loading

Codecov Report

github-actions bot commented Feb 2, 2022 • edited Loading

Test Report (C++) on Ubuntu

AmintorDusko Feb 3, 2022

Choose a reason for hiding this comment

AmintorDusko left a comment

Choose a reason for hiding this comment

maliasadi left a comment

Choose a reason for hiding this comment

trevor-vincent left a comment

Choose a reason for hiding this comment

mlxd commented Feb 2, 2022 •

edited

Loading

codecov bot commented Feb 2, 2022 •

edited

Loading

github-actions bot commented Feb 2, 2022 •

edited

Loading