Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Autodifferentiation support for PytorchBackend #1276

Merged
merged 77 commits into from
Jul 16, 2024
Merged

Conversation

Simone-Bordoni
Copy link
Contributor

@Simone-Bordoni Simone-Bordoni commented Mar 19, 2024

Complete pytorch backend, close issue #1266

Checklist:

  • Reviewers confirm new code works as expected.
  • Tests are passing.
  • Coverage does not decrease.
  • Documentation is updated.

@Simone-Bordoni Simone-Bordoni added the enhancement New feature or request label Mar 19, 2024
@Simone-Bordoni Simone-Bordoni self-assigned this Mar 19, 2024
@Simone-Bordoni Simone-Bordoni linked an issue Mar 19, 2024 that may be closed by this pull request
@renatomello renatomello added this to the Qibo 0.2.7 milestone Mar 19, 2024
@renatomello renatomello changed the title Autodifferentiation support for pytorch backend Autodifferentiation support for pytorch backend Mar 19, 2024
@renatomello renatomello changed the title Autodifferentiation support for pytorch backend Autodifferentiation support for PytorchBackend Mar 19, 2024
@Simone-Bordoni Simone-Bordoni marked this pull request as ready for review March 28, 2024 13:38
@Simone-Bordoni
Copy link
Contributor Author

The only differentiation method that was not working for the pytorch backend was the parameter shift rule.
However now i have unmuted the test and removed the check on the backend and everything is working correctly for the pytorch backend. Probably the problem was linked to some error in the initial implementation of the pytorch backend and was fixed by correcting all the other tests.

Copy link

codecov bot commented Mar 28, 2024

Codecov Report

All modified and coverable lines are covered by tests ✅

Project coverage is 100.00%. Comparing base (0be8f40) to head (85ae474).

Additional details and impacted files
@@            Coverage Diff            @@
##            master     #1276   +/-   ##
=========================================
  Coverage   100.00%   100.00%           
=========================================
  Files           76        76           
  Lines        10833     10923   +90     
=========================================
+ Hits         10833     10923   +90     
Flag Coverage Δ
unittests 100.00% <100.00%> (ø)

Flags with carried forward coverage won't be shown. Click here to find out more.

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

@renatomello
Copy link
Contributor

renatomello commented Mar 28, 2024

The only differentiation method that was not working for the pytorch backend was the parameter shift rule. However now i have unmuted the test and removed the check on the backend and everything is working correctly for the pytorch backend. Probably the problem was linked to some error in the initial implementation of the pytorch backend and was fixed by correcting all the other tests.

There's also all the classes inside qibo.models.variational. They need to work properly with the optmizers. RIght now the tests are not passing, they are being skipped.

@Simone-Bordoni
Copy link
Contributor Author

The only differentiation method that was not working for the pytorch backend was the parameter shift rule. However now i have unmuted the test and removed the check on the backend and everything is working correctly for the pytorch backend. Probably the problem was linked to some error in the initial implementation of the pytorch backend and was fixed by correcting all the other tests.

There's also all the classes inside qibo.models.variational. They need to work properly with the optmizers. RIght now the tests are not passing, they are being skipped.

ok, I forgot these tests, I will try to fix them tomorrow

@scarrazza scarrazza modified the milestones: Qibo 0.2.7, Qibo 0.2.8 Apr 5, 2024
@Simone-Bordoni Simone-Bordoni marked this pull request as draft April 16, 2024 09:44
@Simone-Bordoni
Copy link
Contributor Author

I have encountered a problem while coding the SGD training procedure of the VQE for the pytorch backend.
The problem is that when executing a circuit with parametrized gates that require gradients the parameters of the gates are initially casted as numpy and this is not compatible with the gradients (see failed test).
Fixing the issue requires modifying numpy_matrices.py, as this module is of fundamental importance @renatomello @BrunoLiegiBastonLiegi @MatteoRobbiati maybe we can discuss how to modify it in order to make it work correctly with torch.

@BrunoLiegiBastonLiegi
Copy link
Contributor

Do you mean this cast?
https://github.com/qiboteam/qibo/blob/7b4489e6c95e5844796e8c08b6dbaacc9e28984e/src/qibo/backends/npmatrices.py#L76C1-L76C72

Because this is overloaded by the TorchMatrices object I believe.

@Simone-Bordoni
Copy link
Contributor Author

Simone-Bordoni commented Apr 17, 2024

Do you mean this cast? https://github.com/qiboteam/qibo/blob/7b4489e6c95e5844796e8c08b6dbaacc9e28984e/src/qibo/backends/npmatrices.py#L76C1-L76C72

Because this is overloaded by the TorchMatrices object I believe.

That cast is fine. The problem is in the two lines before:

cos = self.np.cos(theta / 2.0) + 0j
isin = -1j * self.np.sin(theta / 2.0)

when theta is a torch tensor with requires_grad = True you can't compute cos/sin using numpy.

@BrunoLiegiBastonLiegi
Copy link
Contributor

Do you mean this cast? https://github.com/qiboteam/qibo/blob/7b4489e6c95e5844796e8c08b6dbaacc9e28984e/src/qibo/backends/npmatrices.py#L76C1-L76C72
Because this is overloaded by the TorchMatrices object I believe.

That cast is fine. The problem is in the two lines before: cos = self.np.cos(theta / 2.0) + 0j isin = -1j * self.np.sin(theta / 2.0) when theta is a torch tensor with requires_grad = True you can't compute cos/sin using numpy.

but then why isn't self.np = torch for TorchMatrices as it is for the TorchBackend?

@Simone-Bordoni
Copy link
Contributor Author

Do you mean this cast? https://github.com/qiboteam/qibo/blob/7b4489e6c95e5844796e8c08b6dbaacc9e28984e/src/qibo/backends/npmatrices.py#L76C1-L76C72
Because this is overloaded by the TorchMatrices object I believe.

That cast is fine. The problem is in the two lines before: cos = self.np.cos(theta / 2.0) + 0j isin = -1j * self.np.sin(theta / 2.0) when theta is a torch tensor with requires_grad = True you can't compute cos/sin using numpy.

but then why isn't self.np = torch for TorchMatrices as it is for the TorchBackend?

I can try to fix things this way. With Renato we just wanted to have the matrices defined with numpy and then casted to the correct backend.

@renatomello
Copy link
Contributor

Do you mean this cast? https://github.com/qiboteam/qibo/blob/7b4489e6c95e5844796e8c08b6dbaacc9e28984e/src/qibo/backends/npmatrices.py#L76C1-L76C72
Because this is overloaded by the TorchMatrices object I believe.

That cast is fine. The problem is in the two lines before: cos = self.np.cos(theta / 2.0) + 0j isin = -1j * self.np.sin(theta / 2.0) when theta is a torch tensor with requires_grad = True you can't compute cos/sin using numpy.

but then why isn't self.np = torch for TorchMatrices as it is for the TorchBackend?

This solution should be fine

@Simone-Bordoni
Copy link
Contributor Author

@renatomello Fortunately I found a way to let gradients pass...
lets't take the Rx gate for example:

    def RX(self, theta):
        theta = self._cast_parameter(theta)
        cos = self.np.cos(theta / 2.0) + 0j
        isin = -1j * self.np.sin(theta / 2.0)
        return self._cast([[cos, isin], [isin, cos]], dtype=self.dtype)

the cast function is practically a torch.as_tensor and here we loose track of the gradients.
to keep track of gradients it is necessary to use torch.stack (see latest commit as reference):

    def RX(self, theta):
        theta = self._cast_parameter(theta)
        cos = self.np.cos(theta / 2.0) + 0j
        isin = -1j * self.np.sin(theta / 2.0)
        return self.np.stack([cos, isin, isin, cos]).reshape(2, 2)

this way we don't even need to use the _cast function.
If you agree on this solution tomorrow I will update the np.matrices file

Copy link
Contributor

@BrunoLiegiBastonLiegi BrunoLiegiBastonLiegi left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks @Simone-Bordoni I went through all the source files, I have just some minor suggestions. I'll move to the tests next.

src/qibo/backends/npmatrices.py Show resolved Hide resolved
src/qibo/backends/numpy.py Outdated Show resolved Hide resolved
src/qibo/backends/pytorch.py Outdated Show resolved Hide resolved
src/qibo/backends/pytorch.py Show resolved Hide resolved
src/qibo/models/dbi/utils.py Outdated Show resolved Hide resolved
src/qibo/gates/gates.py Outdated Show resolved Hide resolved
src/qibo/models/encodings.py Outdated Show resolved Hide resolved
src/qibo/quantum_info/basis.py Outdated Show resolved Hide resolved
src/qibo/quantum_info/basis.py Outdated Show resolved Hide resolved
src/qibo/quantum_info/utils.py Outdated Show resolved Hide resolved
tests/test_backends.py Outdated Show resolved Hide resolved
tests/test_derivative.py Outdated Show resolved Hide resolved
tests/test_derivative.py Outdated Show resolved Hide resolved
tests/test_models_variational.py Show resolved Hide resolved
tests/test_models_variational.py Outdated Show resolved Hide resolved
tests/test_models_variational.py Show resolved Hide resolved
tests/test_quantum_info_random.py Outdated Show resolved Hide resolved
@Simone-Bordoni
Copy link
Contributor Author

@renatomello @BrunoLiegiBastonLiegi If you agree on non dup;licating the matrices and you have reviewed the code please accept the PR so that tomorrow we can merge

@renatomello renatomello added this pull request to the merge queue Jul 16, 2024
Merged via the queue into master with commit 59af33c Jul 16, 2024
25 checks passed
@renatomello renatomello deleted the pytorch_autodiff branch July 16, 2024 17:34
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

Successfully merging this pull request may close these issues.

DBI tests are not correctly using the backend Gradients for Pytorch backend
4 participants