DEBUG #299

h-vetinari · 2024-12-04T21:44:48Z

~~Demonstrate that fix in conda-forge/libcufile-feedstock#24 was not sufficient, and that conda-forge/libcufile-feedstock@733742d needs to be reverted.~~

Debugging run for #298 (which contains the commits for the original goal of this PR) + #304

Upstream keeps all magma-related routines in a separate libtorch_cuda_linalg library that is loaded dynamically whenever linalg functions are used. Given the library is relatively small, splitting it makes it possible to provide "magma" and "nomagma" variants that can be alternated between. Fixes conda-forge#275 Co-authored-by: Isuru Fernando <[email protected]>

…nda-forge-pinning 2024.12.04.13.54.14

Try to speed up magma/nomagma builds a bit. Rather than rebuilding the package 3 times (possibly switching magma → nomagma → magma again), build it twice at the very beginning and store the built files for later reuse in subpackage builds. While at it, replace the `pip wheel` calls with `setup.py build` to avoid unnecessarily zipping up and then unpacking the whole thing. In the end, we are only grabbing a handful of files for `libtorch*` packages and they are in predictable location in the build directory. `pip install` remains being used in the final builds for `pytorch`.

conda-forge-admin · 2024-12-04T21:46:24Z

Hi! This is the friendly automated conda-forge-linting service.

I just wanted to let you know that I linted all conda-recipes in your PR (recipe/meta.yaml) and found it was in an excellent condition.

I do have some suggestions for making it better though...

For recipe/meta.yaml:

ℹ️ The recipe is not parsable by parser conda-souschef (grayskull). Your recipe may not receive automatic updates and/or may not be compatible with conda-forge's infrastructure. Please check the logs for more information and ensure your recipe can be parsed.
ℹ️ The recipe is not parsable by parser conda-recipe-manager. Your recipe may not receive automatic updates and/or may not be compatible with conda-forge's infrastructure. Please check the logs for more information and ensure your recipe can be parsed.

_{This message was generated by GitHub Actions workflow run https://github.com/conda-forge/conda-forge-webservices/actions/runs/12387955132. Examine the logs at this URL for more detail.}

isuruf · 2024-12-06T09:19:26Z

You need conda-forge/conda-forge-ci-setup-feedstock#368

…onda-forge-pinning 2024.12.06.16.06.14

The test currently refused to even start, since not all dependencies were satisfied.

Put all the rules in a single file. In the end, build_common.sh has pytorch-conditional code at the very end anyway, and keeping the code split like this only makes it harder to notice mistakes.

While technically upstream uses 2024.2.0, this is causing some of the calls to fail with an error: RuntimeError: MKL FFT error: Intel oneMKL DFTI ERROR: Inconsistent configuration parameters Force <2024 that seems to work better. Fixes conda-forge#301

Enable actually running a fixed random subset (1/5) of core tests to check for packaging-related regressions. We are not running the complete test suite because it takes too long to complete.

Per RuntimeError: Ninja is required to load C++ extensions

While there still doesn't seem to be a clear agreement which builds should be preferred, let's prefer "magma" to keep the current behavior unchanged for end users.

Replace the build number hacks with `track_features` to deprioritize generic BLAS over mkl, and CPU over CUDA. This is mostly intended to simplify stuff before trying to port to rattler-build.

…onda-forge-pinning 2024.12.12.16.25.16

Remove a leftover `skip` that prevented CUDA + generic BLAS build from providing all packages, notably `pytorch`. While at it, remove redundant [win] skip.

This reverts commit 4ad7437.

…onda-forge-pinning 2024.12.13.14.31.10

This reverts commit 66a15ad.

This reverts commit 108adff.

…nda-forge-pinning 2024.12.16.21.51.05

Tobias-Fischer · 2024-12-26T03:26:57Z

Can we close here @h-vetinari?

mgorny and others added 4 commits December 4, 2024 19:13

MNT: Re-rendered with conda-build 24.9.0, conda-smithy 3.44.9, and co…

da06d90

…nda-forge-pinning 2024.12.04.13.54.14

trigger CI

e48edc0

h-vetinari mentioned this pull request Dec 6, 2024

Add constraint on sysroot conda-forge/libcufile-feedstock#26

Closed

mgorny added 5 commits December 6, 2024 17:52

Include blas_impl in libtorch-cuda-linalg package build string

55a72e0

Add some explanation on how things work to recipe/README.md

cf410ae

Fix building with CUDA disabled

700e6b4

MNT: Re-rendered with conda-build 24.11.2, conda-smithy 3.44.9, and c…

3bde066

…onda-forge-pinning 2024.12.06.16.06.14

Update test dependencies

9846f68

The test currently refused to even start, since not all dependencies were satisfied.

carterbox mentioned this pull request Dec 6, 2024

use the same glibc version as the docker image for cross conda-forge/conda-forge-ci-setup-feedstock#368

Merged

5 tasks

mgorny added 17 commits December 7, 2024 20:49

Move symlinking from build_pytorch.sh to build_common.sh

2843518

Put all the rules in a single file. In the end, build_common.sh has pytorch-conditional code at the very end anyway, and keeping the code split like this only makes it harder to notice mistakes.

Fix creating libtorch_python.so symlink in sitedir

f1c7ec0

Explain magma vs. nomagma better in the README

ee8f6d3

Merge remote-tracking branch 'upstream/main' into magma-wip

b8066ee

Pin mkl to <2024

0e37b1f

While technically upstream uses 2024.2.0, this is causing some of the calls to fail with an error: RuntimeError: MKL FFT error: Intel oneMKL DFTI ERROR: Inconsistent configuration parameters Force <2024 that seems to work better. Fixes conda-forge#301

Run a subset of core tests

ad49de1

Enable actually running a fixed random subset (1/5) of core tests to check for packaging-related regressions. We are not running the complete test suite because it takes too long to complete.

Add ninja to test dependencies

b20a0af

Per RuntimeError: Ninja is required to load C++ extensions

Use a specific subset of tests

323297c

Add patches to fix testing with numpy-2

276fb8c

Deselect Dynamo tests on py3.13

a51fd2b

Disable fuzzing tests using hypothesis

606ea2d

Deprioritize nomagma builds

921a959

While there still doesn't seem to be a clear agreement which builds should be preferred, let's prefer "magma" to keep the current behavior unchanged for end users.

Use track_features instead of build numbers to deprioritize

4ad7437

Replace the build number hacks with `track_features` to deprioritize generic BLAS over mkl, and CPU over CUDA. This is mostly intended to simplify stuff before trying to port to rattler-build.

Add missing {{ blas_impl }} to CUDA build strings

0350217

MNT: Re-rendered with conda-build 24.11.2, conda-smithy 3.44.9, and c…

830e948

…onda-forge-pinning 2024.12.12.16.25.16

Remove leftover skips for CUDA + generic BLAS builds

9c2026a

Remove a leftover `skip` that prevented CUDA + generic BLAS build from providing all packages, notably `pytorch`. While at it, remove redundant [win] skip.

Add more underscores to build strings for readability

2eef900

mgorny added 4 commits December 13, 2024 16:37

Revert "Use track_features instead of build numbers to deprioritize"

43968e9

This reverts commit 4ad7437.

MNT: Re-rendered with conda-build 24.11.2, conda-smithy 3.44.9, and c…

69519ca

…onda-forge-pinning 2024.12.13.14.31.10

Patch libtorch to determine hasMAGMA() by presence of libmagma.so

d9776f4

Fix the constraint to uninstall libmagma when installing nomagma variant

67c66b7

h-vetinari mentioned this pull request Dec 17, 2024

feat: yet another attempt to add windows builds #231

Merged

5 tasks

Tobias-Fischer added a commit to baszalmstra/pytorch-cpu-feedstock that referenced this pull request Dec 17, 2024

Pull in fixes from conda-forge#299

9b4e1f6

Tobias-Fischer added a commit to baszalmstra/pytorch-cpu-feedstock that referenced this pull request Dec 17, 2024

Pull in fixes from conda-forge#299

bc9fc16

h-vetinari and others added 10 commits December 17, 2024 16:47

Revert "Add c_stdlib for aarch"

216c386

This reverts commit 66a15ad.

Revert "add sysroot constraint"

adf0966

This reverts commit 108adff.

break up some long lines

0d85064

add basic test for libtorch-cuda-linalg

3d02681

MNT: Re-rendered with conda-build 24.9.0, conda-smithy 3.45.0, and co…

c2c6490

…nda-forge-pinning 2024.12.16.21.51.05

renormalize patches

e2045a9

set USE_SYSTEM_NVTX after updating patch 0003

c5f2282

Fix dot in cross-compiling

f51899c

Merge conda-forge#304

a7c9b8c

reinstate testing on aarch

434b9e5

h-vetinari changed the title ~~WIP: remove workarounds for wrong libcufile metadata~~ DEBUG Dec 18, 2024

h-vetinari force-pushed the cufile2 branch from 672fe66 to 75cd374 Compare December 18, 2024 02:13

This was referenced Dec 18, 2024

Fix dot in cross-compiling #304

Closed

Recipe overhaul: more tests & documentation, various clean-ups #298

Merged

isuruf and others added 2 commits December 18, 2024 16:24

be explicit about wheel dependency

e5def1c

Use a patch

06472c5

h-vetinari force-pushed the cufile2 branch from 75cd374 to 06472c5 Compare December 18, 2024 06:59

hmaarrfk mentioned this pull request Dec 20, 2024

Incorrect results with MPS computational-cell-analytics/micro-sam#435

Open

h-vetinari closed this Dec 26, 2024

h-vetinari deleted the cufile2 branch December 26, 2024 09:24

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

DEBUG #299

DEBUG #299

h-vetinari commented Dec 4, 2024 •

edited

Loading

conda-forge-admin commented Dec 4, 2024 •

edited

Loading

isuruf commented Dec 6, 2024

Tobias-Fischer commented Dec 26, 2024

DEBUG #299

DEBUG #299

Conversation

h-vetinari commented Dec 4, 2024 • edited Loading

conda-forge-admin commented Dec 4, 2024 • edited Loading

isuruf commented Dec 6, 2024

Tobias-Fischer commented Dec 26, 2024

h-vetinari commented Dec 4, 2024 •

edited

Loading

conda-forge-admin commented Dec 4, 2024 •

edited

Loading