Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

pml_ucx: add ompi datatype attribute to release ucp_datatype #5878

Merged
merged 2 commits into from
Oct 10, 2018

Conversation

yosefe
Copy link
Contributor

@yosefe yosefe commented Oct 9, 2018

Fixes openucx/ucx#2921
(checked that the leak is gone with IMB Allgatherv 512b, and passed mpi_test_suite)

@yosefe
Copy link
Contributor Author

yosefe commented Oct 9, 2018

adding @ca-taylor

@yosefe
Copy link
Contributor Author

yosefe commented Oct 9, 2018

@vspetrov @hoopoepg @brminich can you pls take a look?

ucp_worker_destroy(ompi_pml_ucx.ucp_worker);
ompi_pml_ucx.ucp_worker = NULL;
err:
return OMPI_ERROR;
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

return rc

@AdamSimpson
Copy link

Will this make it into the 4.0 release?

PML_UCX_ASSERT(datatype->id < OMPI_DATATYPE_MAX_PREDEFINED);
ompi_pml_ucx.predefined_types[datatype->id] = ucp_datatype;
} else {
ret = ompi_attr_set_c(TYPE_ATTR, datatype, &datatype->d_keyhash,

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Why do u need to set the ucp_datatype on the datatype->d_keyhash ? You have it stored in the pml_data anyways, right? The delete callback will be launched and you can get the ucp_type from pml_data. No, need to grow the hash, if i understand correctly.

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

would the delete callback be called for an mpi datatype if i don't set my attribute?
according to the code of omi_attr_delete_impl - no:
https://github.com/open-mpi/ompi/blob/master/ompi/attribute/attribute.c#L1055

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

right, i see

@yosefe yosefe force-pushed the topic/pml-ucx-fix-datatype-leak branch from d3da92c to 40ac9e4 Compare October 10, 2018 11:41
@yosefe
Copy link
Contributor Author

yosefe commented Oct 10, 2018

@AdamSimpson yes the plan is to backport to v4.0.x as well

@akesandgren
Copy link
Contributor

I see no trace of this fix in the 3.1.x or the 3.0.x branches, would probably be good to have it there too.

@jsquyres
Copy link
Member

jsquyres commented Oct 7, 2019

@yosefe @brminich @vspetrov We are aiming for the end of the month for v3.0.x and v3.1.x releases. If you have small bug fixes that need to get to these release branches, you should do so ASAP.

@brminich
Copy link
Member

brminich commented Oct 7, 2019

@akesandgren, is it critical for you? Can you use OMPI 4.0x?

@akesandgren
Copy link
Contributor

We use everything from 1.6 to 3.1.4 at the moment, 4.x is in the pipeline (but at least ScaLAPACK need to get some fixes).
We in this case is the community that uses EasyBuild to install HPC software.
We already have this PR as a patch to our builds but it would be better to not have to do that.

So no, it's not critical, just highly preferable.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

Memory leak with UCX and OpenMPI 3.1.x
6 participants