Add support for NBI put-with-signal operation #244

naveen-rn · 2018-09-10T18:49:46Z

This PR extends the #218 proposal adding non-blocking support for put-with-signal operations. The nonblocking variant of the put-with-signal operation is semantically equivalent to:

shmem_put_nbi();
shmem_fence();
shmem_put_nbi();

Following the trend of no-example for any NBI operations, there is no example added for this proposed routine in this PR. And the placement of the routine, will also be modified based on Issue #216.

Expected DOC changes:

Since the change log entry framework and the keyword highlighting changes are added as part of Add support for put with signal operation #218 - there is no need to do it again in this PR. We will clean it up after reading.

naveen-rn · 2018-11-01T16:21:40Z

Changes made after the October 2018 Meeting:
https://github.com/openshmem-org/specification/compare/8e386..1ce6d5

~~The following changes were made similar to #218 (comment):~~

~~removed restrict qualifier~~
~~updated the description section to contain the same explanation from summary~~
~~updated context argument explanation~~
~~added better explanation for sig_addr and dest overlap~~

~~October 2018 is when we had the official reading of this ticket. These changes can be used for updates in the November 2018 meeting and used later for special ballot.~~

This comment is no more valid - we didn't have any special ballot voting or updates during the November specification meeting. Updates available in next comment.

NBI put-with-signal is an extension to its blocking variant.

We have incorporated common review comments from put-with-signal blocking routines: 1. duplicated explanation from summary to description 2. removed restrict qualifier and also overlapping explanation 3. modified ctx arg explanation

Based on recent review comments, it looks like it would be more clear if we state that the signal update is an atomic operation We have added this as part of the Notes to Implementers section.

Previously, we had the information about the signal updates atomicity guarantees in the notes to implementors section for NBI put-with-signal We are not now moving this into main notes section. We have also clarifies the atomicity guarantees by refering to atomicty section.

naveen-rn · 2019-01-11T18:36:16Z

Changes made after the October 2018 Meeting:
https://github.com/openshmem-org/specification/compare/2b39c09..48f2ec1

The following changes were made similar to #218 (comment):

removed restrict qualifier
updated the description section to contain the same explanation from summary
updated context argument explanation
added better explanation for sig_addr and dest overlap
clarified NBI signal update as an atomic operation

October 2018 is when we had the official reading of this ticket. These changes can be used for updates in the January 2019 meeting and used for special ballot.
[EDIT]: Updated after rebasing with master

Previously, we used restrict qualifier and defined in the def.tex for syntax highlighting in the function definitions. As the usage of restrict qualifier is removed, this change is no longer nedeed.

We were incorrectly using variable and macros incorrectly for \dest and \source. Fixing it in put-with-signal-nbi.

content/shmem_put_signal_nbi.tex

jdinan · 2019-01-14T21:53:38Z

content/shmem_put_signal_nbi.tex

+    point-to-point synchronization interfaces. The delivery of \VAR{signal} flag
+    on the remote \ac{PE} must not cause partial updates. This requires the
+    update on \VAR{signal} flag to be an atomic operation, with atomicity
+    guarantees described in Section~\ref{subsec:amo_guarantees}.


This text is too strong. We don't want the signal update to be a SHMEM-level atomic. If a network can implement this with a simple write, it should be allowed. Suggest something like:

"The put-with-signal interfaces must be implemented such that a synchronization operation on the remote \ac{PE} does not observe partial updates to \VAR{signal}. On some platforms, this may require the update of the \VAR{signal} flag to be performed using an atomic operation."

@jdinan @manjugv @shamisp Though the comment is shown outdated - this one is not fixed. Need some feedback.

To me the core of both @jdinan suggested changes (prefer - atomic-like semantics) and @shamisp review comments (prefer - strong atomic guarantees) almost looks same. But, the issue is - we are trying to determine an implementation detail.

If some implementation can implement shmem_atomic_set as simple shmem_p operation. It doesn't matter if this statement is too strong. Internally it is going to still go through the same path.

IMHO, if it is fine, we should completely remove the implementation detail. We should just say:

The put-with-signal interfaces must be implemented such that a synchronization operation on the remote \ac{PE} does not observe partial updates to \VAR{signal}.

I still prefer the minimally specified "is compatible with remote wait/test" text. It's crucial for open standards to keep the implementation space open to support future and alternative architectures (e.g. Gen-Z, CCIX, NVLink, etc.) where the signal can be non-atomic. SHMEM on shared memory (e.g. POSH) is another example, depending on the underlying architecture.

My understanding is that the weaker text is compatible with current architectures, while keeping the implementation space open. I appreciate that the atomic text is clearer to present day implementors and, as a result, is more likely to be implemented correctly. Apart from clarity, is there something else I'm missing? If the preference for the atomic text is being driven by clarity, we can put more work into the wait-compatible version of the text to address this.

On the last RMA call we had in depth discussion about definition signal semantics, notes posted here http://www.openshmem.org/pipermail/openshmem-list/2019-January/001021.html

@shamisp Sorry I wasn't able to join that meeting. I re-read the notes, but don't see the outcome from the discussion. Can someone please clarify for me?

@bcernohous That's my understanding as well. There is an additional concern we are working through that on some platforms the signal update needs to be performed using a PCIe atomic operation (and by extension a NIC atomic) to ensure that shmem_wait at the target doesn't see a partial update to the variable it is waiting on. Since atomics have lower throughput than puts (at least on the networks we are discussing), there is interest in identifying a path through the API where such a network can still use puts to implement the signal.

@jdinan , understood. And there was some discussion of doubling the APIs to support both"no partial updates" and "partial update ok" versions of wait.

IMO, before I've seen the upcoming "shmem signal" (not shmem_put_signal) proposal, I think SHMEM takes the hard path. No partial updates.

An app that wants performance and tolerates partial updates could use shmem_put/do while(flag!=0). It's not clear to me that we need API's for that. My 2 cents.

@bcernohous The question @naveen-rn asked earlier in the thread was whether the signal update performed by put-with-signal should be specified as a SHMEM AMO or a "no partial updates" update. What's your preference?

@jdinan , I'm waiting to see the new shmem_signal proposal. I think we define them similarly and they both satisfy shmem_wait. So it partly depends on what we decide for shmem_wait.

This signal API sounds a bit like a wrapper around shmem_p and shmem_atomic_set (i.e. choose which ever one is fastest and safe to use with wait). We could also add a preprocessor macro SHMEM_P_IS_WAIT_SAFE and let the user code for this if they want to.

manjugv · 2019-01-28T19:33:24Z

January 2019 Meeting: Voting postponed

jdinan · 2019-07-09T19:38:40Z

@naveen-rn Can this PR be closed now that it is merged with #275?

naveen-rn · 2019-07-09T19:50:53Z

Closing this PR as we have opened a new composed PR with both blocking and non-blocking put-with-signal: #275

naveen-rn mentioned this pull request Sep 10, 2018

Adding support for NBI put with signal operation #238

Closed

jdinan mentioned this pull request Sep 17, 2018

RMA Notification mpi-forum/mpi-issues#59

Open

naveen-rn added the HadReading label Sep 24, 2018

naveen-rn changed the title ~~Add support for NBI put-with-signal~~ Add support for NBI put-with-signal operation Jan 8, 2019

jdinan added PendingBallot SpecialBallot labels Jan 11, 2019

jdinan mentioned this pull request Jan 11, 2019

DOC-edit: RM unnecessary white spaces in backmatter and defs #261

Merged

naveen-rn and others added 6 commits January 11, 2019 12:33

Add support for NBI put-with-signal

2b39c09

NBI put-with-signal is an extension to its blocking variant.

Adding overlapping semantics in put-with-signal-nbi

27eb675

Explicitly state the NBI signal update is AMO

3361c3e

Based on recent review comments, it looks like it would be more clear if we state that the signal update is an atomic operation We have added this as part of the Notes to Implementers section.

Fix variable usage in NBI notes section

086a01c

naveen-rn force-pushed the feature/put-signal-nbi branch from a5b02ae to 3b48d2b Compare January 11, 2019 18:33

naveen-rn added 3 commits January 12, 2019 12:45

Add backmatter for NBI put-with-signal

68cd22c

RM restrict qualifier from def.tex

2d4a35f

Previously, we used restrict qualifier and defined in the def.tex for syntax highlighting in the function definitions. As the usage of restrict qualifier is removed, this change is no longer nedeed.

Fix \VAR and macro usage correctly

48f2ec1

We were incorrectly using variable and macros incorrectly for \dest and \source. Fixing it in put-with-signal-nbi.

jdinan reviewed Jan 14, 2019

View reviewed changes

Reframe NBI signal-put compatibility with p2p syncs

d361527

Update NBI put-with-signal atomicity description

6315ecc

naveen-rn mentioned this pull request May 4, 2019

Add support for Blocking and Non-blocking put-with-signal #275

Merged

jdinan added this to the OpenSHMEM 1.5 milestone May 21, 2019

naveen-rn closed this Jul 9, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add support for NBI put-with-signal operation #244

Add support for NBI put-with-signal operation #244

naveen-rn commented Sep 10, 2018

naveen-rn commented Nov 1, 2018 •

edited

Loading

naveen-rn commented Jan 11, 2019 •

edited

Loading

jdinan Jan 14, 2019

naveen-rn Jan 15, 2019 •

edited

Loading

jdinan Jan 17, 2019

shamisp Jan 18, 2019

jdinan Jan 22, 2019

jdinan Jan 23, 2019

bcernohous Jan 23, 2019

jdinan Jan 23, 2019

bcernohous Jan 23, 2019

jdinan Jan 24, 2019

manjugv commented Jan 28, 2019

jdinan commented Jul 9, 2019

naveen-rn commented Jul 9, 2019

Add support for NBI put-with-signal operation #244

Add support for NBI put-with-signal operation #244

Conversation

naveen-rn commented Sep 10, 2018

naveen-rn commented Nov 1, 2018 • edited Loading

naveen-rn commented Jan 11, 2019 • edited Loading

Choose a reason for hiding this comment

naveen-rn Jan 15, 2019 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

manjugv commented Jan 28, 2019

jdinan commented Jul 9, 2019

naveen-rn commented Jul 9, 2019

naveen-rn commented Nov 1, 2018 •

edited

Loading

naveen-rn commented Jan 11, 2019 •

edited

Loading

naveen-rn Jan 15, 2019 •

edited

Loading