Add differential privacy as a threat mitigation #91

csharrison · 2021-07-28T22:49:41Z

Differential privacy can protect against attacks where a portion of the input data is known a priori, or when the size of a batch is small. This PR adds DP as an optional mitigation for PDA deployments and lists where it helps mitigate various threats.

Additionally:

Removes a false statement that revealing user input does not compromise other users' privacy. This is not necessarily the case (e.g. imagine if the batch size is 2 or if many users reveal their input).
Removes a statement that clients revealing their inputs is outside the threat model (is this still relevant?)

cjpatton

Excellent! Requested changes are editorial.

draft-pda-protocol.md

cjpatton · 2021-07-29T17:32:27Z

draft-pda-protocol.md

+## Differential privacy {#dp}
+
+Optionally, PDA deployments can choose to ensure their output F achieves
+[differential privacy](https://en.wikipedia.org/wiki/Differential_privacy).


EDITED: Given that the likely trajectory of this document is an IETF WG draft, it would be better to replace this markdown with a reference to a paper about DP. The most useful reference would be something that describes the distinction between client-side and server-side noise addition.

Edited this comment.

Done, I added a general DP reference in place of the wikipedia link. It discusses central, local, and multi-party DP.

draft-pda-protocol.md

tgeoghegan

I have one question about whether we need more explicit protocol support for DP but if so, that can land in a subsequent PR.

tgeoghegan · 2021-07-29T17:40:01Z

draft-pda-protocol.md

@@ -1253,6 +1257,18 @@ but server implementations may also opt out of participating in a PDA task if
 the minimum batch size is too small. This document does not specify how to
 choose minimum batch sizes.

+## Differential privacy {#dp}


Besides this informal recommendation, do we need explicit protocol support for differential privacy so that collectors can de-noise outputs? We can leave it up to aggregators to decide how they're going to implement DP but I wonder if PDAOutputShare should have a field for the epsilon value that was used by the aggregator. Forgive me if I'm talking nonsense about DP, I am speaking in the terms that we used in Prio v2.

Hm. I am nervous about being to prescriptive here. In the simplest protocol design nothing is needed since the epsilon is hardcoded into the specific protocol instantiation and won't change.

In practice, some specific instantiations may want to reveal even more information about how noise was applied, e.g.

The distribution noise is sampled from (Laplace, Gaussian, etc)

Parameters of the noise distribution

Any kind of threshold used (for example, if you are using approximate DP)

I think we should make this as opaque to the protocol as possible vs. prescribing some single "epsilon" field which might be too constraining. What do you think?

However, I still think this current PR is land-able given that a basic instantiation can hardcode everything without requiring any communication.

The way to go here, I think, is to document the open question by adding an [OPEN ISSUE: blah blah blah].

I put a note about this in #19, which I think is an appropriate issue to track this discussion.

Support more than one helper Co-authored-by: Christopher Patton <[email protected]>

cjpatton · 2021-07-30T18:44:43Z

draft-pda-protocol.md

@@ -1253,6 +1257,18 @@ but server implementations may also opt out of participating in a PDA task if
 the minimum batch size is too small. This document does not specify how to
 choose minimum batch sizes.

+## Differential privacy {#dp}


The way to go here, I think, is to document the open question by adding an [OPEN ISSUE: blah blah blah].

csharrison added 2 commits July 28, 2021 17:27

Add differential privacy as a threat mitigation

2bc28c6

Fix links

5ffc351

cjpatton requested review from cjpatton and tgeoghegan July 29, 2021 17:20

cjpatton requested changes Jul 29, 2021

View reviewed changes

tgeoghegan approved these changes Jul 29, 2021

View reviewed changes

csharrison and others added 2 commits July 29, 2021 15:46

Apply suggestions from code review

20fd3cb

Support more than one helper Co-authored-by: Christopher Patton <[email protected]>

Add references

1cb3234

cjpatton approved these changes Jul 30, 2021

View reviewed changes

Add open issue about protocol support

fc636b1

tgeoghegan mentioned this pull request Jul 30, 2021

Accommodating randomized inputs (a la DP) #19

Closed

cjpatton merged commit 2590598 into ietf-wg-ppm:main Aug 2, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add differential privacy as a threat mitigation #91

Add differential privacy as a threat mitigation #91

csharrison commented Jul 28, 2021 •

edited

Loading

cjpatton left a comment

cjpatton Jul 29, 2021 •

edited

Loading

cjpatton Jul 29, 2021

csharrison Jul 29, 2021

tgeoghegan left a comment

tgeoghegan Jul 29, 2021

csharrison Jul 29, 2021

csharrison Jul 30, 2021

cjpatton Jul 30, 2021

csharrison Jul 30, 2021

tgeoghegan Jul 30, 2021

cjpatton Jul 30, 2021

Add differential privacy as a threat mitigation #91

Add differential privacy as a threat mitigation #91

Conversation

csharrison commented Jul 28, 2021 • edited Loading

cjpatton left a comment

Choose a reason for hiding this comment

cjpatton Jul 29, 2021 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

tgeoghegan left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

csharrison commented Jul 28, 2021 •

edited

Loading

cjpatton Jul 29, 2021 •

edited

Loading