How to control the privacy budget #2

TeDiou · 2023-12-24T05:18:19Z

As we set the private = True, in your source code it only calculates the privacy budget. How can we control the privacy budget? By adding a if statement?

zhao-zilong · 2023-12-26T06:08:10Z

Hi @TeDiou

If you set private = True, then you enable the training with DP. And for calculate privacy budget, the code block is starting from here:

CTAB-GAN-Plus-DP/model/synthesizer/ctabgan_synthesizer.py

Line 581 in 6507b8a

# if self.private:

And from this line of code:

rdp = compute_rdp(self.micro_batch_size / train_data.shape[0], self.sigma, steps, lmbds)

You can see that to calculate RDP, the batch_size, dataset size, sigma and training steps are four features influencing the privacy budget.

then in the following line:

epsilon, _, _ = get_privacy_spent(lmbds, rdp, target_delta=1e-5)

Epsilon is the privacy budget, can you add an if in the beginning of the loop to control the training only if the epsilon is less than a certain value.

Hope that solves your question.

TeDiou · 2023-12-26T09:04:39Z

Thanks for your answer!

TeDiou · 2023-12-27T03:17:09Z

Sorry to bother u, why this dp-synthesizer.sample method is different from the ctabganplus.sample 。The two models differ only in a privacy module. However, in ctabganplusdp, the generation part requires multiple loops for generation.

zhao-zilong · 2023-12-27T13:27:29Z

Hi @TeDiou
Yeah, we need a loop to generate enough synthetic data, the reason is because we implemented a filter to filter out the invalid generation, so it takes more sampling than the required data number. Check this issue answer:
Team-TUD/CTAB-GAN-Plus#7 (comment)

TeDiou · 2023-12-28T02:40:29Z

I got that. Thanks a lot!_

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

How to control the privacy budget #2

How to control the privacy budget #2

TeDiou commented Dec 24, 2023

zhao-zilong commented Dec 26, 2023

TeDiou commented Dec 26, 2023

TeDiou commented Dec 27, 2023

zhao-zilong commented Dec 27, 2023

TeDiou commented Dec 28, 2023

How to control the privacy budget #2

How to control the privacy budget #2

Comments

TeDiou commented Dec 24, 2023

zhao-zilong commented Dec 26, 2023

TeDiou commented Dec 26, 2023

TeDiou commented Dec 27, 2023

zhao-zilong commented Dec 27, 2023

TeDiou commented Dec 28, 2023