The Viability Boundary of Differential Privacy
When does a differentially-private synthetic dataset stop being useful? Across six tabular datasets we map a sharp viability boundary for DP-SGD with MLP variational autoencoders, set by the ratio of training samples to encoded dimensions. Marginal-based DP methods can be viable two orders of magnitude lower.