HAL Id: hal-02367948
https://hal.archives-ouvertes.fr/hal-02367948v2
Submitted on 21 Nov 2019
HAL is a multi-disciplinary open access archive for the deposit and dissemination of sci- entific research documents, whether they are pub- lished or not. The documents may come from teaching and research institutions in France or abroad, or from public or private research centers.
L’archive ouverte pluridisciplinaire HAL, est destinée au dépôt et à la diffusion de documents scientifiques de niveau recherche, publiés ou non, émanant des établissements d’enseignement et de recherche français ou étrangers, des laboratoires publics ou privés.
Generating Private Data Surrogates for Vision Related Tasks
Ryan Webster, Julien Rabin, Loïc Simon, Frédéric Jurie
To cite this version:
Ryan Webster, Julien Rabin, Loïc Simon, Frédéric Jurie. Generating Private Data Surrogates for
Vision Related Tasks. ICPR 2020, International Association of Pattern Recognition, IAPR, Jan 2021,
Milan, Italy. �hal-02367948v2�
Ryan Webster 1 Julien Rabin 1 Lo¨ıc Simon 1 Fr´ed´eric Jurie 1
Abstract
With the widespread application of deep networks in industry, membership inference attacks, i.e. the ability to discern training data from a model, be- come more and more problematic for data privacy.
Recent work suggests that generative networks may be robust against membership attacks. In this work, we build on this observation, offering a general-purpose solution to the membership pri- vacy problem. As the primary contribution, we demonstrate how to construct surrogate datasets, using images from GAN generators, labelled with a classifier trained on the private dataset. Next, we show this surrogate data can further be used for a variety of downstream tasks (here classifi- cation and regression), while being resistant to membership attacks. We study a variety of differ- ent GANs proposed in the literature, concluding that higher quality GANs result in better surrogate data with respect to the task at hand.
1. Context and Motivation
The fantastic recent advances in deep learning are strongly and inextricably related to the existence of public datasets.
Such datasets not only allow researchers to learn from and experiments with the data but also to measure progress and challenge themselves with other researchers in competitions.
If Pascal VOC (Everingham et al., 2010) was among the first, many followed such as ImageNet (Deng et al., 2009) for object classification or recently CelebA-HQ (Karras et al., 2018a) as a benchmark for generative models. The contemporary machine learning research landscape would undoubtedly be very different without such datasets.
However, several important application areas do not fully benefit from the progress of machine learning because of the lack of massive public datasets. Indeed, building such
1
Normandie Univ, UNICAEN, ENSICAEN, CNRS, GREYC.
Correspondence to: Ryan Webster <Ryan.Webster@ensicaen.fr>.
SPML’19, ICML Workshop on Security and Privacy of Machine Learning, Long Beach, California, 14th June 2019. Copyright 2019 by the author(s).
C
C’
D
TPrivate Dataset
Test set D
VTrain set D
TTest Train
G
Surrogate Public Dataset D’
D’
D
VC(D
V)
C’(D
V) D = { (x
i, `
i) }
i2I<latexit sha1_base64="YuA4WnkLsU0oE1fWkMX7q5cnwAY=">AAADWnicjVJdS9xAFD276Yeu/Vht33wZXAQLZclmF1YKgtg+tD5Z6KpgJCRxVqdOPkgmoiz77q/pa/0rQn9A+y9655qF7cNiZ0hy59xzzsy9mSjXqjSue99oOk+ePnu+tNxaefHy1ev26tphmVVFLEdxprPiOApLqVUqR0YZLY/zQoZJpOVRdPnR5o+uZFGqLP1mbnJ5moTnqRqrODQEBe2NT2JH+BOxdR2o98KXWgdKvBP+NJgo4atUfJkG7U6v6/IQbtcb9Le9h8AbDl0xS3VQj4NstdGGjzNkiFEhgUQKQ7FGiJLmCXpwkRN2iglhBUWK8xJTtEhbEUsSIyT0kt7ntDqp0ZTW1rNkdUy7aHoKUgpskiYjXkGx3U1wvmJniy7ynrCnPdsNfaPaKyHU4ILQx3Qz5v/qbC0GY2xzDYpqyhmx1cW1S8VdsScXc1UZcsgJs/EZ5QuKY1bO+ixYU3Lttrch538z06J2HdfcCn/4lJuMf+ddrMM+90wxS+DDo/WPSWn9Sj77YnZEM7H/eP5OLQ4OvW6v3/W+Djq7e/XtWsI6NrBFN2iIXXzGAUa0wy1+4Cfumr+cprPsrDxQm41a8wb/DOftXxM6rbQ=</latexit>
D
<latexit sha1_base64="t1XPNqA+zkzhfJfeIPVA5nBWVow=">AAADqnicjVLbbtNAEB3XQNNwaVoeeVmRVk2kyLJdTJtKlaoWqcBTqUhbFFeW7WzKEt9krxGp5f+EP4CfQMwOjkQforKW17Nnzjm7M94gi0QhTfOHtqI/ePhotbXWfvzk6bP1zsbmRZGWechHYRql+VXgFzwSCR9JISN+leXcj4OIXwazE5W//MrzQqTJRznP+HXs3yRiKkJfIuR1cnfCp6glpyqeZyKZ1dX56XFd2Y4zYJb9Wk1W3d56s8MOmVux3jdP7BwcnvZuPdEfsMq9o2UujyIinBCxX7M+c2uvEswVCXtXb3mdrmmYztCxTGYajmkNd1UwHO6/chxmGSaNLjTjLN3QOuDCBFIIoYQYOCQgMY7AhwKfMVhgQobYNVSI5RgJynOooY3aElkcGT6iM5xvcDVu0ATXyrMgdYi7RPjmqGSwjZoUeTnGajdG+ZKcFbrMuyJPdbY5foPGK0ZUwmdE79MtmP+rU7VImMI+1SCwpowQVV3YuJTUFXVy9k9VEh0yxFQ8wXyOcUjKRZ8ZaQqqXfXWp/xPYipUrcOGW8IvOuU24V9oF+XwnnomiMXg4N76p6hUfgWdfTk7wCdW/xjv1OLisOXBhW1Yu4b9we4eHTe3qwUv4CX08AbtwRG8hTMY4Q7f4be2qrX0gX6uf9LHf6krWqN5DneGPvkDHXjISA==</latexit> 0= { (x
0i:= G(z
i), `
0i:= C(x
0i)) }
i2I(x
<latexit sha1_base64="v1409fM3vpHXOBN75bcctkPVZWI=">AAADhHicjVJNT9tAEJ1gSiH9IJQjlxUBlUqRtXZqID1UCA6tqh5o1QASQZHtbOg26w/Z64rI8p/qrylHOMG/6OzUkcohomt5Pfvmvbc74w1SJXPN+XVjwVp8svR0eaX57PmLl6uttVcneVJkoeiHiUqys8DPhZKx6GuplThLM+FHgRKnweTI5E9/iiyXSfxNT1NxEfmXsRzL0NcIDVufByMxRi05ldE0lfGkKr9+OKxK1/M6zHF3zeRUza2dq6F83WHl4AGXDYRSmKjYm61hq81t7vU8hzNue9zpdU3Q6+2/9Tzm2JxGG+pxnKw1WjCAESQQQgERCIhBY6zAhxyfc3CAQ4rYBZSIZRhJyguooInaAlkCGT6iE5wvcXVeozGujWdO6hB3UfhmqGSwjZoEeRnGZjdG+YKcDTrPuyRPc7YpfoPaK0JUw3dEH9PNmP+rM7VoGMM+1SCxppQQU11YuxTUFXNy9k9VGh1SxEw8wnyGcUjKWZ8ZaXKq3fTWp/wdMQ1q1mHNLeCeTrlN+A/axTh8op5JYjF492j9Y1Qav5zOPp8d4BOZf4x3anZx2PzgxLWdru1+cdsHh/XtWoYN2IQdvEF7cAAf4Rj6uMMv+A03cGstWR2ra3l/qQuNWrMOD4b1/g/s+L2z</latexit> 0i, `
0i) (x
<latexit sha1_base64="79v/LAoTrEiJBLnDqZ4NXf8uNJI=">AAADg3icjVJNb9NAEJ3UFEr4SuHIZUVaqYjKWrt12yAhVeUAgktBpK3UVJHtbMqS9YfsNSKy/KP4NYgb3OBfMDM4Ej1EZS2vZ9+893ZnvFFudGml/N5ZcW6s3ry1drt75+69+w966w9PyqwqYjWMM5MVZ1FYKqNTNbTaGnWWFypMIqNOo9lLyp9+VkWps/SDnefqIgkvUz3VcWgRGvfejiZqilp2qpN5rtNZU79/ddTUfhBsC8/fo8lruhujKxyx9WWst0UtRsqYsW7E041xry9dGQwCTwrpBtIb7FAwGBzsBoHwXMmjD+04ztY7PRjBBDKIoYIEFKRgMTYQQonPOXggIUfsAmrECow05xU00EVthSyFjBDRGc6XuDpv0RTX5FmyOsZdDL4FKgVsoiZDXoEx7SY4X7Ezocu8a/aks83xG7VeCaIWPiJ6nW7B/F8d1WJhCgdcg8aackaourh1qbgrdHLxT1UWHXLEKJ5gvsA4ZuWiz4I1JddOvQ05/4uZhNI6brkV/OZTbjL+iXchhzfcM80sAc+vrX+KSvIr+ezL2RE+Cf1jvFOLiyOWBye+6+24/ju/f3jU3q41eAxPYAtv0D4cwms4hiHu8BW+wQ/46aw6zxzf2f1LXem0mkdwZTgv/gA5NL17</latexit> i, `
i)
x
i<latexit sha1_base64="RQEfddr+WP/m2qyUYq1BaIgk4ks=">AAADdXicjVLLbtNAFD2JebThlcKyQhqRFLFA7tjF0CAhVWUBYlUQaSs1VWQ7kzLEL/mBiKx8DV/DFlb8Af2CbrlzcSS6iMpYHt8595wzc68nyCJdlFL+arWta9dv3Fxb79y6fefuve7G/cMirfJQDcM0SvPjwC9UpBM1LHUZqeMsV34cROoomL02+aMvKi90mnws55k6jf2zRE916JcEjbuvRhM1JS071fE808lsUX94s7+oXc97Khz3uZmcRac/usQRX8e6P+72pC29gedIIW1POoMdEwwGu888Tzi25NFDMw7SjVYXI0yQIkSFGAoJSooj+CjoOYEDiYywU9SE5RRpziss0CFtRSxFDJ/QGc1ntDpp0ITWxrNgdUi7RPTmpBTYIk1KvJxis5vgfMXOBl3lXbOnOducvkHjFRNa4hOhV+mWzP/VmVpKTLHLNWiqKWPEVBc2LhV3xZxc/FNVSQ4ZYSaeUD6nOGTlss+CNQXXbnrrc/43Mw1q1mHDrXDOp9xi/DPvYhzecc80swReXln/lJTGr+Czr2YH9MTmH9OdWl4csTo4dG1nx3bfu729/eZ2rWETj/CEbtAL7OEtDjCkHb7hO37gZ/vCemj1rcd/qe1Wo3mAS8Pa/gMUwbm4</latexit>