Please use this identifier to cite or link to this item: https://doi.org/10.1109/SPW.2019.00021
Title: Membership Inference Attacks Against Adversarially Robust Deep Learning Models
Authors: Liwei Song
REZA SHOKRI 
Prateek Mittal
Issue Date: 19-May-2019
Publisher: IEEE
Citation: Liwei Song, REZA SHOKRI, Prateek Mittal (2019-05-19). Membership Inference Attacks Against Adversarially Robust Deep Learning Models. ScholarBank@NUS Repository. https://doi.org/10.1109/SPW.2019.00021
Abstract: In recent years, the research community has increasingly focused on understanding the security and privacy challenges posed by deep learning models. However, the security domain and the privacy domain have typically been considered separately. It is thus unclear whether the defense methods in one domain will have any unexpected impact on the other domain. In this paper, we take a step towards enhancing our understanding of deep learning models when the two domains are combined together. We do this by measuring the success of membership inference attacks against two state-of-the-art adversarial defense methods that mitigate evasion attacks: adversarial training and provable defense. On the one hand, membership inference attacks aim to infer an individual's participation in the target model's training dataset and are known to be correlated with target model's overfitting. On the other hand, adversarial defense methods aim to enhance the robustness of target models by ensuring that model predictions are unchanged for a small area around each sample in the training dataset. Intuitively, adversarial defenses may rely more on the training dataset and be more vulnerable to membership inference attacks. By performing empirical membership inference attacks on both adversarially robust models and corresponding undefended models, we find that the adversarial training method is indeed more susceptible to membership inference attacks, and the privacy leakage is directly correlated with model robustness. We also find that the provable defense approach does not lead to enhanced success of membership inference attacks. However, this is achieved by significantly sacrificing the accuracy of the model on benign data points, indicating that privacy, security, and prediction accuracy are not jointly achieved in these two approaches.
URI: https://scholarbank.nus.edu.sg/handle/10635/168422
ISBN: 978-1-7281-3508-3
DOI: 10.1109/SPW.2019.00021
Appears in Collections:Staff Publications
Elements

Show full item record
Files in This Item:
File Description SizeFormatAccess SettingsVersion 
liwei-dls19.pdf532.22 kBAdobe PDF

OPEN

PublishedView/Download

Google ScholarTM

Check

Altmetric


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.