Generating adversarial data from cleverhans attack models

642 Views Asked by Jeredriq Demas At 18 December 2018 at 05:52

I want a code example to how to generate train data from clever hans' adversarial attacks.

adv_x = fgsm.generate_np(X_test, **fgsm_params)

This generates adversarial x data but how can I get y?

adv_pred = model.predict_classes(adv_x)

And this will give the "fooled" results right?

What I want is to correctly show generated x, y, fooled y (by which I mean results of models predictions that may be false because of the attack). I'm using Mnist btw, if it helps.

Original Q&A

There are 1 best solutions below

Nicolas Papernot On 18 December 2018 at 16:23 BEST ANSWER

Based on the code snippets you shared, I would make two suggestions:

It is generally not a good idea to train the model on test data (if you are going to use that test data to evaluate its performance afterwards) so I would replace X_test by X_train in your first line.
To get the label for your adversarial examples, you can use the original labels of the training data or the predictions of the model on the original training data model.predict_classes(X_train) (this assumes that the adversarial example is not perturbed enough to change the label of the input).

Generating adversarial data from cleverhans attack models

There are 1 best solutions below

Related Questions in PYTHON

Related Questions in TENSORFLOW

Related Questions in MACHINE-LEARNING

Related Questions in DEEP-LEARNING

Related Questions in CLEVERHANS

Trending Questions

Popular # Hahtags

Popular Questions