support for non probability outputs

Question

support for non probability outputs

priamai opened this issue 3 years ago · 5 comments

Hi there,
very nice project, is there a plan to implement also attacks that works on the label output (no probabilities) or limited API query setting?
There was an article here a while ago which would be nice to have.
Is this something that should be implemented in the ART component?

Answer 1 · 2021-07-19T21:31:17.000Z

Okay maybe I am not interpreting the documentation right, for example the HopSpikJumpAttack works on predicted labels not probabilities, but in the creditfraud example:
https://github.com/Azure/counterfit/blob/main/demo/WEBINAR-DEMO-2.md
the target is designed with out probabilities.
Would be nice to get some clarity there.

Answer 2 · 2021-08-12T01:13:04.000Z

You effectively translate probabilities to labels depending on what a model gives you back in outputs_to_labels.

A good example here in the wiki . TextAttack requires a numerical value in model_output_classes, and ART will work on with a text label of a numerical label.

Answer 3 · 2021-08-14T17:12:51.000Z

Hi there,
that I understand but is there an example where the target outputs a label (not the probability) ? The creditcard example provides output probabilities.
The HopSkipJump should work directly with binary labels and not probabilities.

Answer 4 · 2021-10-28T05:36:43.000Z

Set your outputs to [0, 1] where 1 is the positive class.

Answer 5 · 2022-02-18T17:51:50.000Z

Will try that thanks.