/neighborhood-mia

Implementation of "Membership Inference Attacks against Language Models via Neighbourhood Comparison" by Justus Mattern, Fatemehsadat Mireshghallah, Zhijing Jin, Bernhard Schölkopf, Mrinmaya Sachan, Taylor Berg-Kirkpatrick

Primary LanguagePython

Membership Inference Attacks against Language Models via Neighbourhood Comparison

Implementation of "Membership Inference Attacks against Language Models via Neighbourhood Comparison" by Justus Mattern, Fatemehsadat Mireshghallah, Zhijing Jin, Bernhard Schölkopf, Mrinmaya Sachan, Taylor Berg-Kirkpatrick

References

[1] Justus Mattern, Fatemehsadat Mireshghallah, Zhijing Jin, Bernhard Schölkopf, Mrinmaya Sachan, Taylor Berg-Kirkpatrick. Membership Inference Attacks against Language Models via Neighbourhood Comparison. arXiv:2305.18462 [cs.CL]