Code for implementation of "Data Shapley: Equitable Valuation of Data for Machine Learning".
Please cite the following work if you use this benchmark or the provided tools or implementations:
@inproceedings{ghorbani2019data,
title={Data Shapley: Equitable Valuation of Data for Machine Learning},
author={Ghorbani, Amirata and Zou, James},
booktitle={International Conference on Machine Learning},
pages={2242--2251},
year={2019}
}
- original package: Python, NumPy, Tensorflow 1.12, Scikit-learn, Matplotlib
To divide value fairly between individual train data points/sources given the learning algorithm and a meausre of performance for the trained model (test accuracy, etc)
This project is licensed under the MIT License - see the LICENSE.md file for details