Jinsung Yoon, Sercan O. Arik, Tomas Pfister
Google Cloud AI
Data Valuation using Reinforcement Learning
1
2020 International Conference on Machine Learning (ICML 2020)
Data Valuation using Reinforcement Learning Jinsung Yoon, Sercan O. - - PowerPoint PPT Presentation
Data Valuation using Reinforcement Learning Jinsung Yoon, Sercan O. Arik, Tomas Pfister Google Cloud AI 2020 International Conference on Machine Learning (ICML 2020) 1 Problem Defjnition What is data valuation? How much does each
1
2020 International Conference on Machine Learning (ICML 2020)
2
Amirata Ghorbani, James Y. Zou, Data Shapley: Equitable Valuation of Data for Machine Learning, ICML, 2019
3
Ruoxi Jia et al., Towards Efficient Data Valuation Based on the Shapley Value, AISTATS, 2019
4
High-value samples Low-value samples
5
High valued samples Cheaply-acquired samples
Amirata Ghorbani, James Y. Zou, Data Shapley: Equitable Valuation of Data for Machine Learning, ICML, 2019
6
Type B
Training Set
Type C Type A
Target Set
Type D Type D
High valued samples
Amirata Ghorbani, James Y. Zou, Data Shapley: Equitable Valuation of Data for Machine Learning, ICML, 2019
7
Amirata Ghorbani, James Y. Zou, Data Shapley: Equitable Valuation of Data for Machine Learning, ICML, 2019
8
9
10
○ Training set: ○ Validation set: ○ Predictor model: ○ Data valuation model: To minimize the validation loss Weighted optimization for predictor
11
12
13
come from the same distribution)
14
15
(WideResNet-28-10 and ResNet-32) and large datasets (CIFAR)
Mengye Ren et al., Learning to Reweight Examples for Robust Deep Learning, ICML, 2018 16
Type B
Training Set
Type C Type A
Testing Set
Type D Type C
Training Set
Type B Type A Type D
Training Set
Type D
Train-on-All Train-on-Rest Train-on-Specific
17
18
○ DVRL jointly optimizes the data valuator and corresponding predictor model
Amirata Ghorbani, James Y. Zou, Data Shapley: Equitable Valuation of Data for Machine Learning, ICML, 2019
19
20
DVRL - Github: https://github.com/google-research/google-research/tree/master/dvrl DVRL- AI-Hub: https://aihub.cloud.google.com/u/0/p/products%2Fcb6b588c-1582-4868-a944-dc70ebe61a36
21