Paper
1 June 2023 Data pricing for vertical federated learning: an approach based on data contribution
Zhixian Zhang, Xinchao Li, Shiyou Yang
Author Affiliations +
Proceedings Volume 12718, International Conference on Cyber Security, Artificial Intelligence, and Digital Economy (CSAIDE 2023); 127181Y (2023) https://doi.org/10.1117/12.2681630
Event: International Conference on Cyber Security, Artificial Intelligence, and Digital Economy (CSAIDE 2023), 2023, Nanjing, China
Abstract
Federated Learning (FedL) emerged as a privacy-aware alternative, creating an effective means for multiple data providers to enable collaboration on training models without accessing the original data. Vertical federated learning (VFedL), as a crucial classification within FedL, has always been primarily utilized to train a machine learning model with non-uniform data from different providers. Despite the VFedL's benefits in facilitating collaborative training models while safeguarding data privacy, it remains a daunting challenge to incentivize more valuable data providers to participate in the VFedL due to the absence of scientific data pricing and precise measurement of data contributions from participants in practical operations. In this paper, we construct a scientific data pricing method based on the participants' data contribution score to federated models, so that all data providers can be compensated fairly. Firstly, an accurate measurement method of the data contribution score of each federated participant to the global model is constructed based on shapely values for Monte Carlo optimization. Then, taking the data contribution score as the input variable, we formulate a data pricing game model based on Stackelberg with the hosts as the leader and the guest as the follower in VFedL. We further solve our model and analyze the guest's optimal data usage strategy based on data contribution score and the hosts' optimal data pricing strategy. Our method has been proven through numerical experiments to precisely assess the data contribution score of participants with the Federated Logistic Regression model. These study findings can also offer management direction for the FedL service providers.
© (2023) COPYRIGHT Society of Photo-Optical Instrumentation Engineers (SPIE). Downloading of the abstract is permitted for personal use only.
Zhixian Zhang, Xinchao Li, and Shiyou Yang "Data pricing for vertical federated learning: an approach based on data contribution", Proc. SPIE 12718, International Conference on Cyber Security, Artificial Intelligence, and Digital Economy (CSAIDE 2023), 127181Y (1 June 2023); https://doi.org/10.1117/12.2681630
Advertisement
Advertisement
RIGHTS & PERMISSIONS
Get copyright permission  Get copyright permission on Copyright Marketplace
KEYWORDS
Data modeling

Education and training

Machine learning

Process modeling

Computer programming

Data privacy

Design and modelling

Back to Top