How data-sharing nudges influence people's privacy preferences: A machine learning-based analysis

Lists

Lu, Yang ORCID: https://orcid.org/0000-0002-0583-2688, Li, Shujun, Freitas, Alex and Ioannou, Athina (2021) How data-sharing nudges influence people's privacy preferences: A machine learning-based analysis. EAI Endorsed Transactions on Security and Safety.

[thumbnail of eai.21-12-2021.172440.pdf]

Preview

Text
eai.21-12-2021.172440.pdf - Published Version
Available under License Creative Commons Attribution.
| Preview

Official URL: http://dx.doi.org/10.4108/eai.21-12-2021.172440

Abstract

INTRODUCTION: Many online services use data-sharing nudges to solicit personal data from their customers for personalized services.

OBJECTIVES: This study aims to study people’s privacy preferences in sharing different types of personal data under different nudging conditions, how digital nudging can change their data sharing willingness, and if people’s data sharing preferences can be predicted using their responses to a questionnaire.

METHODS: This paper reports a machine learning-based analysis on people’s privacy preference patterns under four different data-sharing nudging conditions (without nudging, monetary incentives, non-monetary incentives, and privacy assurance). The analysis is based on data collected from 685 UK residents who participated in a panel survey. Their self-reported willingness levels towards sharing 23 different types of personal data were analyzed by using both unsupervised (clustering) and supervised (classification) machine learning algorithms.

RESULTS: The results led to a better understanding of people’s privacy preference patterns across different data-sharing nudging conditions, e.g., our participants’ preferences are distributed in a space of 48 possible profiles more sparsely than we expected, and the unexpected observation that all the three data-sharing nudging strategies led to an overall negative effect: they led to a reduced level of self-reported willingness for more participants, comparing with the case of no nudging at all. Our experiments with supervised machine learning models also showed that people’s privacy (data-sharing) preference profiles can be automatically predicted with a good accuracy, even when a small questionnaire with just seven questions is used.

CONCLUSION: Our work revealed a more complicated structure of people’s privacy preference profiles, which have some dependencies on the type of data nudging and the type of personal data shared. Such complicated privacy preference profiles can be effectively analyzed using machine learning methods, including automatic prediction based on a small questionnaire. The negative results on the overall effect of different data-sharing nudges imply that service providers should consider if and how to use such mechanisms to incentivise their consumers to share personal data. We believe that more consumer-centric and transparent methods and tools should be used to help improve trust between consumers and service providers.

Item Type:	Article
Status:	Published
DOI:	10.4108/eai.21-12-2021.172440
Subjects:	A General Works > AS Academies and learned societies (General) B Philosophy. Psychology. Religion > BJ Ethics H Social Sciences > HA Statistics H Social Sciences > HN Social history and conditions. Social problems. Social reform Q Science > Q Science (General) > Q325 Machine learning Q Science > QA Mathematics > QA75 Electronic computers. Computer science Q Science > QA Mathematics > QA76 Computer software Z Bibliography. Library Science. Information Resources > ZA Information resources > ZA4450 Databases
School/Department:	School of Science, Technology and Health
URI:	https://ray.yorksj.ac.uk/id/eprint/5792

University Staff: Request a correction | RaY Editors: Update this record

Altmetric

CORE (COnnecting REpositories)

Tools

Deposit and Record Details

ID Code:	5792
Depositing User:	Lu, Dr Yang
Deposited On:	05 Jan 2022 12:53
Last Modified:	09 May 2025 20:15