Privacy-preserving ridge regression on distributed data

Yi Ruei Chen*, Amir Rezapour, Wen-Guey Tzeng

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

17 Scopus citations


Ridge regression is a statistical method for modeling a linear relationship between a dependent variable and some explanatory values. It is a building-block that plays a major role in many learning algorithms such as recommendation systems. However, in many applications such as e-health, explanatory values contains private information owned by different patients that are not willing to share them, unless data privacy is guaranteed. In this paper, we propose a protocol for conducting privacy-preserving ridge regression (PPRR) over high-dimensional data. In our protocol, each user submits its data in an encrypted form to an evaluator and the evaluator computes a linear model of all users’ data without learning their contents. The core encryption method is equipped with homomorphic properties to enable the evaluator to perform ridge regression over encrypted data. We implement our protocol and demonstrate that it is suitable for dealing with high-dimensional data distributed among millions of users. We also compare our protocol with the state-of-the-art solutions in terms of both computation and communication costs. The results show that our protocol outperforms most existing approaches based on secure multi-party computation, garbled circuit, fully homomorphic encryption, secret-sharing, and hybrid methods.

Original languageEnglish
Pages (from-to)34-49
Number of pages16
JournalInformation Sciences
StatePublished - 1 Jul 2018


  • Data privacy
  • Privacy-preserving regression
  • Recommendation system
  • Ridge regression

Fingerprint Dive into the research topics of 'Privacy-preserving ridge regression on distributed data'. Together they form a unique fingerprint.

Cite this