Privacy-preserving multikeyword similarity search over outsourced cloud data

Chia Mu Yu*, Chi Yuan Chen, Han Chieh Chao

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

13 Scopus citations


The amount of data generated by individuals and enterprises is rapidly increasing. With the emerging cloud computing paradigm, the data and corresponding complex management tasks can be outsourced to the cloud for the management flexibility and cost savings. Unfortunately, as the data could be sensitive, the direct data outsourcing would have the problem of privacy leakage. The encryption can be used, before the data outsourcing, with the concern that the operations can still be accomplished by the cloud. We consider the multikeyword similarity search over outsourced cloud data. In particular, with the consideration of the text data only, multiple keywords are specified by the user. The cloud returns the files containing more than a threshold number of input keywords or similar keywords, where the similarity here is defined according to the edit distance metric. We propose three solutions, where blind signature provides the user access privacy, and a novel use of Bloom filter's bit pattern provides the speedup of search task at the cloud side. Our final design to achieve the search is secure against insider threats and efficient in terms of the search time at the cloud side. Performance evaluation and analysis are used to demonstrate the practicality of our proposed solutions.

Original languageEnglish
Pages (from-to)385-394
Number of pages10
JournalIEEE Systems Journal
Issue number2
StatePublished - Jun 2017


  • Cloud computing
  • counterintelligence
  • outsourced data
  • privacy
  • similarity search

Fingerprint Dive into the research topics of 'Privacy-preserving multikeyword similarity search over outsourced cloud data'. Together they form a unique fingerprint.

Cite this