HEaaN-STAT: A Privacy-Preserving Statistical Analysis Toolkit for Large-Scale Numerical, Ordinal, and Categorical Data

  • Younho Lee
  • , Jinyeong Seo
  • , Yujin Nam
  • , Jiseok Chae
  • , Jung Hee Cheon

Research output: Contribution to journalArticlepeer-review

11 Scopus citations

Abstract

Statistical analysis of largescale data is useful as it enables the extraction of a large amount of information, despite its simplicity. Therefore, fusing and analyzing data from different security domains is an attractive and promising approach, unless it jeopardizes the privacy of the data in any security domain. In this study, we proposed the HEaaN-STAT toolkit that can efficiently fuse data from different domains to enable largescale statistical analysis while protecting data privacy. Moreover, we proposed an efficient inverse operation and a table lookup function for Cheon-Kim-Kim-Song (CKKS) encrypted data, as well as a data encoding method for counting encrypted data. Based on this, we proposed a method for generating a contingency table with a large number of cases and k-percentile for largescale data that is hundreds to thousands of times faster than the method proposed by Lu et al. in NDSS'17. The validity of the proposed toolkit was verified through practical use for business applications using real-world data.

Original languageEnglish
Pages (from-to)1224-1241
Number of pages18
JournalIEEE Transactions on Dependable and Secure Computing
Volume21
Issue number3
DOIs
StatePublished - 1 May 2024

Keywords

  • applied cryptography
  • Homomorphic encryption
  • information security
  • privacy preserving statistical data analysis

Fingerprint

Dive into the research topics of 'HEaaN-STAT: A Privacy-Preserving Statistical Analysis Toolkit for Large-Scale Numerical, Ordinal, and Categorical Data'. Together they form a unique fingerprint.

Cite this