IDEA:A Utility-Enhanced Approach to Incomplete Data Stream Anonymization
想法: 不完全的数据流 anonymization 的一条提高用途的途径作者机构:the College of Computer ScienceSichuan UniversityChengdu 610065China the School of Cyber Science and EngineeringSichuan UniversityChengdu 610065China the Cyber Science Research InstituteSichuan UniversityChengdu 610065China
出 版 物:《Tsinghua Science and Technology》 (清华大学学报(自然科学版(英文版))
年 卷 期:2022年第27卷第1期
页 面:127-140页
核心收录:
学科分类:0839[工学-网络空间安全] 08[工学] 081201[工学-计算机系统结构] 0812[工学-计算机科学与技术(可授工学、理学学位)]
基 金:supported by the National Natural Science Foundation of China (Nos. U19A2081 and 61802270) the Fundamental Research Funds for the Central Universities (No. 2020SCUNG129)
主 题:anonymization generalization incomplete data streams privacy preservation utility
摘 要:The prevalence of missing values in the data streams collected in real environments makes them impossible to ignore in the privacy preservation of data ***,the development of most privacy preservation methods does not consider missing values.A few researches allow them to participate in data anonymization but introduce extra considerable information *** balance the utility and privacy preservation of incomplete data streams,we present a utility-enhanced approach for Incomplete Data strEam Anonymization(IDEA).In this approach,a slide-window-based processing framework is introduced to anonymize data streams continuously,in which each tuple can be output with clustering or anonymized *** consider the dimensions of attribute and tuple as the similarity measurement,which enables the clustering between incomplete records and complete records and generates the cluster with minimal information *** avoid the missing value pollution,we propose a generalization method that is based on maybe match for generalizing incomplete *** experiments conducted on real datasets show that the proposed approach can efficiently anonymize incomplete data streams while effectively preserving utility.