A Review of Data Cleaning Methods for Web Information System
作者机构:School of Computer Science and TechnologyHarbin Institute of TechnologyHarbin150006China
出 版 物:《Computers, Materials & Continua》 (计算机、材料和连续体(英文))
年 卷 期:2020年第62卷第3期
页 面:1053-1075页
核心收录:
学科分类:08[工学] 0812[工学-计算机科学与技术(可授工学、理学学位)]
主 题:Data cleaning web information system data quality rule crowdsourcing privacy preservation
摘 要:Web information system(WIS)is frequently-used and indispensable in daily social *** provides information services in many scenarios,such as electronic commerce,communities,and *** cleaning plays an essential role in various WIS scenarios to improve the quality of data *** this paper,we present a review of the state-of-the-art methods for data cleaning in *** to the characteristics of data cleaning,we extract the critical elements of WIS,such as interactive objects,application scenarios,and core technology,to classify the existing ***,after elaborating and analyzing each category,we summarize the descriptions and challenges of data cleaning methods with sub-elements such as data&user interaction,data quality rule,model,crowdsourcing,and privacy ***,we analyze various types of problems and provide suggestions for future research on data cleaning in WIS from the technology and interactive perspective.