An open dataset of data lineage graphs for data governance research
作者机构:Central South UniversityChangshaHunanChina Huawei Cloud Computing Technology Co.Ltd.HangzhouZhejiangChina
出 版 物:《Visual Informatics》 (可视信息学(英文))
年 卷 期:2024年第8卷第1期
页 面:1-5页
核心收录:
学科分类:12[管理学] 1201[管理学-管理科学与工程(可授管理学、工学学位)]
基 金:the National Natural Science Foundation of China(No.62272480 and 62072470)
主 题:Data asset Data governance Data lineage Graph Open dataset
摘 要:Data have become valuable assets for *** governance aims to manage and reuse data assets,facilitating enterprise management and enabling product innovations.A data lineage graph(DLG)is an abstracted collection of data assets and their data lineages in data *** DLGs can provide rich data insights for data ***,the progress of data governance technologies is hindered by the shortage of available open datasets for *** paper introduces an open dataset of DLGs,including the DLG model,the dataset construction process,and applied *** real-world dataset is sourced from Huawei Cloud Computing Technology Company Limited,which contains 18 DLGs with three types of data assets and two types of *** the best of our knowledge,this dataset is the first open dataset of DLGs for data *** dataset can also support the development of other application areas,such as graph analytics and visualization.