An Effective RAID Data Layout for Object-Based De-duplication Backup System
An Effective RAID Data Layout for Object-Based De-duplication Backup System作者机构:School of Computer Science and TechnologyBeijing Institute of Technology School of InformationBeijing Wuzi University Beijing Engineering Research Center of Massive Language Information Processing and Cloud Computing Application Wuhan National Laboratory for OptoelectronicsHuazhong University of Science and Technology Space Star Technology CO.LtdChina Academy of Space Technology
出 版 物:《Chinese Journal of Electronics》 (电子学报(英文))
年 卷 期:2016年第25卷第5期
页 面:832-840页
核心收录:
学科分类:0839[工学-网络空间安全] 08[工学] 081201[工学-计算机系统结构] 0812[工学-计算机科学与技术(可授工学、理学学位)]
基 金:supported by the National High Technology Research and Development Program of China(No.2013AA01A212) National Natural Science Foundation of China(No.61272511,No.61370063,No.61300047,No.91438117,No.91538202) SRF for ROC,SEM,Open fund of China State Key Laboratory of Virtual Reality Technology and Systems(No.BUAA-VR-16KF-20) Beijing Intelligent Logistics System Collaborative Innovation Center
主 题:Data layout Object-based deduplication Backup Energy aware Redundant array of independent disks(RAID)
摘 要:We propose an alternative Redundant array of independent disks(RAID) data layout, Asymmetrical grouping data organization(AGDO), for object-based data de-duplication backup system. Object-based data deduplication is an effective solution for detecting duplicate data for compound files. We designed an asymmetrical grouping strategy, the disk in the array are partitioned into different groups, and in each group parallelism data access scheme is adopted, different types of objects are stored in different groups with maybe different group size, it can set and manage dynamic group size by using dynamic disk group adjustment algorithms. The performance of AGDO is evaluated and proved to be sufficient for the continuous storage application. The result is that disk accesses are concentrated in a part of the disks over a long time period and reduces the power consumption to 25% in a 10-disk configuration. Moreover, object-based de-duplication combined with AGDO has great potential in increasing data restoration speed for compound files. We have shown that this combination makes average restoration speed improved 11%.