A globally shared resource paradigm for encoded storage systems in the public cloud
作者机构:Department of Computer Science and TechnologyTsinghua UniversityBeijing 100084China Beijing National Research Center for Information Science and TechnologyTsinghua UniversityBeijing 100084China
出 版 物:《Fundamental Research》 (自然科学基础研究(英文版))
年 卷 期:2024年第4卷第3期
页 面:642-650页
核心收录:
学科分类:08[工学] 081201[工学-计算机系统结构] 0812[工学-计算机科学与技术(可授工学、理学学位)]
基 金:supported by the National Natural Science Foundation of China(62025203)
主 题:Public cloud Encoded storage Load balancing RAID reconstruction Tail latency
摘 要:Public clouds favor sharing of storage resources,in which many tenants acquire bandwidth and storage capacity from a shared storage *** provide high availability,data are often encoded to provide fault tolerance with low storage *** this,efficiently organizing an encoded storage system for shared I/Os is critical for application *** is usually hard to achieve as different applications have different stripe configurations and fault tolerance *** this paper,we first study the block trace from the Alibaba cloud,and find that I/O patterns of modern applications prefer the resource sharing *** on this,we propose a globally shared resource paradigm for encoded storage system in the public *** globally shared resource paradigm can provide balanced load and fault tolerance for numerous disk pool sizes and arbitrary application stripe ***,we demonstrate with two case studies that our theory can help address the device-specific problems of HDD and SSD RAID arrays with slight modifications:comparing the existing resource partition and resource sharing methods,our theory can promote the rebuild speed of the HDD RAID arrays by 2.5,and reduce the P99 tail latency of the SSD arrays by up to two orders of magnitude.