A Framework for Data Management in the Grid
作者单位:Applied Mathematics and Systems Laboratory Ecole Centrale Paris Grande Voie des Vignes
会议名称:《2007年国际电子商务、工程及科学领域的分布式计算和应用学术研讨会》
会议日期:2007年
学科分类:12[管理学] 1201[管理学-管理科学与工程(可授管理学、工学学位)] 08[工学] 081201[工学-计算机系统结构] 0812[工学-计算机科学与技术(可授工学、理学学位)]
关 键 词:Distributed Computing Data Management Data Intensive Applications Grid Computing Virtual File System.
摘 要:In Grid Computing, a job could be executed on a node that is geographically far away from its data files. These files are stored in heterogeneous storage systems located at geographically distributed virtual organizations. The current approach includes explicit data file transfers to execution nodes, which forces users to deal with different administrative policies at each site and various data access mechanisms on each storage system. This implies a lot of human interventions in order to develop dedicated programs and scripts for data transfers for job execution. This paper presents GRAVY, a framework which enables the data management between distributed file systems irrespective of their heterogeneity. This feature enables high-level schedulers integrated with GRAVY to control data placements like computational jobs (i.e., they can be queued, scheduled and monitored). GRAVY supports multiple data transport protocols and can be extended easily.