Impact of Admission and Cache Replacement Policies on Response Times of Jobs on Data Grids |
| |
Authors: | Email author" target="_blank">Ekow?OtooEmail author Doron?Rotem Arie?Shoshani |
| |
Institution: | (1) Lawrence Berkeley National Laboratory, University of California, 1 Cyclotron Road, MS: 50B-3238, Berkeley, CA, 94720 |
| |
Abstract: | Caching techniques have been used widely to improve the performance gaps of storage hierarchies in computing systems. Little
is known about the impact of policies on the response times of jobs that access and process very large files in data grids,
particularly when data and computations on the data have to be co-located on the same host. In data intensive applications
that access large data files over wide area network environment, such as data-grids, the combination of policies for job servicing
(or scheduling), caching and cache replacement can significantly impact the performance of grid jobs. We present preliminary
results of a simulation study that combines an admission policy with a cache replacement policy when servicing jobs submitted
to a storage resource manager.The results show that, in comparison to a first come first serve policy, the response times
of jobs are significantly improved, for practical limits of disk cache sizes, when the jobs that are back-logged to access
the same files are taken into consideration in scheduling the next file to be retrieved into the disk cache. Not only are
the response times of jobs improved, but also the metric measures for caching policies, such as the hit ratio and the average
cost per retrieval, are improved irrespective of the cache replacement policy used.
Ekow Otoo is research staff scientist with the scientific data management group at Lawrence Berkeley National Laboratory, University
of California, Berkeley. He received his B.Sc. degree in Electrical Engineering from the University of Science and Technology,
Kumasi, Ghana and a post graduate diploma in Computer Science from the University of Ghana, Legon. In 1977, he received his
M.Sc. degree in Computer Science from the University of Newcastle Upon Tyne in Britain and his Ph.D. degree in Computer Science
from McGill University, Montreal, Canada in 1983. He joined the faculty of the School of Computer Science, Carleton University,
in 1983 and from 1987 to 1999, he was a tenured faculty member of the School of Computer Science, Carleton University, Ottawa,
Canada. He has served as research consultant to Bell Northern Research, Ottawa, Canada, and as a research project consultant
to the GIS Division, Geomatics Canada, Natural Resources Canada, from 1990 to 1998. Ekow Otoo is a member of the ACM and IEEE.
His research interests include database management systems, data structures and algorithms, parallel I/O for high performance
computing, parallel and distributed computing.
Doron Rotem is currently a senior staff scientist and a member of the Data Management group at the Lawrence Berkeley National Lab. His
research interests include Grid Computing, Workflow, Scientific Data Management and Paralled and Distributed Computing and
Algorithms. He has published over 80 papers in international journals and conferences in these areas. Prior to that, Dr Rotem
co-founded and served as a CTO of a startup company, called CommerceRoute, that made software products in the area of workflow
and data integration and before that, he was an Associate Professor in the Department of Computer Science, University of Waterloo,
Canada. Dr. Rotem holds a B.Sc degree in Mathematics and Statistics from the Hebrew University, Jerusalem, Israel and a Ph.D.
in Computer Science from the University of the Witwatersrand, Johannesburg, South Africa.
Arie Shoshani is a senior staff scientist at Lawrence Berkeley National Laboratory. He joined LBNL in 1976. He heads the Scientific Data
Management Group. He received his Ph.D. from Princeton University in 1969. From 1969 to 1976, he was a researcher at System
Development Corporation, where he worked on the Network Control Program for the ARPAnet, distributed databases, database conversion,
and natural language interfaces to data management systems. His current areas of work include data models, query languages,
temporal data, statistical and scientific database management, storage management on tertiary storage, and grid storage middleware.
Arie is also the director of a Scientific Data Management (SDM) Integrated Software Infrastructure Center (ISIC), one of seven
centers selected by the SciDAC program at DOE in 2001. In this capacity, he is coordinating the work of collaborators from
4 DOE laboratories and 4 universities (see: http://sdmcenter.lbl.gov). Dr. Shoshani has published over 65 technical papers
in refereed journals and conferences, chaired several workshops, conferences, and panels in database management; and served
on numerous program committees for various database conferences. He also served as an associate editor for the ACM Transactions
on Database Systems. He was elected a member of the VLDB Endowment Board, served as the Publication Board Chairperson for
the VLDB Journal, and as the Vice-President of the VLDB Endowment. His home page is http://www.lbl.gov/arie. |
| |
Keywords: | caching data grid job scheduling storage resource manager |
本文献已被 SpringerLink 等数据库收录! |
|