首页 | 本学科首页   官方微博 | 高级检索  
   检索      


Impact of Admission and Cache Replacement Policies on Response Times of Jobs on Data Grids
Authors:Email author" target="_blank">Ekow?OtooEmail author  Doron?Rotem  Arie?Shoshani
Institution:(1) Lawrence Berkeley National Laboratory, University of California, 1 Cyclotron Road, MS: 50B-3238, Berkeley, CA, 94720
Abstract:Caching techniques have been used widely to improve the performance gaps of storage hierarchies in computing systems. Little is known about the impact of policies on the response times of jobs that access and process very large files in data grids, particularly when data and computations on the data have to be co-located on the same host. In data intensive applications that access large data files over wide area network environment, such as data-grids, the combination of policies for job servicing (or scheduling), caching and cache replacement can significantly impact the performance of grid jobs. We present preliminary results of a simulation study that combines an admission policy with a cache replacement policy when servicing jobs submitted to a storage resource manager.The results show that, in comparison to a first come first serve policy, the response times of jobs are significantly improved, for practical limits of disk cache sizes, when the jobs that are back-logged to access the same files are taken into consideration in scheduling the next file to be retrieved into the disk cache. Not only are the response times of jobs improved, but also the metric measures for caching policies, such as the hit ratio and the average cost per retrieval, are improved irrespective of the cache replacement policy used. Ekow Otoo is research staff scientist with the scientific data management group at Lawrence Berkeley National Laboratory, University of California, Berkeley. He received his B.Sc. degree in Electrical Engineering from the University of Science and Technology, Kumasi, Ghana and a post graduate diploma in Computer Science from the University of Ghana, Legon. In 1977, he received his M.Sc. degree in Computer Science from the University of Newcastle Upon Tyne in Britain and his Ph.D. degree in Computer Science from McGill University, Montreal, Canada in 1983. He joined the faculty of the School of Computer Science, Carleton University, in 1983 and from 1987 to 1999, he was a tenured faculty member of the School of Computer Science, Carleton University, Ottawa, Canada. He has served as research consultant to Bell Northern Research, Ottawa, Canada, and as a research project consultant to the GIS Division, Geomatics Canada, Natural Resources Canada, from 1990 to 1998. Ekow Otoo is a member of the ACM and IEEE. His research interests include database management systems, data structures and algorithms, parallel I/O for high performance computing, parallel and distributed computing. Doron Rotem is currently a senior staff scientist and a member of the Data Management group at the Lawrence Berkeley National Lab. His research interests include Grid Computing, Workflow, Scientific Data Management and Paralled and Distributed Computing and Algorithms. He has published over 80 papers in international journals and conferences in these areas. Prior to that, Dr Rotem co-founded and served as a CTO of a startup company, called CommerceRoute, that made software products in the area of workflow and data integration and before that, he was an Associate Professor in the Department of Computer Science, University of Waterloo, Canada. Dr. Rotem holds a B.Sc degree in Mathematics and Statistics from the Hebrew University, Jerusalem, Israel and a Ph.D. in Computer Science from the University of the Witwatersrand, Johannesburg, South Africa. Arie Shoshani is a senior staff scientist at Lawrence Berkeley National Laboratory. He joined LBNL in 1976. He heads the Scientific Data Management Group. He received his Ph.D. from Princeton University in 1969. From 1969 to 1976, he was a researcher at System Development Corporation, where he worked on the Network Control Program for the ARPAnet, distributed databases, database conversion, and natural language interfaces to data management systems. His current areas of work include data models, query languages, temporal data, statistical and scientific database management, storage management on tertiary storage, and grid storage middleware. Arie is also the director of a Scientific Data Management (SDM) Integrated Software Infrastructure Center (ISIC), one of seven centers selected by the SciDAC program at DOE in 2001. In this capacity, he is coordinating the work of collaborators from 4 DOE laboratories and 4 universities (see: http://sdmcenter.lbl.gov). Dr. Shoshani has published over 65 technical papers in refereed journals and conferences, chaired several workshops, conferences, and panels in database management; and served on numerous program committees for various database conferences. He also served as an associate editor for the ACM Transactions on Database Systems. He was elected a member of the VLDB Endowment Board, served as the Publication Board Chairperson for the VLDB Journal, and as the Vice-President of the VLDB Endowment. His home page is http://www.lbl.gov/arie.
Keywords:caching  data grid  job scheduling  storage resource manager
本文献已被 SpringerLink 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号