Carns00 - In Proc. of the Extreme Linux Track: 4th Annual...

Info iconThis preview shows pages 1–2. Sign up to view the full content.

View Full Document Right Arrow Icon
In Proc. of the Extreme Linux Track: 4th Annual Linux Showcase and Conference , October 2000. PVFS: A Parallel File System for Linux Clusters Philip H. Carns Walter B. Ligon III Parallel Architecture Research Laboratory Clemson University, Clemson, SC 29634, USA pcarns, walt @parl.clemson.edu Robert B. Ross Rajeev Thakur Mathematics and Computer Science Division Argonne National Laboratory, Argonne, IL 60439, USA rross, thakur @mcs.anl.gov Abstract As Linux clusters have matured as platforms for low- cost, high-performance parallel computing, software packages to provide many key services have emerged, especially in areas such as message passing and net- working. One area devoid of support, however, has been parallel file systems, which are critical for high- performance I/O on such clusters. We have developed a parallel file system for Linux clusters, called the Parallel Virtual File System (PVFS). PVFS is intended both as a high-performance parallel file system that anyone can download and use and as a tool for pursuing further re- search in parallel I/O and parallel file systems for Linux clusters. In this paper, we describe the design and implementa- tion of PVFS and present performance results on the Chiba City cluster at Argonne. We provide performance results for a workload of concurrent reads and writes for various numbers of compute nodes, I/O nodes, and I/O request sizes. We also present performance results for MPI-IO on PVFS, both for a concurrent read/write workload and for the BTIO benchmark. We compare the I/O performance when using a Myrinet network versus a fast-ethernet network for I/O-related communication in PVFS. We obtained read and write bandwidthsas highas 700 Mbytes/sec with Myrinet and 225 Mbytes/sec with fast ethernet. This work was supported in part by the Mathematical, Informa- tion, and Computational Sciences Division subprogram of the Office of Advanced Scientific Computing Research, U.S. Department of En- ergy, under Contract W-31-109-Eng-38, and in part by the National Aeronautics and Space Administration, under Research Grant NAG-5- 3835. 1 Introduction Cluster computing has recently emerged as a main- stream method for parallel computing in many applica- tion domains, with Linux leading the pack as the most popular operating system for clusters. As researchers continue to push the limits of the capabilities of clus- ters, new hardware and software have been developed to meet cluster computing’s needs. In particular, hardware and software for message passing have matured a great deal since the early days of Linux cluster computing; in- deed, in many cases, cluster networks rival the networks of commercial parallel machines. These advances have broadened the range of problems that can be effectively solved on clusters. One area in which commercial parallel machines have
Background image of page 1

Info iconThis preview has intentionally blurred sections. Sign up to view the full version.

View Full DocumentRight Arrow Icon
Image of page 2
This is the end of the preview. Sign up to access the rest of the document.

This note was uploaded on 11/12/2011 for the course CE 726 taught by Professor Staf during the Spring '11 term at SUNY Buffalo.

Page1 / 11

Carns00 - In Proc. of the Extreme Linux Track: 4th Annual...

This preview shows document pages 1 - 2. Sign up to view the full document.

View Full Document Right Arrow Icon
Ask a homework question - tutors are online