PNUTS - PNUTS: Yahoo!s Hosted Data Serving Platform Brian...

Info iconThis preview shows pages 1–2. Sign up to view the full content.

View Full Document Right Arrow Icon

Info iconThis preview has intentionally blurred sections. Sign up to view the full version.

View Full DocumentRight Arrow Icon
This is the end of the preview. Sign up to access the rest of the document.

Unformatted text preview: PNUTS: Yahoo!s Hosted Data Serving Platform Brian F. Cooper, Raghu Ramakrishnan, Utkarsh Srivastava, Adam Silberstein, Philip Bohannon, Hans-Arno Jacobsen, Nick Puz, Daniel Weaver and Ramana Yerneni Yahoo! Research ABSTRACT We describe PNUTS, a massively parallel and geographi- cally distributed database system for Yahoo!s web applica- tions. PNUTS provides data storage organized as hashed or ordered tables, low latency for large numbers of con- current requests including updates and queries, and novel per-record consistency guarantees. It is a hosted, centrally managed, and geographically distributed service, and uti- lizes automated load-balancing and failover to reduce oper- ational complexity. The first version of the system is cur- rently serving in production. We describe the motivation for PNUTS and the design and implementation of its table storage and replication layers, and then present experimen- tal results. 1. INTRODUCTION Modern web applications present unprecedented data man- agement challenges, even for relatively simple tasks like managing session state, content meta-data, and user-generated content such as tags and comments. The foremost require- ments of a web application are scalability , consistently good response time for geographically dispersed users, and high availability . At the same time, web applications can fre- quently tolerate relaxed consistency guarantees . We now ex- amine these requirements in more detail. Scalability. For popular applications such as Flickr and del.icio.us, the need for a scalable data engine is obvious [4]. We want not only architectural scalability, but the ability to scale during periods of rapid growth by adding resources with minimal operational effort and minimal impact on sys- tem performance. Response Time and Geographic Scope. A fundamen- tal requirement is that applications must consistently meet 1 Author emails: { cooperb, ramakris, utkarsh, silberst, plb, nickpuz, dweaver, yerneni } @yahoo-inc.com Hans-Arno Jacobsens current affiliation: University of Toronto, jacobsen@eecg.toronto.edu Permission to copy without fee all or part of this material is granted provided that the copies are not made or distributed for direct commercial advantage, the VLDB copyright notice and the title of the publication and its date appear, and notice is given that copying is by permission of the Very Large Data Base Endowment. To copy otherwise, or to republish, to post on servers or to redistribute to lists, requires a fee and/or special permission from the publisher, ACM. VLDB 08, August 24-30, 2008, Auckland, New Zealand Copyright 2008 VLDB Endowment, ACM 000-0-00000-000-0/00/00. Yahoo!s internal SLAs for page load time, placing stringent response time requirements on the data management plat- form. Given that web users are scattered across the globe, it is critical to have data replicas on multiple continents for low-latency access. Consider social network applications alumni of a university in India may reside in North America...
View Full Document

Page1 / 12

PNUTS - PNUTS: Yahoo!s Hosted Data Serving Platform Brian...

This preview shows document pages 1 - 2. Sign up to view the full document.

View Full Document Right Arrow Icon
Ask a homework question - tutors are online