Lect08 - 1 EECS 594 Spring 2010 Lecture 8: Dense Linear...

Info iconThis preview shows pages 1–9. Sign up to view the full content.

View Full Document Right Arrow Icon

Info iconThis preview has intentionally blurred sections. Sign up to view the full version.

View Full DocumentRight Arrow Icon

Info iconThis preview has intentionally blurred sections. Sign up to view the full version.

View Full DocumentRight Arrow Icon

Info iconThis preview has intentionally blurred sections. Sign up to view the full version.

View Full DocumentRight Arrow Icon

Info iconThis preview has intentionally blurred sections. Sign up to view the full version.

View Full DocumentRight Arrow Icon
This is the end of the preview. Sign up to access the rest of the document.

Unformatted text preview: 1 EECS 594 Spring 2010 Lecture 8: Dense Linear Algebra 2 Size Rate TPP performance 0.1 1 10 100 1000 10000 100000 1000000 10000000 100000000 1 Gflop/s 1 Tflop/s 100 Mflop/s 100 Gflop/s 100 Tflop/s 10 Gflop/s 10 Tflop/s 1 Pflop/s 100 Pflop/s 10 Pflop/s 59.7 GFlop/s 400 MFlop/s 1.17 TFlop/s 1.76 PFlop/s 20.0 TFlop/s 27.68 PFlop/s SUM N=1 N=500 6-8 years My Laptop 1993 1995 1997 1999 2001 2003 2005 2007 2009 Looking at the Gordon Bell Prize (Recognize outstanding achievement in high-performance computing applications and encourage development of parallel processing ) 1 GFlop/s; 1988; Cray Y-MP; 8 Processors Static finite element analysis 1 TFlop/s; 1998; Cray T3E; 1024 Processors Modeling of metallic magnet atoms, using a variation of the locally self-consistent multiple scattering method. 1 PFlop/s; 2008; Cray XT5; 1.5x10 5 Processors Superconductive materials 1 EFlop/s; ~2018; ?; 1x10 7 Processors (10 9 threads) Performance Development in Top500 0.1 1 10 100 1000 10000 100000 1000000 0000000 0000000 1E+09 1E+10 1E+11 1994 1996 1998 2000 2002 2004 2006 2008 2010 2012 2014 2016 2018 2020 1 Eflop/s 1 Gflop/s 1 Tflop/s 100 Mflop/s 100 Gflop/s 100 Tflop/s 10 Gflop/s 10 Tflop/s 1 Pflop/s 100 Pflop/s 10 Pflop/s SUM N=1 N=500 Gordon Bell Winners My laptop 6 Intel 80% IBM 10% AMD 8.4% Rank Site Computer Country Procs Rmax [Pflops] % of Peak Power [MW] Flops/ Watt 1 DOE / OS Oak Ridge Nat Lab Jaguar / Cray Cray XT5 sixCore 2.6 GHz USA 224,162 1.76 75 6.95 151 2 DOE / NNSA Los Alamos Nat Lab Roadrunner / IBM BladeCenter QS22/LS21 USA 122,400 1.04 76 2.48 446 3 NSF / NICS / U of Tennessee Jaguar / Cray Cray XT5 sixCore 2.6 GHz USA 98,928 .831 81 4 Forschungszentrum Juelich (FZJ) Jugene / IBM Blue Gene/P Solution Germany 294,912 .825 82 2.26 365 5 National SC Center in Tianjin / NUDT Tianhe-1 / NUDT TH-1 / IntelQC + AMD ATI Radeon 4870 China 71,680 .563 46 6 NASA / Ames Research Center/NAS Pleiades / SGI SGI Altix ICE 8200EX USA 56,320 .544 82 2.09 230 7 DOE / NNSA Lawrence Livermore NL BlueGene/L IBM eServer Blue Gene Solution USA 212,992 .478 80 2.32 206 8 DOE / OS Argonne Nat Lab Intrepid / IBM Blue Gene/P Solution USA 163,840 .458 82 1.26 363 9 NSF TACC/U. of Texas Ranger / Sun SunBlade x6420 USA 62,976 .433 75 2.0 217 10 DOE / NNSA Sandia Nat Lab Sun / SunBlade 6275 USA 41,616 .424 87 Rank Site Computer Country Procs Rmax [Pflops] % of Peak Power [MW] MFlops /Watt 1 DOE / OS Oak Ridge Nat Lab Jaguar / Cray Cray XT5 sixCore 2.6 GHz USA 224,162 1.76 75 7.0 251 2 DOE / NNSA Los Alamos Nat Lab Roadrunner / IBM BladeCenter QS22/LS21 USA 122,400 1.04 76 2.48 446 3 NSF / NICS / U of Tennessee Jaguar / Cray Cray XT5 sixCore 2.6 GHz USA 98,928 .831 81 3.09 269 4 Forschungszentrum Juelich (FZJ) Jugene / IBM Blue Gene/P Solution Germany 294,912 .825 82 2.26 365 5 National SC Center in Tianjin / NUDT Tianhe-1 / NUDT TH-1 / IntelQC + AMD ATI Radeon 4870 China 71,680 .563 46 1.48 380 6 NASA / Ames Research Center/NAS Pleiades / SGI SGI Altix ICE 8200EX...
View Full Document

Page1 / 57

Lect08 - 1 EECS 594 Spring 2010 Lecture 8: Dense Linear...

This preview shows document pages 1 - 9. Sign up to view the full document.

View Full Document Right Arrow Icon
Ask a homework question - tutors are online