Aydin_CSC11_Indexing

Forming r from i in parallel on a 3x3 processor grid

Info iconThis preview shows page 1. Sign up to view the full content.

View Full Document Right Arrow Icon
This is the end of the preview. Sign up to access the rest of the document.

Unformatted text preview: e(J,J,1,na,na);! C = A + R*B*Q - S*A*T;! 8 !0 0 0$ !0 # &# A +# 0 B 0 &'# 0 #0 0 0& #0 " %" Tspasgn = O(nnz( A)) 0 0$ & A( I , J ) 0 & 0 0& % Parallel algorithm for SpRef 1. Forming R from I in parallel, on a 3x3 processor grid P(0,0) P(1,1) 7 SCATTER 0 1 2 3 4 5 6 7 8 2 5 8 1 P(2,2) 3 I R •  Vector distributed only on diagonal processors; for illustra4on. •  Full (2D) vector distribu4on: SCATTER  ALLTOALLV •  Forming QT from J is iden4cal, followed by Q=QT.Transpose() 9 Parallel algorithm for SpRef 2. SpGEMM using memory...
View Full Document

This note was uploaded on 12/27/2011 for the course CMPSC 240A taught by Professor Gilbert during the Fall '09 term at UCSB.

Ask a homework question - tutors are online