sse-assign - CSE 721 Programming Assignment 1 Due 3:00PM...

Info iconThis preview shows page 1. Sign up to view the full content.

View Full Document Right Arrow Icon
This is the end of the preview. Sign up to access the rest of the document.

Unformatted text preview: CSE 721 Programming Assignment 1 Due 2/28/2011, 3:00PM For this assignment, you are to create versions of the following two codes using SSE intrinsics. Submit via Carmen; be sure to include source code listings and report on performance. A single file must be uploaded to Carmen for the assignment, which includes the written report as well as code listings and output from execution of the programs. 1. (35 points) The following code implements a “hardwired” matrix-matrix product for 4x4 single-precision floating-point matrices. In some computational domains such as QCD (Quantum Chromo Dynamics), a large number of such matrix-products of small fixed-size matrices is required. The performance of math library matrix multiplication routines is generally not optimized for such small matrix sizes. Hence SSE-based specialized codes are used. void mul4x4(float *A,float *B, float *C) { int i,j,k; for(i=0;i<4;i++) { for(j=0;j<4;j++) C[4*i+j] = 0.0; for(k=0;k<4;k++) for(j=0;j<4;j++) C[4*i+j] += A[4*i+k]*B[4*k+j];...
View Full Document

{[ snackBarMessage ]}

Ask a homework question - tutors are online