The next three supercells are served directly from

Info iconThis preview shows page 1. Sign up to view the full content.

View Full Document Right Arrow Icon
This is the end of the preview. Sign up to access the rest of the document.

Unformatted text preview: rmed by the general integer functional unit and that this unit is pipelined. This means that one cycle after a multiplication has started, a new integer operation (multiplication or otherwise) can begin. Assume also that the Integer/Branch function unit can perform simple integer operations. A. Show a translation of these lines of assembly code into a sequence of operations. The movl instruction translates into a single load operation. Register %eax gets updated twice in the loop. Label the different versions %eax.1a and %eax.1b. B. Explain how the function can go faster than the number of cycles required for integer multiplication. C. Explain what factor limits the performance of this code to at best a CPE of 2.5. D. For floating-point data, we get a CPE of 3.5. Without needing to examine the assembly code, describe a factor that will limit the performance to at best 3 cycles per iteration. Homework Problem 5.9 [Category 1]: Write a version of the inner product procedure described in Problem 5.8 that uses four-way loop...
View Full Document

Ask a homework question - tutors are online