4 Pages

sampleHW

Course: CDA 5155, Spring 2012
School: University of Florida
Rating:
 
 
 
 
 

Word Count: 1375

Document Preview

Questions Similar and Answers Solutions are based on the fourth edition of the book 1. You are the product manager for Intel core i5-650 processor. The chip has an area of 81 mm2, with a defect rate of 0.67 defects per cm2 and =4. The die of each chip is occupied by two identical cores (70% total area) and a shared L3 cache (30% total area). For simplicity, we assumed here that each chip has only two cores and an...

Register Now

Unformatted Document Excerpt

Coursehero >> Florida >> University of Florida >> CDA 5155

Course Hero has millions of student submitted documents similar to the one
below including study guides, practice problems, reference materials, practice exams, textbook help and tutor support.

Course Hero has millions of student submitted documents similar to the one below including study guides, practice problems, reference materials, practice exams, textbook help and tutor support.
Questions Similar and Answers Solutions are based on the fourth edition of the book 1. You are the product manager for Intel core i5-650 processor. The chip has an area of 81 mm2, with a defect rate of 0.67 defects per cm2 and =4. The die of each chip is occupied by two identical cores (70% total area) and a shared L3 cache (30% total area). For simplicity, we assumed here that each chip has only two cores and an L3 cache (no other components). a. [2 Points] What is the yield of the die? b. [5 Points] Some researchers proposed that the number of defects in a die can be modeled by Poisson distribution. Suppose we can use the yield as the probability that there is no defect on a die, what is the value of parameter in Poisson distribution here? Note: Poisson distribution means the probability that there are exactly k defects (k being a non-negative integer, k = 0, 1, 2, ...) is equal to where, e is the base of the natural logarithm (e = 2.71828...) k is the number of occurrences of an event - the probability of which is given by the function k! is the factorial of k is a positive real number, c. [5 Points] In a defected chip, assume defects are independent and uniformly distributed within the die area. What is the probability that a DEFECTED chip has no defected cores? d. [3 Points] If there is no defected core in a chip, but L3 is defective, we can still sell it by shutting down the defected L3 cache. Suppose you can sell the perfectly working (defect-free) chip for $289.99 each, you need $180 to manufacture and test each chip, what is the minimum sale price for your chips without L3 cache (defective L3 cache is shutdown) to make any profit? Solution a) The equation used here is from the fourth edition of the book. Yield of a single die = (1 + (0.670.81)/4 ) -4= 0.601 b) The probability that there is no defect on the die is f(0;) = e- =0.601 Therefore = 0.509 c) Suppose there are k defects on the die, the probability that all of them are in the cache area is (0.3)^k We also have to compute the probability that there are k defects on the die is f(k;) Therefore the probability that a DEFECTED chip has no defected cores is ((0.3)*f(1;) + (0.3)^2*f(2;) + + (0.3)^k*f(k;) + ) / (1-f(0;)) The nominator can be computed as Sum_(k=1)^(inf) (0.3^k* ^k * e^(-) / k! ) = e^(-0.7) Sum_(k=1)^(inf) ((0.3)^k * e^(-0.3) / k! ) = e^(-0.7) (1-f(0;0.3)) The final result is e^(-0.7) (1-f(0;0.3))/ (1-f(0;)) = 0.248 It is okay to compute above summation by keeping only several large terms. d) (180-(289.99*0.601))/ (0.248*(1-0.601)) =57.8 (dollars) 2. One day you got tired with architecture division. So you accepted an offer from a software division. Your job is to develop a numerical simulation program. The software will be used on a workstation with a single CPU and Nvidia Tesla C1060 GPU with 240 streaming processors. The CPU can achieve 1GFlops, while each streaming processors in the GPU can deliver 3.9GFlops (peak). For simplicity, we only consider floating- point operations in this problem. The CPU and all the streaming processors in GPU can perform calculation simultaneously. a. [5 Points] If all of the dynamic instructions in your main application are parallelizable, what is the maximum performance (Flops) you can get from your hardware in the optimal situation? What is the speedup compared with CPU only execution? State your assumptions, if any. b. [5 Points] In reality, the computation cannot be performed without data. Suppose all input data is stored within the memory prior to execution. Before each round of computation, you have to load the required input data into the cache within CPU or GPU. Similarly, the computation results, i.e., output data, will be stored in the corresponding cache immediately after the computation. The output data must be written back to the memory before next round of computation. Assume that the memory and caches in both CPU and GPU have infinite capacity, and there is no overlap between data transfer and computation. The bandwidth between CPU and memory is 5GB/s, while the bandwidth between GPU and is memory 20GB/s. Each byte input data requires 3 Flops to produce 0.5 byte output data on average. There is no dependency among different parts of data. What is the maximum performance (Flops) you can get from your hardware in the optimal situation if we take the data transfer time into consideration? What is the speedup in this case compared to CPU-only execution? What happens to Flops and speedup computation if every input byte requires 30000 Flops to produce 0.5 byte output data? CPU Memory GPU c. [5 Points] What conclusions can we make based on our results in b)? Even if the program can be completely parallelized, can we have linear performance improvement with the number of processing units i.e., if we double the number of processing units, can we expect two-times performance? Solution a) If all dynamic instructions can be executed on 240 cores simultaneously, the maximum average performance is around 3.9 * 240 = 936GFlops The speedup is 936/1 = 936 This an incredibly large number. b) If each byte input data requires 3 Flops to produce 0.5 byte output data on average, in order to process 100 GB data, GPU needs 5+300/936 + 2.5 = 1.82 (s). Its actual performance is 300GFlops/1.82 = 38GFlops CPU needs 20+ 300/1 +10 = 330 (s). Its actual performance is 300GFlops/330 = 0.91GFlops The speed up is only 42 If each byte input data requires 30000 Flops to produce 0.5 byte output data on average, the computation time will be much more than the data transmission time. Therefore, the speed up around 936 as in a) It is OK if your computation is based on the fact that we can use CPU and GPU simultaneously. We consider GPU only because the results are close.. c) No. For memory bound programs, improvement in bandwidth may be more important than more number of cores. 3. [10 points] Assume that values of variables A, B, C and D reside in memory. Write the code sequence for D = A*B + C * (A B) for four instruction-set architectures: i) Stack, ii) Accumulator, iii) Register-memory and iv) Register-register (Load-Store). (These four architectures are shown in Figure B.1 on page B -4 of the Appendix B). Do not perform any scheduling or other optimizations! Solution i) Stack Push A Push B Sub Push C Mul Push A Push B Mul Add Pop D ii) Accumulator Load A Mul B Store A*B Load A Sub B Mul C Add A*B Store D iii) Register-memory Load R1 A Mul R2,R1,B Sub R4,R1,B Mul R1,R1,C Add R1,R1,R2 Store D,R1 iv) Register-register Load R1,A Load R2,B Load R3,C Mul R4,R1,R2 Sub R5,R1,R2 Mul R1,R3,R5 Add R1,R1,R4 Store R1,D 4. There are some addressing modes that are not supported in MIPS. For example, LDMIA R1,{R2,R3,R4} will perform the following three operations: R2 = memory[R1], R3 = memory[R1+4]; R4 = memory[R1+8] In other words, we can replace LOAD R2,0(R1) LOAD R3,4(R1) LOAD R4,8(R1) ADD R5, R3, R2 ADD R5, R4, R5 STORE R5, 4(R6) by LDMIA R1,{R2,R3,R4} ADD R5, R3, R2 ADD R5, R4, R5 STORE R5, 4(R6) a. [5 Points] Suppose all instructions are still 32 bits. Since this is a R-Type instruction, there are 6 bits reserved for opcode. Can you design one possible binary format (encoding) for this LDMIA instruction if we want to add it to MIPS? Why the increment is a multiple of 4? b. [5 Points] Assume that the new instruction will cause the clock cycle to increase by 2%. Assume that 25% of dynamic instructions are loads. The new instruction affects only the clock speed and not the CPI. If only 30% of load instruction can be eliminated by the new instructions, will the overall performance improve? Solution a) There are many possible binary formats for this instruction. For example, we can use 20 bits to address four registers (5 for each). The increment is 4 indicates that this instruction operates on 32 bits arrays. Each instruction will load 3 32-bits values into 3 registers. b) In the optimal situation, we will remove 25%*30% = 7.5% The overall execution time becomes (1-7.5%)*1.02 = 0.94 Therefore, the performance does increase. It is also fine if you interpret the question as 30% of load instructions can be REPLACE by the new instruction.
Find millions of documents on Course Hero - Study Guides, Lecture Notes, Reference Materials, Practice Exams and more. Course Hero has millions of course specific materials providing students with the best way to expand their education.

Below is a small sample set of documents:

University of Florida - STA - 3024
AdministriviaReview (Basic Math)Looking AheadSTA 3024 (Section 7459): First Dayof ClassDouglas WhitakerStatistics Department9 January 2012AdministriviaReview (Basic Math)Looking AheadFocal PointsEach class Ill try to post a few key questions t
University of Florida - STA - 3024
STA 3024: Review of STA 2023(Part 1)Douglas WhitakerStatistics Department11 January 2012Douglas Whitaker (Statistics Department)STA 3024: Review of STA 2023 (Part 1)11 January 20121 / 105Focal PointsWhat are measures of center?What are measures
University of Florida - STA - 3024
STA 3024: Review of STA 2023(Part 2, Statistical Inference)Douglas WhitakerStatistics Department11 January 2012Douglas Whitaker (Statistics Department) STA 3024: Review of STA 2023 (Part 2, Statistical Inference) January 2012111 / 57Focal PointsW
University of Florida - STA - 3024
STA 3024: ANOVADouglas WhitakerStatistics Department3 February 2012Douglas Whitaker (Statistics Department)STA 3024: ANOVA3 February 20121 / 34Analysis of VarianceWe spent the last few weeks reviewing old material,and we introduced some new mate
University of Florida - STA - 3024
STA 3024: ANOVADouglas WhitakerStatistics Department10 February 2012Douglas Whitaker (Statistics Department)STA 3024: ANOVA10 February 20121/8Analysis of VarianceIn an experiment to investigate the eect of color ofpaper (blue, green, orange) on
University of Florida - STA - 3024
Useful Formulasnexpected value =xi P (xi )s=i=11n1n(xi x)2i=1ProbabilityP (A B ) = P (A) + P (B ) P (A B )P (X = x) =nxp (1 p)nxxmean = npP (A|B ) =P (A B )P (B )standard deviation = np(1 p)Statistical Inferencetest statistic =esti
University of Florida - STA - 3024
F Distribution Tabledf2 1234567891011121314151617181920212223242526272829304060120df1 1161.4518.5110.137.716.615.995.595.325.124.964.844.754.674.604.544.494.454.414.384.354.324.304.284.264.244.
University of Florida - STA - 3024
STA 3024Section 7459Homework 1Form CUFID:Name:This form is only for students whose UFID number begins with 6 or 7.Answer all problems neatly on separate sheets of paper. All pages must be stapled. Work that isnt neator stapled may lose points.32
University of Florida - STA - 3024
STA 3024Section 7459Homework 1Form CUFID:Name:This form is only for students whose UFID number begins with 6 or 7.Answer all problems neatly on separate sheets of paper. All pages must be stapled. Work that isnt neator stapled may lose points.Bas
University of Florida - STA - 3024
STA 3024Section 7459Homework 1Form BUFID:Name:This form is only for students whose UFID number begins with 0, 4, 5, or 9.Answer all problems neatly on separate sheets of paper. All pages must be stapled. Work that isnt neator stapled may lose poin
University of Florida - STA - 3024
STA 3024Section 7459Homework 1Form BUFID:Name:This form is only for students whose UFID number begins with 0, 4, 5, or 9.Answer all problems neatly on separate sheets of paper. All pages must be stapled. Work that isnt neator stapled may lose poin
University of Florida - STA - 3024
STA 3024Section 7459Homework 1Form AUFID:Name:This form is only for students whose UFID number begins with 1, 2, 3, or 8.Answer all problems neatly on separate sheets of paper. All pages must be stapled. Work that isnt neator stapled may lose poin
University of Florida - STA - 3024
STA 3024Section 7459Homework 1Form AUFID:Name:This form is only for students whose UFID number begins with 1, 2, 3, or 8.Answer all problems neatly on separate sheets of paper. All pages must be stapled. Work that isnt neator stapled may lose poin
University of Florida - STA - 3024
STA 3024Section 7459Homework 2Form CUFID:Name:This form is only for students whose UFID number begins with 6 or 7.Answer all problems neatly. If you turn in more than one piece of paper, they must be stapled. Workthat isnt neat or stapled may lose
University of Florida - STA - 3024
STA 3024Section 7459Homework 2Form CUFID:Name:This form is only for students whose UFID number begins with 6 or 7.Answer all problems neatly. If you turn in more than one piece of paper, they must be stapled. Workthat isnt neat or stapled may lose
University of Florida - STA - 3024
STA 3024Section 7459Homework 2Form BUFID:Name:This form is only for students whose UFID number begins with 0, 4, 5, or 9.Answer all problems neatly. If you turn in more than one piece of paper, they must be stapled. Workthat isnt neat or stapled m
University of Florida - STA - 3024
STA 3024Section 7459Homework 2Form BUFID:Name:This form is only for students whose UFID number begins with 0, 4, 5, or 9.Answer all problems neatly. If you turn in more than one piece of paper, they must be stapled. Workthat isnt neat or stapled m
University of Florida - STA - 3024
STA 3024Section 7459Homework 2Form AUFID:Name:This form is only for students whose UFID number begins with 1, 2, 3, or 8.Answer all problems neatly. If you turn in more than one piece of paper, they must be stapled. Workthat isnt neat or stapled m
University of Florida - STA - 3024
STA 3024Section 7459Homework 2Form AUFID:Name:This form is only for students whose UFID number begins with 1, 2, 3, or 8.Answer all problems neatly. If you turn in more than one piece of paper, they must be stapled. Workthat isnt neat or stapled m
University of Florida - STA - 3024
STA 3024Section 7459Homework 3Form CUFID:Name:This form is only for students whose UFID number begins with 6 or 7.Answer all problems neatly. If you turn in more than one piece of paper, they must be stapled. Workthat isnt neat or stapled may lose
University of Florida - MAP - 2302
c ?.)S/ticfw_/,>r / t' 4 /r*E x o,d & ry4y^,s @nztf x pras / eg<r',nnc ,brn&ql s ,-r1aD 4 *nU*2.n n( re- r ) e .n ) an*'yr= l63 - t . 4 3 z r - + 4 . s a of r t . - .o * a'/.%r* *14+r= - le. & .,^ = 4.-zc*rtm / n - , ) Q ,n a cn*Llh:f
University of Florida - MAP - 2302
University of Florida - MAP - 2302
University of Florida - MAP - 2302
University of Florida - MAP - 2302
University of Florida - MAP - 2302
University of Florida - MAP - 2302
University of Florida - MAP - 2302
University of Florida - MAP - 2302
University of Florida - MAP - 2302
University of Florida - MAP - 2302
University of Florida - MAP - 2302
University of Florida - MAP - 2302
University of Florida - MAP - 2302
University of Florida - MAP - 2302
University of Florida - MAP - 2302
University of Florida - MAP - 2302
University of Florida - MAP - 2302
University of Florida - MAP - 2302
-r? y*pg' (# /r, /.*a),fi/r. rL rvfw"-C0Q./ +w:6t-2-)w1-t1 _),w(n) =j,)m'fr,.*/^72,- ?/-,.fu:,J4l'vc-'1- 2 = 7(4/l=",(t iD saftlu tilJ'k): w/(t;D,*ad rtu)=a':7;i";h= w,h)0d)+>,u"(t) -2v'/t)+*ft)+)w"(t-)-?v(t-r1_cfw_*-2,w k"lt
University of Florida - MAP - 2302
e)t.':,:':&"f-:4y*,&t 7 10._ it r l 4t^ $ a*-"/;t cfw_f t a * 1 l = ,f ut lL<./ra/tdcfw_/t'jr-JT(a)=r/t,L t - 7)lL(=o (t| i tuJ.J'rIl<7ofunntoIazt -t.t +1tit t1 / ^( t)LfIII= cfw_- otl&-t:rrY'- .L7p":cj' A" -d,a"'al7
University of Florida - MAP - 2302
,+."\lant o LU-c ae'f' ).uo6^ o 'tvl- 0'nif/42-zs5-\'4":tcer rrr,J !- ; 4".Ur 4oA^Sv ppc'* ',/(t),-fft),7/t) a\- "onr(h,nt o- ,*. -F., ,>-trro(j,4n/or 4) ,*Atc+ c.zQ;ulf -7t to/tE, tid2tansAtn* .h-7L Zr/r/* , b(+)Jt*JJ"J ,t+)=f r
University of Florida - MAP - 2302
q' Y)'6arPofneev'i1t t /:r ' ," + , L t l ,we- EJ.,ft-"yf7*^t_,J =,of.ory,/ = t t',-/',' ,== r (f-,) L n-',J'.)s/c .th) , +-,*.t J ' = . cfw_,r L'-) ,y ':y- fu\N-" tJO'r/r (r*t ) t ',/a t L r ' , , / t , , 1+ r / =z e r ( r *
University of Florida - MAP - 2302
(i v)0t(7*tO 6 o*tzr ncZ-,<rrcfw_S7nC u n"iL-u t -l(-c ur' v .-!.+J "*r\wt= -af tx,.1)I.e^e<-fqpbtr-F- t r,4r.a5it/-a^/.&'/J<4- -l*-t/qJ?r<C r f o(eS.&p0"- Frr,J \cfw_,-Lv /Ltr<4,)"-kL'-l'*-c1rte.d*EOt=*dvc '"-"
University of Florida - MAP - 2302
/l:,"/- pVta-o./non.f.,/4,a/n' /cfw_,cfw_.-4".L sazv'n,i/sL , 4 . . / -l-*a -&tc -/z z/'5a'(r*,r*t),p(, j)/t4^ ,t:cfw_* cfw_ir,t7,'r,/) f,P/aecfw_, ,cfw_ ,r zol7s 'y,acfw_b-cfw_n*n24.r.*<L ,-f &-r.ZL2-/ .,L Z,&-,rcf
University of Florida - MAP - 2302
( 2*)-, J ,6.o,t, /) ' " dtcfw_,f'th. P /una 6 E f uEaus/.2-za a & / -a.,sn,^Jz>Jg 4r.k7 i a ltC's 6o/b"(4 ,frn,y'u ,f*fl/?'ftf"^a ,. / ,nh * , cfw_ J L" a /r-./ o^ k u,E, f t,E x a*,f.e. 4f , (t) = t , : *" "f" -;,=*A''|'3+ +T'
University of Florida - MAP - 2302
University of Florida - MAP - 2302
MAP 2302 - SPRING 2012 - TEAM HOMEWORK 1FRIDAY, FEBRUARY 10TEAM NUMBER:TEAM NUMBER:TEAM CODENAME:TEAM MEMBERS PRESENT:NAMESIGNATURE1.2.3.4.5.6.Instructions This is a team assigment. Include team number and signatures of each contributing t
University of Florida - MAP - 2302
MAP 2302 - SPRING 2012 - TEAM HOMEWORK 2MONDAY, FEBRUARY 13TEAM NUMBER:TEAM CODENAME:TEAM MEMBERS:NAMESIGNATURE1.2.3.4.5.6.Instructions This is a team assigment. Include team number and signatures of each contributing team member on this co
University of Florida - MAP - 2302
MAP 2302 - SPRING 2012 - TEAM HOMEWORK 3WEDNESDAY, FEBRUARY 15TEAM NUMBER:TEAM CODENAME:TEAM MEMBERS:NAMESIGNATURE1.2.3.4.5.6.ACKNOWLEDGEMENT OF ANY OUTSIDE HELP:Instructions This is a team assigment. Include team number and signatures of
University of Florida - MAP - 2302
York University - POLS - 3136
PUBLIC LAW I AS/POLS 3135 3.0 (F) SECTION A(Crosslisted to: AK/POLS 3135 3.00, AK/SOCI 3135 3.00, GL/POLS 3135 3.00, AK/PPAS 3135 3.00)THE CONSTITUTION AND THE COURTS IN CANADA FALL TERM 2007Course Director: Ray Bazowski Office: 131 McLaughlin College
York University - POLS - 3136
PUBLIC LAW IAS/POLS 3135 3.0 (F) SECTION A(Crosslisted to: AK/POLS 3135 3.00, AK/SOCI 3135 3.00,GL/POLS 3135 3.00, AK/PPAS 3135 3.00)THE CONSTITUTION AND THE COURTS IN CANADAFALL TERM 2009Course Director: Ray BazowskiOffice: 131 McLaughlin College
York University - POLS - 3136
A.Be prepared to provide brief definitions of each term (approximately 2-3sentences for each)Judicial positivismJudicial Committee ofthe Privy Councilmischief and remedyMens reaNatural lawpure provincial courtsStare decisisJudicial realismpure
York University - POLS - 3136
YORK UNIVERSITYPOLITICAL SCIENCE 3135.03(A)PUBLIC LAW ITHE CONSTITUTION AND THE COURTS IN CANADAFALL TERM 2009Sample Final ExaminationGeneral InstructionsAnswer one (1) of the following questions:There is no hard and fast rule on how long your ans
York University - POLS - 3136
A.Be prepared to provide brief definitions of each term (approximately 2-3 sentences for each)Judicial positivism Natural lawJudicial Committee of the Privy Council pure provincial courtsmischief and remedy Mens rea Stare decisisJudicial realism pure
York University - POLS - 3136
YORK UNIVERSITY POLITICAL SCIENCE AS 4900 3.0J FALL 2007 LAW, JUSTICE AND POLITICS FINAL EXAM Answer two of the following questions (preferably in typewritten or computergenerated form). Your answers should integrate in a feasible and intelligent fashion
York University - POLS - 3136
Grade-Raising Assignment for Public Law I Midterm TestFor students who received a grade lower than 60% in their midterm exam, the following grade-raising assignment is available. Only students who earned less than 60% can do this assignment. The nominal
York University - POLS - 3136
PUBLIC LAW I AS/POLS 3135 3.0 (F) SECTION A Midterm Study Guide The midterm test in Public Law will consist of three parts. The first part will contain a number of terms drawn from the list provided below. You will be asked to define or explain the terms
York University - POLS - 3136
Public Law I September 7, 2007 Important Announcements n Course kits can be purchased from the Keele Copy Centre, 4699 Keele St., 4166659675. Main text book: Gerald Gall, The Canadian Legal System (in bookstore). n Recommended: Hogg's Constitutional
York University - POLS - 3136
Public Law I September 5, 2008 Important AnnouncementsMain text book: Russell, Knopff, Bateman & Hiebert, The Courts and the Constitution: Leading Cases (in bookstore).Recommended: Hogg's Constitutional Law of Canada (quite expensive, but useful), and a
York University - POLS - 3136
Public Law I: September 21, 2007 Basic concepts regarding the Canadian Legal System n The Canadian court structuren Sources of Law n n Main sources of law: n Written constitution (s. 52(1) of CA, 1982 n statute law (laws created by legislatures) n c