NVIDIA_CUDA_BestPracticesGuide_2.3

NVIDIA_CUDA_BestPracticesGuide_2.3 - Optimization NVIDIA...

Info iconThis preview shows pages 1–4. Sign up to view the full content.

View Full Document Right Arrow Icon
July 2009 Optimization NVIDIA CUDA C Programming Best Practices Guide CUDA Toolkit 2.3
Background image of page 1

Info iconThis preview has intentionally blurred sections. Sign up to view the full version.

View Full DocumentRight Arrow Icon
CUDA Best Practices Guide ii July 2009
Background image of page 2
July 2009 iii Preface Chapter 1. Introduction to Parallel Computing with CUDA 1.1 1.2 Under 1.3 CUDA API Chapter 2. Performance Metrics 2.1 Timing 2.2 Bandwidth Chapter 3. Memory Optimizations 3.1 Dat .......................................................................................................................... vii What Is This Document? . ................................................................................................. vii Who Should Read This Guide? . ......................................................................................... vii Recommendations and Best Practices . .............................................................................. viii Contents Summary . ........................................................................................................ viii ............................................ 1 Heterogeneous Computing with CUDA . ......................................................................... 1 1.1.1 Differences Between Host and Device . ................................................................... 1 1.1.2 What Runs on a CUDA-Enabled Device? . ................................................................ 2 1.1.3 Maximum Performance Benefit . ............................................................................. 3 standing the Programming Environment . .............................................................. 4 1.2.1 CUDA Compute Capability . .................................................................................... 4 1.2.2 Additional Hardware Data . ..................................................................................... 5 1.2.3 C Runtime for CUDA and Driver API Version . ........................................................... 5 1.2.4 Which Version to Target . ....................................................................................... 6 s . ................................................................................................................ 6 1.3.1 C Runtime for CUDA . ............................................................................................ 7 1.3.2 CUDA Driver API . .................................................................................................. 7 1.3.3 When to Use Which API . ....................................................................................... 8 1.3.4 Comparing Code for Different APIs . ........................................................................ 8 ................................................................................... 11 ..................................................................................................................... 11 2.1.1 Using CPU Timers . .............................................................................................. 11 2.1.2 Using CUDA GPU Timers .
Background image of page 3

Info iconThis preview has intentionally blurred sections. Sign up to view the full version.

View Full DocumentRight Arrow Icon
Image of page 4
This is the end of the preview. Sign up to access the rest of the document.

Page1 / 66

NVIDIA_CUDA_BestPracticesGuide_2.3 - Optimization NVIDIA...

This preview shows document pages 1 - 4. Sign up to view the full document.

View Full Document Right Arrow Icon
Ask a homework question - tutors are online