jagomart
digital resources
picture1_Nvhpc Cuda


 179x       Filetype PDF       File size 0.39 MB       Source: www.nersc.gov


File: Nvhpc Cuda
cuda c basics what is cuda cuda architecture expose gpu parallelism for general purpose computing expose enable performance cuda c based on industry standard c set of extensions to enable ...

icon picture PDF Filetype PDF | Posted on 08 Feb 2023 | 2 years ago
Partial capture of text on file.
  CUDA C++ BASICS
                                                     WHAT IS CUDA?
           CUDA Architecture
                  Expose GPU parallelism for general-purpose computing
                  Expose/Enable performance
           CUDA C++
                  Based on industry-standard C++
                  Set of extensions to enable heterogeneous programming
                  Straightforward APIs to manage devices, memory etc.
           This session introduces CUDA C++
                  Other languages/bindings available: Fortran, Python, Matlab, etc.
                                                                                                                             2
                                                GPU KERNELS: DEVICE CODE
                       __global__ void mykernel(void) {
                       }
             CUDA C++ keyword __global__ indicates a function that:
                     Runs on the device
                     Is called from host code (can also be called from other device code)
             nvccseparates source code into host and device components
                     Device functions (e.g. mykernel()) processed by NVIDIA compiler
                     Host functions (e.g. main()) processed by standard host compiler (e.g. gcc)
                                                                                                                                                    3
                                                GPU KERNELS: DEVICE CODE
                       mykernel<<<1,1>>>();
             Triple angle brackets mark a call to device code
                     Also called a “kernel launch”
                     We’ll return to the parameters (1,1) in a moment
                     The parameters inside the triple angle brackets are the CUDA kernel execution configuration
                                                                                                                                                   4
The words contained in this file might help you see if this file matches what you are looking for:

...Cuda c basics what is architecture expose gpu parallelism for general purpose computing enable performance based on industry standard set of extensions to heterogeneous programming straightforward apis manage devices memory etc this session introduces other languages bindings available fortran python matlab kernels device code global void mykernel keyword indicates a function that runs the called from host can also be nvccseparates source into and components functions e g processed by nvidia compiler main gcc triple angle brackets mark call kernel launch we ll return parameters in moment inside are execution configuration...

no reviews yet
Please Login to review.