jagomart
digital resources
picture1_Simon3


 130x       Filetype PDF       File size 2.25 MB       Source: wrigstad.com


File: Simon3
an introduction to many core parallel computing with opencl simon mcintosh smith twitter simonmcs upmarc summer school th uppsala 28 29 2014 recap 5 simple steps in a basic opencl ...

icon picture PDF Filetype PDF | Posted on 03 Feb 2023 | 2 years ago
Partial capture of text on file.
      An introduction to 
      many-core parallel 
      computing with 
      OpenCL  
     Simon McIntosh-Smith 
      Twitter: @simonmcs 
                       UPMARC summer school 
                                   th
                       Uppsala 28-29  2014 
          Recap 
 5 simple steps in a basic OpenCL program: 
 1.  Define the platform = devices + context + 
  queues 
 2.  Create and Build the program (dynamic 
  library of kernels) 
 3.  Setup memory objects 
 4.  Define the kernels 
 5.  Submit commands … transfer memory 
  objects and execute kernels 
                       We have now covered the basic 
                    platform runtime APIs in OpenCL 
                                                                                                                                       
                                                                          CPU                                                    GPU 
                                                                                               Context 
                              Programs                                           Kernels                            Memory Objects                              Command Queues 
                              Programs 
        __kernel void                                                                dp_mul                     Buffers               Images 
                                                                                      arg [0] 
        dp_mul(global const float *a,                  dp_mul                        arg [0]                                                                          In               Out of 
                                                                                        value                                                                        In               Out of 
                                                                                    arg[0] value 
               global const float *b,           CPU program binary                    value                                                                       Order                Order 
               global float *c)                                                       arg [1]                                                                     Order               Order 
        {                                                                            arg [1] 
                                                       dp_mul                           value                                                                     Queue               Queue 
                                                                                    arg[1] value                                                                                      Queue 
          int id = get_global_id(0);            GPU program binary                    value                                                                      Queue 
          c[id] = a[id] * b[id];                                                      arg [2] 
                                                                                     arg [2] 
        }                                                                               value 
                                                                                    arg[2] value 
                                                                                      value                                                                      GPU 
                                                                                                                                                               Compute Device 
  OPENCL KERNEL PROGRAMMING 
The words contained in this file might help you see if this file matches what you are looking for:

...An introduction to many core parallel computing with opencl simon mcintosh smith twitter simonmcs upmarc summer school th uppsala recap simple steps in a basic program define the platform devices context queues create and build dynamic library of kernels setup memory objects submit commands transfer execute we have now covered runtime apis cpu gpu programs command kernel void dp mul buffers images arg global const float out value b binary order c queue int id get compute device programming...

no reviews yet
Please Login to review.