Changes between Version 32 and Version 33 of OpenCL_Memo


Ignore:
Timestamp:
Dec 12, 2011 6:57:12 PM (13 years ago)
Author:
nakasato
Comment:

--

Legend:

Unmodified
Added
Removed
Modified
  • OpenCL_Memo

    v32 v33  
    1 == Latest Driver == 
    2 http://support.amd.com/us/gpudownload/linux/Pages/radeon_linux.aspx?type=2.4.1&product=2.4.1.3.42&lang=English 
     1= Tips = 
     2== Disable auto vectorization == 
     3In the section 6.7.2 (187P) in the OpenCL Specification Version: 1.1 (Revision: 33), "__attribute__((vec_type_hint(<typen>))" is described. This hint controls the autovectorizer in the compiler for OpenCL C. I tested to dump the assembly code for a kernel targeted for AVX instructions with Intel SDK (version 1.5). 
     4 
     5||                  || lines || performance || 
     6|| without the hint || 1567  ||             || 
     7|| with the hint    || 333   ||             || 
     8 
     9This hint amazingly reduce the size of the generated kernel code.  
     10 
     11 
     12= SDK and driver =  
    313 
    414== Latest SDK == 
     
    717Intel http://software.intel.com/en-us/articles/vcsource-tools-opencl-sdk/ 
    818 
     19== Latest Driver for AMD == 
     20http://support.amd.com/us/gpudownload/linux/Pages/radeon_linux.aspx?type=2.4.1&product=2.4.1.3.42&lang=English 
     21 
     22 
     23= old info = 
    924== Standard Compute Layer Library == 
    1025http://www.browndeertechnology.com/stdcl.html 
     
    4762Note this option is generally regarded as "bad" for security. Be careful. 
    4863 
    49 === icc === 
     64= Random notes = 
     65== icc == 
    5066http://software.intel.com/en-us/articles/using-intel-compilers-for-linux-with-ubuntu/ 
    5167 
    52  
    53 === packages === 
     68== packages == 
    5469Ubuntu 10.04.1 LTS 
    5570{{{ 
     
    6580http://forums.amd.com/devforum/messageview.cfm?catid=390&threadid=147002 
    6681 
    67  
    68 === process affinity === 
     82== process affinity == 
    6983From command line: http://www.cyberciti.biz/tips/setting-processor-affinity-certain-task-or-process.html 
    7084 
    7185http://www.open-mpi.org/projects/hwloc/ 
    7286 
    73 === DOUBLE === 
     87== DOUBLE == 
    7488http://developer.amd.com/support/KnowledgeBase/Lists/KnowledgeBase/DispForm.aspx?ID=92