Changes between Version 4 and Version 5 of UGT2011


Ignore:
Timestamp:
Nov 5, 2014 9:33:58 AM (10 years ago)
Author:
nakasato
Comment:

--

Legend:

Unmodified
Added
Removed
Modified
  • UGT2011

    v4 v5  
    1717the kernel program. 
    1818 
    19  
    20  
     19file:///home/committee/aac/Thesis2011/s1140123 
    2120 
    2221= K.Kamijima = 
    2322== Performance Evaluation of the Octree Method on GPU == 
     23The purpose of my research is to examine the performance of graphic processing unit (GPU) with a numerical algorithm for particle simulations. Specifically, I adopt the Octree method that requires many branch instructions. In general, GPU is intrinsically not good at dealing with branch instructions. I have implemented the Octree method in single and double precision on Cypress GPU. With the GPU, the peak performance of single precision operations is five times better than that of double precision. Although it was expected that the performance of the Octree method with double precision is much slower, I found that it is not the case. On GPU, I found that the performance of the Octree method is constrained by not the computing power of the GPU but the performance penalty due to branch instructions. 
     24 
     25 
     26file:///home/committee/aac/Thesis2011/s1150062 
     27 
    2428 
    2529= K.Seiwa =