Version 4 (modified by nakasato, 10 years ago) (diff) |
---|
Y.Suzuki
Fast N-Body Calculation Implemented by OpenCL with Vectorization
I compared the performance of N-body simulations on CPU and GPU with a several optimization techniques. Each program is written in OpenCL which standardizes APIs for GPU, and an important optimization technique in OpenCL is a vectorization. It enables us to utilize multiple variables as one variable. As a result, the program which utilized 4 variables as one was the best performance. I optimized the program using shuffle function. I found the calculation of N-body problem using shuffle function was about 1.3 times faster than without it. I also found Intel SDK had an ability to efficiently vectorize the kernel program.