wiki:WikiStart

Version 57 (modified by nakasato, 10 years ago) (diff)

--

Members

Faculty Member

Senior Associate Professor N.Nakasato

High performance computing and simulations in astronomy and astrophysics.

http://galaxy.u-aizu.ac.jp/note/

Honorary Professor S.G.Sedukhin

http://web-ext.u-aizu.ac.jp/~sedukhin/

Graduate Students

Undergraduate Students

Former Members

Research Project

Parallel Algorithm

Numerical Simulations on GPU

Innovative Hardware Design

Our Recent Papers

2014

  • GPU accelerated Hybrid Tree Algorithm for Collision-less N-body Simulations, T.Watanabe and N.Nakasato, 2014, Fifth International Symposium on Highly-Efficient Accelerators and Reconfigurable Technologies (HEART2014), preprint http://arxiv.org/abs/1406.6158

2013

  • Studying the core-cusp problem in cold dark matter halos using N-body simulations on GPU clusters, G.Ogiya, M.Mori, Y.Miki, T.Boku, & N.Nakasato, 2013, Journal of Physics: Conference Series, 454, 012014

  • Acceleration of Feynman loop integrals in high-energy physics on many core GPUs, F.Yuasa , T.Ishikawa, N.Hamaguchi, T.Koike and N.Nakasato, 2013, Journal of Physics: Conference Series, 454, 012081

2012

  • Blocked United Algorithm for the All-Pairs Shortest Paths Problem on Hybrid CPU-GPU Systems,K.Matsumoto, N. Nakasato, & S.Sedukhin, 2012, IEICE Transactions, Vol.E95- D, No.12, pp. 2759-2768,Dec. 2012.
  • Performance tuning of matrix multiplication in OpenCL on different GPUs and CPUs,Kazuya Matsumoto, Naohito Nakasato, Stanislav G. Sedukhin, In the 3rd International Workshop on Performace Modeling, Benchmarking and Simulation of High Performance Computer Systems (PMBS12) - Proceedings of the 2012 SC Companion: High Performance Computing, Networking, Storage and Analysis (SCC), IEEE CS's Conference Publishing Service, pp. 396-405, Salt Palace Convention Center, Salt Lake City, Utah, USA, November 12, 2012. DOI:10.1109/SC.Companion.2012.59
  • GRAPE-MPs: Implementation of an SIMD for quadruple/hexuple/octuple-precision arithmetic operation on a structured ASIC and an FPGA, N.Nakasato, H.Daisaka, T.Fukushige, A.Kawai, J.Makino, F.Yuasa & T.Ishikawa, 2012, IEEE MCSoC 2012, pp.75–83
  • Implementing a Code Generator for Fast Matrix Multiplication in OpenCL on the GPU, K.Matsumoto, N.Nakasato, & S.G.Sedukhin, 2012, IEEE MCSoC 2012, pp.198–204

2011

2010

  • A fast GEMM implementation on the cypress GPU, N.Nakasato, 1st International Workshop on Performance Modeling, Benchmarking and Simulation of High Performance Computing Systems (PMBS 10), paper&slide

2009

Local access Only

Local_Information