[[PageOutline(1)]] = __Members__ = == Faculty Member == === Senior Associate Professor N.Nakasato === High performance computing and numerical simulations in astronomy and astrophysics. http://galaxy.u-aizu.ac.jp/note/ === Honorary Professor S.G.Sedukhin === http://web-ext.u-aizu.ac.jp/~sedukhin/ == Graduate Students == == Undergraduate Students == == Former Members == = __Research Project__ = == Parallel Algorithm == == Numerical Simulations on GPU == == Innovative Hardware Design == = __Our Recent Papers__ = == 2014 == * '''A development of an accelerator board dedicated for multi-precision arithmetic operations and its application to Feynman loop integrals''', __S.Motoki__, __H.Daisaka__, __N.Nakasato__, __T.Ishikawa__, __F.Yuasa__, __T.Fukushige__, __A.Kawai__ and __J.Makino__, 2014, submitted to the proceedings of the 16th International workshop on Advanced Computing and Analysis Techniques in physics research (ACAT 2014), Prague, preprint http://arxiv.org/abs/1410.3252 * '''GPU accelerated Hybrid Tree Algorithm for Collision-less N-body Simulations''', __T.Watanabe__ and __N.Nakasato__, 2014, Fifth International Symposium on Highly-Efficient Accelerators and Reconfigurable Technologies (HEART2014), preprint http://arxiv.org/abs/1406.6158 == 2013 == * '''Studying the core-cusp problem in cold dark matter halos using N-body simulations on GPU clusters''', G.Ogiya, M.Mori, Y.Miki, T.Boku, & __N.Nakasato__, 2013, Journal of Physics: Conference Series, 454, 012014 * '''Acceleration of Feynman loop integrals in high-energy physics on many core GPUs''', F.Yuasa , T.Ishikawa, N.Hamaguchi, T.Koike and __N.Nakasato__, 2013, Journal of Physics: Conference Series, 454, 012081 == 2012 == * '''Blocked United Algorithm for the All-Pairs Shortest Paths Problem on Hybrid CPU-GPU Systems''',__K.Matsumoto__, __N. Nakasato__, & __S.Sedukhin__, 2012, IEICE Transactions, Vol.E95- D, No.12, pp. 2759-2768,Dec. 2012. * '''Performance tuning of matrix multiplication in OpenCL on different GPUs and CPUs''',__Kazuya Matsumoto__, __Naohito Nakasato__, __Stanislav G. Sedukhin__, In the 3rd International Workshop on Performace Modeling, Benchmarking and Simulation of High Performance Computer Systems (PMBS12) - Proceedings of the 2012 SC Companion: High Performance Computing, Networking, Storage and Analysis (SCC), IEEE CS's Conference Publishing Service, pp. 396-405, Salt Palace Convention Center, Salt Lake City, Utah, USA, November 12, 2012. DOI:10.1109/SC.Companion.2012.59 * '''GRAPE-MPs: Implementation of an SIMD for quadruple/hexuple/octuple-precision arithmetic operation on a structured ASIC and an FPGA''', __N.Nakasato__, H.Daisaka, T.Fukushige, A.Kawai, J.Makino, F.Yuasa & T.Ishikawa, 2012, IEEE MCSoC 2012, pp.75–83 * '''Implementing a Code Generator for Fast Matrix Multiplication in OpenCL on the GPU''', __K.Matsumoto__, __N.Nakasato__, & __S.G.Sedukhin__, 2012, IEEE MCSoC 2012, pp.198–204 == 2011 == * '''Blocked All-Pairs Shortest Paths Algorithm for Hybrid CPU-GPU System''', __K.Matsumoto__, __N.Nakasato__, __S.G.Sedukhin__, HPCC 2011: 145-152., http://dx.doi.org/10.1109/HPCC.2011.28 * '''Multi-level Optimization of Matrix Multiplication for GPU-equipped Systems''', __K.Matsumoto__, __N.Nakasato__, __T.Sakai__, H.Yahagi, __S.G.Sedukhin__, Procedia CS 4: 342-351 (2011), http://dx.doi.org/10.1016/j.procs.2011.04.036 * '''GRAPE-MP: An SIMD Accelerator Board for Multi-precision Arithmetic''', H.Daisaka, __N.Nakasato__, J.Makino, F.Yuasa, T.Ishikawa,2011, http://dx.doi.org/10.1016/j.procs.2011.04.093 * '''Implementation of a Parallel Tree Method on a GPU''', __N.Nakasato__, Journal of Computational Science, 2011, http://dx.doi.org/10.1016/j.jocs.2011.01.006, [http://galaxy.u-aizu.ac.jp/note/wiki/Octree_On_GPU Recent Results] * '''A fast GEMM implementation on the cypress GPU''', __N.Nakasato__, ACM SIGMETRICS Performance Evaluation Review, 2011, http://dx.doi.org/10.1145/1964218.1964227 * '''Chemodynamical Simulations of the Milky Way Galaxy''', C.Kobayashi, __N.Nakasato__, Astrophysical Journal, 2011, http://dx.doi.org/10.1088/0004-637X/729/1/16 == 2010 == * '''A fast GEMM implementation on the cypress GPU''', __N.Nakasato__, 1st International Workshop on Performance Modeling, Benchmarking and Simulation of High Performance Computing Systems (PMBS 10), [http://galaxy.u-aizu.ac.jp/note/wiki/Fast_GEMM_Implementation_On_Cypress paper&slide] * '''Application of Many-core Accelerators for Problems in Astronomy and Physics ''', __N.Nakasato__, Plenary Talk ACAT2010, 2010, http://adsabs.harvard.edu//abs/2010acat.confE..15N == 2009 == * '''A compiler for high performance computing with many-core accelerators''', __N.Nakasato__, J.Makino, Cluster Computing and Workshops, 2009. CLUSTER '09, 2009, http://dx.doi.org/10.1109/CLUSTR.2009.5289127 = __Local access Only__ = [wiki:Local_Information]