[[PageOutline(1)]] = __Members__ = == Faculty Member == === Senior Associate Professor N.Nakasato === High performance computing and numerical simulations in astronomy and astrophysics. http://galaxy.u-aizu.ac.jp/note/ === Honorary Professor S.G.Sedukhin === http://web-ext.u-aizu.ac.jp/~sedukhin/ == Graduate Students == == Undergraduate Students == == Former Members == = __Research Project__ = == Parallel Algorithm == == Numerical Simulations on GPU == == Innovative Hardware Design == = __Our Recent Papers__ = == 2015 == * * '''Stream Computation of Shallow Water Equation Solver for FPGA-based 1D Tsunami Simulation''', Kentaro Sano, Fumiya Kono, Naohito Nakasato, Alexander Vazhenin and Stanislav Sedukhin, 2015, International Symposium on Highly-Energy Efficient Accelerators and Reconfiguable Technologies, Boston, Jun. 1-2, 2015 * '''A systematic study of carbon-oxygen white dwarf mergers: mass combinations for Type Ia supernovae''', Yushi Sato, Naohito Nakasato, Ataru Tanikawa, Ken'ichi Nomoto, Keiichi Maeda, Izumi Hachisu, 2015, accepted for publication in ApJ, http://arxiv.org/abs/1505.01646 * '''Hydrodynamical evolution of merging carbon-oxygen white dwarfs: their pre-supernova structure and observational counterparts''', Ataru Tanikawa, Naohito Nakasato, Yushi Sato, Ken'ichi Nomoto, Keiichi Maeda, Izumi Hachisu, 2015, accepted for publication in ApJ, http://arxiv.org/abs/1504.06035 * '''Implementation and Evaluation of an Efficient Parallel Architecture for Matrix Calculations''', Yuki Murakami, Naohito Nakasato, Stanislav Sedukhin, 2015, IEEE Symposium on Low-Power and High-Speed Chips, Yokohama, Apr. 13-15, 2015 * '''OpenMP Parallelization of Tsunami Simulation by MOST''', Fumiya Kono, Naohito Nakasato, Kensaku Hayashi, Alexander Vazhenin, and Stanislav Sedukhin, 2015, Annual Meeting on Advanced Computing System and Infrastructure (ACSI) 2015, Tsukuba, Jan. 26-28, 2015 (poster paper) == 2014 == * '''A development of an accelerator board dedicated for multi-precision arithmetic operations and its application to Feynman loop integrals''', S.Motoki, H.Daisaka, __N.Nakasato__, T.Ishikawa, F.Yuasa, T.Fukushige, A.Kawai and J.Makino, 2014, submitted to the proceedings of the 16th International workshop on Advanced Computing and Analysis Techniques in physics research (ACAT 2014), Prague, preprint http://arxiv.org/abs/1410.3252 * '''GPU accelerated Hybrid Tree Algorithm for Collision-less N-body Simulations''', __T.Watanabe__ and __N.Nakasato__, 2014, Fifth International Symposium on Highly-Efficient Accelerators and Reconfigurable Technologies (HEART2014), preprint http://arxiv.org/abs/1406.6158 == 2013 == * '''Studying the core-cusp problem in cold dark matter halos using N-body simulations on GPU clusters''', G.Ogiya, M.Mori, Y.Miki, T.Boku, & __N.Nakasato__, 2013, Journal of Physics: Conference Series, 454, 012014, http://dx.doi.org/10.1088/1742-6596/454/1/012014 * '''Acceleration of Feynman loop integrals in high-energy physics on many core GPUs''', F.Yuasa , T.Ishikawa, N.Hamaguchi, T.Koike and __N.Nakasato__, 2013, Journal of Physics: Conference Series, 454, 012081, http://dx.doi.org/10.1088/1742-6596/454/1/012081 * '''Implementation and Performance Evaluation of Astrophysical Tree-code for GPU Clusters''', G.Ogiya, Y.Miki, T.Boku, M.Mori, & __N.Nakasato__, 2013, http://id.nii.ac.jp/1001/00095272/ == 2012 == * '''Blocked United Algorithm for the All-Pairs Shortest Paths Problem on Hybrid CPU-GPU Systems''',__K.Matsumoto__, __N. Nakasato__, & __S.Sedukhin__, 2012, IEICE Transactions, Vol.E95- D, No.12, pp. 2759-2768,Dec. 2012., http://search.ieice.org/bin/summary.php?id=e95-d_12_2759 * '''Performance tuning of matrix multiplication in OpenCL on different GPUs and CPUs''',__Kazuya Matsumoto__, __Naohito Nakasato__, __Stanislav G. Sedukhin__, In the 3rd International Workshop on Performace Modeling, Benchmarking and Simulation of High Performance Computer Systems (PMBS12) - Proceedings of the 2012 SC Companion: High Performance Computing, Networking, Storage and Analysis (SCC), IEEE CS's Conference Publishing Service, pp. 396-405, Salt Palace Convention Center, Salt Lake City, Utah, USA, November 12, 2012. DOI:10.1109/SC.Companion.2012.59 * '''GRAPE-MPs: Implementation of an SIMD for quadruple/hexuple/octuple-precision arithmetic operation on a structured ASIC and an FPGA''', __N.Nakasato__, H.Daisaka, T.Fukushige, A.Kawai, J.Makino, F.Yuasa & T.Ishikawa, 2012, IEEE MCSoC 2012, pp.75–83, http://dx.doi.org/10.1109/MCSoC.2012.31 * '''Implementing a Code Generator for Fast Matrix Multiplication in OpenCL on the GPU''', __K.Matsumoto__, __N.Nakasato__, & __S.G.Sedukhin__, 2012, IEEE MCSoC 2012, pp.198–204, http://dx.doi.org/10.1109/MCSoC.2012.30 == 2011 == * '''Blocked All-Pairs Shortest Paths Algorithm for Hybrid CPU-GPU System''', __K.Matsumoto__, __N.Nakasato__, __S.G.Sedukhin__, HPCC 2011: 145-152., http://dx.doi.org/10.1109/HPCC.2011.28 * '''Multi-level Optimization of Matrix Multiplication for GPU-equipped Systems''', __K.Matsumoto__, __N.Nakasato__, __T.Sakai__, H.Yahagi, __S.G.Sedukhin__, Procedia CS 4: 342-351 (2011), http://dx.doi.org/10.1016/j.procs.2011.04.036 * '''GRAPE-MP: An SIMD Accelerator Board for Multi-precision Arithmetic''', H.Daisaka, __N.Nakasato__, J.Makino, F.Yuasa, T.Ishikawa,2011, http://dx.doi.org/10.1016/j.procs.2011.04.093 * '''Implementation of a Parallel Tree Method on a GPU''', __N.Nakasato__, Journal of Computational Science, 2011, http://dx.doi.org/10.1016/j.jocs.2011.01.006, [http://galaxy.u-aizu.ac.jp/note/wiki/Octree_On_GPU Recent Results] * '''A fast GEMM implementation on the cypress GPU''', __N.Nakasato__, ACM SIGMETRICS Performance Evaluation Review, 2011, http://dx.doi.org/10.1145/1964218.1964227 * '''Chemodynamical Simulations of the Milky Way Galaxy''', C.Kobayashi, __N.Nakasato__, Astrophysical Journal, 2011, http://dx.doi.org/10.1088/0004-637X/729/1/16 == 2010 == * '''A fast GEMM implementation on the cypress GPU''', __N.Nakasato__, 1st International Workshop on Performance Modeling, Benchmarking and Simulation of High Performance Computing Systems (PMBS 10), [http://galaxy.u-aizu.ac.jp/note/wiki/Fast_GEMM_Implementation_On_Cypress paper&slide] * '''Application of Many-core Accelerators for Problems in Astronomy and Physics ''', __N.Nakasato__, Plenary Talk ACAT2010, 2010, http://adsabs.harvard.edu//abs/2010acat.confE..15N == 2009 == * '''A compiler for high performance computing with many-core accelerators''', __N.Nakasato__, J.Makino, Cluster Computing and Workshops, 2009. CLUSTER '09, 2009, http://dx.doi.org/10.1109/CLUSTR.2009.5289127 = __Local access Only__ = [wiki:Local_Information]