[[PageOutline(1)]] = __Members__ = == Faculty Member == === Senior Associate Professor N.Nakasato === High performance computing and numerical simulations in astronomy and astrophysics. http://galaxy.u-aizu.ac.jp/note/ === Honorary Professor S.G.Sedukhin === http://web-ext.u-aizu.ac.jp/~sedukhin/ == Graduate Students == == Undergraduate Students == == Former Members == = __Research Project__ = == Parallel Algorithm == == Numerical Simulations on GPU == == Innovative Hardware Design == = __Our Recent Papers__ = == 2015 == * '''Application of GRAPE9-MPX for high precision calculation in particle physics and performance results''', Hiroshi Daisaka, __Naohito Nakasato__, Tadashi Ishikawa, Fukuko Yuasa, 2015, Large Scale Computational Physics Workshop (INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE), Reykjavík, Jun. 1-3, 2015 * '''Stream Computation of Shallow Water Equation Solver for FPGA-based 1D Tsunami Simulation''', Kentaro Sano, __Fumiya Kono__, __Naohito Nakasato__, Alexander Vazhenin and __Stanislav Sedukhin__, 2015, International Symposium on Highly-Energy Efficient Accelerators and Reconfiguable Technologies, Boston, Jun. 1-2, 2015 * '''A systematic study of carbon-oxygen white dwarf mergers: mass combinations for Type Ia supernovae''', Yushi Sato, __Naohito Nakasato__, Ataru Tanikawa, Ken'ichi Nomoto, Keiichi Maeda, Izumi Hachisu, 2015, accepted for publication in ApJ, http://arxiv.org/abs/1505.01646 http://dx.doi.org/10.1088/0004-637X/807/1/105 * '''Hydrodynamical evolution of merging carbon-oxygen white dwarfs: their pre-supernova structure and observational counterparts''', Ataru Tanikawa, __Naohito Nakasato__, Yushi Sato, Ken'ichi Nomoto, Keiichi Maeda, Izumi Hachisu, 2015, accepted for publication in ApJ, http://arxiv.org/abs/1504.06035 http://dx.doi.org/10.1088/0004-637X/807/1/40 * '''Implementation and Evaluation of an Efficient Parallel Architecture for Matrix Calculations''', __Yuki Murakami__, __Naohito Nakasato__, __Stanislav Sedukhin__, 2015, IEEE Symposium on Low-Power and High-Speed Chips, Yokohama, Apr. 13-15, 2015 * '''OpenMP Parallelization of Tsunami Simulation by MOST''', __Fumiya Kono__, __Naohito Nakasato__, Kensaku Hayashi, Alexander Vazhenin, and __Stanislav Sedukhin__, 2015, Annual Meeting on Advanced Computing System and Infrastructure (ACSI) 2015, Tsukuba, Jan. 26-28, 2015 (poster paper) == 2014 == * '''A development of an accelerator board dedicated for multi-precision arithmetic operations and its application to Feynman loop integrals''', S.Motoki, H.Daisaka, __N.Nakasato__, T.Ishikawa, F.Yuasa, T.Fukushige, A.Kawai and J.Makino, 2014, submitted to the proceedings of the 16th International workshop on Advanced Computing and Analysis Techniques in physics research (ACAT 2014), Prague, preprint http://arxiv.org/abs/1410.3252 * '''GPU accelerated Hybrid Tree Algorithm for Collision-less N-body Simulations''', __T.Watanabe__ and __N.Nakasato__, 2014, Fifth International Symposium on Highly-Efficient Accelerators and Reconfigurable Technologies (HEART2014), preprint http://arxiv.org/abs/1406.6158 http://dx.doi.org/10.1145/2693714.2693718 == 2013 == * '''Studying the core-cusp problem in cold dark matter halos using N-body simulations on GPU clusters''', G.Ogiya, M.Mori, Y.Miki, T.Boku, & __N.Nakasato__, 2013, Journal of Physics: Conference Series, 454, 012014, http://dx.doi.org/10.1088/1742-6596/454/1/012014 * '''Acceleration of Feynman loop integrals in high-energy physics on many core GPUs''', F.Yuasa , T.Ishikawa, N.Hamaguchi, T.Koike and __N.Nakasato__, 2013, Journal of Physics: Conference Series, 454, 012081, http://dx.doi.org/10.1088/1742-6596/454/1/012081 * '''Implementation and Performance Evaluation of Astrophysical Tree-code for GPU Clusters''', G.Ogiya, Y.Miki, T.Boku, M.Mori, & __N.Nakasato__, 2013, http://id.nii.ac.jp/1001/00095272/ == 2012 == * '''Blocked United Algorithm for the All-Pairs Shortest Paths Problem on Hybrid CPU-GPU Systems''',__K.Matsumoto__, __N. Nakasato__, & __S.Sedukhin__, 2012, IEICE Transactions, Vol.E95- D, No.12, pp. 2759-2768,Dec. 2012., http://search.ieice.org/bin/summary.php?id=e95-d_12_2759 * '''Performance tuning of matrix multiplication in OpenCL on different GPUs and CPUs''',__Kazuya Matsumoto__, __Naohito Nakasato__, __Stanislav G. Sedukhin__, In the 3rd International Workshop on Performace Modeling, Benchmarking and Simulation of High Performance Computer Systems (PMBS12) - Proceedings of the 2012 SC Companion: High Performance Computing, Networking, Storage and Analysis (SCC), IEEE CS's Conference Publishing Service, pp. 396-405, Salt Palace Convention Center, Salt Lake City, Utah, USA, November 12, 2012. DOI:10.1109/SC.Companion.2012.59 * '''GRAPE-MPs: Implementation of an SIMD for quadruple/hexuple/octuple-precision arithmetic operation on a structured ASIC and an FPGA''', __N.Nakasato__, H.Daisaka, T.Fukushige, A.Kawai, J.Makino, F.Yuasa & T.Ishikawa, 2012, IEEE MCSoC 2012, pp.75–83, http://dx.doi.org/10.1109/MCSoC.2012.31 * '''Implementing a Code Generator for Fast Matrix Multiplication in OpenCL on the GPU''', __K.Matsumoto__, __N.Nakasato__, & __S.G.Sedukhin__, 2012, IEEE MCSoC 2012, pp.198–204, http://dx.doi.org/10.1109/MCSoC.2012.30 == 2011 == * '''Blocked All-Pairs Shortest Paths Algorithm for Hybrid CPU-GPU System''', __K.Matsumoto__, __N.Nakasato__, __S.G.Sedukhin__, HPCC 2011: 145-152., http://dx.doi.org/10.1109/HPCC.2011.28 * '''Multi-level Optimization of Matrix Multiplication for GPU-equipped Systems''', __K.Matsumoto__, __N.Nakasato__, __T.Sakai__, H.Yahagi, __S.G.Sedukhin__, Procedia CS 4: 342-351 (2011), http://dx.doi.org/10.1016/j.procs.2011.04.036 * '''GRAPE-MP: An SIMD Accelerator Board for Multi-precision Arithmetic''', H.Daisaka, __N.Nakasato__, J.Makino, F.Yuasa, T.Ishikawa,2011, http://dx.doi.org/10.1016/j.procs.2011.04.093 * '''Implementation of a Parallel Tree Method on a GPU''', __N.Nakasato__, Journal of Computational Science, 2011, http://dx.doi.org/10.1016/j.jocs.2011.01.006, [http://galaxy.u-aizu.ac.jp/note/wiki/Octree_On_GPU Recent Results] * '''A fast GEMM implementation on the cypress GPU''', __N.Nakasato__, ACM SIGMETRICS Performance Evaluation Review, 2011, http://dx.doi.org/10.1145/1964218.1964227 * '''Chemodynamical Simulations of the Milky Way Galaxy''', C.Kobayashi, __N.Nakasato__, Astrophysical Journal, 2011, http://dx.doi.org/10.1088/0004-637X/729/1/16 == 2010 == * '''A fast GEMM implementation on the cypress GPU''', __N.Nakasato__, 1st International Workshop on Performance Modeling, Benchmarking and Simulation of High Performance Computing Systems (PMBS 10), [http://galaxy.u-aizu.ac.jp/note/wiki/Fast_GEMM_Implementation_On_Cypress paper&slide] * '''Application of Many-core Accelerators for Problems in Astronomy and Physics ''', __N.Nakasato__, Plenary Talk ACAT2010, 2010, http://adsabs.harvard.edu//abs/2010acat.confE..15N == 2009 == * '''A compiler for high performance computing with many-core accelerators''', __N.Nakasato__, J.Makino, Cluster Computing and Workshops, 2009. CLUSTER '09, 2009, http://dx.doi.org/10.1109/CLUSTR.2009.5289127 = __Local access Only__ = [wiki:Local_Information]