High-performance and memory-saving sparse general matrix-matrix multiplication for nvidia pascal gpu Y Nagasaka, A Nukada, S Matsuoka 2017 46th International Conference on Parallel Processing (ICPP), 101-110, 2017 | 84 | 2017 |
High-performance sparse matrix-matrix products on Intel KNL and multicore architectures Y Nagasaka, S Matsuoka, A Azad, A Buluç Workshop Proceedings of the 47th International Conference on Parallel …, 2018 | 68 | 2018 |
Performance optimization, modeling and analysis of sparse matrix-matrix products on multi-core and many-core processors Y Nagasaka, S Matsuoka, A Azad, A Buluç Parallel Computing 90, 102545, 2019 | 50 | 2019 |
Adaptive multi-level blocking optimization for sparse matrix vector multiplication on GPU Y Nagasaka, A Nukada, S Matsuoka Procedia Computer Science 80, 131-142, 2016 | 27 | 2016 |
Cache-aware sparse matrix formats for Kepler GPU Y Nagasaka, A Nukada, S Matsuoka 2014 20th IEEE International Conference on Parallel and Distributed Systems …, 2014 | 22 | 2014 |
Batched sparse matrix multiplication for accelerating graph convolutional networks Y Nagasaka, A Nukada, R Kojima, S Matsuoka 2019 19th IEEE/ACM International Symposium on Cluster, Cloud and Grid …, 2019 | 9 | 2019 |
A traffic-aware memory-cube network using bypassing Y Shikama, R Kawano, H Matsutani, H Amano, Y Nagasaka, N Fukumoto, ... Microprocessors and Microsystems 90, 104471, 2022 | 3 | 2022 |
Efficient collision-free mttkrp algorithm for multi-core cpus with less memory usage Y Nagasaka, N Fukumoto 2022 22nd IEEE International Symposium on Cluster, Cloud and Internet …, 2022 | 1 | 2022 |
Low-latency low-energy memory-cube networks using dual-voltage datapaths Y Shikama, R Kawano, H Matsutani, H Amano, Y Nagasaka, N Fukumoto, ... 2021 29th Euromicro International Conference on Parallel, Distributed and …, 2021 | 1 | 2021 |
GPU のキャッシュを考慮した疎行列ベクトル積計算手法の性能評価 長坂侑亮, 額田彰, 松岡聡 研究報告ハイパフォーマンスコンピューティング (HPC) 2014 (5), 1-9, 2014 | 1 | 2014 |
Performance Evaluation on Auto Tuned MPI Communication Y Hu, S Hirasawa, T Honda, Y Nagasaka, N Fukumoto, M Koibuchi IEICE Technical Report; IEICE Tech. Rep. 121 (425), 97-102, 2022 | | 2022 |
MRG8 Random Number Generator Library (MRG8) v1. 0 JM Shalf, K Miura, Y Nagasaka Lawrence Berkeley National Laboratory (LBNL), Berkeley, CA (United States), 2019 | | 2019 |
MRG8 J Shalf, Y Nagasaka, K Miura, S Matsuoka, A Nukada Proceedings of the Platform for Advanced Scientific Computing Conference, 2018 | | 2018 |
MRG8: Random Number Generation for the Exascale Era Y Nagasaka, A Nukada, S Matsuoka, K Miura, J Shalf Proceedings of the Platform for Advanced Scientific Computing Conference, 1-11, 2018 | | 2018 |
疎行列ベクトル積計算を対象とした GPU 向けメモリアクセス削減手法 長坂侑亮, 額田彰, 松岡聡 研究報告ハイパフォーマンスコンピューティング (HPC) 2015 (8), 1-7, 2015 | | 2015 |
Multi-‐level Blocking Optimization for Fast Sparse Matrix Vector Multiplication on GPUs Y Nagasaka, A Nukada, S Matsuoka | | 2015 |
Cache-aware Sparse Matrix Format for GPU Y Nagasaka, A Nukada, S MATSUOKA Cache-aware Sparse Matrix Format for GPU, 2014 | | 2014 |
Communication Optimization by Autotuning in Parallel Computers Y Hu, S Hirasawa, T Honda, Y Nagasaka, N Fukumoto, M Koibuchi IEICE Technical Report; IEICE Tech. Rep., 0 | | |
Fast Sparse General Matrix-Matrix Multiplication on GPU with Low Memory Usage Y Nagasaka, A Nukada, S Matsuoka | | |