フォロー
Yusuke Nagasaka
Yusuke Nagasaka
確認したメール アドレス: fujitsu.com - ホームページ
タイトル
引用先
引用先
High-performance and memory-saving sparse general matrix-matrix multiplication for nvidia pascal gpu
Y Nagasaka, A Nukada, S Matsuoka
2017 46th International Conference on Parallel Processing (ICPP), 101-110, 2017
702017
High-performance sparse matrix-matrix products on Intel KNL and multicore architectures
Y Nagasaka, S Matsuoka, A Azad, A Buluç
Workshop Proceedings of the 47th International Conference on Parallel …, 2018
602018
Performance optimization, modeling and analysis of sparse matrix-matrix products on multi-core and many-core processors
Y Nagasaka, S Matsuoka, A Azad, A Buluç
Parallel Computing 90, 102545, 2019
392019
Adaptive multi-level blocking optimization for sparse matrix vector multiplication on GPU
Y Nagasaka, A Nukada, S Matsuoka
Procedia Computer Science 80, 131-142, 2016
242016
Cache-aware sparse matrix formats for Kepler GPU
Y Nagasaka, A Nukada, S Matsuoka
2014 20th IEEE International Conference on Parallel and Distributed Systems …, 2014
212014
Batched sparse matrix multiplication for accelerating graph convolutional networks
Y Nagasaka, A Nukada, R Kojima, S Matsuoka
2019 19th IEEE/ACM International Symposium on Cluster, Cloud and Grid …, 2019
82019
A traffic-aware memory-cube network using bypassing
Y Shikama, R Kawano, H Matsutani, H Amano, Y Nagasaka, N Fukumoto, ...
Microprocessors and Microsystems 90, 104471, 2022
22022
Efficient collision-free mttkrp algorithm for multi-core cpus with less memory usage
Y Nagasaka, N Fukumoto
2022 22nd IEEE International Symposium on Cluster, Cloud and Internet …, 2022
12022
Low-latency low-energy memory-cube networks using dual-voltage datapaths
Y Shikama, R Kawano, H Matsutani, H Amano, Y Nagasaka, N Fukumoto, ...
2021 29th Euromicro International Conference on Parallel, Distributed and …, 2021
12021
GPU のキャッシュを考慮した疎行列ベクトル積計算手法の性能評価
長坂侑亮, 額田彰, 松岡聡
研究報告ハイパフォーマンスコンピューティング (HPC) 2014 (5), 1-9, 2014
12014
Performance Evaluation on Auto Tuned MPI Communication
Y Hu, S Hirasawa, T Honda, Y Nagasaka, N Fukumoto, M Koibuchi
IEICE Technical Report; IEICE Tech. Rep. 121 (425), 97-102, 2022
2022
MRG8 Random Number Generator Library (MRG8) v1. 0
JM Shalf, K Miura, Y Nagasaka
Lawrence Berkeley National Laboratory (LBNL), Berkeley, CA (United States), 2019
2019
MRG8: Random Number Generation for the Exascale Era
Y Nagasaka, A Nukada, S Matsuoka, K Miura, J Shalf
Proceedings of the Platform for Advanced Scientific Computing Conference, 1-11, 2018
2018
疎行列ベクトル積計算を対象とした GPU 向けメモリアクセス削減手法
長坂侑亮, 額田彰, 松岡聡
研究報告ハイパフォーマンスコンピューティング (HPC) 2015 (8), 1-7, 2015
2015
Multi-‐level Blocking Optimization for Fast Sparse Matrix Vector Multiplication on GPUs
Y Nagasaka, A Nukada, S Matsuoka
2015
Cache-aware Sparse Matrix Format for GPU
Y Nagasaka, A Nukada, S MATSUOKA
Cache-aware Sparse Matrix Format for GPU, 2014
2014
Communication Optimization by Autotuning in Parallel Computers
Y Hu, S Hirasawa, T Honda, Y Nagasaka, N Fukumoto, M Koibuchi
IEICE Technical Report; IEICE Tech. Rep., 0
Fast Sparse General Matrix-Matrix Multiplication on GPU with Low Memory Usage
Y Nagasaka, A Nukada, S Matsuoka
現在システムで処理を実行できません。しばらくしてからもう一度お試しください。
論文 1–18