Toshio Endo
TitleCited byYear
Peta-scale phase-field simulation for dendritic solidification on the TSUBAME 2.0 supercomputer
T Shimokawabe, T Aoki, T Takaki, T Endo, A Yamanaka, N Maruyama, ...
Proceedings of 2011 International Conference for High Performance Computing …, 2011
1882011
Statistical power modeling of GPU kernels using performance counters
H Nagasaka, N Maruyama, A Nukada, T Endo, S Matsuoka
International conference on green computing, 115-122, 2010
1712010
Bandwidth intensive 3-D FFT kernel for GPUs using CUDA
A Nukada, Y Ogata, T Endo, S Matsuoka
Proceedings of the 2008 ACM/IEEE conference on Supercomputing, 5, 2008
1502008
An 80-fold speedup, 15.0 TFlops full GPU acceleration of non-hydrostatic weather model ASUCA production code
T Shimokawabe, T Aoki, C Muroi, J Ishida, K Kawano, T Endo, A Nukada, ...
SC'10: Proceedings of the 2010 ACM/IEEE International Conference for High …, 2010
1392010
A scalable mark-sweep garbage collector on large-scale shared-memory machines
T Endo, K Taura, A Yonezawa
Proceedings of the 1997 ACM/IEEE conference on Supercomputing, 1-14, 1997
981997
Phoenix: a parallel programming model for accommodating dynamically joining/leaving resources
K Taura, K Kaneda, T Endo, A Yonezawa
ACM SIGPLAN Notices 38 (10), 216-229, 2003
942003
An efficient, model-based CPU-GPU heterogeneous FFT library
Y Ogata, T Endo, N Maruyama, S Matsuoka
2008 IEEE International Symposium on Parallel and Distributed Processing, 1-10, 2008
872008
Linpack evaluation on a supercomputer with heterogeneous accelerators
T Endo, S Matsuoka, A Nukada, N Maruyama
2010 IEEE International Symposium on Parallel & Distributed Processing …, 2010
642010
Massive supercomputing coping with heterogeneity of modern accelerators
T Endo, S Matsuoka
2008 IEEE International Symposium on Parallel and Distributed Processing, 1-10, 2008
552008
Petaflop biofluidics simulations on a two million-core system
M Bernaschi, M Bisson, T Endo, S Matsuoka, M Fatica, S Melchionna
Proceedings of 2011 International Conference for High Performance Computing …, 2011
442011
Exploration of lossy compression for application-level checkpoint/restart
N Sasaki, K Sato, T Endo, S Matsuoka
2015 IEEE International Parallel and Distributed Processing Symposium, 914-922, 2015
432015
GPU accelerated computing–from hype to mainstream, the rebirth of vector computing
S Matsuoka, T Aoki, T Endo, A Nukada, T Kato, A Hasegawa
Journal of Physics: Conference Series 180 (1), 012043, 2009
422009
Power-aware dynamic task scheduling for heterogeneous accelerated clusters
T Hamano, T Endo, S Matsuoka
2009 IEEE International Symposium on Parallel & Distributed Processing, 1-8, 2009
342009
Access-pattern and bandwidth aware file replication algorithm in a grid environment
H Sato, S Matsuoka, T Endo, N Maruyama
Proceedings of the 2008 9th IEEE/ACM international Conference on Grid …, 2008
332008
A parallel optimization method for stencil computation on the domain that is bigger than memory capacity of GPUs
G Jin, T Endo, S Matsuoka
2013 IEEE International Conference on Cluster Computing (CLUSTER), 1-8, 2013
302013
File clustering based replication algorithm in a grid environment
H Sato, S Matsuoka, T Endo
Proceedings of the 2009 9th IEEE/ACM International Symposium on Cluster …, 2009
252009
ABARIS: An adaptable fault detection/recovery component framework for MPIs
H Jitsumoto, T Endo, S Matsuoka
2007 IEEE International Parallel and Distributed Processing Symposium, 1-8, 2007
252007
A multi-level optimization method for stencil computation on the domain that is bigger than memory capacity of GPU
G Jin, T Endo, S Matsuoka
2013 IEEE International Symposium on Parallel & Distributed Processing …, 2013
242013
Software technologies coping with memory hierarchy of GPGPU clusters for stencil computations
T Endo, G Jin
2014 IEEE International Conference on Cluster Computing (CLUSTER), 132-139, 2014
232014
An evaluation of the potential of flash SSD as large and slow memory for stencil computations
H Midorikawa, H Tan, T Endo
2014 International Conference on High Performance Computing & Simulation …, 2014
232014
The system can't perform the operation now. Try again later.
Articles 1–20