Pedro VALERO-LARA
Pedro VALERO-LARA
Computer Scientist at ORNL
確認したメール アドレス: ornl.gov - ホームページ
タイトル
引用先
引用先
The design and performance of batched BLAS on modern high-performance computing systems
J Dongarra, S Hammarling, NJ Higham, SD Relton, P Valero-Lara, ...
Procedia Computer Science 108, 495-504, 2017
412017
Fast finite difference poisson solvers on heterogeneous architectures
P Valero-Lara, A Pinelli, M Prieto-Matias
Computer Physics Communications 185 (4), 1265-1272, 2014
332014
Accelerating fluid–solid simulations (Lattice-Boltzmann & Immersed-Boundary) on heterogeneous architectures
P Valero-Lara, FD Igual, M Prieto-Matías, A Pinelli, J Favier
Journal of Computational Science 10, 249-261, 2015
322015
Accelerating solid-fluid interaction using lattice-boltzmann and immersed boundary coupled simulations on heterogeneous platforms
P Valero-Lara, A Pinelli, M Prieto-Matias
Procedia Computer Science 29, 50-61, 2014
272014
A proposed API for batched basic linear algebra subprograms
J Dongarra, I Duff, M Gates, A Haidar, S Hammarling, NJ Higham, J Hogg, ...
Manchester Institute for Mathematical Sciences, University of Manchester, 2016
242016
Block tridiagonal solvers on heterogeneous architectures
P Valero-Lara, A Pinelli, J Favier, MP Matias
2012 IEEE 10th International Symposium on Parallel and Distributed …, 2012
242012
Heterogeneous CPU+ GPU approaches for mesh refinement over Lattice‐Boltzmann simulations
P Valero‐Lara, J Jansson
Concurrency and Computation: Practice and Experience 29 (7), e3919, 2017
202017
Similarity search implementations for multi-core and many-core processors
R Uribe-Paredes, P Valero-Lara, E Arias, JL Sánchez, D Cazorla
2011 International Conference on High Performance Computing & Simulation …, 2011
202011
Accelerating solid–fluid interaction based on the immersed boundary method on multicore and gpu architectures
P Valero-Lara
The Journal of Supercomputing 70 (2), 799-815, 2014
182014
cuHinesBatch: Solving multiple hines systems on GPUs human brain project
P Valero-Lara, I Martínez-Perez, AJ Pena, X Martorell, R Sirvent, ...
Procedia Computer Science 108, 566-575, 2017
172017
A gpu-based implementation for range queries on spaghettis data structure
R Uribe-Paredes, P Valero-Lara, E Árias, JL Sánchez, D Cazorla
International Conference on Computational Science and Its Applications, 615-629, 2011
172011
cuThomasBatch and cuThomasVBatch, CUDA Routines to compute batch of tridiagonal systems on NVIDIA GPUs
P Valero‐Lara, I Martínez‐Pérez, R Sirvent, X Martorell, AJ Peña
Concurrency and Computation: Practice and Experience 30 (24), e4909, 2018
162018
Improving the performance for the range search on metric spaces using a multi-GPU platform
R Uribe-Paredes, E Arias, JL Sánchez, D Cazorla, P Valero-Lara
International Conference on Database and Expert Systems Applications, 442-449, 2012
152012
Performance evaluation of cudnn convolution algorithms on nvidia volta gpus
M Jorda, P Valero-Lara, AJ Peña
IEEE Access 7, 70461-70473, 2019
142019
Many-task computing on many-core architectures
P Valero-Lara, P Nookala, FL Pelayo, J Jansson, S Dimitropoulos, I Raicu
Scalable Computing: Practice and Experience 17 (1), 32-46, 2016
142016
Multi-GPU acceleration of DARTEL (early detection of Alzheimer)
P Valero-Lara
2014 IEEE International Conference on Cluster Computing (CLUSTER), 346-354, 2014
142014
Towards a more efficient use of gpus
P Valero, FL Pelayo
2011 International Conference on Computational Science and Its Applications, 3-9, 2011
142011
A non-uniform Staggered Cartesian grid approach for Lattice-Boltzmann method
P Valero-Lara, J Jansson
Procedia Computer Science 51, 296-305, 2015
122015
A GPU-based implementation of the MRF algorithm in ITK package
P Valero, JL Sánchez, D Cazorla, E Arias
The Journal of Supercomputing 58 (3), 403-410, 2011
112011
Reducing memory requirements for large size LBM simulations on GPUs
P Valero‐Lara
Concurrency and Computation: Practice and Experience 29 (24), e4221, 2017
102017
現在システムで処理を実行できません。しばらくしてからもう一度お試しください。
論文 1–20