Xiaonan Tian
Xiaonan Tian
Compiler Engineer@NVIDIA
Verified email at uh.edu - Homepage
Title
Cited by
Cited by
Year
Compiling a High-level Directive-Based Programming Model for GPGPUs
X Tian, R Xu, Y Yan, Z Yun, S Chandrasekaran, B Chapman
The 26th International Workshop on Languages and Compilers for Parallel …, 2013
592013
Nas parallel benchmarks for gpgpus using a directive-based programming model
R Xu, X Tian, S Chandrasekaran, Y Yan, B Chapman
International Workshop on Languages and Compilers for Parallel Computing, 67-81, 2014
272014
Compiler transformation of nested loops for general purpose GPUs
X Tian, R Xu, Y Yan, S Chandrasekaran, D Eachempati, B Chapman
Concurrency and Computation: Practice and Experience 28 (2), 537-556, 2016
122016
Multi-GPU support on single node using directive-based programming model
R Xu, X Tian, S Chandrasekaran, B Chapman
Scientific Programming 2015, 2015
112015
Implementing the OpenACC data model
M Wolfe, S Lee, J Kim, X Tian, R Xu, S Chandrasekaran, B Chapman
2017 IEEE International Parallel and Distributed Processing Symposium …, 2017
82017
The OpenACC data model: Preliminary study on its major challenges and implementations
M Wolfe, S Lee, J Kim, X Tian, R Xu, B Chapman, S Chandrasekaran
Parallel Computing 78, 15-27, 2018
72018
Reduction operations in parallel loops for GPGPUs
R Xu, X Tian, Y Yan, S Chandrasekaran, B Chapman
Proceedings of Programming Models and Applications on Multicores and …, 2014
72014
OpenUH: open source OpenACC compiler
X Tian, R Xu, B Chapman
GTC2014, HPCTools Group Computer Science Department University of Houston, 2014
62014
OpenACC Parallelization and optimization of NAS parallel benchmarks
R Xu, X Tian, S Chandrasekaran, Y Yan, B Chapman
Proc. GPU Technol. Conf., 1-27, 2014
62014
Optimizing GPU Register Usage: Extensions to OpenACC and Compiler Optimizations
X Tian, D Khaldi, D Eachempati, R Xu, B Chapman
2016 45th International Conference on Parallel Processing (ICPP), 572 - 581, 2016
42016
An analytical model-based auto-tuning framework for locality-aware loop scheduling
R Xu, S Chandrasekaran, X Tian, B Chapman
International Conference on High Performance Computing, 3-20, 2016
42016
Assessing one-to-one parallelism levels mapping for openmp offloading to gpus
C Shen, X Tian, D Khaldi, B Chapman
Proceedings of the 8th International Workshop on Programming Models and …, 2017
32017
Performance and power characteristics of matrix multiplication algorithms on multicore and shared memory machines
Y Yan, J Kemp, X Tian, AM Malik, B Chapman
2012 SC Companion: High Performance Computing, Networking Storage and …, 2012
32012
Acceleration of bulk memory operations in a heterogeneous multicore architecture
JH Lee, Z Liu, X Tian, DH Woo, W Shi, D Boumber, Y Yan, KA Kwon
Proceedings of the 21st international conference on Parallel architectures …, 2012
12012
A Compiler Optimization Framework for Directive-Based GPU Computing
X Tian
2016
The system can't perform the operation now. Try again later.
Articles 1–15