Wen-mei W. Hwu
Wen-mei W. Hwu
Professor and Sanders-AMD Chair of Electrical and Computer Engineering, University of Illinois at
確認したメール アドレス: illinois.edu - ホームページ
タイトル
引用先
引用先
Programming massively parallel processors: a hands-on approach
DB Kirk, WH Wen-Mei
Morgan kaufmann, 2016
34182016
Optimization principles and application performance evaluation of a multithreaded GPU using CUDA
S Ryoo, CI Rodrigues, SS Baghsorkhi, SS Stone, DB Kirk, WW Hwu
Proceedings of the 13th ACM SIGPLAN Symposium on Principles and practice of …, 2008
11742008
A power controlled multiple access protocol for wireless packet networks
JP Monks, V Bharghavan, WMW Hwu
Proceedings IEEE INFOCOM 2001. Conference on Computer Communications …, 2001
9282001
Effective compiler support for predicated execution using the hyperblock
SA Mahlke, DC Lin, WY Chen, RE Hank, RA Bringmann
ACM SIGMICRO Newsletter 23 (1-2), 45-54, 1992
8311992
The superblock: An effective technique for VLIW and superscalar compilation
WMW Hwu, SA Mahlke, WY Chen, PP Chang, NJ Warter, RA Bringmann, ...
Instruction-Level Parallelism, 229-248, 1993
8191993
Parboil: A revised benchmark suite for scientific and commercial throughput computing
JA Stratton, C Rodrigues, IJ Sung, N Obeid, LW Chang, N Anssari, GD Liu, ...
Center for Reliable and High-Performance Computing 127, 2012
6342012
IMPACT: An architectural framework for multiple-instruction-issue processors
PP Chang, SA Mahlke, WY Chen, NJ Warter, WW Hwu
ACM SIGARCH Computer Architecture News 19 (3), 266-275, 1991
4661991
Accelerating advanced MRI reconstructions on GPUs
SS Stone, JP Haldar, SC Tsao, BP Sutton, ZP Liang
Journal of parallel and distributed computing 68 (10), 1307-1318, 2008
3662008
An adaptive performance modeling tool for GPU architectures
SS Baghsorkhi, M Delahaye, SJ Patel, WD Gropp, WW Hwu
Proceedings of the 15th ACM SIGPLAN symposium on Principles and practice of …, 2010
3472010
Program optimization space pruning for a multithreaded GPU
S Ryoo, CI Rodrigues, SS Stone, SS Baghsorkhi, SZ Ueng, JA Stratton, ...
Proceedings of the 6th annual IEEE/ACM international symposium on Code …, 2008
3432008
Checkpoint repair for high-performance out-of-order execution machines
WMW Hwu, YN Patt
IEEE Transactions on Computers 100 (12), 1496-1514, 1987
3211987
MCUDA: An efficient implementation of CUDA kernels for multi-core CPUs
JA Stratton, SS Stone, WH Wen-mei
International Workshop on Languages and Compilers for Parallel Computing, 16-30, 2008
3152008
Using profile information to assist classic code optimizations
PP Chang, SA Mahlke, WMW Hwu
Software: Practice and Experience 21 (12), 1301-1321, 1991
3101991
GPU computing gems emerald edition
WMW Hwu
Elsevier, 2011
3052011
CUDA-lite: Reducing GPU programming complexity
SZ Ueng, M Lathara, SS Baghsorkhi, WH Wen-mei
International Workshop on Languages and Compilers for Parallel Computing, 1-15, 2008
2792008
Achieving high instruction cache performance with an optimizing compiler
WW Hwu, PP Chang
Proceedings of the 16th annual international symposium on Computer …, 1989
2741989
GPU clusters for high-performance computing
VV Kindratenko, JJ Enos, G Shi, MT Showerman, GW Arnold, JE Stone, ...
2009 IEEE International Conference on Cluster Computing and Workshops, 1-8, 2009
2712009
An effective GPU implementation of breadth-first search
L Luo, M Wong, W Hwu
Design Automation Conference, 52-55, 2010
2472010
A comparison of full and partial predicated execution support for ILP processors
SA Mahlke, RE Hank, JE McCormick, DI August, WMW Hwu
Proceedings of the 22nd annual international symposium on Computer …, 1995
2401995
Run-time adaptive cache hierarchy management via reference analysis
TL Johnson, WMW Hwu
ACM SIGARCH Computer Architecture News 25 (2), 315-326, 1997
2341997
現在システムで処理を実行できません。しばらくしてからもう一度お試しください。
論文 1–20