关注
Torsten Hoefler
Torsten Hoefler
Professor of Computer Science at ETH Zurich
在 inf.ethz.ch 的电子邮件经过验证 - 首页
标题
引用次数
引用次数
年份
Demystifying parallel and distributed deep learning: An in-depth concurrency analysis
T Ben-Nun, T Hoefler
ACM Computing Surveys (CSUR) 52 (4), 1-43, 2019
8702019
Sparsity in deep learning: Pruning and growth for efficient inference and training in neural networks
T Hoefler, D Alistarh, T Ben-Nun, N Dryden, A Peste
Journal of Machine Learning Research 22 (241), 1-124, 2021
8102021
Gptq: Accurate post-training quantization for generative pre-trained transformers
E Frantar, S Ashkboos, T Hoefler, D Alistarh
arXiv preprint arXiv:2210.17323, 2022
6202022
The convergence of sparsified gradient methods
D Alistarh, T Hoefler, M Johansson, N Konstantinov, S Khirirat, C Renggli
Advances in Neural Information Processing Systems 31, 2018
5942018
Graph of thoughts: Solving elaborate problems with large language models
M Besta, N Blach, A Kubicek, R Gerstenberger, M Podstawski, ...
Proceedings of the AAAI Conference on Artificial Intelligence 38 (16), 17682 …, 2024
5472024
MPI: A Message-Passing Interface Standard
MPI Forum
Technical Report, 2012
458*2012
Slim fly: A cost effective low-diameter network topology
M Besta, T Hoefler
SC'14: proceedings of the international conference for high performance …, 2014
3592014
Scientific benchmarking of parallel computing systems: twelve ways to tell the masses when reporting performance results
T Hoefler, R Belli
Proceedings of the international conference for high performance computing …, 2015
3222015
Characterizing the influence of system noise on large-scale applications by simulation
T Hoefler, T Schneider, A Lumsdaine
SC'10: Proceedings of the 2010 ACM/IEEE International Conference for High …, 2010
3222010
Neural code comprehension: A learnable representation of code semantics
T Ben-Nun, AS Jakobovits, T Hoefler
Advances in neural information processing systems 31, 2018
3082018
Generic topology mapping strategies for large-scale parallel architectures
T Hoefler, M Snir
Proceedings of the international conference on Supercomputing, 75-84, 2011
3062011
The PERCS high-performance interconnect
B Arimilli, R Arimilli, V Chung, S Clark, W Denzel, B Drerup, T Hoefler, ...
2010 18th IEEE Symposium on High Performance Interconnects, 75-82, 2010
2992010
Implementation and performance analysis of non-blocking collective operations for MPI
T Hoefler, A Lumsdaine, W Rehm
Proceedings of the 2007 ACM/IEEE conference on Supercomputing, 1-10, 2007
2832007
Augment your batch: Improving generalization through instance repetition
E Hoffer, T Ben-Nun, I Hubara, N Giladi, T Hoefler, D Soudry
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2020
2522020
LogGOPSim: simulating large-scale applications in the LogGOPS model
T Hoefler, T Schneider, A Lumsdaine
Proceedings of the 19th ACM International Symposium on High Performance …, 2010
2292010
The digital revolution of Earth-system science
P Bauer, PD Dueben, T Hoefler, T Quintino, TC Schulthess, NP Wedi
Nature Computational Science 1 (2), 104-113, 2021
2202021
Dare: High-performance state machine replication on rdma networks
M Poke, T Hoefler
Proceedings of the 24th International Symposium on High-Performance Parallel …, 2015
1932015
Using automated performance modeling to find scalability bugs in complex codes
A Calotoiu, T Hoefler, M Poke, F Wolf
Proceedings of the International Conference on High Performance Computing …, 2013
1922013
Kilometer-scale climate models: Prospects and challenges
C Schär, O Fuhrer, A Arteaga, N Ban, C Charpilloz, S Di Girolamo, ...
Bulletin of the American Meteorological Society 101 (5), E567-E587, 2020
190*2020
OPTQ: Accurate quantization for generative pre-trained transformers
E Frantar, S Ashkboos, T Hoefler, D Alistarh
The Eleventh International Conference on Learning Representations, 2022
1872022
系统目前无法执行此操作,请稍后再试。
文章 1–20