Verified instruction-level energy consumption measurement for nvidia gpus Y Arafa, A ElWazir, A ElKanishy, Y Aly, A Elsayed, AH Badawy, ... Proceedings of the 17th ACM International Conference on Computing Frontiers …, 2020 | 40 | 2020 |
Low Overhead Instruction Latency Characterization for NVIDIA GPGPUs Y Arafa, AHA Badawy, G Chennupati, N Santhi, S Eidenbenz 2019 IEEE High Performance Extreme Computing Conference (HPEC), 1-8, 2019 | 38* | 2019 |
PPT-GPU: Scalable gpu performance modeling Y Arafa, AHA Badawy, G Chennupati, N Santhi, S Eidenbenz IEEE Computer Architecture Letters 18 (1), 55-58, 2019 | 38 | 2019 |
Hybrid, scalable, trace-driven performance modeling of GPGPUs Y Arafa, AH Badawy, A ElWazir, A Barai, A Eker, G Chennupati, N Santhi, ... Proceedings of the International Conference for High Performance Computing …, 2021 | 21 | 2021 |
Fast, accurate, and scalable memory modeling of GPGPUs using reuse profiles Y Arafa, AH Badawy, G Chennupati, A Barai, N Santhi, S Eidenbenz Proceedings of the 34th ACM International Conference on Supercomputing, 1-12, 2020 | 20 | 2020 |
Demystifying the nvidia ampere architecture through microbenchmarking and instruction-level analysis H Abdelkhalik, Y Arafa, N Santhi, AHA Badawy 2022 IEEE High Performance Extreme Computing Conference (HPEC), 1-8, 2022 | 12 | 2022 |
GPUs cache performance estimation using reuse distance analysis Y Arafa, G Chennupati, A Barai, AHA Badawy, N Santhi, S Eidenbenz 2019 IEEE 38th International Performance Computing and Communications …, 2019 | 11 | 2019 |
Fault tolerance performance evaluation of large-scale distributed storage systems HDFS and Ceph case study Y Arafa, A Barai, M Zheng, AHA Badawy 2018 IEEE High Performance extreme Computing Conference (HPEC), 1-7, 2018 | 10 | 2018 |
PPT-SASMM: Scalable analytical shared memory model: Predicting the performance of multicore caches from a single-threaded execution trace A Barai, G Chennupati, N Santhi, AH Badawy, Y Arafa, S Eidenbenz Proceedings of the International Symposium on Memory Systems, 341-351, 2020 | 8 | 2020 |
Efficient intra-rack resource disaggregation for HPC using co-packaged DWDM photonics G Michelogiannakis, Y Arafa, B Cook, LY Dai, AHH Badawy, M Glick, ... 2023 IEEE International Conference on Cluster Computing (CLUSTER), 158-172, 2023 | 7 | 2023 |
Load-aware dynamic time synchronization in parallel discrete event simulation A Eker, Y Arafa, AHA Badawy, N Santhi, S Eidenbenz, D Ponomarev Proceedings of the 2021 ACM SIGSIM Conference on Principles of Advanced …, 2021 | 7 | 2021 |
PPT-Multicore: performance prediction of OpenMP applications using reuse profiles and analytical modeling A Barai, Y Arafa, AH Badawy, G Chennupati, N Santhi, S Eidenbenz The Journal of Supercomputing, 1-32, 2022 | 6 | 2022 |
PPT-GPU: Performance prediction toolkit for gpus identifying the impact of caches Y Arafa, AHA Badawy, G Chennupati, N Santhi, S Eidenbenz Proceedings of the International Symposium on Memory Systems, 301-302, 2018 | 5 | 2018 |
Evaluating the fault tolerance performance of hdfs and ceph Y Arafa, A Barai, M Zheng, AHA Badawy Proceedings of the Practice and Experience on Advanced Research Computing, 1-3, 2018 | 3 | 2018 |
NVIDIA GPGPUs Instructions Energy Consumption Y Arafa, A ElWazir, A Elkanishy, Y Aly, A Elsayed, AH Badawy, ... 2020 IEEE International Symposium on Performance Analysis of Systems and …, 2020 | 1 | 2020 |
PPT-SASMM: Scalable analytical shared memory model A Barai, G Chennupati, N Santhi, AHA Badawy, Y Arafa, SJ Eidenbenz Press of the 6th International Symposium on Memory Systems (MEMSYS). ACM …, 2020 | 1 | 2020 |
BB-ML: Basic Block Performance Prediction using Machine Learning Techniques H Abdelkhalik, S Aktar, Y Arafa, A Barai, G Chennupati, N Santhi, ... 2023 IEEE 29th International Conference on Parallel and Distributed Systems …, 2023 | | 2023 |
Modeling and Characterizing Shared and Local Memories of the Ampere GPUs H Abdelkhalik, Y Arafa, N Santhi, N Prajapati, AHA Badawy Proceedings of the International Symposium on Memory Systems, 1-3, 2023 | | 2023 |
BB-ML: Basic Block Performance Prediction using Machine Learning Techniques S Aktar, H Abdelkhalik, NH Turja, Y Arafa, A Barai, N Panda, ... arXiv preprint arXiv:2202.07798, 2022 | | 2022 |
-Multicore: performance prediction of Open applications using reuse profiles and analytical modeling A Barai, Y Arafa, AH Badawy, G Chennupati, N Santhi, S Eidenbenz Journal of Supercomputing 78 (LA-UR-21-22749), 2021 | | 2021 |