Nexus: a GPU cluster engine for accelerating DNN-based video analysis H Shen, L Chen, Y Jin, L Zhao, B Kong, M Philipose, A Krishnamurthy, ... Proceedings of the 27th ACM Symposium on Operating Systems Principles, 322-337, 2019 | 206* | 2019 |
AutoLRS: Automatic Learning-Rate Schedule by Bayesian Optimization on the Fly Y Jin, T Zhou, L Zhao, Y Zhu, C Guo, M Canini, A Krishnamurthy arXiv preprint arXiv:2105.10762, 2021 | 19 | 2021 |
Efficient Direct-Connect Topologies for Collective Communications L Zhao, S Pal, T Chugh, W Wang, J Fantl, P Basu, J Khoury, ... arXiv preprint arXiv:2202.03356, 2022 | 3* | 2022 |
Bandwidth Optimal Pipeline Schedule for Collective Communication L Zhao, A Krishnamurthy arXiv preprint arXiv:2305.18461, 2023 | 2 | 2023 |
ForestColl: Efficient Collective Communications on Heterogeneous Network Fabrics L Zhao, S Maleki, Z Yang, H Pourreza, A Shah, C Hwang, ... arXiv preprint arXiv:2402.06787, 2024 | | 2024 |
Efficient All-to-All Collective Communication Schedules for Direct-Connect Topologies S Pal, L Zhao, J Fantl, J Khoury, A Krishnamurthy, P Basu arXiv preprint arXiv:2309.13541, 2023 | | 2023 |
Nexus: A GPU Cluster Engine for Accelerating Neural Networks Based Video Analysis H Shen, L Chen, Y Jin, L Zhao, B Kong, M Philipose, A Krishnamurthy, ... | | |