Memcached design on high performance RDMA capable interconnects J Jose, H Subramoni, M Luo, M Zhang, J Huang, M Wasi-ur-Rahman, ... 2011 International Conference on Parallel Processing, 743-752, 2011 | 263 | 2011 |
High performance RDMA-based design of HDFS over InfiniBand NS Islam, MW Rahman, J Jose, R Rajachandrasekar, H Wang, ... SC'12: Proceedings of the International Conference on High Performance …, 2012 | 222 | 2012 |
High-performance design of hadoop rpc with rdma over infiniband X Lu, NS Islam, M Wasi-Ur-Rahman, J Jose, H Subramoni, H Wang, ... 2013 42nd International Conference on Parallel Processing, 641-650, 2013 | 159 | 2013 |
High-performance design of hbase with rdma over infiniband J Huang, X Ouyang, J Jose, M Wasi-ur-Rahman, H Wang, M Luo, ... 2012 IEEE 26th International Parallel and Distributed Processing Symposium …, 2012 | 109 | 2012 |
Designing topology-aware collective communication algorithms for large scale infiniband clusters: Case studies with scatter and gather K Kandalla, H Subramoni, A Vishnu, DK Panda 2010 IEEE International Symposium on Parallel & Distributed Processing …, 2010 | 106 | 2010 |
High-performance RDMA-based design of Hadoop MapReduce over InfiniBand M Wasi-ur-Rahman, NS Islam, X Lu, J Jose, H Subramoni, H Wang, ... 2013 IEEE International Symposium on Parallel & Distributed Processing …, 2013 | 86 | 2013 |
Design of a scalable InfiniBand topology service to enable network-topology-aware placement of processes H Subramoni, S Potluri, K Kandalla, B Barth, J Vienne, J Keasler, ... SC'12: Proceedings of the International Conference on High Performance …, 2012 | 83 | 2012 |
Performance analysis and evaluation of infiniband fdr and 40gige roce on hpc and cloud computing systems J Vienne, J Chen, M Wasi-Ur-Rahman, NS Islam, H Subramoni, ... 2012 IEEE 20th Annual Symposium on High-Performance Interconnects, 48-55, 2012 | 83 | 2012 |
An in-depth performance characterization of CPU-and GPU-based DNN training on modern architectures AA Awan, H Subramoni, DK Panda Proceedings of the Machine Learning on HPC Environments, 1-8, 2017 | 80 | 2017 |
Scalable memcached design for infiniband clusters using hybrid transports J Jose, H Subramoni, K Kandalla, M Wasi-ur-Rahman, H Wang, ... 2012 12th IEEE/ACM International Symposium on Cluster, Cloud and Grid …, 2012 | 78 | 2012 |
High-performance and scalable non-blocking all-to-all with collective offload on InfiniBand clusters: a study with parallel 3D FFT K Kandalla, H Subramoni, K Tomko, D Pekurovsky, S Sur, DK Panda Computer Science-Research and Development 26 (3), 237-246, 2011 | 78 | 2011 |
Designing multi-leader-based allgather algorithms for multi-core clusters K Kandalla, H Subramoni, G Santhanaraman, M Koop, DK Panda 2009 IEEE International Symposium on Parallel & Distributed Processing, 1-8, 2009 | 67 | 2009 |
Rdma over ethernet—a preliminary study H Subramoni, P Lai, M Luo, DK Panda 2009 IEEE International Conference on Cluster Computing and Workshops, 1-9, 2009 | 65 | 2009 |
The MVAPICH project: Transforming research into high-performance MPI library for HPC community DK Panda, H Subramoni, CH Chu, M Bayatpour Journal of Computational Science 52, 101208, 2021 | 61 | 2021 |
Design and evaluation of benchmarks for financial applications using Advanced Message Queuing Protocol (AMQP) over InfiniBand H Subramoni, G Marsh, S Narravula, P Lai, DK Panda 2008 workshop on high performance computational finance, 1-8, 2008 | 59 | 2008 |
Design and evaluation of network topology-/speed-aware broadcast algorithms for infiniband clusters H Subramoni, K Kandalla, J Vienne, S Sur, B Barth, K Tomko, R Mclay, ... 2011 IEEE International Conference on Cluster Computing, 317-325, 2011 | 56 | 2011 |
Scalable distributed dnn training using tensorflow and cuda-aware mpi: Characterization, designs, and performance evaluation AA Awan, J Bédorf, CH Chu, H Subramoni, DK Panda 2019 19th IEEE/ACM International Symposium on Cluster, Cloud and Grid …, 2019 | 55 | 2019 |
MVAPICH-PRISM: A proxy-based communication framework using InfiniBand and SCIF for Intel MIC clusters S Potluri, D Bureddy, K Hamidouche, A Venkatesh, K Kandalla, ... Proceedings of the International Conference on High Performance Computing …, 2013 | 52 | 2013 |
Optimized broadcast for deep learning workloads on dense-GPU InfiniBand clusters: MPI or NCCL? AA Awan, CH Chu, H Subramoni, DK Panda Proceedings of the 25th European MPI Users' Group Meeting, 1-9, 2018 | 51 | 2018 |
High performance data transfer in grid environment using gridftp over infiniband H Subramoni, P Lai, R Kettimuthu, DK Panda 2010 10th IEEE/ACM International Conference on Cluster, Cloud and Grid …, 2010 | 47 | 2010 |