Year |
Citation |
Score |
2018 |
Agostini E, Rossetti D, Potluri S. GPUDirect Async: Exploring GPU synchronous communication techniques for InfiniBand clusters Journal of Parallel and Distributed Computing. 114: 28-45. DOI: 10.1016/J.Jpdc.2017.12.007 |
0.488 |
|
2015 |
Shamis P, Venkata MG, Lopez MG, Baker MB, Hernandez O, Itigin Y, Dubman M, Shainer G, Graham RL, Liss L, Shahar Y, Potluri S, Rossetti D, Becker D, Poole D, et al. UCX: An Open Source Framework for HPC Network APIs and Beyond Proceedings - 2015 Ieee 23rd Annual Symposium On High-Performance Interconnects, Hoti 2015. 40-43. DOI: 10.1109/HOTI.2015.13 |
0.339 |
|
2014 |
Jose J, Potluri S, Subramoni H, Lu X, Hamidouche K, Schulz K, Sundar H, Panda DK. Designing scalable out-of-core sorting with hybrid MPI+PGAS programming models Acm International Conference Proceeding Series. 2014. DOI: 10.1145/2676870.2676880 |
0.626 |
|
2014 |
Potluri S, Hamidouche K, Bureddy D, Panda DK. MVAPICH2-MIC: A high performance MPI library for Xeon Phi clusters with infiniband Proceedings - 2013 Extreme Scaling Workshop, Xsw 2013. 25-32. DOI: 10.1109/XSW.2013.8 |
0.608 |
|
2014 |
Wang H, Potluri S, Bureddy D, Rosales C, Panda DK. GPU-aware MPI on RDMA-enabled clusters: Design, implementation and evaluation Ieee Transactions On Parallel and Distributed Systems. 25: 2595-2605. DOI: 10.1109/Tpds.2013.222 |
0.616 |
|
2014 |
Venkatesh A, Potluri S, Rajachandrasekar R, Luo M, Hamidouche K, Panda DK. High performance alltoall and allgather designs for infiniband MIC clusters Proceedings of the International Parallel and Distributed Processing Symposium, Ipdps. 637-646. DOI: 10.1109/IPDPS.2014.72 |
0.425 |
|
2014 |
Shi R, Potluri S, Hamidouche K, Perkins J, Li M, Rossetti D, Panda DKDK. Designing efficient small message transfer mechanism for inter-node MPI communication on InfiniBand GPU clusters 2014 21st International Conference On High Performance Computing, Hipc 2014. DOI: 10.1109/HiPC.2014.7116873 |
0.463 |
|
2014 |
Jose J, Hamidouche K, Lu X, Potluri S, Zhang J, Tomko K, Panda DK. High performance OpenSHMEM for Xeon Phi clusters: Extensions, runtime designs and application co-design 2014 Ieee International Conference On Cluster Computing, Cluster 2014. 10-18. DOI: 10.1109/CLUSTER.2014.6968754 |
0.416 |
|
2014 |
Jose J, Zhang J, Venkatesh A, Potluri S, Panda DDK. A comprehensive performance evaluation of OpenSHMEM libraries on InfiniBand clusters Lecture Notes in Computer Science (Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). 8356: 14-28. DOI: 10.1007/978-3-319-05215-1-2 |
0.364 |
|
2013 |
Potluri S, Bureddy D, Hamidouche K, Venkatesh A, Kandalla K, Subramoni H, Panda DK. MVAPICH-PRISM: A proxy-based communication framework using InfiniBand and SCIF for Intel MIC clusters International Conference For High Performance Computing, Networking, Storage and Analysis, Sc. DOI: 10.1145/2503210.2503288 |
0.767 |
|
2013 |
Li M, Potluri S, Hamidouche K, Jose J, Panda DK. Efficient and truly passive MPI-3 RMA using InfiniBand atomics Acm International Conference Proceeding Series. 91-96. DOI: 10.1145/2488551.2488573 |
0.45 |
|
2013 |
Hamidouche K, Potluri S, Subramoni H, Kandalla K, Panda DK. MIC-RO: Enabling efficient remote offload on heterogeneous many integrated core (MIC) clusters with InfiniBand Proceedings of the International Conference On Supercomputing. 399-408. DOI: 10.1145/2464996.2465445 |
0.742 |
|
2013 |
Potluri S, Bureddy D, Wang H, Subramoni H, Panda DK. Extending OpenSHMEM for GPU computing Proceedings - Ieee 27th International Parallel and Distributed Processing Symposium, Ipdps 2013. 1001-1012. DOI: 10.1109/IPDPS.2013.104 |
0.591 |
|
2013 |
Potluri S, Hamidouche K, Venkatesh A, Bureddy D, Panda DK. Efficient inter-Node MPI communication using GPUDirect RDMA for InfiniBand clusters with NVIDIA GPUs Proceedings of the International Conference On Parallel Processing. 80-89. DOI: 10.1109/ICPP.2013.17 |
0.468 |
|
2013 |
Kandalla K, Venkatesh A, Hamidouche K, Potluri S, Bureddy D, Panda DK. Designing optimized MPI broadcast and allreduce for Many Integrated Core (MIC) InfiniBand clusters Proceedings - Ieee 21st Annual Symposium On High-Performance Interconnects, Hoti 2013. 63-70. DOI: 10.1109/HOTI.2013.26 |
0.773 |
|
2013 |
Shi R, Potluri S, Hamidouche K, Lu X, Tomko K, Panda DK. A scalable and portable approach to accelerate hybrid HPL on heterogeneous CPU-GPU clusters Proceedings - Ieee International Conference On Cluster Computing, Iccc. DOI: 10.1109/CLUSTER.2013.6702619 |
0.367 |
|
2013 |
Potluri S, Venkatesh A, Bureddy D, Kandalla K, Panda DK. Efficient intra-node communication on Intel-MIC clusters Proceedings - 13th Ieee/Acm International Symposium On Cluster, Cloud, and Grid Computing, Ccgrid 2013. 128-135. DOI: 10.1109/CCGrid.2013.86 |
0.739 |
|
2013 |
Jose J, Potluri S, Tomko K, Panda DK. Designing scalable Graph500 benchmark with hybrid MPI+OpenSHMEM programming models Lecture Notes in Computer Science (Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). 7905: 109-124. DOI: 10.1007/978-3-642-38750-0_9 |
0.479 |
|
2012 |
Subramoni H, Potluri S, Kandalla K, Barth B, Vienne J, Keasler J, Tomko K, Schulz K, Moody A, Panda DK. Design of a scalable InfiniBand topology service to enable network-topology-aware placement of processes International Conference For High Performance Computing, Networking, Storage and Analysis, Sc. DOI: 10.1109/SC.2012.47 |
0.723 |
|
2012 |
Potluri S, Wang H, Bureddy D, Singh AK, Rosales C, Panda DK. Optimizing MPI communication on multi-GPU systems using CUDA inter-process communication Proceedings of the 2012 Ieee 26th International Parallel and Distributed Processing Symposium Workshops, Ipdpsw 2012. 1848-1857. DOI: 10.1109/IPDPSW.2012.228 |
0.47 |
|
2012 |
Bureddy D, Wang H, Venkatesh A, Potluri S, Panda DK. OMB-GPU: A micro-benchmark suite for evaluating MPI libraries on GPU clusters Lecture Notes in Computer Science (Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). 7490: 110-120. DOI: 10.1007/978-3-642-33518-1_16 |
0.467 |
|
2011 |
Sur S, Potluri S, Kandalla KC, Subramoni H, Panda DK, Tomko K. Codesign for infiniband clusters Computer. 44: 31-36. DOI: 10.1109/Mc.2011.265 |
0.761 |
|
2011 |
Singh AK, Potluri S, Wang H, Kandalla K, Sur S, Panda DK. MPI alltoall personalized exchange on GPGPU clusters: Design alternatives and benefit Proceedings - Ieee International Conference On Cluster Computing, Iccc. 420-427. DOI: 10.1109/CLUSTER.2011.67 |
0.723 |
|
2011 |
Wang H, Potluri S, Luo M, Singh AK, Ouyang X, Sur S, Panda DK. Optimized non-contiguous MPI datatype communication for GPU clusters: Design, implementation and evaluation with MVAPICH2 Proceedings - Ieee International Conference On Cluster Computing, Iccc. 308-316. DOI: 10.1109/CLUSTER.2011.42 |
0.497 |
|
2011 |
Wang H, Potluri S, Luo M, Singh AK, Sur S, Panda DK. MVAPICH2-GPU: Optimized GPU to GPU communication for InfiniBand clusters Computer Science - Research and Development. 26: 257-266. DOI: 10.1007/S00450-011-0171-3 |
0.732 |
|
2011 |
Potluri S, Sur S, Bureddy D, Panda DK. Design and implementation of key proposed MPI-3 one-sided communication semantics on infiniband Lecture Notes in Computer Science (Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). 6960: 321-324. DOI: 10.1007/978-3-642-24449-0_38 |
0.588 |
|
2011 |
Potluri S, Wang H, Dhanraj V, Sur S, Panda DK. Optimizing MPI one sided communication on multi-core InfiniBand clusters using shared memory backed windows Lecture Notes in Computer Science (Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). 6960: 99-109. DOI: 10.1007/978-3-642-24449-0_13 |
0.413 |
|
2010 |
Potluri S, Lai P, Tomko K, Sur S, Cui Y, Tatineni M, Schulz KW, Barth WL, Majumdar A, Panda DK. Quantifying performance benefits of overlap using MPI-2 in a seismic modeling application Proceedings of the International Conference On Supercomputing. 17-25. DOI: 10.1145/1810085.1810092 |
0.474 |
|
2010 |
Luo M, Potluri S, Lai P, Mancini EP, Subramoni H, Kandalla K, Sur S, Panda DK. High performance design and implementation of Nemesis communication layer for two-sided and one-sided MPI semantics in MVAPICH2 Proceedings of the International Conference On Parallel Processing Workshops. 377-386. DOI: 10.1109/ICPPW.2010.58 |
0.736 |
|
Show low-probability matches. |