Sreeram Potluri, Ph.D. - Publications

Affiliations:

2014

Computer Science and Engineering

Ohio State University, Columbus, Columbus, OH

Area:

Computer Science, Computer Engineering

Year	Citation	Score
2018	Agostini E, Rossetti D, Potluri S. GPUDirect Async: Exploring GPU synchronous communication techniques for InfiniBand clusters Journal of Parallel and Distributed Computing. 114: 28-45. DOI: 10.1016/J.Jpdc.2017.12.007	0.488
2015	Shamis P, Venkata MG, Lopez MG, Baker MB, Hernandez O, Itigin Y, Dubman M, Shainer G, Graham RL, Liss L, Shahar Y, Potluri S, Rossetti D, Becker D, Poole D, et al. UCX: An Open Source Framework for HPC Network APIs and Beyond Proceedings - 2015 Ieee 23rd Annual Symposium On High-Performance Interconnects, Hoti 2015. 40-43. DOI: 10.1109/HOTI.2015.13	0.339
2014	Jose J, Potluri S, Subramoni H, Lu X, Hamidouche K, Schulz K, Sundar H, Panda DK. Designing scalable out-of-core sorting with hybrid MPI+PGAS programming models Acm International Conference Proceeding Series. 2014. DOI: 10.1145/2676870.2676880	0.626
2014	Potluri S, Hamidouche K, Bureddy D, Panda DK. MVAPICH2-MIC: A high performance MPI library for Xeon Phi clusters with infiniband Proceedings - 2013 Extreme Scaling Workshop, Xsw 2013. 25-32. DOI: 10.1109/XSW.2013.8	0.608
2014	Wang H, Potluri S, Bureddy D, Rosales C, Panda DK. GPU-aware MPI on RDMA-enabled clusters: Design, implementation and evaluation Ieee Transactions On Parallel and Distributed Systems. 25: 2595-2605. DOI: 10.1109/Tpds.2013.222	0.616
2014	Venkatesh A, Potluri S, Rajachandrasekar R, Luo M, Hamidouche K, Panda DK. High performance alltoall and allgather designs for infiniband MIC clusters Proceedings of the International Parallel and Distributed Processing Symposium, Ipdps. 637-646. DOI: 10.1109/IPDPS.2014.72	0.425
2014	Shi R, Potluri S, Hamidouche K, Perkins J, Li M, Rossetti D, Panda DKDK. Designing efficient small message transfer mechanism for inter-node MPI communication on InfiniBand GPU clusters 2014 21st International Conference On High Performance Computing, Hipc 2014. DOI: 10.1109/HiPC.2014.7116873	0.463
2014	Jose J, Hamidouche K, Lu X, Potluri S, Zhang J, Tomko K, Panda DK. High performance OpenSHMEM for Xeon Phi clusters: Extensions, runtime designs and application co-design 2014 Ieee International Conference On Cluster Computing, Cluster 2014. 10-18. DOI: 10.1109/CLUSTER.2014.6968754	0.416
2014	Jose J, Zhang J, Venkatesh A, Potluri S, Panda DDK. A comprehensive performance evaluation of OpenSHMEM libraries on InfiniBand clusters Lecture Notes in Computer Science (Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). 8356: 14-28. DOI: 10.1007/978-3-319-05215-1-2	0.364
2013	Potluri S, Bureddy D, Hamidouche K, Venkatesh A, Kandalla K, Subramoni H, Panda DK. MVAPICH-PRISM: A proxy-based communication framework using InfiniBand and SCIF for Intel MIC clusters International Conference For High Performance Computing, Networking, Storage and Analysis, Sc. DOI: 10.1145/2503210.2503288	0.767
2013	Li M, Potluri S, Hamidouche K, Jose J, Panda DK. Efficient and truly passive MPI-3 RMA using InfiniBand atomics Acm International Conference Proceeding Series. 91-96. DOI: 10.1145/2488551.2488573	0.45
2013	Hamidouche K, Potluri S, Subramoni H, Kandalla K, Panda DK. MIC-RO: Enabling efficient remote offload on heterogeneous many integrated core (MIC) clusters with InfiniBand Proceedings of the International Conference On Supercomputing. 399-408. DOI: 10.1145/2464996.2465445	0.742
2013	Potluri S, Bureddy D, Wang H, Subramoni H, Panda DK. Extending OpenSHMEM for GPU computing Proceedings - Ieee 27th International Parallel and Distributed Processing Symposium, Ipdps 2013. 1001-1012. DOI: 10.1109/IPDPS.2013.104	0.591
2013	Potluri S, Hamidouche K, Venkatesh A, Bureddy D, Panda DK. Efficient inter-Node MPI communication using GPUDirect RDMA for InfiniBand clusters with NVIDIA GPUs Proceedings of the International Conference On Parallel Processing. 80-89. DOI: 10.1109/ICPP.2013.17	0.468
2013	Kandalla K, Venkatesh A, Hamidouche K, Potluri S, Bureddy D, Panda DK. Designing optimized MPI broadcast and allreduce for Many Integrated Core (MIC) InfiniBand clusters Proceedings - Ieee 21st Annual Symposium On High-Performance Interconnects, Hoti 2013. 63-70. DOI: 10.1109/HOTI.2013.26	0.773
2013	Shi R, Potluri S, Hamidouche K, Lu X, Tomko K, Panda DK. A scalable and portable approach to accelerate hybrid HPL on heterogeneous CPU-GPU clusters Proceedings - Ieee International Conference On Cluster Computing, Iccc. DOI: 10.1109/CLUSTER.2013.6702619	0.367
2013	Potluri S, Venkatesh A, Bureddy D, Kandalla K, Panda DK. Efficient intra-node communication on Intel-MIC clusters Proceedings - 13th Ieee/Acm International Symposium On Cluster, Cloud, and Grid Computing, Ccgrid 2013. 128-135. DOI: 10.1109/CCGrid.2013.86	0.739
2013	Jose J, Potluri S, Tomko K, Panda DK. Designing scalable Graph500 benchmark with hybrid MPI+OpenSHMEM programming models Lecture Notes in Computer Science (Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). 7905: 109-124. DOI: 10.1007/978-3-642-38750-0_9	0.479
2012	Subramoni H, Potluri S, Kandalla K, Barth B, Vienne J, Keasler J, Tomko K, Schulz K, Moody A, Panda DK. Design of a scalable InfiniBand topology service to enable network-topology-aware placement of processes International Conference For High Performance Computing, Networking, Storage and Analysis, Sc. DOI: 10.1109/SC.2012.47	0.723
2012	Potluri S, Wang H, Bureddy D, Singh AK, Rosales C, Panda DK. Optimizing MPI communication on multi-GPU systems using CUDA inter-process communication Proceedings of the 2012 Ieee 26th International Parallel and Distributed Processing Symposium Workshops, Ipdpsw 2012. 1848-1857. DOI: 10.1109/IPDPSW.2012.228	0.47
2012	Bureddy D, Wang H, Venkatesh A, Potluri S, Panda DK. OMB-GPU: A micro-benchmark suite for evaluating MPI libraries on GPU clusters Lecture Notes in Computer Science (Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). 7490: 110-120. DOI: 10.1007/978-3-642-33518-1_16	0.467
2011	Sur S, Potluri S, Kandalla KC, Subramoni H, Panda DK, Tomko K. Codesign for infiniband clusters Computer. 44: 31-36. DOI: 10.1109/Mc.2011.265	0.761
2011	Singh AK, Potluri S, Wang H, Kandalla K, Sur S, Panda DK. MPI alltoall personalized exchange on GPGPU clusters: Design alternatives and benefit Proceedings - Ieee International Conference On Cluster Computing, Iccc. 420-427. DOI: 10.1109/CLUSTER.2011.67	0.723
2011	Wang H, Potluri S, Luo M, Singh AK, Ouyang X, Sur S, Panda DK. Optimized non-contiguous MPI datatype communication for GPU clusters: Design, implementation and evaluation with MVAPICH2 Proceedings - Ieee International Conference On Cluster Computing, Iccc. 308-316. DOI: 10.1109/CLUSTER.2011.42	0.497
2011	Wang H, Potluri S, Luo M, Singh AK, Sur S, Panda DK. MVAPICH2-GPU: Optimized GPU to GPU communication for InfiniBand clusters Computer Science - Research and Development. 26: 257-266. DOI: 10.1007/S00450-011-0171-3	0.732
2011	Potluri S, Sur S, Bureddy D, Panda DK. Design and implementation of key proposed MPI-3 one-sided communication semantics on infiniband Lecture Notes in Computer Science (Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). 6960: 321-324. DOI: 10.1007/978-3-642-24449-0_38	0.588
2011	Potluri S, Wang H, Dhanraj V, Sur S, Panda DK. Optimizing MPI one sided communication on multi-core InfiniBand clusters using shared memory backed windows Lecture Notes in Computer Science (Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). 6960: 99-109. DOI: 10.1007/978-3-642-24449-0_13	0.413
2010	Potluri S, Lai P, Tomko K, Sur S, Cui Y, Tatineni M, Schulz KW, Barth WL, Majumdar A, Panda DK. Quantifying performance benefits of overlap using MPI-2 in a seismic modeling application Proceedings of the International Conference On Supercomputing. 17-25. DOI: 10.1145/1810085.1810092	0.474
2010	Luo M, Potluri S, Lai P, Mancini EP, Subramoni H, Kandalla K, Sur S, Panda DK. High performance design and implementation of Nemesis communication layer for two-sided and one-sided MPI semantics in MVAPICH2 Proceedings of the International Conference On Parallel Processing Workshops. 377-386. DOI: 10.1109/ICPPW.2010.58	0.736
Show low-probability matches.