John Mellor-Crummey - Publications

Affiliations: 
Rice University, Houston, TX 
Area:
Computer Science

86 high-probability publications. We are testing a new system for linking publications to authors. You can help! If you notice any inaccuracies, please sign in and mark papers as correct or incorrect matches. If you identify any major omissions or other inaccuracies in the publication list, please let us know.

Year Citation  Score
2016 Yang C, Mellor-Crummey J. A practical solution to the cactus stack problem Annual Acm Symposium On Parallelism in Algorithms and Architectures. 11: 61-70. DOI: 10.1145/2935764.2935787  0.356
2016 Yang C, Mellor-Crummey J. A wait-free queue as fast as fetch-and-add Proceedings of the Acm Sigplan Symposium On Principles and Practice of Parallel Programming, Ppopp. 12. DOI: 10.1145/2851141.2851168  0.31
2016 Aji AM, Panwar LS, Ji F, Murthy K, Chabbi M, Balaji P, Bisset KR, Dinan J, Feng WC, Mellor-Crummey J, Ma X, Thakur R. MPI-ACC: Accelerator-Aware MPI for Scientific Applications Ieee Transactions On Parallel and Distributed Systems. 27: 1401-1414. DOI: 10.1109/Tpds.2015.2446479  0.482
2016 Murthy K, Mellor-Crummey J. Communication Avoiding Algorithms: Analysis and Code Generation for Parallel Systems Parallel Architectures and Compilation Techniques - Conference Proceedings, Pact. 2016: 150-162. DOI: 10.1109/PACT.2015.41  0.345
2016 Paul SR, Araya-Polo M, Mellor-Crummey J, Hohl D. Performance analysis and optimization of a hybrid seismic imaging application Procedia Computer Science. 80: 8-18. DOI: 10.1016/j.procs.2016.05.293  0.481
2015 Chabbi M, Fagan M, Mellor-Crummey J. High performance locks for multi-level NUMA systems Proceedings of the Acm Sigplan Symposium On Principles and Practice of Parallel Programming, Ppopp. 2015: 215-226. DOI: 10.1145/2688500.2688503  0.3
2014 Liu X, Sharma K, Mellor-Crummey J. ArrayTool: A lightweight profiler to guide array regrouping Parallel Architectures and Compilation Techniques - Conference Proceedings, Pact. 405-415. DOI: 10.1145/2628071.2628102  0.343
2014 Mellor-Crummey J, Hiranandani S, Sethi A. Author retrospective: Compilation techniques for block-cyclic distributions Proceedings of the International Conference On Supercomputing. 29-31. DOI: 10.1145/2591635.2591651  0.437
2014 Liu X, Mellor-Crummey J. A tool to analyze the performance of multithreaded programs on NUMA architectures Proceedings of the Acm Sigplan Symposium On Principles and Practice of Parallel Programming, Ppopp. 259-271. DOI: 10.1145/2555243.2555271  0.383
2014 Yang C, Bland W, Mellor-Crummey J, Balaji P. Portable, MPI-interoperable Coarray Fortran Acm Sigplan Notices. 49: 81-92. DOI: 10.1145/2555243.2555270  0.418
2014 Chabbi M, Liu X, Mellor-Crummey J. Call paths for pin tools Proceedings of the 12th Acm/Ieee International Symposium On Code Generation and Optimization, Cgo 2014. 76-86. DOI: 10.1145/2544137.2544164  0.461
2014 Wei L, Mellor-Crummey J. Autotuning tensor transposition Proceedings of the International Parallel and Distributed Processing Symposium, Ipdps. 342-351. DOI: 10.1109/IPDPSW.2014.43  0.372
2013 Chabbi M, Murthy K, Fagan M, Mellor-Crummey J. Effective sampling-driven performance tools for GPU-accelerated supercomputers International Conference For High Performance Computing, Networking, Storage and Analysis, Sc. DOI: 10.1145/2503210.2503299  0.37
2013 Liu X, Mellor-Crummey J. A data-centric profiler for parallel programs International Conference For High Performance Computing, Networking, Storage and Analysis, Sc. DOI: 10.1145/2503210.2503297  0.448
2013 Liu X, Mellor-Crummey J, Fagan M. A new approach for performance analysis of openMP programs Proceedings of the International Conference On Supercomputing. 69-80. DOI: 10.1145/2464996.2465433  0.522
2013 Liu X, Mellor-Crummey J. Pinpointing data locality bottlenecks with low overhead Ispass 2013 - Ieee International Symposium On Performance Analysis of Systems and Software. 183-193. DOI: 10.1109/ISPASS.2013.6557169  0.392
2013 Yang C, Murthy K, Mellor-Crummey J. Managing asynchronous operations in Coarray Fortran 2.0 Proceedings - Ieee 27th International Parallel and Distributed Processing Symposium, Ipdps 2013. 1321-1332. DOI: 10.1109/IPDPS.2013.17  0.311
2013 Eichenberger AE, Mellor-Crummey J, Schulz M, Wong M, Copty N, Dietrich R, Liu X, Loh E, Lorenz D. OMPT: An OpenMP tools application programming interface for performance analysis Lecture Notes in Computer Science (Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). 8122: 171-185. DOI: 10.1007/978-3-642-40698-0_13  0.423
2012 Chabbi M, Mellor-Crummey J. DeadSpy: A tool to pinpoint program inefficiencies Proceedings - International Symposium On Code Generation and Optimization, Cgo 2012. 124-134. DOI: 10.1145/2259016.2259033  0.424
2012 Tallent NR, Mellor-Crummey J. Using sampling to understand parallel program performance Proceedings of the 5th International Workshop On Parallel Tools For High Performance Computing 2011. 13-25. DOI: 10.1007/978-3-642-31476-6_2  0.818
2011 Chabbi MM, Mellor-Crummey JM, Cooper KD. Efficiently exploring compiler optimization sequences with pairwise pruning Acm International Conference Proceeding Series. 34-45. DOI: 10.1145/2000417.2000421  0.36
2011 Tallent NR, Mellor-Crummey J, Franco M, Landrum R, Adhianto L. Scalable fine-grained call path tracing Proceedings of the International Conference On Supercomputing. 63-74. DOI: 10.1145/1995896.1995908  0.799
2011 Jin G, Mellor-Crummey J, Adhianto L, Scherer WN, Yang C. Implementation and performance evaluation of the HPC challenge benchmarks in Coarray Fortran 2.0 Proceedings - 25th Ieee International Parallel and Distributed Processing Symposium, Ipdps 2011. 1089-1100. DOI: 10.1109/IPDPS.2011.104  0.411
2011 Liu X, Mellor-Crummey J. Pinpointing data locality problems using data-centric analysis Proceedings - International Symposium On Code Generation and Optimization, Cgo 2011. 171-180. DOI: 10.1109/CGO.2011.5764685  0.313
2010 Scherer WN, Adhianto L, Jin G, Mellor-Crummey J, Yang C. Hiding latency in Coarray Fortran 2.0 Acm International Conference Proceeding Series. DOI: 10.1145/2020373.2020387  0.37
2010 Mellor-Crummey J, Gropp W, Herlihy M. Teaching parallel programming Xrds: Crossroads, the Acm Magazine For Students. 17: 28-30. DOI: 10.1145/1836543.1836553  0.347
2010 Tallent NR, Mellor-Crummey JM, Porterfield A. Analyzing lock contention in multithreaded applications Proceedings of the Acm Sigplan Symposium On Principles and Practice of Parallel Programming, Ppopp. 269-279. DOI: 10.1145/1693453.1693489  0.787
2010 Tallent NR, Adhianto L, Mellor-Crummey JM. Scalable identification of load imbalance in parallel executions using call path profiles 2010 Acm/Ieee International Conference For High Performance Computing, Networking, Storage and Analysis, Sc 2010. DOI: 10.1109/SC.2010.47  0.777
2010 Adhianto L, Mellor-Crummey J, Tallent NR. Effectively presenting call path profiles of application performance Proceedings of the International Conference On Parallel Processing Workshops. 179-188. DOI: 10.1109/ICPPW.2010.35  0.798
2010 Adhianto L, Banerjee S, Fagan M, Krentel M, Marin G, Mellor-Crummey J, Tallent NR. HPCTOOLKIT: Tools for performance analysis of optimized parallel programs Concurrency Computation Practice and Experience. 22: 685-701. DOI: 10.1002/cpe  0.819
2009 Tallent NR, Mellor-Crummey JM, Adhianto L, Fagan MW, Krentel M. Diagnosing performance bottlenecks in emerging petascale applications Proceedings of the Conference On High Performance Computing Networking, Storage and Analysis, Sc '09. DOI: 10.1145/1654059.1654111  0.786
2009 Tallent NR, Mellor-Crummey JM, Fagan MW. Binary analysis for measurement and attribution of program performance Acm Sigplan Notices. 44: 441-452. DOI: 10.1145/1542476.1542526  0.802
2009 Tallent NR, Mellor-Crummey JM. Effective performance measurement and analysis of multithreaded applications Acm Sigplan Notices. 44: 229-239. DOI: 10.1145/1504176.1504210  0.812
2009 Tallent NR, Mellor-Crummey JM. Identifying performance bottlenecks in work-stealing computations Computer. 42: 44-50. DOI: 10.1109/Mc.2009.396  0.766
2009 Chen JH, Choudhary A, De Supinski B, Devries M, Hawkes ER, Klasky S, Liao WK, Ma KL, Mellor-Crummey J, Podhorszki N, Sankaran R, Shende S, Yoo CS. Terascale direct numerical simulations of turbulent combustion using S3D Computational Science and Discovery. 2. DOI: 10.1088/1749-4699/2/1/015001  0.391
2009 Fowler R, Adhianto L, De Supinski B, Fagan M, Gamblin T, Krentel M, Mellor-Crummey J, Schulz M, Tallent N. Frontiers of performance analysis on leadership-class systems Journal of Physics: Conference Series. 180. DOI: 10.1088/1742-6596/180/1/012041  0.336
2009 De Supinski BR, Alam S, Bailey DH, Carrington L, Daley C, Dubey A, Gamblin T, Gunter D, Hovland PD, Jagode H, Karavanic K, Marin G, Mellor-Crummey J, Moore S, Norris B, et al. Modeling the Office of Science ten year facilities plan: The PERI Architecture Tiger Team Journal of Physics: Conference Series. 180. DOI: 10.1088/1742-6596/180/1/012039  0.651
2008 Marin G, Mellor-Crummey J. Pinpointing and exploiting opportunities for enhancing data reuse Ispass 2008 - Ieee International Symposium On Performance Analysis of Systems and Software. 115-126. DOI: 10.1109/ISPASS.2008.4510744  0.383
2008 Tallent N, Mellor-Crummey J, Adhianto L, Fagan M, Krentel M. HPCToolkit: Performance tools for scientific computing Journal of Physics: Conference Series. 125. DOI: 10.1088/1742-6596/125/1/012088  0.442
2008 Marin G, Jin G, Mellor-Crummey J. Managing locality in grand challenge applications: A case study of the gyrokinetic toroidal code Journal of Physics: Conference Series. 125. DOI: 10.1088/1742-6596/125/1/012087  0.405
2007 Coarfa C, Mellor-Crummey J, Froyd N, Dotsenko Y. Scalability analysis of SPMD codes using expectations Proceedings of the International Conference On Supercomputing. 13-22. DOI: 10.1145/1274971.1274976  0.821
2007 Marin G, Mellor-Crummey J. Application insight through performance modeling Conference Proceedings of the Ieee International Performance, Computing, and Communications Conference. 65-74. DOI: 10.1109/PCCC.2007.358880  0.449
2007 Mellor-Crummey J. Harnessing the power of emerging petascale platforms Journal of Physics: Conference Series. 78. DOI: 10.1088/1742-6596/78/1/012048  0.444
2006 Dotsenko Y, Coarfa C, Nakhleh L, Mellor-Crummey J, Roshan U. PRec-I-DCM3: a parallel framework for fast and accurate large-scale phylogeny reconstruction. International Journal of Bioinformatics Research and Applications. 2: 407-19. PMID 18048181 DOI: 10.1504/Ijbra.2006.011039  0.757
2006 Qasem A, Kennedy K, Mellor-Crummey J. Automatic tuning of whole applications using direct search and a performance-based transformation system Journal of Supercomputing. 36: 183-196. DOI: 10.1007/S11227-006-7957-2  0.541
2006 Coarfa C, Dotsenko Y, Mellor-Crummey J. Experiences with Sweep3D implementations in Co-array Fortran Journal of Supercomputing. 36: 101-121. DOI: 10.1007/S11227-006-7952-7  0.817
2006 Froyd N, Tallent N, Mellor-Crummey J, Fowler R. Call path profiling for unmodified, optimized binaries Proceedings of the Gcc Developers' Summit 2006. 21-35.  0.352
2005 Jin G, Mellor-Crummey J. Improving performance by reducing the memory footprint of scientific applications International Journal of High Performance Computing Applications. 19: 433-451. DOI: 10.1177/1094342005056138  0.308
2005 Strout MM, Mellor-Crummey J, Hovland P. Representation-independent program analysis Acm Sigplan/Sigsoft Workshop On Program Analysis For Software Tools and Engineering. 67-74. DOI: 10.1145/1108792.1108810  0.415
2005 Froyd N, Mellor-Crummey J, Fowler R. Low-overhead call path profiling of unmodified, optimized code Proceedings of the International Conference On Supercomputing. 81-90. DOI: 10.1145/1088149.1088161  0.366
2005 Coarfa C, Dotsenko Y, Mellor-Crummey J, Cantonnet F, Ei-Ghazawi T, Mohanti A, Yao Y, Chavarría-Miranda D. An evaluation of global address space languages: Co-array fortran and Unified Parallel C Proceedings of the Acm Sigplan Symposium On Principles and Practice of Parallel Programming, Ppopp. 36-47. DOI: 10.1145/1065944.1065950  0.808
2005 Jin G, Mellor-Crummey J. SFCGen: A framework for efficient generation of multi-dimensional space-filling curves by recursion Acm Transactions On Mathematical Software. 31: 120-148. DOI: 10.1145/1055531.1055537  0.367
2005 Chavarría-Miranda D, Jin G, Mellor-Crummey J. COTS clusters vs. the earth simulator: An application study using IMPACT-3D Proceedings - 19th Ieee International Parallel and Distributed Processing Symposium, Ipdps 2005. 2005. DOI: 10.1109/IPDPS.2005.156  0.368
2005 Coarfa C, Dotsenko Y, Mellor-Crummey J, Nakhleh L, Roshan U. PRec-I-DCM3: A parallel framework for fast and accurate large scale phylogeny reconstruction Proceedings of the International Conference On Parallel and Distributed Systems - Icpads. 2: 346-350. DOI: 10.1109/ICPADS.2005.240  0.764
2005 Chavarría-Miranda D, Mellor-Crummey J. Effective communication coalescing for data-parallel applications Proceedings of the Acm Sigplan Symposium On Principles and Practice of Parallel Programming, Ppopp. 13-25.  0.363
2005 Dotsenko Y, Coarfa C, Mellor-Crummey J, Chavarría-Miranda D. Experiences with Co-array Fortran on hardware shared memory platforms Lecture Notes in Computer Science. 3602: 332-347.  0.8
2005 Mandal A, Kennedy K, Koelbel C, Marin G, Mellor-Crummey J, Liu B, Johnsson L. Scheduling strategies for mapping application workflows onto the grid Proceedings of the Ieee International Symposium On High Performance Distributed Computing. 125-134.  0.322
2004 Dotsenko Y, Coarfa C, Mellor-Crummey J. A multi-platform Co-array Fortran compiler Parallel Architectures and Compilation Techniques - Conference Proceedings, Pact. 29-40. DOI: 10.1109/PACT.2004.1342539  0.811
2004 Cooper K, Dasgupta A, Kennedy K, Koelbel C, Mandal A, Marin G, Mazina M, Mellor-Crummey J, Berman F, Casanova H, Chien A, Dail H, Liu X, Olugbile A, Sievert O, et al. New grid scheduling and rescheduling methods in the GrADS project Proceedings - International Parallel and Distributed Processing Symposium, Ipdps 2004 (Abstracts and Cd-Rom). 18: 2775-2782. DOI: 10.1007/S10766-005-3584-4  0.664
2004 Marin G, Mellor-Crummey J. Cross-architecture performance predictions for scientific applications using parameterized models Performance Evaluation Review. 32: 2-13.  0.341
2004 Coarfa C, Dotsenko Y, Eckhardt J, Mellor-Crummey J. Co-array fortran performance and potential: An NPB experimental study Lecture Notes in Computer Science (Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). 2958: 177-193.  0.812
2002 Chavarría-Miranda D, Mellor-Crummey J. An evaluation of data-parallel compiler support for line-sweep applications Parallel Architectures and Compilation Techniques - Conference Proceedings, Pact. 2002: 7-17. DOI: 10.1109/PACT.2002.1105969  0.444
2002 Kennedy K, Mazina M, Mellor-Crummey J, Cooper K, Torczon L, Berman F, Chien A, Dail H, Sievert O, Angulo D, Foster I, Aydt R, Reed D, Gannon D, Dongarra J, et al. Toward a framework for preparing and executing adaptive grid programs Proceedings - International Parallel and Distributed Processing Symposium, Ipdps 2002. 171. DOI: 10.1109/IPDPS.2002.1016570  0.452
2002 Darte A, Chavarría-Miranda D, Fowler R, Mellor-Crummey J. Generalized multipartitioning for multi-dimensional arrays Proceedings - International Parallel and Distributed Processing Symposium, Ipdps 2002. 246-255. DOI: 10.1109/IPDPS.2002.1015501  0.383
2002 Mellor-Crummey J, Adve V, Broom B, Chavarría-Miranda D, Fowler R, Jin G, Kennedy K, Yi Q. Advanced optimization strategies in the Rice dHPF compiler Concurrency Computation Practice and Experience. 14: 741-767. DOI: 10.1002/Cpe.647  0.805
2002 Jin G, Mellor-Crummey J. Experiences tuning SMG98 - A semicoarsening multigrid benchmark based on the hypre library Proceedings of the International Conference On Supercomputing. 305-314.  0.458
2001 Mellor-Crummey J, Whalley D, Kennedy K. Improving Memory Hierarchy Performance for Irregular Applications Using Data and Computation Reorderings International Journal of Parallel Programming. 29: 217-247. DOI: 10.1023/A:1011119519789  0.447
2001 Kennedy K, Broom B, Cooper K, Dongarra J, Fowler R, Gannon D, Johnsson L, Mellor-Crummey J, Torczon L. Telescoping languages: A strategy for automatic generation of scientific problem-solving systems from annotated libraries Journal of Parallel and Distributed Computing. 61: 1803-1826. DOI: 10.1006/Jpdc.2001.1724  0.517
2001 Mellor-Crummey J, Whalley D, Kennedy K. Improving memory hierarchy performance for irregular applications using data and computation reorderings International Journal of Parallel Programming. 29: 217-247.  0.344
2001 Chavarría-Miranda D, Mellor-Crummey J, Sarang T. Data-parallel compiler support for multipartitioning Lecture Notes in Computer Science (Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). 2150: 241-253.  0.472
2001 Mellor-Crummey J, Fowler R, Whalley D. On providing useful information for analyzing and tuning applications Performance Evaluation Review. 29: 332-333.  0.443
2001 Mellor-Crummey J, Fowler R, Whalley D. Tools for application-oriented performance tuning Proceedings of the International Conference On Supercomputing. 154-165.  0.419
2000 Chavarría-Miranda D, Mellor-Crummey J. Toward compiler support for scalable parallelism using multipartitioning Lecture Notes in Computer Science (Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). 1915: 272-284. DOI: 10.1007/3-540-40889-4_21  0.803
2000 Zhang K, Mellor-Crummey J, Fowler RJ. Compilation and runtime optimizations for software distributed shared memory Lecture Notes in Computer Science (Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). 1915: 182-191. DOI: 10.1007/3-540-40889-4_14  0.37
1999 McCurdy C, Mellor-Crummey J. An evaluation of computing paradigms for N-body simulations on distributed memory architectures Sigplan Notices (Acm Special Interest Group On Programming Languages). 34: 25-36.  0.367
1998 Mellor-Crummey J, Adve V. Simplifying control flow in compiler-generated parallel code Lecture Notes in Computer Science (Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). 1366: 235-239. DOI: 10.1007/BFb0032695  0.384
1998 Adve V, Mellor-Crummey J. Using Integer Sets for Data-Parallel Program Analysis and Optimization Sigplan Notices (Acm Special Interest Group On Programming Languages). 33: 186-198.  0.444
1997 Roth G, Mellor-Crummey J, Kennedy K, Brickner RG. Compiling stencils in high performance fortran Proceedings of the International Conference On Supercomputing. DOI: 10.1145/509593.509605  0.471
1994 Adve V, Tseng CW, Carle A, Granston E, Hiranandani S, Kennedy K, Koelbel C, Kremer U, Mellor-Crummey J, Warren S. Requirements for Data-Parallel Programming Environments Ieee Parallel and Distributed Technology. 2: 48-58. DOI: 10.1109/M-Pdt.1994.329801  0.539
1993 Mellor-Crummey J. Compile-time Support for Efficient Data Race Detection in Shared-Memory Parallel Programs Acm Sigplan Notices. 28: 129-139. DOI: 10.1145/174267.171370  0.338
1993 Cooper KD, Kennedy K, Mckinley KS, Mellor-Crummey JM, Torczon L, Hall MW, Hood RT, Warren SK. The ParaScope Parallel Programming Environment Proceedings of the Ieee. 81: 244-263. DOI: 10.1109/5.214549  0.433
1992 Lin S, Mellor-Crummey J, Pettitt B, Phillips G. Molecular dynamics on a distributed-memory multiprocessor Journal of Computational Chemistry. 13: 1022-1035. DOI: 10.1002/Jcc.540130813  0.43
1990 Leblanc TJ, Mellor-Crummey JM, Fowler RJ. Analyzing parallel program executions using multiple views Journal of Parallel and Distributed Computing. 9: 203-217. DOI: 10.1016/0743-7315(90)90046-R  0.415
1989 Fowler RJ, LeBlanc TJ, Mellor-Crummey JM. An Integrated Approach to Parallel Program Debugging and Performance Analysis on Large-Scale Multiprocessors Acm Sigplan Notices. 24: 163-173. DOI: 10.1145/69215.69231  0.467
1989 Mellor-Crummey JM, LeBlanc TJ. Software instruction counter . 78-86.  0.385
1987 Leblanc TJ, Mellor-Crummey JM. Debugging Parallel Programs with Instant Replay Ieee Transactions On Computers. 471-482. DOI: 10.1109/TC.1987.1676929  0.364
Show low-probability matches.