John A. Gunnels, Ph.D. - Publications

Affiliations: 
2001 University of Texas at Austin, Austin, Texas, U.S.A. 
Area:
Computer Science

34 high-probability publications. We are testing a new system for linking publications to authors. You can help! If you notice any inaccuracies, please sign in and mark papers as correct or incorrect matches. If you identify any major omissions or other inaccuracies in the publication list, please let us know.

Year Citation  Score
2017 Chung I, Sainath TN, Ramabhadran B, Picheny M, Gunnels J, Austel V, Chauhari U, Kingsbury B. Parallel Deep Neural Network Training for Big Data on Blue Gene/Q Ieee Transactions On Parallel and Distributed Systems. 28: 1703-1714. DOI: 10.1109/Tpds.2016.2626289  0.399
2017 Draeger EW, Andrade X, Gunnels JA, Bhatele A, Schleife A, Correa AA. Massively parallel first-principles simulation of electron dynamics in materials Journal of Parallel and Distributed Computing. 106: 205-214. DOI: 10.1016/J.Jpdc.2017.02.005  0.317
2016 Van Zee FG, Smith TM, Marker B, Low TM, Van De Geijn RA, Igual FD, Smelyanskiy M, Zhang X, Kistler M, Austel V, Gunnels JA, Killough L. The BLIS framework: Experiments in portability Acm Transactions On Mathematical Software. 42. DOI: 10.1145/2755561  0.419
2015 Nair R, Antao SF, Bertolli C, Bose P, Brunheroto JR, Chen T, Cher CY, Costa CHA, Doi J, Evangelinos C, Fleischer BM, Fox TW, Gallo DS, Grinberg L, Gunnels JA, et al. Active memory cube: A processing-in-memory architecture for exascale systems Ibm Journal of Research and Development. 59. DOI: 10.1147/Jrd.2015.2409732  0.342
2015 Buono D, Gunnels JA, Que X, Checconi F, Petrini F, Tuan TC, Long C. Optimizing sparse linear algebra for large-scale graph analytics Computer. 48: 26-34. DOI: 10.1109/Mc.2015.228  0.421
2015 Que X, Checconi F, Petrini F, Gunnels JA. Scalable Community Detection with the Louvain Algorithm Proceedings - 2015 Ieee 29th International Parallel and Distributed Processing Symposium, Ipdps 2015. 28-37. DOI: 10.1109/IPDPS.2015.59  0.392
2014 Chung IH, Sainath TN, Ramabhadran B, Pichen M, Gunnels J, Austel V, Chauhari U, Kingsbury B. Parallel Deep Neural Network Training for Big Data on Blue Gene/Q International Conference For High Performance Computing, Networking, Storage and Analysis, Sc. 2015: 745-753. DOI: 10.1109/SC.2014.66  0.309
2013 Ghoting AN, Gunnels JA, Kambadur P, Pednault EP, Squillante MS. Trends and outlook for the massive-scale analytics stack Ibm Journal of Research and Development. 57: 2:1-2:11. DOI: 10.1147/Jrd.2013.2242673  0.393
2013 Carnes B, Chan B, Draeger EW, Fattebert J, Fried L, Glosli J, Krauss WD, Langer SH, McCallen R, Mirin AA, Najjar F, Nichols AL, Oppelstrup T, Rathkopf JA, Richards D, ... ... Gunnels JA, et al. Science at LLNL with IBM Blue Gene/Q Ibm Journal of Research and Development. 57: 11:1-11:18. DOI: 10.1147/Jrd.2012.2233371  0.302
2013 Bientinesi P, Gunnels JA, Myers ME, Quintana-Ortí ES, Rhodes T, Van De Geijn RA, Van Zee FG. Deriving dense linear algebra libraries Formal Aspects of Computing. 25: 933-945. DOI: 10.1007/S00165-011-0221-4  0.641
2010 Gunnels J, Lee J, Margulies S. Efficient high-precision matrix algebra on parallel architectures for nonlinear combinatorial optimization Mathematical Programming Computation. 2: 103-124. DOI: 10.1007/S12532-010-0014-4  0.527
2009 Kistler M, Gunnels J, Brokenshire D, Benton B. Programming the Linpack benchmark for the IBM PowerXCell 8i processor Scientific Programming. 17: 43-57. DOI: 10.3233/SPR-2009-0278  0.317
2009 Kistler M, Gunnels J, Brokenshire D, Benton B. Programming the Linpack Benchmark for the IBM PowerXCell 8i Processor Scientific Programming. 17: 43-57. DOI: 10.1155/2009/401691  0.316
2009 Kistler M, Gunnels J, Brokenshire D, Benton B. Programming the Linpack benchmark for Roadrunner Ibm Journal of Research and Development. 53: 9:1-9:11. DOI: 10.1147/Jrd.2009.5429075  0.411
2009 Kistler M, Gunnels J, Brokenshire D, Benton B. Petascale computing with accelerators Acm Sigplan Notices. 44: 241-249. DOI: 10.1145/1504176.1504212  0.319
2008 Bohm E, Bhatele A, Kalé LV, Tuckerman ME, Kumar S, Gunnels JA, Martyna GJ. Fine-grained parallelization of the Car-Parrinello ab initio molecular dynamics method on the IBM Blue Gene/L supercomputer Ibm Journal of Research and Development. 52: 159-176. DOI: 10.1147/Rd.521.0159  0.381
2008 Saxena V, Agrawal P, Sabharwal Y, Garg VK, Kuruvilla VA, Gunnels JA. Optimization of BLAS on the cell processor Lecture Notes in Computer Science (Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). 5374: 18-29. DOI: 10.1007/978-3-540-89894-8_6  0.331
2008 Sabharwal Y, Garg SK, Garg R, Gunnels JA, Sahoo RK. Optimization of fast fourier transforms on the blue gene/L supercomputer Lecture Notes in Computer Science (Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). 5374: 309-322. DOI: 10.1007/978-3-540-89894-8_29  0.344
2007 Yotov K, Roeder T, Pingali K, Gunnels J, Gustavson F. An experimental comparison of cache-oblivious and cache-conscious programs Annual Acm Symposium On Parallelism in Algorithms and Architectures. 93-104. DOI: 10.1145/1248377.1248394  0.357
2007 Gustavson FG, Gunnels JA, Sexton JC. Minimal data copy for dense linear algebra factorization Lecture Notes in Computer Science (Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). 4699: 540-549.  0.401
2006 Bientinesi P, Gunnels JA, Gustavson FG, Henry GM, Myers M, Quintana-Ortí ES, Van De Geijn RA. Rapid development of high-performance linear algebra libraries Lecture Notes in Computer Science (Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). 3732: 376-384. DOI: 10.1007/11558958_45  0.655
2006 Gunnels JA, Gustavson FG. A new array format for symmetric and triangular matrices Lecture Notes in Computer Science (Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). 3732: 247-255. DOI: 10.1007/11558958_29  0.305
2005 Martorell X, Smeds N, Walkup R, Brunheroto JR, Almási G, Gunnels JA, DeRose L, Labarta J, Escalé F, Giménez J, Servat H, Moreira JE. Blue Gene/L performance tools Ibm Journal of Research and Development. 49: 407-424. DOI: 10.1147/Rd.492.0407  0.361
2005 Almási G, Archer C, Castaños JG, Gunnels JA, Erway CC, Heidelberger P, Martorell X, Moreira JE, Pinnow K, Ratterman J, Steinmacher-Burow BD, Gropp W, Toonen B. Design and implementation of message-passing services for the Blue Gene/L supercomputer Ibm Journal of Research and Development. 49: 393-406. DOI: 10.1147/Rd.492.0393  0.35
2005 Chatterjee S, Bachega LR, Bergner P, Dockser KA, Gunnels JA, Gupta M, Gustavson FG, Lapkowski CA, Liu GK, Mendell M, Nair R, Wait CD, Ward TJC, Wu P. Design and exploitation of a high-performance SIMD floating-point unit for Blue Gene/L Ibm Journal of Research and Development. 49: 377-391. DOI: 10.1147/Rd.492.0377  0.489
2005 Andersen BS, Gunnels JA, Gustavson FG, Reid JK, Wasniewski J. A fully portable high performance minimal storage hybrid format cholesky algorithm Acm Transactions On Mathematical Software. 31: 201-227. DOI: 10.1145/1067967.1067969  0.404
2005 Bientinesi P, Gunnels JA, Myers ME, Quintana-ORTÍ ES, Van Geijn RADE. The science of deriving dense linear algebra algorithms Acm Transactions On Mathematical Software. 31: 1-26. DOI: 10.1145/1055531.1055532  0.634
2005 Chatterjee S, Bachega LR, Bergner P, Dockser KA, Gunnels JA, Gupta M, Gustavson FG, Lapkowski CA, Liu GK, Mendell M, Nair R, Wait CD, Ward TJC, Wu P. Design and exploitation of a high-performance SIMD floating-point unit for Blue Gene/L Ibm Journal of Research and Development. 49: 377-391.  0.407
2004 Bachega L, Chatterjee S, Dockser KA, Gunnels JA, Gupta M, Gustavson FG, Lapkowski CA, Liu GK, Mendell MP, Wait CD, Ward TJC. A high-performance SIMD floating point unit for BlueGene/L: Architecture, compilation, and algorithm design Parallel Architectures and Compilation Techniques - Conference Proceedings, Pact. 85-96. DOI: 10.1109/PACT.2004.1342544  0.426
2004 Almási G, Archer C, Gunnels JA, Heidelberger P, Martorell X, Moreira JE. Architecture and Performance of the BlueGene/L Message Layer Lecture Notes in Computer Science. 405-414. DOI: 10.1007/978-3-540-30218-6_55  0.372
2002 Andersen BS, Gunnels JA, Gustavson F, Waśniewski J. A recursive formulation of the inversion of symmetric positive definite matrices in packed storage data format Lecture Notes in Computer Science (Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). 2367: 287-296.  0.371
2001 Gunnels JA, Gustavson FG, Henry GM, Van De Geijn RA. FLAME: Formal linear algebra methods environment Acm Transactions On Mathematical Software. 27: 422-455. DOI: 10.1145/504210.504213  0.48
2001 Gunnels JA, van de Geijn RA. Formal methods for high-performance linear algebra libraries Ifip Advances in Information and Communication Technology. 60: 193-208.  0.33
1997 Chtchelkanova A, Gunnels J, Morrow G, Overfelt J, van de Geijn RA. Parallel implementation of BLAS: general techniques for Level 3 BLAS Concurrency: Practice and Experience. 9: 837-857. DOI: 10.1002/(Sici)1096-9128(199709)9:9<837::Aid-Cpe267>3.0.Co;2-2  0.332
Show low-probability matches.