Year |
Citation |
Score |
2017 |
Chung I, Sainath TN, Ramabhadran B, Picheny M, Gunnels J, Austel V, Chauhari U, Kingsbury B. Parallel Deep Neural Network Training for Big Data on Blue Gene/Q Ieee Transactions On Parallel and Distributed Systems. 28: 1703-1714. DOI: 10.1109/Tpds.2016.2626289 |
0.399 |
|
2017 |
Draeger EW, Andrade X, Gunnels JA, Bhatele A, Schleife A, Correa AA. Massively parallel first-principles simulation of electron dynamics in materials Journal of Parallel and Distributed Computing. 106: 205-214. DOI: 10.1016/J.Jpdc.2017.02.005 |
0.317 |
|
2016 |
Van Zee FG, Smith TM, Marker B, Low TM, Van De Geijn RA, Igual FD, Smelyanskiy M, Zhang X, Kistler M, Austel V, Gunnels JA, Killough L. The BLIS framework: Experiments in portability Acm Transactions On Mathematical Software. 42. DOI: 10.1145/2755561 |
0.419 |
|
2015 |
Nair R, Antao SF, Bertolli C, Bose P, Brunheroto JR, Chen T, Cher CY, Costa CHA, Doi J, Evangelinos C, Fleischer BM, Fox TW, Gallo DS, Grinberg L, Gunnels JA, et al. Active memory cube: A processing-in-memory architecture for exascale systems Ibm Journal of Research and Development. 59. DOI: 10.1147/Jrd.2015.2409732 |
0.342 |
|
2015 |
Buono D, Gunnels JA, Que X, Checconi F, Petrini F, Tuan TC, Long C. Optimizing sparse linear algebra for large-scale graph analytics Computer. 48: 26-34. DOI: 10.1109/Mc.2015.228 |
0.421 |
|
2015 |
Que X, Checconi F, Petrini F, Gunnels JA. Scalable Community Detection with the Louvain Algorithm Proceedings - 2015 Ieee 29th International Parallel and Distributed Processing Symposium, Ipdps 2015. 28-37. DOI: 10.1109/IPDPS.2015.59 |
0.392 |
|
2014 |
Chung IH, Sainath TN, Ramabhadran B, Pichen M, Gunnels J, Austel V, Chauhari U, Kingsbury B. Parallel Deep Neural Network Training for Big Data on Blue Gene/Q International Conference For High Performance Computing, Networking, Storage and Analysis, Sc. 2015: 745-753. DOI: 10.1109/SC.2014.66 |
0.309 |
|
2013 |
Ghoting AN, Gunnels JA, Kambadur P, Pednault EP, Squillante MS. Trends and outlook for the massive-scale analytics stack Ibm Journal of Research and Development. 57: 2:1-2:11. DOI: 10.1147/Jrd.2013.2242673 |
0.393 |
|
2013 |
Carnes B, Chan B, Draeger EW, Fattebert J, Fried L, Glosli J, Krauss WD, Langer SH, McCallen R, Mirin AA, Najjar F, Nichols AL, Oppelstrup T, Rathkopf JA, Richards D, ... ... Gunnels JA, et al. Science at LLNL with IBM Blue Gene/Q Ibm Journal of Research and Development. 57: 11:1-11:18. DOI: 10.1147/Jrd.2012.2233371 |
0.302 |
|
2013 |
Bientinesi P, Gunnels JA, Myers ME, Quintana-Ortí ES, Rhodes T, Van De Geijn RA, Van Zee FG. Deriving dense linear algebra libraries Formal Aspects of Computing. 25: 933-945. DOI: 10.1007/S00165-011-0221-4 |
0.641 |
|
2010 |
Gunnels J, Lee J, Margulies S. Efficient high-precision matrix algebra on parallel architectures for nonlinear combinatorial optimization Mathematical Programming Computation. 2: 103-124. DOI: 10.1007/S12532-010-0014-4 |
0.527 |
|
2009 |
Kistler M, Gunnels J, Brokenshire D, Benton B. Programming the Linpack benchmark for the IBM PowerXCell 8i processor Scientific Programming. 17: 43-57. DOI: 10.3233/SPR-2009-0278 |
0.317 |
|
2009 |
Kistler M, Gunnels J, Brokenshire D, Benton B. Programming the Linpack Benchmark for the IBM PowerXCell 8i Processor Scientific Programming. 17: 43-57. DOI: 10.1155/2009/401691 |
0.316 |
|
2009 |
Kistler M, Gunnels J, Brokenshire D, Benton B. Programming the Linpack benchmark for Roadrunner Ibm Journal of Research and Development. 53: 9:1-9:11. DOI: 10.1147/Jrd.2009.5429075 |
0.411 |
|
2009 |
Kistler M, Gunnels J, Brokenshire D, Benton B. Petascale computing with accelerators Acm Sigplan Notices. 44: 241-249. DOI: 10.1145/1504176.1504212 |
0.319 |
|
2008 |
Bohm E, Bhatele A, Kalé LV, Tuckerman ME, Kumar S, Gunnels JA, Martyna GJ. Fine-grained parallelization of the Car-Parrinello ab initio molecular dynamics method on the IBM Blue Gene/L supercomputer Ibm Journal of Research and Development. 52: 159-176. DOI: 10.1147/Rd.521.0159 |
0.381 |
|
2008 |
Saxena V, Agrawal P, Sabharwal Y, Garg VK, Kuruvilla VA, Gunnels JA. Optimization of BLAS on the cell processor Lecture Notes in Computer Science (Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). 5374: 18-29. DOI: 10.1007/978-3-540-89894-8_6 |
0.331 |
|
2008 |
Sabharwal Y, Garg SK, Garg R, Gunnels JA, Sahoo RK. Optimization of fast fourier transforms on the blue gene/L supercomputer Lecture Notes in Computer Science (Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). 5374: 309-322. DOI: 10.1007/978-3-540-89894-8_29 |
0.344 |
|
2007 |
Yotov K, Roeder T, Pingali K, Gunnels J, Gustavson F. An experimental comparison of cache-oblivious and cache-conscious programs Annual Acm Symposium On Parallelism in Algorithms and Architectures. 93-104. DOI: 10.1145/1248377.1248394 |
0.357 |
|
2007 |
Gustavson FG, Gunnels JA, Sexton JC. Minimal data copy for dense linear algebra factorization Lecture Notes in Computer Science (Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). 4699: 540-549. |
0.401 |
|
2006 |
Bientinesi P, Gunnels JA, Gustavson FG, Henry GM, Myers M, Quintana-Ortí ES, Van De Geijn RA. Rapid development of high-performance linear algebra libraries Lecture Notes in Computer Science (Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). 3732: 376-384. DOI: 10.1007/11558958_45 |
0.655 |
|
2006 |
Gunnels JA, Gustavson FG. A new array format for symmetric and triangular matrices Lecture Notes in Computer Science (Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). 3732: 247-255. DOI: 10.1007/11558958_29 |
0.305 |
|
2005 |
Martorell X, Smeds N, Walkup R, Brunheroto JR, Almási G, Gunnels JA, DeRose L, Labarta J, Escalé F, Giménez J, Servat H, Moreira JE. Blue Gene/L performance tools Ibm Journal of Research and Development. 49: 407-424. DOI: 10.1147/Rd.492.0407 |
0.361 |
|
2005 |
Almási G, Archer C, Castaños JG, Gunnels JA, Erway CC, Heidelberger P, Martorell X, Moreira JE, Pinnow K, Ratterman J, Steinmacher-Burow BD, Gropp W, Toonen B. Design and implementation of message-passing services for the Blue Gene/L supercomputer Ibm Journal of Research and Development. 49: 393-406. DOI: 10.1147/Rd.492.0393 |
0.35 |
|
2005 |
Chatterjee S, Bachega LR, Bergner P, Dockser KA, Gunnels JA, Gupta M, Gustavson FG, Lapkowski CA, Liu GK, Mendell M, Nair R, Wait CD, Ward TJC, Wu P. Design and exploitation of a high-performance SIMD floating-point unit for Blue Gene/L Ibm Journal of Research and Development. 49: 377-391. DOI: 10.1147/Rd.492.0377 |
0.489 |
|
2005 |
Andersen BS, Gunnels JA, Gustavson FG, Reid JK, Wasniewski J. A fully portable high performance minimal storage hybrid format cholesky algorithm Acm Transactions On Mathematical Software. 31: 201-227. DOI: 10.1145/1067967.1067969 |
0.404 |
|
2005 |
Bientinesi P, Gunnels JA, Myers ME, Quintana-ORTÍ ES, Van Geijn RADE. The science of deriving dense linear algebra algorithms Acm Transactions On Mathematical Software. 31: 1-26. DOI: 10.1145/1055531.1055532 |
0.634 |
|
2005 |
Chatterjee S, Bachega LR, Bergner P, Dockser KA, Gunnels JA, Gupta M, Gustavson FG, Lapkowski CA, Liu GK, Mendell M, Nair R, Wait CD, Ward TJC, Wu P. Design and exploitation of a high-performance SIMD floating-point unit for Blue Gene/L Ibm Journal of Research and Development. 49: 377-391. |
0.407 |
|
2004 |
Bachega L, Chatterjee S, Dockser KA, Gunnels JA, Gupta M, Gustavson FG, Lapkowski CA, Liu GK, Mendell MP, Wait CD, Ward TJC. A high-performance SIMD floating point unit for BlueGene/L: Architecture, compilation, and algorithm design Parallel Architectures and Compilation Techniques - Conference Proceedings, Pact. 85-96. DOI: 10.1109/PACT.2004.1342544 |
0.426 |
|
2004 |
Almási G, Archer C, Gunnels JA, Heidelberger P, Martorell X, Moreira JE. Architecture and Performance of the BlueGene/L Message Layer Lecture Notes in Computer Science. 405-414. DOI: 10.1007/978-3-540-30218-6_55 |
0.372 |
|
2002 |
Andersen BS, Gunnels JA, Gustavson F, Waśniewski J. A recursive formulation of the inversion of symmetric positive definite matrices in packed storage data format Lecture Notes in Computer Science (Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). 2367: 287-296. |
0.371 |
|
2001 |
Gunnels JA, Gustavson FG, Henry GM, Van De Geijn RA. FLAME: Formal linear algebra methods environment Acm Transactions On Mathematical Software. 27: 422-455. DOI: 10.1145/504210.504213 |
0.48 |
|
2001 |
Gunnels JA, van de Geijn RA. Formal methods for high-performance linear algebra libraries Ifip Advances in Information and Communication Technology. 60: 193-208. |
0.33 |
|
1997 |
Chtchelkanova A, Gunnels J, Morrow G, Overfelt J, van de Geijn RA. Parallel implementation of BLAS: general techniques for Level 3 BLAS Concurrency: Practice and Experience. 9: 837-857. DOI: 10.1002/(Sici)1096-9128(199709)9:9<837::Aid-Cpe267>3.0.Co;2-2 |
0.332 |
|
Show low-probability matches. |