Year |
Citation |
Score |
2015 |
Wang Y, Cammarota R, Nicolau A. Fault Tolerant Scheduling for Parallel Loops on Shared Memory Systems Journal of Information Science and Engineering. 31: 1937-1959. DOI: 10.6688/Jise.2015.31.6.7 |
0.348 |
|
2015 |
Wanner L, Lai L, Rahimi A, Gottscho M, Mercati P, Huang C, Sala F, Agarwal Y, Dolecek L, Dutt N, Gupta P, Gupta R, Jhala R, Kumar R, Lerner S, ... ... Nicolau A, et al. NSF expedition on variability-aware software: Recent results and contributions It - Information Technology. 57. DOI: 10.1515/Itit-2014-1085 |
0.372 |
|
2015 |
Gottscho M, BanaiyanMofrad A, Dutt N, Nicolau A, Gupta P. DPCS Acm Transactions On Architecture and Code Optimization. 12: 1-26. DOI: 10.1145/2792982 |
0.328 |
|
2015 |
Gottscho M, Bathen LAD, Dutt N, Nicolau A, Gupta P. ViPZonE: Hardware power variability-aware virtual memory management for energy savings Ieee Transactions On Computers. 64: 1483-1496. DOI: 10.1109/Tc.2014.2329675 |
0.366 |
|
2014 |
Ebcioglu K, Nicolau A. A global resource-constrained parallelization technique Proceedings of the International Conference On Supercomputing. 112-121. DOI: 10.1145/2591635.2667160 |
0.36 |
|
2013 |
Gupta P, Agarwal Y, Dolecek L, Dutt N, Gupta RK, Kumar R, Mitra S, Nicolau A, Rosing TS, Srivastava MB, Swanson S, Sylvester D. Underdesigned and opportunistic computing in presence of hardware variability Ieee Transactions On Computer-Aided Design of Integrated Circuits and Systems. 32: 8-23. DOI: 10.1109/Tcad.2012.2223467 |
0.396 |
|
2013 |
Cammarota R, Nicolau A, Veidenbaum AV. Just in time load balancing Lecture Notes in Computer Science (Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). 7760: 1-16. DOI: 10.1007/978-3-642-37658-0_1 |
0.373 |
|
2011 |
Badin M, Bic L, Dillencourt M, Nicolau A. Improving accuracy for matrix multiplications on GPUs Scientific Programming. 19: 3-11. DOI: 10.1155/2011/417569 |
0.428 |
|
2011 |
D'Alberto P, Bodrato M, Nicolau A. Exploiting parallelism in matrix-computation kernels for symmetric multiprocessor systems: Matrix-multiplication and matrix-addition algorithm optimizations by software pipelining and threads allocation Acm Transactions On Mathematical Software. 38. DOI: 10.1145/2049662.2049664 |
0.753 |
|
2011 |
Nicolau A, Kejariwal A. How many threads to spawn during program multithreading? Lecture Notes in Computer Science (Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). 6548: 166-183. DOI: 10.1007/978-3-642-19595-2_12 |
0.655 |
|
2009 |
Nicolau A, Li G, Veidenbaum AV, Kejariwal A. Synchronization optimizations for efficient execution on multi-cores Proceedings of the International Conference On Supercomputing. 169-180. DOI: 10.1145/1542275.1542303 |
0.683 |
|
2009 |
Nicolau A, Li G, Kejariwal A. Techniques for efficient placement of synchronization primitives Acm Sigplan Notices. 44: 199-208. DOI: 10.1145/1504176.1504207 |
0.686 |
|
2009 |
D'Alberto P, Nicolau A. Adaptive Winograd's matrix multiplications Acm Transactions On Mathematical Software. 36. DOI: 10.1145/1486525.1486528 |
0.766 |
|
2009 |
Kejariwal A, Veidenbaum AV, Nicolau A, Girkar M, Tian X, Saito H. On the exploitation of loop-level parallelism in embedded applications Transactions On Embedded Computing Systems. 8. DOI: 10.1145/1457255.1457257 |
0.726 |
|
2009 |
Moorkanikara Nageswaran J, Felch A, Chandrasekhar A, Dutt N, Granger R, Nicolau A, Veidenbaum A. Brain derived vision algorithm on high performance architectures International Journal of Parallel Programming. 37: 345-369. DOI: 10.1007/S10766-009-0106-9 |
0.459 |
|
2009 |
Ghodrat MA, Givargis T, Nicolau A. Optimizing control flow in loops using interval and dependence analysis Design Automation For Embedded Systems. 13: 193-221. DOI: 10.1007/S10617-009-9043-5 |
0.386 |
|
2008 |
Park S, Shrivastava A, Dutt N, Nicolau A, Paek Y, Earlie E. Register File Power Reduction Using Bypass Sensitive Compiler Ieee Transactions On Computer-Aided Design of Integrated Circuits and Systems. 27: 1155-1159. DOI: 10.1109/Tcad.2008.923254 |
0.34 |
|
2008 |
D'Alberto P, Nicolau A. Using recursion to boost ATLAS's performance Lecture Notes in Computer Science (Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). 4759: 142-151. |
0.315 |
|
2008 |
Kejariwal A, Nicolau A, Polychronopoulos CD. Enhanced loop coalescing: A compiler technique for transforming non-uniform iteration spaces Lecture Notes in Computer Science (Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). 4759: 17-32. |
0.601 |
|
2007 |
Badea C, Nicolau A, Veidenbaum AV. A simplified java bytecode compilation system for resource-constrained embedded processors Cases'07: Proceedings of the 2007 International Conference On Compilers, Architecture, and Synthesis For Embedded Systems. 218-228. DOI: 10.1145/1289881.1289920 |
0.318 |
|
2007 |
D'Alberto P, Nicolau A. Adaptive Strassen's matrix multiplication Proceedings of the International Conference On Supercomputing. 284-292. DOI: 10.1145/1274971.1275010 |
0.407 |
|
2007 |
Tang W, Kejariwal A, Veidenbaum AV, Nicolau A. A predictive decode filter cache for reducing power consumption in embedded processors Acm Transactions On Design Automation of Electronic Systems. 12. DOI: 10.1145/1230800.1230806 |
0.658 |
|
2007 |
Shrivastava A, Sanghyun P, Earlie E, Dutt ND, Nicolau A, Yunheung P. Automatic Design Space Exploration of Register Bypasses in Embedded Processors Ieee Transactions On Computer-Aided Design of Integrated Circuits and Systems. 26: 2102-2115. DOI: 10.1109/Tcad.2007.907066 |
0.37 |
|
2007 |
Mohapatra S, Dutt N, Nicolau A, Venkatasubramanian N. DYNAMO: A cross-layer framework for end-to-end QoS and energy optimization in mobile handheld devices Ieee Journal On Selected Areas in Communications. 25: 722-737. DOI: 10.1109/Jsac.2007.070509 |
0.34 |
|
2007 |
D'Alberto P, Nicolau A. R-Kleene: A high-performance divide-and-conquer algorithm for the all-pair shortest path for densely connected networks Algorithmica (New York). 47: 203-213. DOI: 10.1007/S00453-006-1224-Z |
0.74 |
|
2006 |
Kim M, Oh H, Dutt N, Nicolau A, Venkatasubramanian N. PBPAIR Acm Sigmobile Mobile Computing and Communications Review. 10: 58-69. DOI: 10.1145/1148094.1148100 |
0.304 |
|
2006 |
Shrivastava A, Biswas P, Halambi A, Dutt N, Nicolau A. Compilation framework for code size reduction using reduced bit-width ISAs (rISAs) Acm Transactions On Design Automation of Electronic Systems. 11: 123-146. DOI: 10.1145/1124713.1124722 |
0.371 |
|
2006 |
Ghodrat MA, Givargis T, Nicolau A. Expression equivalence checking using interval analysis Ieee Transactions On Very Large Scale Integration (Vlsi) Systems. 14: 830-842. DOI: 10.1109/Tvlsi.2006.878471 |
0.35 |
|
2006 |
Shrivastava A, Earlie E, Dutt N, Nicolau A. Retargetable pipeline hazard detection for partially bypassed processors Ieee Transactions On Very Large Scale Integration (Vlsi) Systems. 14: 791-801. DOI: 10.1109/Tvlsi.2006.878468 |
0.412 |
|
2006 |
Kejariwal A, Gupta S, Nicolau A, Dutt ND, Gupta R. Energy efficient watermarking on mobile devices using proxy-based partitioning Ieee Transactions On Very Large Scale Integration (Vlsi) Systems. 14: 625-635. DOI: 10.1109/Tvlsi.2006.878218 |
0.653 |
|
2006 |
Kejariwal A, Nicolau A, Polychronopoulos CD. An efficient approach for self-scheduling parallel loops on multiprogrammed parallel computers Lecture Notes in Computer Science (Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). 4339: 441-449. DOI: 10.1007/978-3-540-69330-7_31 |
0.684 |
|
2005 |
Kejariwal A, Nicolau A, Banerjee U, Polychronopoulos CD. A novel approach for partitioning iteration spaces with variable densities Proceedings of the Acm Sigplan Symposium On Principles and Practice of Parallel Programming, Ppopp. 120-131. DOI: 10.1145/1065944.1065962 |
0.642 |
|
2005 |
D'Alberto P, Nicolau A, Veidenbaum A, Gupta R. Line size adaptivity analysis of parameterized loop nests for direct mapped data cache Ieee Transactions On Computers. 54: 185-197. DOI: 10.1109/Tc.2005.28 |
0.735 |
|
2005 |
Kejariwal A, Nicolau A. An efficient load balancing scheme for grid-based high performance scientific computing Ispdc 2005: 4th International Symposium On Parallel and Distributed Computing. 2005: 217-225. DOI: 10.1109/ISPDC.2005.14 |
0.693 |
|
2005 |
D'Alberto P, Nicolau A. Adaptive Strassen and ATLAS'S DGEMM: A Fast Square-Matrix Multiply for Modern High-Performance Systems Proceedings - Eighth International Conference On High-Performance Computing in Asia-Pacific Region, Hpc Asia 2005. 2005: 45-52. DOI: 10.1109/HPCASIA.2005.18 |
0.396 |
|
2005 |
D'Alberto P, Nicolau A. JuliusC: A practical approach for the analysis of divide-and-conquer algorithms Lecture Notes in Computer Science. 3602: 117-131. |
0.4 |
|
2004 |
Gupta S, Gupta RK, Dutt ND, Nicolau A. Coordinated parallelizing compiler optimizations and high-level synthesis Acm Transactions On Design Automation of Electronic Systems (Todaes). 9: 441-470. DOI: 10.1145/1027084.1027087 |
0.382 |
|
2004 |
Gupta S, Savoiu N, Dutt N, Gupta R, Nicolau A. Using Global Code Motions to Improve the Quality of Results for High-Level Synthesis Ieee Transactions On Computer-Aided Design of Integrated Circuits and Systems. 23: 302-312. DOI: 10.1109/Tcad.2003.822105 |
0.352 |
|
2003 |
Grun P, Dutt N, Nicolau A. Access pattern-based memory and connectivity architecture exploration Acm Transactions On Embedded Computing Systems (Tecs). 2: 33-73. DOI: 10.1145/605459.605462 |
0.345 |
|
2002 |
Mishra P, Tomiyama H, Dutt N, Nicolau A. Automatic verification of in-order execution in microprocessors with fragmented pipelines and multicycle functional units Proceedings -Design, Automation and Test in Europe, Date. 36-43. DOI: 10.1109/DATE.2002.998247 |
0.315 |
|
2001 |
Khare A, Halambi A, Savoiu N, Grun P, Dutt N, Nicolau A. V-SAT: A visual specification and analysis tool for system-on-chip exploration Journal of Systems Architecture. 47: 263-275. DOI: 10.1016/S1383-7621(00)00049-7 |
0.342 |
|
2000 |
Panda PR, Dutt ND, Nicolau A. On-chip vs. off-chip memory: the data partitioning problem in embedded processor-based systems Acm Transactions On Design Automation of Electronic Systems. 5: 682-704. DOI: 10.1145/348019.348570 |
0.392 |
|
2000 |
Azevedo A, Nicolau A, Hummel J. An annotation‐aware Java virtual machine implementation Concurrency and Computation: Practice and Experience. 12: 423-444. DOI: 10.1002/1096-9128(200005)12:6<423::Aid-Cpe483>3.0.Co;2-L |
0.537 |
|
1999 |
Panda P, Nakamura H, Dutt N, Nicolau A. Augmenting loop tiling with data alignment for improved cache performance Ieee Transactions On Computers. 48: 142-149. DOI: 10.1109/12.752655 |
0.406 |
|
1999 |
Kolson DJ, Nicolau A, Dutt N. Copy elimination for parallelizing compilers Lecture Notes in Computer Science (Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). 1656: 275-289. |
0.32 |
|
1997 |
Panda PR, Dutt ND, Nicolau A. Memory data organization for improved cache performance in embedded processor applications Acm Transactions On Design Automation of Electronic Systems. 2: 384-409. DOI: 10.1145/268424.268464 |
0.402 |
|
1997 |
Novack S, Nicolau A. Resource-directed loop pipelining: Exposing just enough parallelism Computer Journal. 40. DOI: 10.1093/Comjnl/40.6.311 |
0.451 |
|
1997 |
Panda PR, Nakamura H, Dutt ND, Nicolau A. Improving cache performance through tiling and data alignment Lecture Notes in Computer Science (Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). 1253: 168-185. DOI: 10.1007/3-540-63138-0_16 |
0.448 |
|
1997 |
Hummel J, Azevedo A, Kolson D, Nicolau A. Annotating the Java bytecodes in support of optimization Concurrency Practice and Experience. 9: 1003-1016. DOI: 10.1002/(Sici)1096-9128(199711)9:11<1003::Aid-Cpe346>3.0.Co;2-G |
0.588 |
|
1996 |
Kolson DJ, Nicolau A, Dutt N, Kennedy K. Optimal register assignment to loops for embedded code generation Acm Transactions On Design Automation of Electronic Systems (Todaes). 1: 251-279. DOI: 10.1145/233539.233542 |
0.444 |
|
1996 |
Wang H, Nicolau A, Keung S, Siu K. Computing programs containing band linear recurrences on vector supercomputers Ieee Transactions On Parallel and Distributed Systems. 7: 769-782. DOI: 10.1109/71.532109 |
0.462 |
|
1996 |
Kolson DJ, Nicolau A, Dull N. Elimination of redundant memory traffic in high-level synthesis Ieee Transactions On Computer-Aided Design of Integrated Circuits and Systems. 15: 1354-1364. DOI: 10.1109/43.543768 |
0.343 |
|
1995 |
Gong J, Gajski DD, Nicolau A. Performance Evaluation for Application-Specific Architectures Ieee Transactions On Very Large Scale Integration (Vlsi) Systems. 3: 483-490. DOI: 10.1109/92.475967 |
0.408 |
|
1995 |
Aiken A, Nicolau A, Novack S. Resource-Constrained Software Pipelining Ieee Transactions On Parallel and Distributed Systems. 6: 1248-1270. DOI: 10.1109/71.476167 |
0.763 |
|
1995 |
Capitanio A, Nicolau A, Dutt N. A hypergraph-based model for port allocation on multiple-register-file VLIW architectures International Journal of Parallel Programming. 23: 499-513. DOI: 10.1007/Bf02577864 |
0.377 |
|
1995 |
Novack S, Nicolau A. A hierarchical approach to instruction-level parallelization International Journal of Parallel Programming. 23: 35-62. DOI: 10.1007/Bf02577783 |
0.379 |
|
1993 |
Nicolau A. Massive Parallelism And Fine-Grain Parallelism: Are They Incompatible? International Journal of High Speed Computing. 5: 271-292. DOI: 10.1142/S0129053393000128 |
0.347 |
|
1993 |
Yang J, Bic L, Nicolau A. A Mapping Strategy For Mimd Computers International Journal of High Speed Computing. 5: 89-123. DOI: 10.1142/S0129053393000062 |
0.346 |
|
1992 |
Hummel J, Hendren LJ, Nicolau A. Abstract Description of Pointer Data Structures: An Approach for Improving the Analysis and Optimization of Imperative Programs Acm Letters On Programming Languages and Systems (Loplas). 1: 243-260. DOI: 10.1145/151640.151644 |
0.349 |
|
1990 |
Hendren LJ, Nicolau A. Parallelizing Programs with Recursive Data Structures Ieee Transactions On Parallel and Distributed Systems. 1: 35-47. DOI: 10.1109/71.80123 |
0.378 |
|
1990 |
Beck M, Pingali KK, Nicolau A. Static scheduling for dynamic dataflow machines Journal of Parallel and Distributed Computing. 10: 279-288. DOI: 10.1016/0743-7315(90)90030-S |
0.396 |
|
1989 |
Bilardi G, Nicolau A. Adaptive Bitonic Sorting: An Optimal Parallel Algorithm for Shared Memory Machines Siam Journal On Computing. 18: 216-228. DOI: 10.1137/0218014 |
0.381 |
|
1989 |
Nicolau A. Runtime Disambiguation: Coping with Statically Unpredictable Dependencies Ieee Transactions On Computers. 38: 663-678. DOI: 10.1109/12.24269 |
0.434 |
|
1988 |
Aiken A, Nicolau A. A Development Environment for Horizontal Microcode Ieee Transactions On Software Engineering. 14: 584-594. DOI: 10.1109/32.6136 |
0.721 |
|
1988 |
Nicolau A. Loop quantization: A generalized loop unwinding technique Journal of Parallel and Distributed Computing. 5: 568-586. DOI: 10.1016/0743-7315(88)90013-5 |
0.336 |
|
1988 |
Nicolau A, Pingali K, Aiken A. Fine-grain compilation for pipelined machines The Journal of Supercomputing. 2: 279-295. DOI: 10.1007/Bf00129781 |
0.314 |
|
1986 |
Nicolau A. A development environment for scientific parallel programs Applied Mathematics and Computation. 20: 175-183. DOI: 10.1016/0096-3003(86)90132-3 |
0.396 |
|
1986 |
Karplus K, Nicolau A. A compiler-driven supercomputer Applied Mathematics and Computation. 20: 95-110. DOI: 10.1016/0096-3003(86)90128-1 |
0.449 |
|
1984 |
Nicolau A, Fisher JA. Measuring the Parallelism Available for Very Long Instruction Word Architectures Ieee Transactions On Computers. 968-976. DOI: 10.1109/TC.1984.1676371 |
0.332 |
|
1983 |
Cohen J, Nicolau A. Comparison of Compacting Algorithms for Garbage Collection Acm Transactions On Programming Languages and Systems (Toplas). 5: 532-553. DOI: 10.1145/69575.357226 |
0.442 |
|
Show low-probability matches. |