Pencil: A platform-neutral compute intermediate language for accelerator programming R Baghdadi, U Beaugnon, A Cohen, T Grosser, M Kruse, C Reddy, ... 2015 International Conference on Parallel Architecture and Compilation (PACT …, 2015 | 168 | 2015 |
Autotuning polybench benchmarks with llvm clang/polly loop optimization pragmas using bayesian optimization X Wu, M Kruse, P Balaprakash, H Finkel, P Hovland, V Taylor, M Hall Concurrency and Computation: Practice and Experience 34 (20), e6683, 2022 | 44 | 2022 |
High-performance generalized tensor operations: A compiler-oriented approach R Gareev, T Grosser, M Kruse ACM Transactions on Architecture and Code Optimization (TACO) 15 (3), 1-27, 2018 | 44 | 2018 |
Reduction drawing: Language constructs and polyhedral compilation for reductions on gpu C Reddy, M Kruse, A Cohen Proceedings of the 2016 International Conference on Parallel Architectures …, 2016 | 34 | 2016 |
Lattice QCD estimate of the decay rate D Becirevic, M Kruse, F Sanfilippo arXiv preprint arXiv:1411.6426, 2014 | 25 | 2014 |
A polyhedral compilation framework for loops with dynamic data-dependent bounds J Zhao, M Kruse, A Cohen Proceedings of the 27th International Conference on Compiler Construction, 14-24, 2018 | 19 | 2018 |
Qiral: A high level language for lattice qcd code generation D Barthou, G Grosdidier, M Kruse, O Pene, C Tadonki arXiv preprint arXiv:1208.4035, 2012 | 18 | 2012 |
ytopt: Autotuning scientific applications for energy efficiency at large scales X Wu, P Balaprakash, M Kruse, J Koo, B Videau, P Hovland, V Taylor, ... Concurrency and Computation: Practice and Experience 37 (1), e8322, 2025 | 15 | 2025 |
DeLICM: scalar dependence removal at zero memory cost M Kruse, T Grosser Proceedings of the 2018 International Symposium on Code Generation and …, 2018 | 15 | 2018 |
Outcomes of openMP hackathon: openMP application experiences with the offloading model (part II) B Chapman, B Pham, C Yang, C Daley, C Bertoni, D Kulkarni, ... OpenMP: Enabling Massive Node-Level Parallelism: 17th International Workshop …, 2021 | 14 | 2021 |
Customized Monte Carlo tree search for LLVM/Polly's composable loop optimization transformations J Koo, P Balaprakash, M Kruse, X Wu, P Hovland, M Hall 2021 International Workshop on Performance Modeling, Benchmarking and …, 2021 | 11 | 2021 |
Autotuning search space for loop transformations M Kruse, H Finkel, X Wu 2020 IEEE/ACM 6th Workshop on the LLVM Compiler Infrastructure in HPC (LLVM …, 2020 | 11 | 2020 |
User-directed loop-transformations in Clang M Kruse, H Finkel 2018 IEEE/ACM 5th Workshop on the LLVM Compiler Infrastructure in HPC (LLVM …, 2018 | 11 | 2018 |
A proposal for loop-transformation pragmas M Kruse, H Finkel Evolving OpenMP for Evolving Architectures: 14th International Workshop on …, 2018 | 10 | 2018 |
Lattice QCD estimate of the η c (2S)→ J/ψγ decay rate D Bečirević, M Kruse, F Sanfilippo Journal of High Energy Physics 2015 (5), 1-19, 2015 | 9 | 2015 |
Outcomes of OpenMP Hackathon: OpenMP Application Experiences with the Offloading Mode S Pophale, D Oryspayev, B Chapman, B Pham, C Yang, C Daley, ... Brookhaven National Lab.(BNL), Upton, NY (United States), 2021 | 8 | 2021 |
Introducing Molly: distributed memory parallelization with LLVM M Kruse arXiv preprint arXiv:1409.2088, 2014 | 8 | 2014 |
Loop Transformations using Clang’s abstract syntax tree M Kruse 50th International Conference on Parallel Processing Workshop, 1-7, 2021 | 7 | 2021 |
Commissioning of the control and data acquisition electronics for the CDF silicon vertex detector SM Tkaczyk, KJ Turner, CA Nelson, TM Shaw, TR Wesson, MW Bailey, ... Fermi National Accelerator Lab., Batavia, IL (United States), 1991 | 7 | 1991 |
Design and use of loop-transformation pragmas M Kruse, H Finkel OpenMP: Conquering the Full Hardware Spectrum: 15th International Workshop …, 2019 | 6 | 2019 |