Pencil: A platform-neutral compute intermediate language for accelerator programming R Baghdadi, U Beaugnon, A Cohen, T Grosser, M Kruse, C Reddy, ...
2015 International Conference on Parallel Architecture and Compilation (PACT …, 2015
161 2015 High-performance generalized tensor operations: A compiler-oriented approach R Gareev, T Grosser, M Kruse
ACM Transactions on Architecture and Code Optimization (TACO) 15 (3), 1-27, 2018
42 2018 Autotuning polybench benchmarks with llvm clang/polly loop optimization pragmas using bayesian optimization X Wu, M Kruse, P Balaprakash, H Finkel, P Hovland, V Taylor, M Hall
Concurrency and Computation: Practice and Experience 34 (20), e6683, 2022
32 2022 Reduction drawing: Language constructs and polyhedral compilation for reductions on gpu C Reddy, M Kruse, A Cohen
Proceedings of the 2016 International Conference on Parallel Architectures …, 2016
30 2016 Lattice QCD estimate of the decay rate D Becirevic, M Kruse, F Sanfilippo
arXiv preprint arXiv:1411.6426, 2014
23 2014 Qiral: A high level language for lattice qcd code generation D Barthou, G Grosdidier, M Kruse, O Pene, C Tadonki
arXiv preprint arXiv:1208.4035, 2012
18 2012 A polyhedral compilation framework for loops with dynamic data-dependent bounds J Zhao, M Kruse, A Cohen
Proceedings of the 27th International Conference on Compiler Construction, 14-24, 2018
17 2018 DeLICM: scalar dependence removal at zero memory cost M Kruse, T Grosser
Proceedings of the 2018 International Symposium on Code Generation and …, 2018
15 2018 Outcomes of openMP hackathon: openMP application experiences with the offloading model (part II) B Chapman, B Pham, C Yang, C Daley, C Bertoni, D Kulkarni, ...
OpenMP: Enabling Massive Node-Level Parallelism: 17th International Workshop …, 2021
11 2021 Autotuning search space for loop transformations M Kruse, H Finkel, X Wu
2020 IEEE/ACM 6th Workshop on the LLVM Compiler Infrastructure in HPC (LLVM …, 2020
11 2020 User-directed loop-transformations in Clang M Kruse, H Finkel
2018 IEEE/ACM 5th Workshop on the LLVM Compiler Infrastructure in HPC (LLVM …, 2018
9 2018 A proposal for loop-transformation pragmas M Kruse, H Finkel
Evolving OpenMP for Evolving Architectures: 14th International Workshop on …, 2018
9 2018 Introducing Molly: distributed memory parallelization with LLVM M Kruse
arXiv preprint arXiv:1409.2088, 2014
8 2014 ytopt: Autotuning scientific applications for energy efficiency at large scales X Wu, P Balaprakash, M Kruse, J Koo, B Videau, P Hovland, V Taylor, ...
arXiv preprint arXiv:2303.16245, 2023
7 2023 Customized Monte Carlo tree search for LLVM/Polly's composable loop optimization transformations J Koo, P Balaprakash, M Kruse, X Wu, P Hovland, M Hall
2021 International Workshop on Performance Modeling, Benchmarking and …, 2021
7 2021 Outcomes of OpenMP Hackathon: OpenMP Application Experiences with the Offloading Mode S Pophale, D Oryspayev, B Chapman, B Pham, C Yang, C Daley, ...
Brookhaven National Lab.(BNL), Upton, NY (United States), 2021
6 2021 Lattice QCD estimate of the η c (2S)→ J/ψγ decay rate D Bečirević, M Kruse, F Sanfilippo
Journal of High Energy Physics 2015 (5), 1-19, 2015
6 2015 Loop Transformations using Clang’s abstract syntax tree M Kruse
50th International Conference on Parallel Processing Workshop, 1-7, 2021
5 2021 Design and use of loop-transformation pragmas M Kruse, H Finkel
OpenMP: Conquering the Full Hardware Spectrum: 15th International Workshop …, 2019
5 2019 atJIT: A just-in-time autotuning compiler for C++ K Farvardin, H Finkel, M Kruse, J Reppy
LLVM Developers Meeting Technical Talk, 2018
5 2018