Follow
Suraj Kumar
Title
Cited by
Cited by
Year
Optimized association rule mining using genetic algorithm
M Anandhavalli, SK Sudhanshu, A Kumar, MK Ghose
Advances in Information Mining 1 (2), 01-04, 2009
702009
Are static schedules so bad? a case study on cholesky factorization
E Agullo, O Beaumont, L Eyraud-Dubois, S Kumar
2016 IEEE International Parallel and Distributed Processing Symposium (IPDPS …, 2016
642016
From NWChem to NWChemEx: Evolving with the computational chemistry landscape
K Kowalski, R Bair, NP Bauman, JS Boschen, EJ Bylaska, J Daily, ...
Chemical reviews 121 (8), 4962-4998, 2021
632021
Bridging the gap between performance and bounds of cholesky factorization on heterogeneous platforms
E Agullo, O Beaumont, L Eyraud-Dubois, J Herrmann, S Kumar, ...
2015 IEEE International Parallel and Distributed Processing Symposium …, 2015
252015
Approximation proofs of a fast and efficient list scheduling algorithm for task-based runtime systems on multicores and gpus
O Beaumont, L Eyraud-Dubois, S Kumar
2017 IEEE international parallel and distributed processing symposium (IPDPS …, 2017
202017
Scheduling of linear algebra kernels on multiple heterogeneous resources
O Beaumont, T Cojean, L Eyraud-Dubois, A Guermouche, S Kumar
2016 IEEE 23rd International Conference on High Performance Computing (HiPC …, 2016
112016
Fast approximation algorithms for task‐based runtime systems
O Beaumont, L Eyraud‐Dubois, S Kumar
Concurrency and Computation: Practice and Experience 30 (17), e4502, 2018
102018
Scheduling of dense linear algebra kernels on heterogeneous resources
S Kumar
Université de Bordeaux, 2017
82017
Parallel tensor train through hierarchical decomposition
L Grigori, S Kumar
52021
Analysis of a list scheduling algorithm for task graphs on two types of resources
L Eyraud-Dubois, S Kumar
2020 IEEE International Parallel and Distributed Processing Symposium (IPDPS …, 2020
52020
Brief Announcement: Tight Memory-Independent Parallel Matrix Multiplication Communication Lower Bounds
H Al Daas, G Ballard, L Grigori, S Kumar, K Rouse
Proceedings of the 34th ACM Symposium on Parallelism in Algorithms and …, 2022
42022
Performance optimizations for TTI RTM on GPU based hybrid architectures
A Narang, S Kumar, AS Das, M Perrone, D Wade, K Bendiksen, V Slatten, ...
Biennial International Conference & Exposition, 2013
42013
NWChemEx–computational chemistry for the exascale era
H van Dam, E Apra, R Bair, J Boschen, E Bylaska, W De Jong, T Dunning, ...
Bulletin of the American Physical Society 65, 2020
32020
Communication lower bounds and optimal algorithms for multiple tensor-times-matrix computation
H Al Daas, G Ballard, L Grigori, S Kumar, K Rouse
SIAM Journal on Matrix Analysis and Applications 45 (1), 450-477, 2024
22024
Parallel Memory-Independent Communication Bounds for SYRK
H Al Daas, G Ballard, L Grigori, S Kumar, K Rouse
Proceedings of the 35th ACM Symposium on Parallelism in Algorithms and …, 2023
12023
Maximizing TTI RTM throughput for CPU+ GPU
A Narang, S Kumar, J Soman, M Perrone, D Wade, K Bendiksen, ...
European Association of Geoscientists and Engineers Conference, 2013
12013
Maximizing TTI RTM Throughput for CPU+ GPU
D Wade, A Narang, S Kumar, J Soman, M Perrone, K Bendiksen, V Slĺtten, ...
75th EAGE Conference & Exhibition incorporating SPE EUROPEC 2013, cp-348-00751, 2013
12013
Communication Lower Bounds and Optimal Algorithms for Symmetric Matrix Computations
HA Daas, G Ballard, L Grigori, S Kumar, K Rouse, M Verite
arXiv preprint arXiv:2409.11304, 2024
2024
Performance Models for Data Transfers: A Case Study with Molecular Chemistry Kernels
S Kumar, L Eyraud-Dubois, S Krishnamoorthy
Proceedings of the 48th International Conference on Parallel Processing, 1-10, 2019
2019
The system can't perform the operation now. Try again later.
Articles 1–19