Suraj Kumar

Cited by

	All	Since 2020
Citations	307	156
h-index	8	6
i10-index	7	2

20102011201220132014201520162017201820192020202120222023202420251 1 7 17 12 18 12 19 36 27 24 31 20 39 34 8

Public access

View all

6 articles

0 articles

available

not available

Based on funding mandates

Co-authors

Lionel Eyraud-DuboisVerified email at inria.fr
Olivier BeaumontINRIA Bordeaux Sud-OuestVerified email at labri.fr
Laura GrigoriEPFL and PSI, SwitzerlandVerified email at epfl.ch
Grey BallardWake Forest UniversityVerified email at wfu.edu
Hussam Al DaasComputational Mathematics Theme, Scientific Computing Department, STFCVerified email at stfc.ac.uk
Ankur NarangSigmoidstarVerified email at sigmoidstar.com
Michael PerroneGoogleVerified email at google.com
Samuel ThibaultProfesseur, Université de BordeauxVerified email at labri.fr
Loris MarchalLIP, CNRS, ENS LyonVerified email at ens-lyon.fr
Julien HerrmannCentre National de la Recherche Scientifique (CNRS - IRIT - APO)Verified email at irit.fr
Terry CojeanEvidenVerified email at eviden.com
Sriram KrishnamoorthyGoogleVerified email at google.com

Suraj Kumar

Inria and ENS Lyon

Verified email at inria.fr - Homepage

Tensor Computations Communication Costs Parallel Computing Scheduling Runtime Systems


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Optimized association rule mining using genetic algorithm M Anandhavalli, SK Sudhanshu, A Kumar, MK Ghose Advances in Information Mining 1 (2), 01-04, 2009	69	2009
From NWChem to NWChemEx: Evolving with the computational chemistry landscape K Kowalski, R Bair, NP Bauman, JS Boschen, EJ Bylaska, J Daily, ... Chemical reviews 121 (8), 4962-4998, 2021	67	2021
Are static schedules so bad? a case study on cholesky factorization E Agullo, O Beaumont, L Eyraud-Dubois, S Kumar 2016 IEEE International Parallel and Distributed Processing Symposium (IPDPS …, 2016	64	2016
Bridging the gap between performance and bounds of cholesky factorization on heterogeneous platforms E Agullo, O Beaumont, L Eyraud-Dubois, J Herrmann, S Kumar, ... 2015 IEEE International Parallel and Distributed Processing Symposium …, 2015	26	2015
Approximation proofs of a fast and efficient list scheduling algorithm for task-based runtime systems on multicores and gpus O Beaumont, L Eyraud-Dubois, S Kumar 2017 IEEE international parallel and distributed processing symposium (IPDPS …, 2017	21	2017
Scheduling of linear algebra kernels on multiple heterogeneous resources O Beaumont, T Cojean, L Eyraud-Dubois, A Guermouche, S Kumar 2016 IEEE 23rd International Conference on High Performance Computing (HiPC …, 2016	12	2016
Fast approximation algorithms for task‐based runtime systems O Beaumont, L Eyraud‐Dubois, S Kumar Concurrency and Computation: Practice and Experience 30 (17), e4502, 2018	10	2018
Scheduling of dense linear algebra kernels on heterogeneous resources S Kumar Université de Bordeaux, 2017	8	2017
Brief Announcement: Tight Memory-Independent Parallel Matrix Multiplication Communication Lower Bounds H Al Daas, G Ballard, L Grigori, S Kumar, K Rouse Proceedings of the 34th ACM Symposium on Parallelism in Algorithms and …, 2022	7	2022
Analysis of a list scheduling algorithm for task graphs on two types of resources L Eyraud-Dubois, S Kumar 2020 IEEE International Parallel and Distributed Processing Symposium (IPDPS …, 2020	6	2020
Parallel tensor train through hierarchical decomposition L Grigori, S Kumar	5	2021
Performance optimizations for TTI RTM on GPU based hybrid architectures A Narang, S Kumar, AS Das, M Perrone, D Wade, K Bendiksen, V Slatten, ... Biennial International Conference & Exposition, 2013	4	2013
NWChemEx–computational chemistry for the exascale era H van Dam, E Apra, R Bair, J Boschen, E Bylaska, W De Jong, T Dunning, ... Bulletin of the American Physical Society 65, 2020	3	2020
Communication lower bounds and optimal algorithms for multiple tensor-times-matrix computation H Al Daas, G Ballard, L Grigori, S Kumar, K Rouse SIAM Journal on Matrix Analysis and Applications 45 (1), 450-477, 2024	2	2024
Parallel Memory-Independent Communication Bounds for SYRK H Al Daas, G Ballard, L Grigori, S Kumar, K Rouse Proceedings of the 35th ACM Symposium on Parallelism in Algorithms and …, 2023	1	2023
Maximizing TTI RTM throughput for CPU+ GPU A Narang, S Kumar, J Soman, M Perrone, D Wade, K Bendiksen, ... European Association of Geoscientists and Engineers Conference, 2013	1	2013
Maximizing TTI RTM Throughput for CPU+ GPU D Wade, A Narang, S Kumar, J Soman, M Perrone, K Bendiksen, V Slĺtten, ... 75th EAGE Conference & Exhibition incorporating SPE EUROPEC 2013, cp-348-00751, 2013	1	2013
Communication Lower Bounds and Optimal Algorithms for Symmetric Matrix Computations HA Daas, G Ballard, L Grigori, S Kumar, K Rouse, M Verite arXiv preprint arXiv:2409.11304, 2024		2024
Performance Models for Data Transfers: A Case Study with Molecular Chemistry Kernels S Kumar, L Eyraud-Dubois, S Krishnamoorthy Proceedings of the 48th International Conference on Parallel Processing, 1-10, 2019		2019

The system can't perform the operation now. Try again later.

Articles 1–19

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors