Conditioning deep generative raw audio models for structured automatic music R Manzelli, V Thakkar, A Siahkamari, B Kulis arXiv preprint arXiv:1806.09905, 2018 | 56 | 2018 |
Flashattention-3: Fast and accurate attention with asynchrony and low-precision J Shah, G Bikshandi, Y Zhang, V Thakkar, P Ramani, T Dao arXiv preprint arXiv:2407.08608, 2024 | 42 | 2024 |
An end to end model for automatic music generation: Combining deep raw and symbolic audio networks R Manzelli, V Thakkar, A Siahkamari, B Kulis Proceedings of the musical metacreation workshop at 9th international …, 2018 | 31 | 2018 |
CUTLASS V Thakkar, P Ramani, C Cecka, A Shivam, H Lu, E Yan, J Kosaian, ... github, 2023 | 23 | 2023 |
Scalable knowledge graph analytics at 136 petaflop/s R Kannan, P Sao, H Lu, D Herrmannova, V Thakkar, R Patton, R Vuduc, ... SC20: International Conference for High Performance Computing, Networking …, 2020 | 11 | 2020 |
Flashattention-3: Fast and accurate attention with asynchrony and low-precision, 2024 J Shah, G Bikshandi, Y Zhang, V Thakkar, P Ramani, T Dao URL https://arxiv. org/abs/2407.08608, 0 | 6 | |
fvdb: A deep-learning framework for sparse, large scale, and high performance spatial intelligence F Williams, J Huang, J Swartz, G Klar, V Thakkar, M Cong, X Ren, R Li, ... ACM Transactions on Graphics (TOG) 43 (4), 1-15, 2024 | 5 | 2024 |
Scalable All-pairs Shortest Paths for Huge Graphs on Multi-GPU Clusters P Sao, H Lu, R Kannan, V Thakkar, R Vuduc, T Potok Proceedings of the 30th International Symposium on High-Performance Parallel …, 2021 | 5 | 2021 |
Exaflops biomedical knowledge graph analytics R Kannan, P Sao, H Lu, J Kurzak, G Schenk, Y Shi, SH Lim, S Israni, ... SC22: International Conference for High Performance Computing, Networking …, 2022 | 4 | 2022 |
CUTLASS, January 2023 V Thakkar, P Ramani, C Cecka, A Shivam, H Lu, E Yan, J Kosaian, ... URL https://github. com/NVIDIA/cutlass, 0 | 4 | |
Knowledge graph analytics kernels in high performance computing R Kannan, PK Sao, H Lu, D Herrmannova, V Thakkar, RM Patton, ... US Patent App. 17/389,862, 2022 | 2 | 2022 |
Dense semiring linear algebra on modern cuda hardware V Thakkar, R Kannan, P Sao, H Lu, D Herrmannova, R Patton, R Vuduc, ... SIAM Computational Sciences and Engineering. SIAM, 2021 | 2 | 2021 |
Scalable knowledge-graph analytics at 136 petaflop/s D Herrmannova, R Kannan, PK Sao, H Lu, RM Patton, TE Potok, ... Oak Ridge National Lab.(ORNL), Oak Ridge, TN (United States). Oak Ridge …, 2020 | 1 | 2020 |
Application programming interface to synchronize matrix multiply-accumulate memory transactions HC Edwards, K Perelygin, M Tyrlik, GRHC Shekhara, BKY Atukuri, ... US Patent App. 18/072,053, 2024 | | 2024 |
Application programming interface to indicate operations to be performed by corresponding streaming multiprocessors HC Edwards, K Perelygin, M Tyrlik, GRHC Shekhara, BKY Atukuri, ... US Patent App. 18/072,300, 2024 | | 2024 |
Application programming interface to indicate matrix multiply-accumulate HC Edwards, K Perelygin, M Tyrlik, GRHC Shekhara, BKY Atukuri, ... US Patent App. 18/072,060, 2024 | | 2024 |
Application programming interface to wait on matrix multiply-accumulate HC Edwards, K Perelygin, M Tyrlik, GRHC Shekhara, BKY Atukuri, ... US Patent App. 18/072,081, 2024 | | 2024 |
Critique of “MemXCT: Memory-Centric X-Ray CT Reconstruction With Massive Parallelization” by SCC Team From Georgia Tech N Prindle, A Kazmi, A Jain, A Chen, M Sorkin, S Agarwal, R Vuduc, ... IEEE Transactions on Parallel and Distributed Systems 33 (9), 2035-2038, 2022 | | 2022 |
Scalable All-pairs Shortest Paths for Huge Graphs on Multi-GPU Clusters R Kannan, V Thakkar, R Vuduc, T Potok HPDC'21: Proceedings of the 30th International Symposium on High-Performance …, 2020 | | 2020 |