Follow
Shigang Li
Title
Cited by
Cited by
Year
Deep learning for post-processing ensemble weather forecasts
P Grönquist, C Yao, T Ben-Nun, N Dryden, P Dueben, S Li, T Hoefler
Philosophical Transactions of the Royal Society A 379 (2194), 20200092, 2021
1782021
Data Movement Is All You Need: A Case Study on Optimizing Transformers
A Ivanov, N Dryden, T Ben-Nun, S Li, T Hoefler
Proceedings of Machine Learning and Systems 3, 2021
1362021
NUMA-aware shared-memory collective communication for MPI
S Li, T Hoefler, M Snir
Proceedings of the 22nd international symposium on High-performance parallel …, 2013
1202013
Parallel processing systems for big data: a survey
Y Zhang, T Cao, S Li, X Tian, L Yuan, H Jia, AV Vasilakos
Proceedings of the IEEE 104 (11), 2114-2136, 2016
1152016
Chimera: efficiently training large-scale neural networks with bidirectional pipelines
S Li, T Hoefler
Proceedings of the International Conference for High Performance Computing …, 2021
1122021
CAS‐ESM 2: Description and climate simulation performance of the Chinese Academy of Sciences (CAS) Earth System Model (ESM) version 2
H Zhang, M Zhang, J Jin, K Fei, D Ji, C Wu, J Zhu, J He, Z Chai, J Xie, ...
Journal of Advances in Modeling Earth Systems, e2020MS002210, 2020
802020
Taming unbalanced training workloads in deep learning with partial collective operations
S Li, T Ben-Nun, SD Girolamo, D Alistarh, T Hoefler
Proceedings of the 25th ACM SIGPLAN Symposium on Principles and Practice of …, 2020
622020
Asynchronous Decentralized SGD with Quantized and Local Updates
G Nadiradze, A Sabour, P Davies, S Li, D Alistarh
Advances in Neural Information Processing Systems 34, 2021
542021
Intra-hour Photovoltaic Generation Forecasting based on Multi-source Data and Deep Learning Methods
T Yao, J Wang, H Wu, P Zhang, S Li, K Xu, X Liu, X Chi
IEEE Transactions on Sustainable Energy, 2021
472021
Flare: flexible in-network allreduce
D De Sensi, S Di Girolamo, S Ashkboos, S Li, T Hoefler
Proceedings of the International Conference for High Performance Computing …, 2021
452021
Near-optimal sparse allreduce for distributed deep learning
S Li, T Hoefler
Proceedings of the 27th ACM SIGPLAN Symposium on Principles and Practice of …, 2022
442022
Improved MPI collectives for MPI processes in shared address spaces
S Li, T Hoefler, C Hu, M Snir
Cluster Computing 17 (4), 1139-1155, 2014
342014
Cache-oblivious MPI all-to-all communications based on Morton order
S Li, Y Zhang, T Hoefler
IEEE Transactions on Parallel and Distributed Systems, 2018
312018
A photovoltaic power output dataset: Multi-source photovoltaic power output dataset with Python toolkit
T Yao, J Wang, H Wu, P Zhang, S Li, Y Wang, X Chi, M Shi
Solar Energy 230, 122-130, 2021
302021
Efficient quantized sparse matrix operations on tensor cores
S Li, K Osawa, T Hoefler
SC22: International Conference for High Performance Computing, Networking …, 2022
292022
Kernel optimization for short-range molecular dynamics
C Hu, X Wang, J Li, X He, S Li, Y Feng, S Yang, H Bai
Computer Physics Communications, 2016
212016
Efficient parallel optimizations of a high-performance SIFT on GPUs
Z Li, H Jia, Y Zhang, S Liu, S Li, X Wang, H Zhang
Journal of Parallel and Distributed Computing, 2018
202018
Why Dataset Properties Bound the Scalability of Parallel Machine Learning Training Algorithms
D Cheng, S Li, Z Hanping, F Xia, Y Zhang
IEEE Transactions on Parallel and Distributed Systems, 2021
18*2021
Massively Scaling the Metal Microscopic Damage Simulation on Sunway TaihuLight Supercomputer
S Li, B Wu, Y Zhang, X Wang, J Li, C Hu, J Wang, Y Feng, N Nie
Proceedings of the 47th International Conference on Parallel Processing, 47, 2018
182018
Hammingmesh: A network topology for large-scale deep learning
T Hoefler, T Bonato, D De Sensi, S Di Girolamo, S Li, M Heddes, J Belk, ...
SC22: International Conference for High Performance Computing, Networking …, 2022
172022
The system can't perform the operation now. Try again later.
Articles 1–20