Zhewei Yao
Zhewei Yao
Verified email at berkeley.edu - Homepage
Title
Cited by
Cited by
Year
Hessian-based analysis of large batch training and robustness to adversaries
Z Yao, A Gholami, Q Lei, K Keutzer, MW Mahoney
Advances in Neural Information Processing Systems, 4949-4959, 2018
382018
Inexact non-convex Newton-type methods
Z Yao, P Xu, F Roosta-Khorasani, MW Mahoney
arXiv preprint arXiv:1802.06925, 2018
232018
Q-bert: Hessian based ultra low precision quantization of bert
S Shen, Z Dong, J Ye, L Ma, Z Yao, A Gholami, MW Mahoney, K Keutzer
AAAI 2020, 2019
202019
Hawq: Hessian aware quantization of neural networks with mixed-precision
Z Dong, Z Yao, A Gholami, MW Mahoney, K Keutzer
Proceedings of the IEEE International Conference on Computer Vision, 293-302, 2019
202019
On the computational inefficiency of large batch sizes for stochastic gradient descent
N Golmant, N Vemuri, Z Yao, V Feinberg, A Gholami, K Rothauge, ...
arXiv preprint arXiv:1811.12941, 2018
172018
Shallow learning for fluid flow reconstruction with limited sensors and limited data
NB Erichson, L Mathelin, Z Yao, SL Brunton, MW Mahoney, JN Kutz
arXiv preprint arXiv:1902.07358, 2019
162019
A TV-Gaussian prior for infinite-dimensional Bayesian inverse problems and its numerical implementations
Z Yao, Z Hu, J Li
Inverse Problems 32 (7), 075006, 2016
162016
Large batch size training of neural networks with adversarial training and second-order information
Z Yao, A Gholami, K Keutzer, M Mahoney
arXiv preprint arXiv:1810.01021, 2018
152018
ANODEV2: A Coupled Neural ODE Framework
T Zhang, Z Yao, A Gholami, JE Gonzalez, K Keutzer, MW Mahoney, ...
Advances in Neural Information Processing Systems, 5152-5162, 2019
9*2019
Trust region based adversarial attack on neural networks
Z Yao, A Gholami, P Xu, K Keutzer, MW Mahoney
Proceedings of the IEEE Conference on Computer Vision and Pattern …, 2019
72019
On an adaptive preconditioned Crank–Nicolson MCMC algorithm for infinite dimensional Bayesian inference
Z Hu, Z Yao, J Li
Journal of Computational Physics 332, 492-503, 2017
72017
JumpReLU: A Retrofit Defense Strategy for Adversarial Attacks
NB Erichson, Z Yao, MW Mahoney
ICPRAM 2020, 2019
52019
Inefficiency of K-FAC for large batch size training
L Ma, G Montague, J Ye, Z Yao, A Gholami, K Keutzer, MW Mahoney
AAAI 2020, 2019
42019
ZeroQ: A Novel Zero Shot Quantization Framework
Y Cai, Z Yao, Z Dong, A Gholami, MW Mahoney, K Keutzer
CVPR 2020, 2020
32020
PyHessian: Neural Networks Through the Lens of the Hessian
Z Yao, A Gholami, K Keutzer, M Mahoney
arXiv preprint arXiv:1912.07145, 2019
32019
HAWQ-V2: Hessian Aware trace-Weighted Quantization of Neural Networks
Z Dong, Z Yao, Y Cai, D Arfeen, A Gholami, MW Mahoney, K Keutzer
Advances in Neural Information Processing Systems, Beyond First order Method …, 2019
3*2019
Rethinking Batch Normalization in Transformers
S Shen, Z Yao, A Gholami, M Mahoney, K Keutzer
arXiv preprint arXiv:2003.07845, 2020
12020
Parameter Re-Initialization through Cyclical Batch Size Schedules
N Mu, Z Yao, A Gholami, K Keutzer, M Mahoney
Advances in Neural Information Processing Systems MLSYS Workshop, 2018
12018
Residual Networks as Nonlinear Systems: Stability Analysis using Linearization
K Rothauge, Z Yao, Z Hu, MW Mahoney
arXiv preprint arXiv:1905.13386, 2019
2019
A hybrid adaptive MCMC algorithm in function spaces
Q Zhou, Z Hu, Z Yao, J Li
SIAM/ASA Journal on Uncertainty Quantification 5 (1), 621-639, 2017
2017
The system can't perform the operation now. Try again later.
Articles 1–20