Heterogeneous system coherence for integrated CPU-GPU systems J Power, A Basu, J Gu, S Puthoor, BM Beckmann, MD Hill, SK Reinhardt, ... Proceedings of the 46th Annual IEEE/ACM International Symposium on …, 2013 | 209 | 2013 |
PPEP: Online performance, power, and energy prediction framework and DVFS space exploration B Su, J Gu, L Shen, W Huang, JL Greathouse, Z Wang 2014 47th Annual IEEE/ACM International Symposium on Microarchitecture, 445-457, 2014 | 114 | 2014 |
WADE: Writeback-aware dynamic cache management for NVM-based main memory system Z Wang, S Shan, T Cao, J Gu, Y Xu, S Mu, Y Xie, DA Jiménez ACM Transactions on Architecture and Code Optimization (TACO) 10 (4), 1-21, 2013 | 59 | 2013 |
Implementing a leading loads performance predictor on commodity processors B Su, JL Greathouse, J Gu, M Boyer, L Shen, Z Wang 2014 USENIX Annual Technical Conference (USENIX ATC 14), 2014 | 47 | 2014 |
Opencl caffe: Accelerating and enabling a cross platform machine learning framework J Gu, Y Liu, Y Gao, M Zhu Proceedings of the 4th International Workshop on OpenCL, 1-5, 2016 | 40 | 2016 |
Implementation and evaluation of deep neural networks (DNN) on mainstream heterogeneous systems J Gu, M Zhu, Z Zhou, F Zhang, Z Lin, Q Zhang, M Breternitz Proceedings of 5th Asia-Pacific Workshop on Systems, 1-7, 2014 | 37 | 2014 |
A hybrid GPU+ FPGA system design for autonomous driving cars C Hao, A Sarwari, Z Jin, H Abu-Haimed, D Sew, Y Li, X Liu, B Wu, D Fu, ... 2019 IEEE International Workshop on Signal Processing Systems (SiPS), 121-126, 2019 | 29 | 2019 |
NAIS: Neural architecture and implementation search and its applications in autonomous driving C Hao, Y Chen, X Liu, A Sarwari, D Sew, A Dhar, B Wu, D Fu, J Xiong, ... 2019 IEEE/ACM International Conference on Computer-Aided Design (ICCAD), 1-8, 2019 | 24 | 2019 |
Self-supervised learning of depth and ego-motion with differentiable bundle adjustment Y Shi, J Zhu, Y Fang, K Lien, J Gu arXiv preprint arXiv:1909.13163, 2019 | 12 | 2019 |
Moving data between caches in a heterogeneous processor system J Gu, BM Beckmann, Y Xie US Patent 9,652,390, 2017 | 11 | 2017 |
Optimizing a parallel video encoder with message passing and a shared memory architecture J Gu, Y Sun Tsinghua Science and Technology 16 (4), 393-398, 2011 | 9 | 2011 |
Structure-attentioned memory network for monocular depth estimation J Zhu, Y Shi, M Ren, Y Fang, KC Lien, J Gu arXiv preprint arXiv:1909.04594, 2019 | 5 | 2019 |
MOPED: Orchestrating interprocess message data on CMPs J Gu, SS Lumetta, R Kumar, Y Sun 2011 IEEE 17th International Symposium on High Performance Computer …, 2011 | 4 | 2011 |
iCHAT: inter-cache hardware-assistant data transfer for heterogeneous chip multiprocessors J Gu, BM Beckmann, T Cao, Y Hu 2014 9th IEEE International Conference on Networking, Architecture, and …, 2014 | 2 | 2014 |
Accelerating data movement on future chip multi-processors J Gu, R Kumar, SS Lumetta, Y Sun Proceedings of the Second International Forum on Next-Generation Multicore …, 2010 | 2 | 2010 |
MOPED: Accelerating data communication on future cmps J Gu, Y Sun, SS Lumetta, R Kumar IEEE Micro 31 (4), 42-50, 2011 | 1 | 2011 |
Enhancing lifetime of non-volatile cache by injecting random replacement policy Z Wang, Y Xie, Y Xu, J Gu, T Cao US Patent 9,792,228, 2017 | | 2017 |
Enhancing lifetime of non-volatile cache by reducing intra-block write variation Z Wang, Y Xie, Y Xu, J Gu, T Cao US Patent 9,767,043, 2017 | | 2017 |
Thermal-aware compiler for parallel instruction execution in processors Y Xie, J Gu US Patent 9,639,359, 2017 | | 2017 |
Method and apparatus related to cache memory Z Wang, J Gu, Y Xu US Patent 9,552,301, 2017 | | 2017 |