A network-centric hardware/algorithm co-design to accelerate distributed training of deep neural networks Y Li, J Park, M Alian, Y Yuan, Z Qu, P Pan, R Wang, A Schwing, ... 2018 51st Annual IEEE/ACM International Symposium on Microarchitecture …, 2018 | 99 | 2018 |

DOTA: detect and omit weak attentions for scalable transformer acceleration Z Qu, L Liu, F Tu, Z Chen, Y Ding, Y Xie Proceedings of the 27th ACM International Conference on Architectural …, 2022 | 50 | 2022 |

H2learn: High-efficiency learning accelerator for high-accuracy spiking neural networks L Liang, Z Qu, Z Chen, F Tu, Y Wu, L Deng, G Li, P Li, Y Xie IEEE Transactions on Computer-Aided Design of Integrated Circuits and …, 2021 | 26 | 2021 |

DUET: Boosting Deep Neural Network Efficiency on Dual-Module Architecture L Liu, Z Qu, L Deng, F Tu, S Li, X Hu, Z Gu, Y Ding, Y Xie 2020 53rd Annual IEEE/ACM International Symposium on Microarchitecture …, 2020 | 26 | 2020 |

Efficient tensor core-based GPU kernels for structured sparsity under reduced precision Z Chen, Z Qu, L Liu, Y Ding, Y Xie Proceedings of the International Conference for High Performance Computing …, 2021 | 24 | 2021 |

Dynamic Sparse Attention for Scalable Transformer Acceleration L Liu, Z Qu, Z Chen, F Tu, Y Ding, Y Xie IEEE Transactions on Computers, 2022 | 22* | 2022 |

INSPIRE: __in__-__s__torage __p__rivate __i__nformation __re__trieval via protocol and architecture co-designJ Lin, L Liang, Z Qu, I Ahmad, L Liu, F Tu, T Gupta, Y Ding, Y Xie Proceedings of the 49th Annual International Symposium on Computer …, 2022 | 18 | 2022 |

Improving Streaming Graph Processing Performance using Input Knowledge A Basak, Z Qu, J Lin, AR Alameldeen, Z Chishti, Y Ding, Y Xie MICRO-54: 54th Annual IEEE/ACM International Symposium on Microarchitecture …, 2021 | 15 | 2021 |

Dynamic n: M fine-grained structured sparse attention mechanism Z Chen, Z Qu, Y Quan, L Liu, Y Ding, Y Xie Proceedings of the 28th ACM SIGPLAN Annual Symposium on Principles and …, 2023 | 11 | 2023 |

ENMC: Extreme Near-Memory Classification via Approximate Screening L Liu, J Lin, Z Qu, Y Ding, Y Xie MICRO-54: 54th Annual IEEE/ACM International Symposium on Microarchitecture …, 2021 | 9 | 2021 |

Hardware-Enabled Efficient Data Processing with Tensor-Train Decomposition Z Qu, L Deng, B Wang, H Chen, J Lin, L Liang, G Li, Z Zhang, Y Xie IEEE Transactions on Computer-Aided Design of Integrated Circuits and …, 2021 | 6 | 2021 |

ASP-SIFT: Using analog signal processing architecture to accelerate keypoint detection of SIFT algorithm Z Fan, Z Liu, Z Qu, F Qiao, Q Wei, X Liu, Y Sun, S Xu, H Yang IEEE Transactions on Very Large Scale Integration (VLSI) Systems 28 (1), 198-211, 2019 | 4 | 2019 |

SPG: Structure-Private Graph Database via SqueezePIR L Liang, J Lin, Z Qu, I Ahmad, F Tu, T Gupta, Y Ding, Y Xie Proceedings of the VLDB Endowment 16 (7), 1615-1628, 2023 | 3 | 2023 |

Tensor train decomposition for solving large-scale linear equations H Chen, L Deng, Z Qu, L Liang, T Yan, Y Xie, G Li Neurocomputing 464, 203-217, 2021 | 2 | 2021 |

TT-GNN: Efficient On-Chip Graph Neural Network Training via Embedding Reformation and Hardware Optimization Z Qu, D Niu, S Li, H Zheng, Y Xie Proceedings of the 56th Annual IEEE/ACM International Symposium on …, 2023 | | 2023 |

Addressing Data Explosion Issue in Emerging Deep Learning Applications Z Qu University of California, Santa Barbara, 2023 | | 2023 |

DFSSATTEN: Dynamic Fine-grained Structured Sparse Attention Mechanism Z Chen, L Liu, Y Quan, Z Qu, Y Ding, Y Xie | | 2021 |

Efficient Processing of Sparse Tensor Decomposition via Unified Abstraction and PE-interactive Architecture B Wang, L Deng, Z Qu, S Li, Z Zhang, Y Xie IEEE Transactions on Computers 71 (2), 266-281, 2021 | | 2021 |