Follow
zhang pengyuan
zhang pengyuan
Verified email at hccl.ioa.ac.cn
Title
Cited by
Cited by
Year
Transformer-based online CTC/attention end-to-end speech recognition architecture
H Miao, G Cheng, C Gao, P Zhang, Y Yan
ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020
1232020
Using neural network front-ends on far field multiple microphones based speech recognition
Y Liu, P Zhang, T Hain
2014 IEEE international conference on acoustics, speech and signal …, 2014
1072014
Integrating the data augmentation scheme with various classifiers for acoustic scene modeling
H Chen, Z Liu, Z Liu, P Zhang, Y Yan
arXiv preprint arXiv:1907.06639, 2019
832019
DPT-FSNet: Dual-path transformer based full-band and sub-band fusion network for speech enhancement
F Dang, H Chen, P Zhang
ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022
742022
The effect of silence and dual-band fusion in anti-spoofing system
Y Zhang12, W Wang12, P Zhang12
Proc. Interspeech, 2021
622021
Online Hybrid CTC/Attention Architecture for End-to-End Speech Recognition.
H Miao, G Cheng, P Zhang, T Li, Y Yan
Interspeech, 2623-2627, 2019
582019
Online hybrid CTC/attention end-to-end automatic speech recognition architecture
H Miao, G Cheng, P Zhang, Y Yan
IEEE/ACM Transactions on Audio, Speech, and Language Processing 28, 1452-1465, 2020
462020
Semi-supervised DNN training in meeting recognition
P Zhang, Y Liu, T Hain
2014 IEEE Spoken Language Technology Workshop (SLT), 141-146, 2014
382014
Self-attention based prosodic boundary prediction for chinese speech synthesis
C Lu, P Zhang, Y Yan
ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019
312019
Attention-Based LSTM with Multi-Task Learning for Distant Speech Recognition.
Y Zhang, P Zhang, Y Yan
Interspeech, 3857-3861, 2017
312017
Deep Convolutional Neural Network with Scalogram for Audio Scene Modeling.
H Chen, P Zhang, H Bai, Q Yuan, X Bao, Y Yan
Interspeech, 3304-3308, 2018
282018
Open source magicdata-ramc: A rich annotated mandarin conversational (ramc) speech dataset
Z Yang, Y Chen, L Luo, R Yang, L Ye, G Cheng, J Xu, Y Jin, Q Zhang, ...
arXiv preprint arXiv:2203.16844, 2022
262022
Improving ctc-based speech recognition via knowledge transferring from pre-trained language models
K Deng, S Cao, Y Zhang, L Ma, G Cheng, J Xu, P Zhang
ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022
242022
Improving non-autoregressive end-to-end speech recognition with pre-trained acoustic and language models
K Deng, Z Yang, S Watanabe, Y Higuchi, G Cheng, P Zhang
ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022
242022
Multi-accent adaptation based on gate mechanism
H Zhu, L Wang, P Zhang, Y Yan
arXiv preprint arXiv:2011.02774, 2020
222020
An audio scene classification framework with embedded filters and a DCT-based temporal module
H Chen, P Zhang, Y Yan
ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019
202019
A fast fuzzy keyword spotting algorithm based on syllable confusion network
J Shao, Q Zhao, P Zhang, Z Liu, Y Yan
eps 2 (q1), q3, 2007
202007
Pre-training transformer decoder for end-to-end asr model with unpaired text data
C Gao, G Cheng, R Yang, H Zhu, P Zhang, Y Yan
ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021
172021
Keyword spotting based on phoneme confusion matrix
P Zhang, J Shao, J Han, Z Liu, Y Yan
Proc. of ISCSLP 2, 408-419, 2006
172006
Incorporating Cross-Speaker Style Transfer for Multi-Language Text-to-Speech.
Z Shang, Z Huang, H Zhang, P Zhang, Y Yan
Interspeech, 1619-1623, 2021
152021
The system can't perform the operation now. Try again later.
Articles 1–20