NWPU-ASLP system for the voiceprivacy 2022 challenge J Yao, Q Wang, L Zhang, P Guo, Y Liang, L Xie arXiv preprint arXiv:2209.11969, 2022 | 11 | 2022 |
Expressive-vc: Highly expressive voice conversion with attention fusion of bottleneck and perturbation features Z Ning, Q Xie, P Zhu, Z Wang, L Xue, J Yao, L Xie, M Bi ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023 | 9 | 2023 |
Promptvc: Flexible stylistic voice conversion in latent space driven by natural language prompts J Yao, Y Yang, Y Lei, Z Ning, Y Hu, Y Pan, J Yin, H Zhou, H Lu, L Xie ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024 | 7 | 2024 |
Exploring the power of cross-contextual large language model in mimic emotion prediction G Yi, Y Yang, Y Pan, Y Cao, J Yao, X Lv, C Fan, Z Lv, J Tao, S Liang, H Lu Proceedings of the 4th on Multimodal Sentiment Analysis Challenge and …, 2023 | 7 | 2023 |
Distinguishable speaker anonymization based on formant and fundamental frequency scaling J Yao, Q Wang, Y Lei, P Guo, L Xie, N Wang, J Liu ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023 | 7 | 2023 |
Preserving background sound in noise-robust voice conversion via multi-task learning J Yao, Y Lei, Q Wang, P Guo, Z Ning, L Xie, H Li, J Liu, D Xie ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023 | 7 | 2023 |
Gemo-clap: Gender-attribute-enhanced contrastive language-audio pretraining for speech emotion recognition Y Pan, Y Hu, Y Yang, J Yao, W Fei, L Ma, H Lu arXiv preprint arXiv:2306.07848, 2023 | 6 | 2023 |
Dualvc: Dual-mode voice conversion using intra-model knowledge distillation and hybrid predictive coding Z Ning, Y Jiang, P Zhu, J Yao, S Wang, L Xie, M Bi arXiv preprint arXiv:2305.12425, 2023 | 4 | 2023 |
The NPU-ASLP System for Deepfake Algorithm Recognition in ADD 2023 Challenge. Z Wang, Q Wang, J Yao, L Xie DADA@ IJCAI, 64-69, 2023 | 3 | 2023 |
Dualvc 2: Dynamic masked convolution for unified streaming and non-streaming voice conversion Z Ning, Y Jiang, P Zhu, S Wang, J Yao, L Xie, M Bi ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024 | 2 | 2024 |
UniSyn: an end-to-end unified model for text-to-speech and singing voice synthesis Y Lei, S Yang, X Wang, Q Xie, J Yao, L Xie, D Su Proceedings of the AAAI Conference on Artificial Intelligence 37 (11), 13025 …, 2023 | 2 | 2023 |
GEmo-CLAP: Gender-Attribute-Enhanced Contrastive Language-Audio Pretraining for Accurate Speech Emotion Recognition Y Pan, Y Hu, Y Yang, W Fei, J Yao, H Lu, L Ma, J Zhao ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024 | 1 | 2024 |
Salt: Distinguishable Speaker Anonymization Through Latent Space Transformation Y Lv, J Yao, P Chen, H Zhou, H Lu, L Xie 2023 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU), 1-8, 2023 | 1 | 2023 |
Timbre-reserved Adversarial Attack in Speaker Identification Q Wang, J Yao, L Zhang, P Guo, L Xie IEEE/ACM Transactions on Audio, Speech, and Language Processing, 2023 | 1 | 2023 |
DualVC 3: Leveraging Language Model Generated Pseudo Context for End-to-end Low Latency Streaming Voice Conversion Z Ning, S Wang, P Zhu, Z Wang, J Yao, L Xie, M Bi arXiv preprint arXiv:2406.07846, 2024 | | 2024 |
Distinctive and Natural Speaker Anonymization via Singular Value Transformation-Assisted Matrix J Yao, Q Wang, P Guo, Z Ning, L Xie IEEE/ACM Transactions on Audio, Speech, and Language Processing 32, 2944-2956, 2024 | | 2024 |
Towards Out-of-Distribution Detection in Vocoder Recognition via Latent Feature Reconstruction R Du, J Yao, Q Kong, Y Cao arXiv preprint arXiv:2406.02233, 2024 | | 2024 |
Pseudo-Siamese Network based Timbre-reserved Black-box Adversarial Attack in Speaker Identification Q Wang, J Yao, Z Wang, P Guo, L Xie arXiv preprint arXiv:2305.19020, 2023 | | 2023 |
A Reward Shaping Method based on Meta-LSTM for Continuous Control of Robot J Yao, X Li, D Huang Proceedings of the 2020 4th International Conference on Computer Science and …, 2020 | | 2020 |