Promptvc: Flexible stylistic voice conversion in latent space driven by natural language prompts J Yao, Y Yang, Y Lei, Z Ning, Y Hu, Y Pan, J Yin, H Zhou, H Lu, L Xie ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024 | 15 | 2024 |
NWPU-ASLP system for the voiceprivacy 2022 challenge J Yao, Q Wang, L Zhang, P Guo, Y Liang, L Xie arXiv preprint arXiv:2209.11969, 2022 | 15 | 2022 |
Expressive-vc: Highly expressive voice conversion with attention fusion of bottleneck and perturbation features Z Ning, Q Xie, P Zhu, Z Wang, L Xue, J Yao, L Xie, M Bi ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023 | 12 | 2023 |
Distinguishable speaker anonymization based on formant and fundamental frequency scaling J Yao, Q Wang, Y Lei, P Guo, L Xie, N Wang, J Liu ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023 | 11 | 2023 |
Dualvc: Dual-mode voice conversion using intra-model knowledge distillation and hybrid predictive coding Z Ning, Y Jiang, P Zhu, J Yao, S Wang, L Xie, M Bi arXiv preprint arXiv:2305.12425, 2023 | 10 | 2023 |
Exploring the power of cross-contextual large language model in mimic emotion prediction G Yi, Y Yang, Y Pan, Y Cao, J Yao, X Lv, C Fan, Z Lv, J Tao, S Liang, H Lu Proceedings of the 4th on Multimodal Sentiment Analysis Challenge and …, 2023 | 8 | 2023 |
UniSyn: an end-to-end unified model for text-to-speech and singing voice synthesis Y Lei, S Yang, X Wang, Q Xie, J Yao, L Xie, D Su Proceedings of the AAAI Conference on Artificial Intelligence 37 (11), 13025 …, 2023 | 8 | 2023 |
Gemo-clap: Gender-attribute-enhanced contrastive language-audio pretraining for speech emotion recognition Y Pan, Y Hu, Y Yang, J Yao, W Fei, L Ma, H Lu arXiv preprint arXiv:2306.07848, 2023 | 7 | 2023 |
Preserving background sound in noise-robust voice conversion via multi-task learning J Yao, Y Lei, Q Wang, P Guo, Z Ning, L Xie, H Li, J Liu, D Xie ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023 | 7 | 2023 |
GEmo-CLAP: Gender-Attribute-Enhanced Contrastive Language-Audio Pretraining for Accurate Speech Emotion Recognition Y Pan, Y Hu, Y Yang, W Fei, J Yao, H Lu, L Ma, J Zhao ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024 | 6 | 2024 |
The NPU-ASLP System for Deepfake Algorithm Recognition in ADD 2023 Challenge. Z Wang, Q Wang, J Yao, L Xie DADA@ IJCAI, 64-69, 2023 | 6 | 2023 |
Salt: Distinguishable Speaker Anonymization Through Latent Space Transformation Y Lv, J Yao, P Chen, H Zhou, H Lu, L Xie 2023 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU), 1-8, 2023 | 5 | 2023 |
Musa: Multi-lingual speaker anonymization via serial disentanglement J Yao, Q Wang, P Guo, Z Ning, Y Yang, Y Pan, L Xie arXiv preprint arXiv:2407.11629, 2024 | 4 | 2024 |
Distinctive and Natural Speaker Anonymization via Singular Value Transformation-Assisted Matrix J Yao, Q Wang, P Guo, Z Ning, L Xie IEEE/ACM Transactions on Audio, Speech, and Language Processing 32, 2944-2956, 2024 | 4 | 2024 |
Dualvc 2: Dynamic masked convolution for unified streaming and non-streaming voice conversion Z Ning, Y Jiang, P Zhu, S Wang, J Yao, L Xie, M Bi ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024 | 4 | 2024 |
Timbre-reserved Adversarial Attack in Speaker Identification Q Wang, J Yao, L Zhang, P Guo, L Xie IEEE/ACM Transactions on Audio, Speech, and Language Processing, 2023 | 2 | 2023 |
High Quality and Similarity One-Shot Voice Conversion Using End-to-End Model R Du, J Yao Proceedings of the 2022 6th International Conference on Computer Science and …, 2022 | 2 | 2022 |
NTU-NPU System for Voice Privacy 2024 Challenge N Kuzmin, HT Luong, J Yao, L Xie, KA Lee, ES Chng arXiv preprint arXiv:2410.02371, 2024 | 1 | 2024 |
Takin-VC: Zero-shot Voice Conversion via Jointly Hybrid Content and Memory-Augmented Context-Aware Timbre Modeling Y Yang, Y Pan, J Yao, X Zhang, J Ye, H Zhou, L Xie, L Ma, J Zhao arXiv preprint arXiv:2410.01350, 2024 | 1 | 2024 |
Takin: A Cohort of Superior Quality Zero-shot Speech Generation Models S Chen, Y Feng, L He, T He, W He, Y Hu, B Lin, Y Lin, Y Pan, P Tan, ... arXiv preprint arXiv:2409.12139, 2024 | 1 | 2024 |