Follow
Jixun Yao
Title
Cited by
Cited by
Year
Promptvc: Flexible stylistic voice conversion in latent space driven by natural language prompts
J Yao, Y Yang, Y Lei, Z Ning, Y Hu, Y Pan, J Yin, H Zhou, H Lu, L Xie
ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024
262024
Expressive-vc: Highly expressive voice conversion with attention fusion of bottleneck and perturbation features
Z Ning, Q Xie, P Zhu, Z Wang, L Xue, J Yao, L Xie, M Bi
ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023
202023
NWPU-ASLP system for the voiceprivacy 2022 challenge
J Yao, Q Wang, L Zhang, P Guo, Y Liang, L Xie
arXiv preprint arXiv:2209.11969, 2022
182022
GEmo-CLAP: Gender-attribute-enhanced contrastive language-audio pretraining for accurate speech emotion recognition
Y Pan, Y Hu, Y Yang, W Fei, J Yao, H Lu, L Ma, J Zhao
ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024
16*2024
Distinguishable speaker anonymization based on formant and fundamental frequency scaling
J Yao, Q Wang, Y Lei, P Guo, L Xie, N Wang, J Liu
ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023
142023
Dualvc: Dual-mode voice conversion using intra-model knowledge distillation and hybrid predictive coding
Z Ning, Y Jiang, P Zhu, J Yao, S Wang, L Xie, M Bi
arXiv preprint arXiv:2305.12425, 2023
142023
Preserving background sound in noise-robust voice conversion via multi-task learning
J Yao, Y Lei, Q Wang, P Guo, Z Ning, L Xie, H Li, J Liu, D Xie
ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023
122023
Exploring the power of cross-contextual large language model in mimic emotion prediction
G Yi, Y Yang, Y Pan, Y Cao, J Yao, X Lv, C Fan, Z Lv, J Tao, S Liang, H Lu
Proceedings of the 4th on Multimodal Sentiment Analysis Challenge and …, 2023
102023
Unisyn: an end-to-end unified model for text-to-speech and singing voice synthesis
Y Lei, S Yang, X Wang, Q Xie, J Yao, L Xie, D Su
Proceedings of the AAAI Conference on Artificial Intelligence 37 (11), 13025 …, 2023
92023
Dualvc 2: Dynamic masked convolution for unified streaming and non-streaming voice conversion
Z Ning, Y Jiang, P Zhu, S Wang, J Yao, L Xie, M Bi
ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024
72024
Salt: Distinguishable speaker anonymization through latent space transformation
Y Lv, J Yao, P Chen, H Zhou, H Lu, L Xie
2023 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU), 1-8, 2023
72023
The NPU-ASLP System for Deepfake Algorithm Recognition in ADD 2023 Challenge.
Z Wang, Q Wang, J Yao, L Xie
DADA@ IJCAI, 64-69, 2023
72023
Takin: A cohort of superior quality zero-shot speech generation models
S Chen, Y Feng, L He, T He, W He, Y Hu, B Lin, Y Lin, Y Pan, P Tan, ...
arXiv preprint arXiv:2409.12139, 2024
62024
MUSA: Multi-lingual speaker anonymization via serial disentanglement
J Yao, Q Wang, P Guo, Z Ning, Y Yang, Y Pan, L Xie
arXiv preprint arXiv:2407.11629, 2024
52024
Distinctive and natural speaker anonymization via singular value transformation-assisted matrix
J Yao, Q Wang, P Guo, Z Ning, L Xie
IEEE/ACM Transactions on Audio, Speech, and Language Processing 32, 2944-2956, 2024
52024
Msac: Multiple speech attribute control method for reliable speech emotion recognition
Y Pan, Y Yang, Y Huang, J Yao, J Yin, Y Hu, H Lu, L Ma, J Zhao
arXiv preprint arXiv:2308.04025, 2023
52023
Timbre-reserved adversarial attack in speaker identification
Q Wang, J Yao, L Zhang, P Guo, L Xie
IEEE/ACM Transactions on Audio, Speech, and Language Processing 31, 3848-3858, 2023
42023
Takin-vc: Zero-shot voice conversion via jointly hybrid content and memory-augmented context-aware timbre modeling
Y Yang, Y Pan, J Yao, X Zhang, J Ye, H Zhou, L Xie, L Ma, J Zhao
arXiv preprint arXiv:2410.01350, 2024
32024
Dualvc 3: Leveraging language model generated pseudo context for end-to-end low latency streaming voice conversion
Z Ning, S Wang, P Zhu, Z Wang, J Yao, L Xie, M Bi
arXiv preprint arXiv:2406.07846, 2024
32024
Stablevc: Style controllable zero-shot voice conversion with conditional flow matching
J Yao, Y Yang, Y Pan, Z Ning, J Ye, H Zhou, L Xie
arXiv preprint arXiv:2412.04724, 2024
22024
The system can't perform the operation now. Try again later.
Articles 1–20