Follow
Jixun Yao
Title
Cited by
Cited by
Year
Promptvc: Flexible stylistic voice conversion in latent space driven by natural language prompts
J Yao, Y Yang, Y Lei, Z Ning, Y Hu, Y Pan, J Yin, H Zhou, H Lu, L Xie
ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024
152024
NWPU-ASLP system for the voiceprivacy 2022 challenge
J Yao, Q Wang, L Zhang, P Guo, Y Liang, L Xie
arXiv preprint arXiv:2209.11969, 2022
152022
Expressive-vc: Highly expressive voice conversion with attention fusion of bottleneck and perturbation features
Z Ning, Q Xie, P Zhu, Z Wang, L Xue, J Yao, L Xie, M Bi
ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023
122023
Distinguishable speaker anonymization based on formant and fundamental frequency scaling
J Yao, Q Wang, Y Lei, P Guo, L Xie, N Wang, J Liu
ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023
112023
Dualvc: Dual-mode voice conversion using intra-model knowledge distillation and hybrid predictive coding
Z Ning, Y Jiang, P Zhu, J Yao, S Wang, L Xie, M Bi
arXiv preprint arXiv:2305.12425, 2023
102023
Exploring the power of cross-contextual large language model in mimic emotion prediction
G Yi, Y Yang, Y Pan, Y Cao, J Yao, X Lv, C Fan, Z Lv, J Tao, S Liang, H Lu
Proceedings of the 4th on Multimodal Sentiment Analysis Challenge and …, 2023
82023
UniSyn: an end-to-end unified model for text-to-speech and singing voice synthesis
Y Lei, S Yang, X Wang, Q Xie, J Yao, L Xie, D Su
Proceedings of the AAAI Conference on Artificial Intelligence 37 (11), 13025 …, 2023
82023
Gemo-clap: Gender-attribute-enhanced contrastive language-audio pretraining for speech emotion recognition
Y Pan, Y Hu, Y Yang, J Yao, W Fei, L Ma, H Lu
arXiv preprint arXiv:2306.07848, 2023
72023
Preserving background sound in noise-robust voice conversion via multi-task learning
J Yao, Y Lei, Q Wang, P Guo, Z Ning, L Xie, H Li, J Liu, D Xie
ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023
72023
GEmo-CLAP: Gender-Attribute-Enhanced Contrastive Language-Audio Pretraining for Accurate Speech Emotion Recognition
Y Pan, Y Hu, Y Yang, W Fei, J Yao, H Lu, L Ma, J Zhao
ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024
62024
The NPU-ASLP System for Deepfake Algorithm Recognition in ADD 2023 Challenge.
Z Wang, Q Wang, J Yao, L Xie
DADA@ IJCAI, 64-69, 2023
62023
Salt: Distinguishable Speaker Anonymization Through Latent Space Transformation
Y Lv, J Yao, P Chen, H Zhou, H Lu, L Xie
2023 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU), 1-8, 2023
52023
Musa: Multi-lingual speaker anonymization via serial disentanglement
J Yao, Q Wang, P Guo, Z Ning, Y Yang, Y Pan, L Xie
arXiv preprint arXiv:2407.11629, 2024
42024
Distinctive and Natural Speaker Anonymization via Singular Value Transformation-Assisted Matrix
J Yao, Q Wang, P Guo, Z Ning, L Xie
IEEE/ACM Transactions on Audio, Speech, and Language Processing 32, 2944-2956, 2024
42024
Dualvc 2: Dynamic masked convolution for unified streaming and non-streaming voice conversion
Z Ning, Y Jiang, P Zhu, S Wang, J Yao, L Xie, M Bi
ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024
42024
Timbre-reserved Adversarial Attack in Speaker Identification
Q Wang, J Yao, L Zhang, P Guo, L Xie
IEEE/ACM Transactions on Audio, Speech, and Language Processing, 2023
22023
High Quality and Similarity One-Shot Voice Conversion Using End-to-End Model
R Du, J Yao
Proceedings of the 2022 6th International Conference on Computer Science and …, 2022
22022
NTU-NPU System for Voice Privacy 2024 Challenge
N Kuzmin, HT Luong, J Yao, L Xie, KA Lee, ES Chng
arXiv preprint arXiv:2410.02371, 2024
12024
Takin-VC: Zero-shot Voice Conversion via Jointly Hybrid Content and Memory-Augmented Context-Aware Timbre Modeling
Y Yang, Y Pan, J Yao, X Zhang, J Ye, H Zhou, L Xie, L Ma, J Zhao
arXiv preprint arXiv:2410.01350, 2024
12024
Takin: A Cohort of Superior Quality Zero-shot Speech Generation Models
S Chen, Y Feng, L He, T He, W He, Y Hu, B Lin, Y Lin, Y Pan, P Tan, ...
arXiv preprint arXiv:2409.12139, 2024
12024
The system can't perform the operation now. Try again later.
Articles 1–20