Jixun Yao

Cited by

	All	Since 2020
Citations	208	208
h-index	9	9
i10-index	8	8

160

120

20222023202420251 24 146 37

Public access

View all

2 articles

0 articles

available

not available

Based on funding mandates

Co-authors

Lei XieNorthwestern Polytechnical UniversityVerified email at nwpu.edu.cn
Ziqian NingNorthwestern Polytechnical UniversityVerified email at mail.nwpu.edu.cn
yu pankyushu universityVerified email at s.kyushu-u.ac.jp
yuguang yangXimalaya Inc.Verified email at connect.hku.hk
Pengcheng GuoNorthwestern Polytechnical UniversityVerified email at nwpu-aslp.org
Qing WangNorthwestern Polytechnical UniversityVerified email at nwpu-aslp.org
yi leiVerified email at nwpu.edu.cn

Jixun Yao

Northwestern Polytechnical University

Verified email at mail.nwpu.edu.cn - Homepage

Voice Conversion Speech Synthesis


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Promptvc: Flexible stylistic voice conversion in latent space driven by natural language prompts J Yao, Y Yang, Y Lei, Z Ning, Y Hu, Y Pan, J Yin, H Zhou, H Lu, L Xie ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024	26	2024
Expressive-vc: Highly expressive voice conversion with attention fusion of bottleneck and perturbation features Z Ning, Q Xie, P Zhu, Z Wang, L Xue, J Yao, L Xie, M Bi ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023	20	2023
NWPU-ASLP system for the voiceprivacy 2022 challenge J Yao, Q Wang, L Zhang, P Guo, Y Liang, L Xie arXiv preprint arXiv:2209.11969, 2022	18	2022
GEmo-CLAP: Gender-attribute-enhanced contrastive language-audio pretraining for accurate speech emotion recognition Y Pan, Y Hu, Y Yang, W Fei, J Yao, H Lu, L Ma, J Zhao ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024	16*	2024
Distinguishable speaker anonymization based on formant and fundamental frequency scaling J Yao, Q Wang, Y Lei, P Guo, L Xie, N Wang, J Liu ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023	14	2023
Dualvc: Dual-mode voice conversion using intra-model knowledge distillation and hybrid predictive coding Z Ning, Y Jiang, P Zhu, J Yao, S Wang, L Xie, M Bi arXiv preprint arXiv:2305.12425, 2023	14	2023
Preserving background sound in noise-robust voice conversion via multi-task learning J Yao, Y Lei, Q Wang, P Guo, Z Ning, L Xie, H Li, J Liu, D Xie ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023	12	2023
Exploring the power of cross-contextual large language model in mimic emotion prediction G Yi, Y Yang, Y Pan, Y Cao, J Yao, X Lv, C Fan, Z Lv, J Tao, S Liang, H Lu Proceedings of the 4th on Multimodal Sentiment Analysis Challenge and …, 2023	10	2023
Unisyn: an end-to-end unified model for text-to-speech and singing voice synthesis Y Lei, S Yang, X Wang, Q Xie, J Yao, L Xie, D Su Proceedings of the AAAI Conference on Artificial Intelligence 37 (11), 13025 …, 2023	9	2023
Dualvc 2: Dynamic masked convolution for unified streaming and non-streaming voice conversion Z Ning, Y Jiang, P Zhu, S Wang, J Yao, L Xie, M Bi ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024	7	2024
Salt: Distinguishable speaker anonymization through latent space transformation Y Lv, J Yao, P Chen, H Zhou, H Lu, L Xie 2023 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU), 1-8, 2023	7	2023
The NPU-ASLP System for Deepfake Algorithm Recognition in ADD 2023 Challenge. Z Wang, Q Wang, J Yao, L Xie DADA@ IJCAI, 64-69, 2023	7	2023
Takin: A cohort of superior quality zero-shot speech generation models S Chen, Y Feng, L He, T He, W He, Y Hu, B Lin, Y Lin, Y Pan, P Tan, ... arXiv preprint arXiv:2409.12139, 2024	6	2024
MUSA: Multi-lingual speaker anonymization via serial disentanglement J Yao, Q Wang, P Guo, Z Ning, Y Yang, Y Pan, L Xie arXiv preprint arXiv:2407.11629, 2024	5	2024
Distinctive and natural speaker anonymization via singular value transformation-assisted matrix J Yao, Q Wang, P Guo, Z Ning, L Xie IEEE/ACM Transactions on Audio, Speech, and Language Processing 32, 2944-2956, 2024	5	2024
Msac: Multiple speech attribute control method for reliable speech emotion recognition Y Pan, Y Yang, Y Huang, J Yao, J Yin, Y Hu, H Lu, L Ma, J Zhao arXiv preprint arXiv:2308.04025, 2023	5	2023
Timbre-reserved adversarial attack in speaker identification Q Wang, J Yao, L Zhang, P Guo, L Xie IEEE/ACM Transactions on Audio, Speech, and Language Processing 31, 3848-3858, 2023	4	2023
Takin-vc: Zero-shot voice conversion via jointly hybrid content and memory-augmented context-aware timbre modeling Y Yang, Y Pan, J Yao, X Zhang, J Ye, H Zhou, L Xie, L Ma, J Zhao arXiv preprint arXiv:2410.01350, 2024	3	2024
Dualvc 3: Leveraging language model generated pseudo context for end-to-end low latency streaming voice conversion Z Ning, S Wang, P Zhu, Z Wang, J Yao, L Xie, M Bi arXiv preprint arXiv:2406.07846, 2024	3	2024
Stablevc: Style controllable zero-shot voice conversion with conditional flow matching J Yao, Y Yang, Y Pan, Z Ning, J Ye, H Zhou, L Xie arXiv preprint arXiv:2412.04724, 2024	2	2024

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors