Ye Jia

Cited by

	All	Since 2019
Citations	5376	5313
h-index	21	21
i10-index	26	26

1600

800

400

1200

201820192020202120222023202445 301 645 1025 1355 1566 409

Public access

View all

1 article

0 articles

available

not available

Based on funding mandates

Co-authors

Yonghui WuGoogle BrainVerified email at google.com
Yu ZhangOpenAIVerified email at csail.mit.edu
Ron J WeissGoogleVerified email at google.com
Jonathan ShenGoogleVerified email at google.com
Heiga ZenPrincipal Scientist (Director), Google DeepMindVerified email at google.com
Zhifeng ChenGoogle Inc.Verified email at google.com
Quan WangSenior Staff Software Engineer @ Google; Instructor @ Udemy; Textbook Author; IEEE Senior MemberVerified email at google.com
Ignacio Lopez MorenoGoogle IncVerified email at google.com
Patrick NguyenResearch Scientist, Google, Inc.Verified email at google.com
Melvin JohnsonResearcher, GoogleVerified email at stanford.edu
RJ Skerry-RyanGoogle, Inc.Verified email at alum.mit.edu
Rob ClarkGoogleVerified email at google.com
Yuxuan WangByteDanceVerified email at cse.ohio-state.edu
Bhuvana RamabhadranManager, GoogleVerified email at google.com
Andrew RosenbergGoogleVerified email at google.com
Yuan CaoGoogle DeepMindVerified email at google.com
Ankur BapnaSoftware Engineer, Google DeepmindVerified email at google.com
Chung-Cheng ChiuAppleVerified email at apple.com
Michelle Tadmor (Ramanovich)GoogleVerified email at google.com
Wolfgang MachereyGoogle ResearchVerified email at google.com

Ye Jia

Meta

Verified email at google.com - Homepage

Speech synthesis Speech translation


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Transfer Learning from Speaker Verification to Multispeaker Text-To-Speech Synthesis Y Jia, Y Zhang, RJ Weiss, Q Wang, J Shen, F Ren, Z Chen, P Nguyen, ... Advances in Neural Information Processing Systems, 2018	883	2018
Style tokens: Unsupervised style modeling, control and transfer in end-to-end speech synthesis Y Wang, D Stanton, Y Zhang, RJS Ryan, E Battenberg, J Shor, Y Xiao, ... International conference on machine learning, 5180-5189, 2018	882	2018
Libritts: A corpus derived from librispeech for text-to-speech H Zen, V Dang, R Clark, Y Zhang, RJ Weiss, Y Jia, Z Chen, Y Wu arXiv preprint arXiv:1904.02882, 2019	701	2019
Voicefilter: Targeted voice separation by speaker-conditioned spectrogram masking Q Wang, H Muckenhirn, K Wilson, P Sridhar, Z Wu, J Hershey, ... arXiv preprint arXiv:1810.04826, 2018	392	2018
ASVspoof 2019: a large-scale public database of synthetized, converted and replayed speech X Wang, J Yamagishi, M Todisco, H Delgado, A Nautsch, N Evans, ... Computer Speech & Language, 101114, 2020	303	2020
Hierarchical generative modeling for controllable speech synthesis WN Hsu, Y Zhang, RJ Weiss, H Zen, Y Wu, Y Wang, Y Cao, Y Jia, Z Chen, ... arXiv preprint arXiv:1810.07217, 2018	266	2018
Improved noisy student training for automatic speech recognition DS Park, Y Zhang, Y Jia, W Han, CC Chiu, B Li, Y Wu, QV Le arXiv preprint arXiv:2005.09629, 2020	235	2020
Direct speech-to-speech translation with a sequence-to-sequence model Y Jia, RJ Weiss, F Biadsy, W Macherey, M Johnson, Z Chen, Y Wu Proc. Interspeech 2019, 1123--1127, 2019	210	2019
Lingvo: a modular and scalable framework for sequence-to-sequence modeling J Shen, P Nguyen, Y Wu, Z Chen, MX Chen, Y Jia, A Kannan, T Sainath, ... arXiv preprint arXiv:1902.08295, 2019	199	2019
Learning to speak fluently in a foreign language: Multilingual speech synthesis and cross-language voice cloning Y Zhang, RJ Weiss, H Zen, Y Wu, Z Chen, RJ Skerry-Ryan, Y Jia, ... arXiv preprint arXiv:1907.04448, 2019	178	2019
Leveraging weakly supervised data to improve end-to-end speech-to-text translation Y Jia, M Johnson, W Macherey, RJ Weiss, Y Cao, CC Chiu, N Ari, ... ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019	171	2019
Parrotron: An end-to-end speech-to-speech conversion model and its applications to hearing-impaired speech and speech separation F Biadsy, RJ Weiss, PJ Moreno, D Kanevsky, Y Jia arXiv preprint arXiv:1904.04169, 2019	121	2019
Speech recognition with augmented synthesized speech A Rosenberg, Y Zhang, B Ramabhadran, Y Jia, P Moreno, Y Wu, Z Wu 2019 IEEE automatic speech recognition and understanding workshop (ASRU …, 2019	118	2019
Parallel tacotron: Non-autoregressive and controllable tts I Elias, H Zen, J Shen, Y Zhang, Y Jia, RJ Weiss, Y Wu ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021	113	2021
mslam: Massively multilingual joint pre-training for speech and text A Bapna, C Cherry, Y Zhang, Y Jia, M Johnson, Y Cheng, S Khanuja, ... arXiv preprint arXiv:2202.01374, 2022	90	2022
Non-attentive tacotron: Robust and controllable neural tts synthesis including unsupervised duration modeling J Shen, Y Jia, M Chrzanowski, Y Zhang, I Elias, H Zen, Y Wu arXiv preprint arXiv:2010.04301, 2020	88	2020
PnG BERT: Augmented BERT on phonemes and graphemes for neural TTS Y Jia, H Zen, J Shen, Y Zhang, Y Wu Proc. Interspeech 2021, 151--155, 2021	77	2021
Translatotron 2: High-quality direct speech-to-speech translation with voice preservation Y Jia, MT Ramanovich, T Remez, R Pomerantz International Conference on Machine Learning, 10120-10134, 2022	71*	2022
SLAM: A unified encoder for speech and language modeling via speech-text joint pre-training A Bapna, Y Chung, N Wu, A Gulati, Y Jia, JH Clark, M Johnson, J Riesa, ... arXiv preprint arXiv:2110.10329, 2021	71	2021
Parallel Tacotron 2: A non-autoregressive neural TTS model with differentiable duration modeling I Elias, H Zen, J Shen, Y Zhang, Y Jia, RJ Skerry-Ryan, Y Wu arXiv preprint arXiv:2103.14574, 2021	58	2021

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors