Universal neural vocoding with parallel wavenet Y Jiao, A Gabryś, G Tinchev, B Putrycz, D Korzekwa, V Klimkov ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021 | 57 | 2021 |
Interpretable deep learning model for the detection and reconstruction of dysarthric speech D Korzekwa, R Barra-Chicote, B Kostek, T Drugman, M Lajszczak Interspeech 2019, 2019 | 41 | 2019 |
Computer-assisted pronunciation training—Speech synthesis is almost all you need D Korzekwa, J Lorenzo-Trueba, T Drugman, B Kostek Speech Communication 142, 22-33, 2022 | 39 | 2022 |
Non-autoregressive TTS with explicit duration modelling for low-resource highly expressive speech R Shah, K Pokora, A Ezzerg, V Klimkov, G Huybrechts, B Putrycz, ... arXiv preprint arXiv:2106.12896, 2021 | 31 | 2021 |
Llm pruning and distillation in practice: The minitron approach ST Sreenivas, S Muralidharan, R Joshi, M Chochowski, ... arXiv preprint arXiv:2408.11796, 2024 | 25 | 2024 |
Text-free non-parallel many-to-many voice conversion using normalising flow T Merritt, A Ezzerg, P Biliński, M Proszewska, K Pokora, R Barra-Chicote, ... ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022 | 22 | 2022 |
Comprehensive evaluation of statistical speech waveform synthesis T Merritt, B Putrycz, A Nadolski, T Ye, D Korzekwa, W Dolecki, T Drugman, ... 2018 IEEE Spoken Language Technology Workshop (SLT), 325-331, 2018 | 22 | 2018 |
Creating New Voices using Normalizing Flows P Bilinski, T Merritt, A Ezzerg, K Pokora, S Cygert, K Yanagisawa, ... Interspeech 2022, 2022 | 21 | 2022 |
Weakly-supervised word-level pronunciation error detection in non-native English speech D Korzekwa, J Lorenzo-Trueba, T Drugman, S Calamaro, B Kostek arXiv preprint arXiv:2106.03494, 2021 | 21 | 2021 |
Mispronunciation detection in non-native (L2) English with uncertainty modeling D Korzekwa, J Lorenzo-Trueba, S Zaporowski, S Calamaro, T Drugman, ... ICASSP 2021-2021 IEEE international conference on acoustics, speech and …, 2021 | 20 | 2021 |
Detection of lexical stress errors in non-native (L2) English with data augmentation and attention D Korzekwa, R Barra-Chicote, S Zaporowski, G Beringer, ... arXiv preprint arXiv:2012.14788, 2020 | 14 | 2020 |
Varying speaking styles with neural textto-speech T Wood, T Merritt Alexa Blogs, Nov 19, 2018 | 12 | 2018 |
L2-GEN: A Neural Phoneme Paraphrasing Approach to L2 Speech Synthesis for Mispronunciation Diagnosis DY Zhang, A Ganesan, S Campbell, D Korzekwa Interspeech 2022, 2022 | 10 | 2022 |
Enhancing audio quality for expressive neural text-to-speech A Ezzerg, A Gabrys, B Putrycz, D Korzekwa, D Saez-Trigueros, ... arXiv preprint arXiv:2108.06270, 2021 | 9 | 2021 |
Text-to-speech (TTS) processing AF Nadolski, D Korzekwa, TE Merritt, M Nicolis, B Putrycz, RB Chicote, ... US Patent 10,699,695, 2020 | 6 | 2020 |
Comparing normalizing flows and diffusion models for prosody and acoustic modelling in text-to-speech G Zhang, T Merritt, MS Ribeiro, B Tura-Vecino, K Yanagisawa, K Pokora, ... arXiv preprint arXiv:2307.16679, 2023 | 5 | 2023 |
AE-Flow: Autoencoder Normalizing Flow J Mosiński, P Biliński, T Merritt, A Ezzerg, D Korzekwa ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023 | 5 | 2023 |
On granularity of prosodic representations in expressive text-to-speech M Babiański, K Pokora, R Shah, R Sienkiewicz, D Korzekwa, V Klimkov 2022 IEEE Spoken Language Technology Workshop (SLT), 892-899, 2023 | 5 | 2023 |
Remap, warp and attend: Non-parallel many-to-many accent conversion with normalizing flows A Ezzerg, T Merritt, K Yanagisawa, P Bilinski, M Proszewska, K Pokora, ... 2022 IEEE Spoken Language Technology Workshop (SLT), 984-990, 2023 | 4 | 2023 |
Constructing a dataset of speech recordings with lombard effect D Weber, S Zaporowski, D Korzekwa 2020 Signal Processing: Algorithms, Architectures, Arrangements, and …, 2020 | 4 | 2020 |