Towards accountability for machine learning datasets: Practices from software engineering and infrastructure B Hutchinson, A Smart, A Hanna, E Denton, C Greer, O Kjartansson, ... Proceedings of the 2021 ACM Conference on Fairness, Accountability, and …, 2021 | 337 | 2021 |
Data cards: Purposeful and transparent dataset documentation for responsible ai M Pushkarna, A Zaldivar, O Kjartansson Proceedings of the 2022 ACM Conference on Fairness, Accountability, and …, 2022 | 184 | 2022 |
Open-source multi-speaker speech corpora for building Gujarati, Kannada, Malayalam, Marathi, Tamil and Telugu speech synthesis systems F He, SHC Chu, O Kjartansson, C Rivera, A Katanova, A Gutkin, ... Proceedings of the Twelfth Language Resources and Evaluation Conference …, 2020 | 87 | 2020 |
Open-source multi-speaker corpora of the english accents in the british isles I Demirsahin, O Kjartansson, A Gutkin, C Rivera Proceedings of the twelfth language resources and evaluation conference …, 2020 | 78 | 2020 |
Crowd-Sourced Speech Corpora for Javanese, Sundanese, Sinhala, Nepali, and Bangladeshi Bengali. O Kjartansson, S Sarin, K Pipatsrisawat, M Jansche, L Ha SLTU, 52-55, 2018 | 78 | 2018 |
A Step-by-Step Process for Building TTS Voices Using Open Source Data and Frameworks for Bangla, Javanese, Khmer, Nepali, Sinhala, and Sundanese. K Sodimana, P De Silva, S Sarin, O Kjartansson, M Jansche, ... SLTU, 66-70, 2018 | 46 | 2018 |
Crowdsourcing Latin American Spanish for low-resource text-to-speech A Guevara-Rukoz, I Demirsahin, F He, SHC Chu, S Sarin, K Pipatsrisawat, ... Proceedings of the Twelfth Language Resources and Evaluation Conference …, 2020 | 44 | 2020 |
Rapid development of TTS corpora for four South African languages DR Van Niekerk, C van Heerden, N Kleynhans, O Kjartansson, ... Interspeech 2017, 2017 | 40 | 2017 |
Developing an open-source corpus of yoruba speech A Gutkin, I Demirsahin, O Kjartansson, CE Rivera, K Túbòsún | 34 | 2020 |
Open-source high quality speech datasets for Basque, Catalan and Galician O Kjartansson, A Gutkin, A Butryna, I Demirsahin, C Rivera Proceedings of the 1st Joint Workshop on Spoken Language Technologies for …, 2020 | 33 | 2020 |
Almannaromur: An open icelandic speech corpus J Guðnason, O Kjartansson, J Jóhannsson, E Carstensdóttir, ... Spoken Language Technologies for Under-Resourced Languages, 2012 | 26 | 2012 |
Burmese speech corpus, finite-state text normalization and pronunciation grammars with an application to text-to-speech YM Oo, T Wattanavekin, C Li, P De Silva, S Sarin, K Pipatsrisawat, ... Proceedings of the Twelfth Language Resources and Evaluation Conference …, 2020 | 20 | 2020 |
Building open Javanese and Sundanese corpora for multilingual text-to-speech JAE Wibawa, S Sarin, C Li, K Pipatsrisawat, K Sodimana, O Kjartansson, ... Proceedings of the Eleventh International Conference on Language Resources …, 2018 | 15 | 2018 |
Building statistical parametric multi-speaker synthesis for bangladeshi bangla A Gutkin, L Ha, M Jansche, O Kjartansson, K Pipatsrisawat, R Sproat Procedia Computer Science 81, 194-200, 2016 | 15 | 2016 |
Google crowdsourced speech corpora and related open-source resources for low-resource languages and dialects: an overview A Butryna, SHC Chu, I Demirsahin, A Gutkin, L Ha, F He, M Jansche, ... arXiv preprint arXiv:2010.06778, 2020 | 14 | 2020 |
Málrómur J Guðnason, O Kjartansson, J Jóhannsson, E Carstensdóttir, ... The Árni Magnússon Institute for Icelandic Studies, 2014 | | 2014 |