VoxCeleb: a large-scale speaker identification dataset A Nagrani, JS Chung, A Zisserman Interspeech, 2017 | 787 | 2017 |
VoxCeleb2: Deep Speaker Recognition JS Chung, A Nagrani, A Zisserman Interspeech, 2018 | 638 | 2018 |
Lip reading sentences in the wild JS Chung, A Senior, O Vinyals, A Zisserman 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 3444 …, 2017 | 423 | 2017 |
Lip reading in the wild JS Chung, A Zisserman Asian Conference on Computer Vision, 87-103, 2016 | 278 | 2016 |
Deep audio-visual speech recognition T Afouras, JS Chung, A Senior, O Vinyals, A Zisserman IEEE Transactions on Pattern Analysis and Machine Intelligence, 2018 | 197 | 2018 |
Out of time: automated lip sync in the wild JS Chung, A Zisserman Asian conference on computer vision, 251-263, 2016 | 175 | 2016 |
Utterance-level Aggregation For Speaker Recognition In The Wild W Xie, A Nagrani, JS Chung, A Zisserman ICASSP, 2019 | 151 | 2019 |
The Conversation: Deep Audio-Visual Speech Enhancement T Afouras, JS Chung, A Zisserman Interspeech, 2018 | 151 | 2018 |
VoxCeleb: Large-scale Speaker Verification in the Wild A Nagrani, JS Chung, W Xie, A Zisserman Computer Speech & Language, 101027, 2019 | 110 | 2019 |
You said that? JS Chung, A Jamaludin, A Zisserman BMVC, 2017 | 84 | 2017 |
Lip Reading in Profile JS Chung, A Zisserman BMVC, 2017 | 71 | 2017 |
LRS3-TED: a large-scale dataset for visual speech recognition T Afouras, JS Chung, A Zisserman arXiv preprint arXiv:1809.00496, 2018 | 58 | 2018 |
In defence of metric learning for speaker recognition JS Chung, J Huh, S Mun, M Lee, HS Heo, S Choe, C Ham, S Jung, ... Interspeech, 2020 | 56 | 2020 |
Deep Lip Reading: a comparison of models and an online application T Afouras, JS Chung, A Zisserman Interspeech, 2018 | 47 | 2018 |
Perfect match: Improved cross-modal embeddings for audio-visual synchronisation SW Chung, JS Chung, HG Kang ICASSP, 2019 | 39 | 2019 |
Learning to lip read words by watching videos JS Chung, A Zisserman Computer Vision and Image Understanding 173, 76-85, 2018 | 39 | 2018 |
You said that?: Synthesising talking faces from audio A Jamaludin, JS Chung, A Zisserman International Journal of Computer Vision 127 (11), 1767-1779, 2019 | 32 | 2019 |
Disentangled Speech Embeddings using Cross-modal Self-supervision A Nagrani, JS Chung, S Albanie, A Zisserman ICASSP, 2020 | 26 | 2020 |
My lips are concealed: Audio-visual speech enhancement through obstructions T Afouras, JS Chung, A Zisserman Interspeech, 2019 | 25 | 2019 |
Voxsrc 2019: The first voxceleb speaker recognition challenge JS Chung, A Nagrani, E Coto, W Xie, M McLaren, DA Reynolds, ... arXiv preprint arXiv:1912.02522, 2019 | 22 | 2019 |