Ammar Abbas
Ammar Abbas
Amazon Research, Cambridge, UK
Verified email at
Cited by
Cited by
A geometric approach to obtain a bird's eye view from an image
S Ammar Abbas, A Zisserman
Proceedings of the IEEE/CVF international conference on computer vision …, 2019
Camp: a two-stage approach to modelling prosody in context
Z Hodari, A Moinet, S Karlapati, J Lorenzo-Trueba, T Merritt, A Joly, ...
ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021
Prosodic representation learning and contextual sampling for neural text-to-speech
S Karlapati, A Abbas, Z Hodari, A Moinet, A Joly, P Karanasou, ...
ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021
Simple and effective multi-sentence TTS with expressive and coherent prosody
P Makarov, A Abbas, M Łajszczak, A Joly, S Karlapati, A Moinet, ...
arXiv preprint arXiv:2206.14643, 2022
CopyCat2: A single model for multi-speaker TTS and many-to-many fine-grained prosody transfer
S Karlapati, P Karanasou, M Lajszczak, A Abbas, A Moinet, P Makarov, ...
arXiv preprint arXiv:2206.13443, 2022
Recovering Homography from Camera Captured Documents using Convolutional Neural Networks
SA Abbas, S Hussain
arXiv preprint arXiv:1709.03524, 2017
A learned conditional prior for the VAE acoustic space of a TTS system
P Karanasou, S Karlapati, A Moinet, A Joly, A Abbas, S Slangen, ...
arXiv preprint arXiv:2106.10229, 2021
Expressive, variable, and controllable duration modelling in TTS
A Abbas, T Merritt, A Moinet, S Karlapati, E Muszynska, S Slangen, E Gatti, ...
arXiv preprint arXiv:2206.14165, 2022
eCat: An end-to-end model for multi-speaker TTS & many-to-many fine-grained prosody transfer
A Abbas, S Karlapati, B Schnell, P Karanasou, MG Moya, A Nagaraj, ...
arXiv preprint arXiv:2306.11327, 2023
Controllable Emphasis with zero data for text-to-speech
A Joly, M Nicolis, E Peterova, A Lombardi, A Abbas, A van Korlaar, ...
12th Speech Synthesis Workshop (SSW) 2023, 2023
Comparing normalizing flows and diffusion models for prosody and acoustic modelling in text-to-speech
G Zhang, T Merritt, MS Ribeiro, B Tura-Vecino, K Yanagisawa, K Pokora, ...
arXiv preprint arXiv:2307.16679, 2023
Multi-Scale Spectrogram Modelling for Neural Text-to-Speech
A Abbas, B Bollepalli, A Moinet, A Joly, P Karanasou, P Makarov, ...
arXiv preprint arXiv:2106.15649, 2021
The system can't perform the operation now. Try again later.
Articles 1–12