Segui
Mike Seltzer
Titolo
Citata da
Citata da
Anno
A study on data augmentation of reverberant speech for robust speech recognition
T Ko, V Peddinti, D Povey, ML Seltzer, S Khudanpur
2017 IEEE international conference on acoustics, speech and signal …, 2017
11332017
Recent advances in deep learning for speech research at Microsoft
L Deng, J Li, JT Huang, K Yao, D Yu, F Seide, M Seltzer, G Zweig, X He, ...
2013 IEEE international conference on acoustics, speech and signal …, 2013
10592013
The Microsoft 2017 conversational speech recognition system
W Xiong, L Wu, F Alleva, J Droppo, X Huang, A Stolcke
2018 IEEE international conference on acoustics, speech and signal …, 2018
9732018
An investigation of deep neural networks for noise robust speech recognition
ML Seltzer, D Yu, Y Wang
2013 IEEE international conference on acoustics, speech and signal …, 2013
8072013
The llama 3 herd of models
A Dubey, A Jauhri, A Pandey, A Kadian, A Al-Dahle, A Letman, A Mathur, ...
arXiv preprint arXiv:2407.21783, 2024
7742024
Achieving human parity in conversational speech recognition
W Xiong, J Droppo, X Huang, F Seide, M Seltzer, A Stolcke, D Yu, ...
arXiv preprint arXiv:1610.05256, 2016
7222016
Binary coding of speech spectrograms using a deep auto-encoder
L Deng, ML Seltzer, D Yu, A Acero, A Mohamed, G Hinton
Eleventh annual conference of the international speech communication association, 2010
4982010
An introduction to computational networks and the computational network toolkit
D Yu, A Eversole, M Seltzer, K Yao, Z Huang, B Guenter, O Kuchaiev, ...
Microsoft Technical Report MSR-TR-2014–112, 2014
4752014
Improved bottleneck features using pretrained deep neural networks
D Yu, ML Seltzer
Twelfth annual conference of the international speech communication association, 2011
3902011
Feature learning in deep neural networks-studies on speech recognition tasks
D Yu, ML Seltzer, J Li, JT Huang, F Seide
arXiv preprint arXiv:1301.3605, 2013
3232013
Multi-task learning in deep neural networks for improved phoneme recognition
ML Seltzer, J Droppo
2013 IEEE International Conference on Acoustics, Speech and Signal …, 2013
2942013
Crowdmos: An approach for crowdsourcing mean opinion score studies
F Ribeiro, D Florêncio, C Zhang, M Seltzer
2011 IEEE international conference on acoustics, speech and signal …, 2011
2912011
Reconstruction of missing features for robust speech recognition
B Raj, ML Seltzer, RM Stern
Speech communication 43 (4), 275-296, 2004
2822004
Augmenting speech recognition with depth imaging
J Kapur, I Tashev, M Seltzer, SE Hodges
US Patent App. 13/662,293, 2014
2722014
Transformer-based acoustic modeling for hybrid speech recognition
Y Wang, A Mohamed, D Le, C Liu, A Xiao, J Mahadeokar, H Huang, ...
ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020
2682020
Toward human parity in conversational speech recognition
W Xiong, J Droppo, X Huang, F Seide, ML Seltzer, A Stolcke, D Yu, ...
IEEE/ACM Transactions on Audio, Speech, and Language Processing 25 (12 …, 2017
2592017
Deep beamforming networks for multi-channel speech recognition
X Xiao, S Watanabe, H Erdogan, L Lu, J Hershey, ML Seltzer, G Chen, ...
2016 IEEE International Conference on Acoustics, Speech and Signal …, 2016
2242016
A Bayesian classifier for spectrographic mask estimation for missing feature speech recognition
ML Seltzer, B Raj, RM Stern
Speech Communication 43 (4), 379-393, 2004
2152004
Singular value decomposition based low-footprint speaker adaptation and personalization for deep neural network
J Xue, J Li, D Yu, M Seltzer, Y Gong
2014 IEEE International Conference on Acoustics, Speech and Signal …, 2014
2002014
Likelihood-maximizing beamforming for robust hands-free speech recognition
ML Seltzer, B Raj, RM Stern
IEEE Transactions on speech and audio processing 12 (5), 489-498, 2004
1902004
Il sistema al momento non può eseguire l'operazione. Riprova più tardi.
Articoli 1–20