You are here

Publications

Export 5 results:
  • BibTex
  • RTF
Author Title Type [ Year(Asc)]
Filters: Author is Oscar Saz  [Clear All Filters]
2016
O. Saz and Hain, T., Acoustic Adaptation to Dynamic Background Conditions with Asynchronous Transformations, Computer Speech and Language, 2016.
M. Doulaty, Saz, O., Ng, R. W. M., and Hain, T., Automatic Genre and Show Identification of Broadcast Media, in Proceedings of the 17th Annual Conference of the International Speech Communication Association (Interspeech), San Francisco, California, USA, 2016.
S. Deena, Hasan, M., Doulaty, M., Saz, O., and Hain, T., Combining Feature and Model-Based Adaptation of RNNLMs for Multi-Genre Broadcast Speech Recognition, in Proceedings of the 17th Annual Conference of the International Speech Communication Association (Interspeech), San Francisco, California, USA, 2016.
J. Olcoz, Saz, O., and Hain, T., Error correction in lightly supervised alignment of broadcast subtitles, in Proceedings of the 17th Annual Conference of the International Speech Communication Association (Interspeech), San Francisco, CA, 2016.
R. W. M. Ng, Nicolao, M., Saz, O., Hasan, M., Chettri, B., Doulaty, M., Lee, T., and Hain, T., Sheffield LRE 2015 System Description}, in {Odyssey: The Speaker and Language Recognition Workshop (Submitted)}, 2016.
T. Hain, Christian, J., Saz, O., Deena, S., Hasan, M., Ng, R. W. M., Milner, R., Doulaty, M., and Liu, Y., “webASR 2 - Improved cloud based speech technology”, in Proceedings of the 17th Annual Conference of the International Speech Communication Association (Interspeech), San Francisco, CA, 2016.
2015
R. Milner, Saz, O., Deena, S., Doulaty, M., Ng, R., and Hain, T., The 2015 Sheffield System for Longitudinal Diarisation of Broadcast Media, in {Proceedings of the 2015 IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU)}, Scottsdale, AZ, 2015.
O. Saz, Doulaty, M., Deena, S., Milner, R., Ng, R., Hasan, M., Liu, Y., and Hain, T., The 2015 Sheffield System for Transcription of Multi–Genre Broadcast Media, in {Proceedings of the 2015 IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU)}, Scottsdale, AZ, 2015.
M. Doulaty, Saz, O., and Hain, T., “Data-selective Transfer Learning for Multi-Domain Speech Recognition”, in Proceedings of the 16th Annual Conference of the International Speech Communication Association (Interspeech), Dresden, Germany, 2015.
M. Doulaty, Saz, O., Ng, R. W. M., and Hain, T., “Latent Dirichlet Allocation Based Organisation of Broadcast Media Archives for Deep Neural Network Adaptation”, in Proc. of ASRU, Arizona, USA, 2015.
P. Bell, Gales, M., Hain, T., Kilgour, J., Lanchantin, P., Liu, A., McParland, A., Renals, S., Saz, O., Wester, M., and Woodland, P., “The MGB Challenge: Evaluating Multi-genre Broadcast Media Recognition”, in {Proceedings of the 2015 IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU)}, Scottsdale, AZ, 2015.
M. Doulaty, Saz, O., and Hain, T., “Unsupervised Domain Discovery using Latent Dirichlet Allocation for Acoustic Modelling in Speech Recognition”, in Proceedings of the 16th Annual Conference of the International Speech Communication Association (Interspeech), Dresden, Germany, 2015.
2014
O. Saz, Doulaty, M., and Hain, T., “Background-Tracking Acoustic Features for Genre Identification of Broadcast Shows”, in Proceedings of the 2014 Spoken Language Technology (SLT) Workshop, South Lake Tahoe NV, USA, 2014, pp. 118–123.
O. Saz and Hain, T., “Using Contextual Information in Joint Factor Eigenspace MLLR for Speech Recognition in Diverse Scenarios”, in {Proceedings of the 2014 International Conference on Acoustic, Speech and Signal Processing (ICASSP)}, Florence, Italy, 2014, pp. 6314–6318.
2013
O. Saz and Hain, T., “Asynchronous Factorisation of Speaker and Background with Feature Transforms in Speech Recognition”, in {Proceedings of the 14th Annual Conference of the International Speech Communication Association (Interspeech)}, Lyon, France, 2013, pp. 1238–1242.
P. Lanchantin, Bell, P. J., Gales, M. J. F., Hain, T., Liu, X., Long, Y., Quinnell, J., Renals, S., Saz, O., Seigel, M. S., Swietojanski, P., and Woodland, P. C., “Automatic Transcription of Multi-Genre Media Archives”, in {Proceedings of the First Workshop on Speech, Language and Audio in Multimedia}, Marseille, France, 2013, pp. 26–31.