Publications

Export 5 results:

BibTex
RTF

Author Title Type [ Year

]

Filters: Author is Oscar Saz [Clear All Filters]

2016

O. Saz and Hain, T., “Acoustic Adaptation to Dynamic Background Conditions with Asynchronous Transformations”, Computer Speech and Language, 2016.

M. Doulaty, Saz, O., Ng, R. W. M., and Hain, T., “Automatic Genre and Show Identification of Broadcast Media”, in Proceedings of the 17th Annual Conference of the International Speech Communication Association (Interspeech), San Francisco, California, USA, 2016.

S. Deena, Hasan, M., Doulaty, M., Saz, O., and Hain, T., “Combining Feature and Model-Based Adaptation of RNNLMs for Multi-Genre Broadcast Speech Recognition”, in Proceedings of the 17th Annual Conference of the International Speech Communication Association (Interspeech), San Francisco, California, USA, 2016.

J. Olcoz, Saz, O., and Hain, T., “Error correction in lightly supervised alignment of broadcast subtitles”, in Proceedings of the 17th Annual Conference of the International Speech Communication Association (Interspeech), San Francisco, CA, 2016.

R. W. M. Ng, Nicolao, M., Saz, O., Hasan, M., Chettri, B., Doulaty, M., Lee, T., and Hain, T., “Sheffield LRE 2015 System Description}”, in {Odyssey: The Speaker and Language Recognition Workshop (Submitted)}, 2016.

T. Hain, Christian, J., Saz, O., Deena, S., Hasan, M., Ng, R. W. M., Milner, R., Doulaty, M., and Liu, Y., “webASR 2 - Improved cloud based speech technology”, in Proceedings of the 17th Annual Conference of the International Speech Communication Association (Interspeech), San Francisco, CA, 2016.

Google Scholar
BibTex
RTF

2015

R. Milner, Saz, O., Deena, S., Doulaty, M., Ng, R., and Hain, T., “The 2015 Sheffield System for Longitudinal Diarisation of Broadcast Media”, in {Proceedings of the 2015 IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU)}, Scottsdale, AZ, 2015.

O. Saz, Doulaty, M., Deena, S., Milner, R., Ng, R., Hasan, M., Liu, Y., and Hain, T., “The 2015 Sheffield System for Transcription of Multi–Genre Broadcast Media”, in {Proceedings of the 2015 IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU)}, Scottsdale, AZ, 2015.

M. Doulaty, Saz, O., and Hain, T., “Data-selective Transfer Learning for Multi-Domain Speech Recognition”, in Proceedings of the 16th Annual Conference of the International Speech Communication Association (Interspeech), Dresden, Germany, 2015.

M. Doulaty, Saz, O., Ng, R. W. M., and Hain, T., “Latent Dirichlet Allocation Based Organisation of Broadcast Media Archives for Deep Neural Network Adaptation”, in Proc. of ASRU, Arizona, USA, 2015.

Google Scholar
BibTex
RTF

P. Bell, Gales, M., Hain, T., Kilgour, J., Lanchantin, P., Liu, A., McParland, A., Renals, S., Saz, O., Wester, M., and Woodland, P., “The MGB Challenge: Evaluating Multi-genre Broadcast Media Recognition”, in {Proceedings of the 2015 IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU)}, Scottsdale, AZ, 2015.

Google Scholar
BibTex
RTF

M. Doulaty, Saz, O., and Hain, T., “Unsupervised Domain Discovery using Latent Dirichlet Allocation for Acoustic Modelling in Speech Recognition”, in Proceedings of the 16th Annual Conference of the International Speech Communication Association (Interspeech), Dresden, Germany, 2015.

Google Scholar
BibTex
RTF

2014

O. Saz, Doulaty, M., and Hain, T., “Background-Tracking Acoustic Features for Genre Identification of Broadcast Shows”, in Proceedings of the 2014 Spoken Language Technology (SLT) Workshop, South Lake Tahoe NV, USA, 2014, pp. 118–123.

Google Scholar
BibTex
RTF

O. Saz and Hain, T., “Using Contextual Information in Joint Factor Eigenspace MLLR for Speech Recognition in Diverse Scenarios”, in {Proceedings of the 2014 International Conference on Acoustic, Speech and Signal Processing (ICASSP)}, Florence, Italy, 2014, pp. 6314–6318.

Google Scholar
BibTex
RTF

2013

O. Saz and Hain, T., “Asynchronous Factorisation of Speaker and Background with Feature Transforms in Speech Recognition”, in {Proceedings of the 14th Annual Conference of the International Speech Communication Association (Interspeech)}, Lyon, France, 2013, pp. 1238–1242.

Google Scholar
BibTex
RTF

P. Lanchantin, Bell, P. J., Gales, M. J. F., Hain, T., Liu, X., Long, Y., Quinnell, J., Renals, S., Saz, O., Seigel, M. S., Swietojanski, P., and Woodland, P. C., “Automatic Transcription of Multi-Genre Media Archives”, in {Proceedings of the First Workshop on Speech, Language and Audio in Multimedia}, Marseille, France, 2013, pp. 26–31.

Google Scholar
BibTex
RTF

Main menu

Navigation

tags

You are here

Main menu

Search form

Navigation

User login

tags

You are here

Publications