You are here

Publications

Export 5 results:

BibTex
RTF

Author Title Type [ Year

]

2016

O. Saz and Hain, T., “Acoustic Adaptation to Dynamic Background Conditions with Asynchronous Transformations”, Computer Speech and Language, 2016.

M. Doulaty, Saz, O., Ng, R. W. M., and Hain, T., “Automatic Genre and Show Identification of Broadcast Media”, in Proceedings of the 17th Annual Conference of the International Speech Communication Association (Interspeech), San Francisco, California, USA, 2016.

and Hain, T., “Combining Feature and Model-Based Adaptation of RNNLMs for Multi-Genre Broadcast Speech Recognition”, in Proceedings of the 17th Annual Conference of the International Speech Communication Association (Interspeech), San Francisco, California, USA, 2016.

S. Deena, Hasan, M., Doulaty, M., Saz, O., and Hain, T., “Combining Feature and Model-Based Adaptation of RNNLMs for Multi-Genre Broadcast Speech Recognition”, in Proceedings of the 17th Annual Conference of the International Speech Communication Association (Interspeech), San Francisco, California, USA, 2016.

R. W. M. Ng, Chettri, B., and Hain, T., “Combining weak tokenisers for phonotactic language recognition in a resource-constrained setting”, in Proceedings of the 17th Annual Conference of the International Speech Communication Association (Interspeech), San Francisco, CA, 2016.

T. Merritt, Clark, R. A. J., Wu, Z., Yamagishi, J., and King, S., “Deep neural network-guided unit selection synthesis”, in Proc. ICASSP, 2016.

P. Swietojanski and Renals, S., “Differentiable Pooling for Unsupervised Acoustic Model Adaptation”, IEEE/ACM Transactions on Audio, Speech, and Language Processing, vol. PP, pp. 1-1, 2016.

C. Zhang and Woodland, P. C., “DNN Speaker Adaptation using Parameterised Sigmoid and ReLU Hidden Activation Functions”, in Proc. ICASSP'16, Shanghai, China, 2016.

R. Milner and Hain, T., “DNN-based speaker clustering for speaker diarisation”, in Proceedings of the 17th Annual Conference of the International Speech Communication Association (Interspeech), San Francisco, CA, 2016.

J. Olcoz, Saz, O., and Hain, T., “Error correction in lightly supervised alignment of broadcast subtitles”, in Proceedings of the 17th Annual Conference of the International Speech Communication Association (Interspeech), San Francisco, CA, 2016.

M. Wester, Watts, O., and Henter, G. Eje, “Evaluating comprehension of natural and synthetic conversational speech”, in Speech Prosody, Boston, MA, 2016.

M. Nicolao, Christensen, H., Cunningham, S., Green, P., and Hain, T., “A framework for collecting realistic recordings of dysarthric speech - the homeService corpus”, in The International Conference on Language Resources and Evaluation - LREC 2016, Portorož, SLO, 2016.

O. Watts, Henter, G. Eje, Merritt, T., Wu, Z., and King, S., “From HMMs to DNNs: where do the improvements come from?”, in Proc. ICASSP, Shanghai, China, 2016, vol. 41.

R. W. M. Ng, Shah, K., Specia, L., and Hain, T., “Groupwise learning for ASR k-best list reranking in spoken language translation”, in Proceedings of the 2016 International Conference on Acoustic, Speech and Signal Processing (ICASSP), Shanghai, China, 2016.

T. Yoshimura, Henter, G. Eje, Watts, O., Wester, M., Yamagishi, J., and Tokuda, K., “A hierarchical predictor of synthetic speech naturalness using neural networks”, in Proc. Interspeech, San Francisco, CA, 2016.

L. Wang, Zhang, C., Woodland, P. C., Gales, M. J. F., Karanasou, P., Lanchantin, P., Liu, X., and Qian, Y., “Improved DNN-based Segmentation for Multi-genre Broadcast Audio”, in Proc. ICASSP'16, Shanghai, China, 2016.

I. Casanueva, Hain, T., and Green, P., “Improving generalisation to new speakers in spoken dialogue state tracking”, in Proceedings of the 17th Annual Conference of the International Speech Communication Association (Interspeech), San Francisco, CA, 2016.

Y. Qian, Tan, T., Yu, D., and Zhang, Y., “Integrated adaptation with multi-factor joint-learning for far-field speech recognition”, in Proc. ICASSP’16, Shanghai, China, 2016.

Y. Qian, Tan, T., and Yu, D., “An investigation into using parallel data for far-field speech recognition”, in Proc. ICASSP’16, Shanghai, China, 2016.

P. Swietojanski, Li, J., and Renals, S., “Learning Hidden Unit Contributions for Unsupervised Acoustic Model Adaptation”, IEEE/ACM Transactions on Audio, Speech, and Language Processing, vol. 24, pp. 1450-1463, 2016.

and Yu, K., “Multi-task joint-learning of deep neural networks for robust speech recognition”, in Proc. ASRU’15, Scottsdale, Arizona, USA, 2016.

G. Eje Henter, Ronanki, S., Watts, O., Wester, M., Wu, Z., and King, S., “Robust TTS duration modelling using DNNs”, in Proc. ICASSP, Shanghai, China, 2016, vol. 41.

P. Swietojanski and Renals, S., “SAT-LHUC: Speaker Adaptive Training for Learning Hidden Unit Contributions”, in Proc. IEEE ICASSP, Shanghai, China, 2016.

L. Lu, Kong, L., Dyer, C., Smith, N. A., and Renals, S., “Segmental Recurrent Neural Networks for End-to-end Speech Recognition”, in Proc. INTERSPEECH, 2016.

R. Milner and Hain, T., “Segment-oriented evaluation of speaker diarisation performance”, in Proceedings of the 2016 International Conference on Acoustic, Speech and Signal Processing (ICASSP), Shanghai, China, 2016.

Pages