Publications

2016

T. Merritt, Clark, R. A. J., Wu, Z., Yamagishi, J., and King, S., “Deep neural network-guided unit selection synthesis”, in Proc. ICASSP, 2016.

O. Watts, Henter, G. Eje, Merritt, T., Wu, Z., and King, S., “From HMMs to DNNs: where do the improvements come from?”, in Proc. ICASSP, Shanghai, China, 2016, vol. 41.

G. Eje Henter, Ronanki, S., Watts, O., Wester, M., Wu, Z., and King, S., “Robust TTS duration modelling using DNNs”, in Proc. ICASSP, Shanghai, China, 2016, vol. 41.

S. Ronanki, Henter, G. Eje, Wu, Z., and King, S., “A template-based approach for speech synthesis intonation generation using LSTMs”, in Proc. Interspeech, San Francisco, CA, 2016.

Google Scholar
BibTex
RTF

R. Dall, Brognaux, S., Richmond, K., Valentini-Botinhao, C., Henter, G. Eje, Hirschberg, J., Yamagishi, J., and King, S., “Testing the consistency assumption: Pronunciation variant forced alignment in read and spontaneous speech synthesis”, in Proc. ICASSP, Shanghai, China, 2016, vol. 41.

2015

T. Merritt, Latorre, J., and King, S., “Attributing modelling errors in HMM synthesis by stepping gradually from natural to modelled speech”, in Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Brisbane, 2015.

T. Merritt, Yamagishi, J., Wu, Z., Watts, O., and King, S., “Deep neural network context embeddings for model selection in rich-context HMM synthesis”, in Proc. Interspeech, Dresden, Germany, 2015, pp. 2207–2211.

M. Tomalin, Wester, M., Dall, R., Byrne, B., and King, S., “A Lattice-based Approach to Automatic Filled Pause Insertion”, in Proc. of DiSS 2015, Edinburgh, 2015.

Google Scholar
BibTex
RTF

2014

T. Merritt, Raitio, T., and King, S., “Investigating source and filter contributions, and their interaction, to statistical parametric speech synthesis”, in Proc. Interspeech, Singapore, 2014, pp. 1509–1513.

Google Scholar
BibTex
RTF

G. Eje Henter, Merritt, T., Shannon, M., Mayo, C., and King, S., “Measuring the perceptual effects of modelling assumptions in speech synthesis using stimuli constructed from repeated natural speech”, in Proceedings of Interspeech, Singapore, 2014.

Google Scholar
BibTex
RTF

2013

H. Lu, King, S., and Watts, O., “Combining a Vector Space Representation of Linguistic Context with a Deep Neural Network for Text-To-Speech Synthesis”, in 8th ISCA Workshop on Speech Synthesis, Barcelona, Spain, 2013, pp. 281–285.

Google Scholar
BibTex
RTF

T. Merritt and King, S., “Investigating the shortcomings of HMM synthesis”, in 8th ISCA Workshop on Speech Synthesis, Barcelona, Spain, 2013, pp. 185–190.

Google Scholar
BibTex
RTF

A. Stan, Bell, P., Yamagishi, J., and King, S., “Lightly Supervised Discriminative Training of Grapheme Models for Improved Sentence-level Alignment of Speech and Text Data”, in Proc. Interspeech, Lyon, France, 2013.

Google Scholar
BibTex
RTF

C. Veaux, Yamagishi, J., and King, S., “Towards Personalized Synthesized Voices for Individuals with Vocal Disabilities: Voice Banking and Reconstruction”, in SLPAT 2013, 4th Workshop on Speech and Language Processing for Assistive Technologies, 2013, pp. 107–111.

Google Scholar
BibTex
RTF

C. Valentini-Botinhao, Wester, M., Yamagishi, J., and King, S., “Using neighbourhood density and selective SNR boosting to increase the intelligibility of synthetic speech in noise”, in 8th ISCA Workshop on Speech Synthesis, Barcelona, Spain, 2013, pp. 133–138.

Google Scholar
BibTex
RTF

2012

J. Yamagishi, Veaux, C., King, S., and Renals, S., “Speech synthesis technologies for individuals with vocal disabilities: Voice banking and reconstruction”, Acoustical Science and Technology, vol. 33, pp. 1-5, 2012.

Google Scholar
BibTex
RTF

Main menu

Navigation

tags

You are here

Main menu

Search form

Navigation

User login

tags

You are here

Publications