You are here

Publications

Export 5 results:
  • BibTex
  • RTF
Author Title Type [ Year(Asc)]
Filters: Author is King, Simon  [Clear All Filters]
2016
T. Merritt, Clark, R. A. J., Wu, Z., Yamagishi, J., and King, S., Deep neural network-guided unit selection synthesis, in Proc. ICASSP, 2016.
O. Watts, Henter, G. Eje, Merritt, T., Wu, Z., and King, S., From HMMs to DNNs: where do the improvements come from?, in Proc. ICASSP, Shanghai, China, 2016, vol. 41.
G. Eje Henter, Ronanki, S., Watts, O., Wester, M., Wu, Z., and King, S., Robust TTS duration modelling using DNNs, in Proc. ICASSP, Shanghai, China, 2016, vol. 41.
S. Ronanki, Henter, G. Eje, Wu, Z., and King, S., “A template-based approach for speech synthesis intonation generation using LSTMs”, in Proc. Interspeech, San Francisco, CA, 2016.
R. Dall, Brognaux, S., Richmond, K., Valentini-Botinhao, C., Henter, G. Eje, Hirschberg, J., Yamagishi, J., and King, S., “Testing the consistency assumption: Pronunciation variant forced alignment in read and spontaneous speech synthesis”, in Proc. ICASSP, Shanghai, China, 2016, vol. 41.
2015
T. Merritt, Latorre, J., and King, S., Attributing modelling errors in HMM synthesis by stepping gradually from natural to modelled speech, in Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Brisbane, 2015.
T. Merritt, Yamagishi, J., Wu, Z., Watts, O., and King, S., “Deep neural network context embeddings for model selection in rich-context HMM synthesis”, in Proc. Interspeech, Dresden, Germany, 2015, pp. 2207–2211.
M. Tomalin, Wester, M., Dall, R., Byrne, B., and King, S., “A Lattice-based Approach to Automatic Filled Pause Insertion”, in Proc. of DiSS 2015, Edinburgh, 2015.
2014
T. Merritt, Raitio, T., and King, S., “Investigating source and filter contributions, and their interaction, to statistical parametric speech synthesis”, in Proc. Interspeech, Singapore, 2014, pp. 1509–1513.
G. Eje Henter, Merritt, T., Shannon, M., Mayo, C., and King, S., “Measuring the perceptual effects of modelling assumptions in speech synthesis using stimuli constructed from repeated natural speech”, in Proceedings of Interspeech, Singapore, 2014.
2013
H. Lu, King, S., and Watts, O., “Combining a Vector Space Representation of Linguistic Context with a Deep Neural Network for Text-To-Speech Synthesis”, in 8th ISCA Workshop on Speech Synthesis, Barcelona, Spain, 2013, pp. 281–285.
T. Merritt and King, S., “Investigating the shortcomings of HMM synthesis”, in 8th ISCA Workshop on Speech Synthesis, Barcelona, Spain, 2013, pp. 185–190.
A. Stan, Bell, P., Yamagishi, J., and King, S., “Lightly Supervised Discriminative Training of Grapheme Models for Improved Sentence-level Alignment of Speech and Text Data”, in Proc. Interspeech, Lyon, France, 2013.
C. Veaux, Yamagishi, J., and King, S., “Towards Personalized Synthesized Voices for Individuals with Vocal Disabilities: Voice Banking and Reconstruction”, in SLPAT 2013, 4th Workshop on Speech and Language Processing for Assistive Technologies, 2013, pp. 107–111.
C. Valentini-Botinhao, Wester, M., Yamagishi, J., and King, S., “Using neighbourhood density and selective SNR boosting to increase the intelligibility of synthetic speech in noise”, in 8th ISCA Workshop on Speech Synthesis, Barcelona, Spain, 2013, pp. 133–138.
2012
J. Yamagishi, Veaux, C., King, S., and Renals, S., “Speech synthesis technologies for individuals with vocal disabilities: Voice banking and reconstruction”, Acoustical Science and Technology, vol. 33, pp. 1-5, 2012.