Publications 
from the Multimodal Speech Technology Group


Öhman, T. (2000) "Vision in Speech Technology. Automatic measurements of visual speech and audio-visual intelligibility of synthetic and natural faces." Licentiate Thesis, TMH, KTH, 2000.

Gustafson, J., Lindberg, N. and Lundeberg, M. (1999). "The August spoken dialogue system" to be published in Proceedings from Eurospeech'99, Budapest, Hungary.

Lundeberg, M. and Beskow, J. (1999). "Developing a 3D-agent for the August dialogue system" In Proceedings from AVSP'99, Santa Cruz, USA. [HTML] [postscript]

Agelfors, E., Beskow, J., Dahlquist, M., Granström, B., Lundeberg, M., Salvi, G., Spens, K-E & Öhman, T. (1999). "Synthetic visual speech driven from auditory speech" In Proceedings from AVSP'99, Santa Cruz, USA.

Massaro, D.W., Beskow, J., Cohen, M.M., Fry C.L., Rodriquez, T. (1999). "Picture My Voice: Audio to Visual Speech Synthesis using Artificial Neural Networks" In Proceedings from AVSP'99, Santa Cruz, USA.

Öhman, T. and Lundeberg, M. (1999). "Differences in speechreading a synthetic and a natural face" In Proceedings from ICPhS'99, San Francisco, USA.

Granström, B., House, D. and Lundeberg, M. (1999). "Prosodic cues in multimodal speech perception" In Proceedings from ICPhS'99, San Francisco, USA.

Gustafson, J., Lundeberg, M. and Liljencrants, J. (1999). "Experiences from the development of August - a multimodal spoken dialogue system" In Proceedings from IDMS '99, Germany.

Granström, B., House, D. and Lundeberg, M. (1999)."Visual Prominence in Multimodal Speech Perception" in Proceedings from Fonetik 99, Gothenburg, Sweden.

Agelfors, E., Beskow, J., Dahlquist, M., Granström, B., Lundeberg, M., Salvi, G., Spens, K-E & Öhman, T. (1999). "Two methods for Visual Parameter Extraction in the Teleface Project" in Proceedings from Fonetik 99, Gothenburg, Sweden.

Bengtsson, B., Burgoon, J.K., Cederberg, C., Bonito, J., Lundeberg, M. (1999). "The Impact of Antropomorphic Interfaces on Influence, Understanding, and Credibility". In Proceedings of the 23nd Hawaii International Conference on System Sciences - 1999, Maui, USA.

Agelfors, E., Beskow, J., Dahlquist, M., Granström, B., Lundeberg, M., Spens, K-E & Öhman, T. (1998). "Synthetic faces as a lipreading support". To appear in: Proceedings of ICSLP'98, Sydney, Australia. [HTML][postscript]

Agelfors, E., Beskow, J., Dahlquist, M., Granström, B., Lundeberg, M., Spens, K-E & Öhman, T. (1998). "Teleface - the use of a synthetic face for the hard of hearing". To appear in: Proceedings of IVTTA'98, Turin, Italia. []

Öhman, T. (1998). "An audio-visual speech database and automatic measurements of visual speech". In TMH-QPSR 1/1998, Stockholm, Sweden. [postscript][HTML]

Agelfors, E., Beskow, J., Dahlquist, M., Granström, B., Lundeberg, M., Spens, K-E, & Öhman, T. (1998). The synthetic face from a hearing impaired view. In Proceedings of Fonetik 98, Stockholm, Sweden.

Beskow, J. (1998). A Tool for Teaching and Development of Parametric Speech Synthesis. In Proceedings of Fonetik 98, Stockholm, Sweden.

Engwall, O. (1998). A 3D vocal tract model for articulatory and visual speech synthesis. In Proceedings of Fonetik 98, Stockholm, Sweden.

Beskow, J.,Dahlquist, M., Granström, B., Lundeberg, M., Spens, K-E & Öhman, T. (1997). "The Teleface project - Multimodal Speech Communication for the Hearing Impaired". In Proceedings of Eurospeech '97, Rhodos, Greece. [postscript][gzipped postscript] More results presented at Eurospeech'97 [html]

Beskow, J. (1997): "Animation of Talking Agents", In Proceedings of AVSP'97, ESCA Workshop on Audio-Visual Speech Processing, Rhodes, Greece, September 1997. [postscript][gzipped postscript]

Beskow, Elenius & MacGlashan (1997): Olga - A dialogue system with an animated talking agent. In Proceedings of EUROSPEECH'97, Rhodes, Greece. [postscript][gzipped postscript]

Beskow, J., & McGlashan, S. (1997): Olga - A Conversational Agent with Gestures. In Proceedings of the IJCAI'97 workshop on Animated Interface Agents - Making them Intelligent, Nagoya, Japan, August 1997, japan97olga.ps, japan97olga.ps.gz

Beskow, J.,Dahlquist, M., Granström, B., Lundeberg, M., Spens, K-E. & Öhman, T. (1997): "The Teleface project - disability, feasibility and intelligibility" In Proceedings of Fonetik -97, Swedish Phonetics Conference, Umeå, Sweden [html][postscript][gzipped postscript]

Öhman, T. (1997). "Measuring visual speech" In Proceedings of Fonetik 97, Umeå, Sweden.

Beskow, J., Elenius, K. & MacGlashan, S. (1997): Olga - A dialogue system with an animated talking agent. In Proceedings of Fonetik 97, Umeå, Sweden. [html][postscript][gzipped postscript]

Lundeberg, M. (1997). "Multimodal talkommunikation - Utveckling av testmiljö" (in swedish). Master of science thesis, Department of Speech Communication and Music Accoustics, KTH, Stockholm, Sweden. [ Swedish and English Abstract in HTML]

Beskow, J. (1996): "Talking Heads - communication, articulation and animation" In TMH-QPSR 2/1996, Proceedings of Fonetik '96, Swedish Phonetics Conference, Nässlingen, Sweden, 29-31 May 1996  [postscript][gzipped postscript]

Benoît, C., Beskow, J., Cohen, M.M., Granström, B., Le Goff, B. & Massaro, D.W. (1995) "Text-to-audio-visual speech synthesis over the world" Advanced topics for Speech Mapping, Speech Maps Workshop, Grenoble.

Beskow, J. (1995): "Rule-based Visual Speech Synthesis" In Proceedings of Eurospeech '95, Madrid, Spain.  [postscript][gzipped postscript]

Beskow, J. (1995): "Regelstyrd Visuell Talsyntes" (in swedish) Master of science thesis, Department of Speech Communication and Music Accoustics, KTH, Stockholm, Sweden.


Back to the Multimodal Speech Synthesis page