Contact





TMH / Publications

TMH Publications by author

Note: this list may be incomplete.

Author:

2016

Arnela, M., Blandin, R., Dabbaghchian, S., Guasch, O., Alías, F., Pelorson, X., Van Hirtum, A., & Engwall, O. (2016). Influence of lips on the production of vowels based on finite element simulations and experiments. Journal of the Acoustical Society of America, 139(5), 2852–2859. [abstract]

Arnela, M., Dabbaghchian, S., Blandin, R., Guasch, O., Engwall, O., Van Hirtum, A., & Pelorson, X. (2016). Influence of vocal tract geometry simplifications on the numerical simulation of vowel sounds. Journal of the Acoustical Society of America, 140(3), 1707-1718. [abstract]

Dabbaghchian, S., Arnela, M., Engwall, O., Guasch, O., Stavness, I., & Badin, P. (2016). Using a Biomechanical Model and Articulatory Data for the Numerical Production of Vowels. In Proceedings of Interspeech 2016. San Fransisco. [abstract] [pdf]

Wedenborn, A., Wik, P., Engwall, O., & Beskow, J. (2016). The effect of a physical robot on vocabulary learning. In Proceedings of the International Workshop on Spoken Dialogue Systems (IWSDS 2016),. Saariselkä, Finland. [pdf]

2015

Dabbaghchian, S., Arnela, M., & Engwall, O. (2015). Simplification of Vocal Tract Shapes with Different Levels of Detail. In Proceedings of 18th International Congress of Phonetic Sciences. Glasgow. [abstract]

2013

Koniaris, C., Salvi, G., & Engwall, O. (2013). On Mispronunciation Analysis of Individual Foreign Speakers Using Auditory Periphery Models. Speech Communication, 55(5), 691-706. [abstract] [link]

2012

Ananthakrishnan, G., Engwall, O., & Neiberg, D. (2012). Exploring the Predictability of Non-unique Acoustic-to-Articulatory Mappings. IEEE transactions on Audio, Speech, and Language Processing. [abstract] [link]

Engwall, O. (2012). Datoranimerade talande ansikten. In Adelswärd, V., & Forstorp, P-A. (Eds.), Människans ansikten: Emotion, interaktion och konst. Carlssons Bokförlag. [pdf]

Engwall, O. (2012). Pronunciation analysis by acoustic-to-articulatory feature inversion. In Engwall, O. (Ed.), Proceedings of the International Symposium on Automatic detection of Errors in Pronunciation Training (pp. 79-84). Stockholm. [pdf]

Koniaris, C., Engwall, O., & Salvi, G. (2012). Auditory and Dynamic Modeling Paradigms to Detect L2 Mispronunciations. In Interspeech 2012. Portland, OR, USA. [abstract] [pdf]

Koniaris, C., Engwall, O., & Salvi, G. (2012). On the Benefit of Using Auditory Modeling for Diagnostic Evaluation of Pronunciations. In Inter. Symp. on Auto. Detect. Errors in Pronunc. Training (IS ADEPT), 2012 (pp. 59-64). Stockholm, Sweden. [abstract] [pdf]

2011

Ananthakrishnan, G., & Engwall, O. (2011). Mapping between Acoustic and Articulatory Gestures. Speech Communication, 53(4), 567-589. [abstract] [link]

Ananthakrishnan, G., & Engwall, O. (2011). Resolving Non-uniqueness in the Acoustic-to-Articulatory Mapping. In ICASSP (pp. 4628-4631). Prague, Czech republic.

Ananthakrishnan, G., Wik, P., & Engwall, O. (2011). Detecting confusable phoneme pairs for Swedish language learners depending on their first language. TMH-QPSR, 51(1), 89-92. [pdf]

Ananthakrishnan, G., Wik, P., Engwall, O., & Abdou, S. (2011). Using an Ensemble of Classifiers for Mispronunciation Feedback. In Strik, H., Delmonte, R., & Russel, M. (Eds.), Proceedings of SLaTE. Venice, Italy.

Engwall, O. (2011). Analysis of and feedback on phonetic features in pronunciation training with a virtual teacher. Computer Assisted Language Learning. [abstract] [link]

Engwall, O. (2011). Augmented Reality Talking Heads as a Support for Speech Perception and Production. In Nee, A. (Ed.), Augmented Reality. InTech. [pdf]

Koniaris, C., & Engwall, O. (2011). Perceptual Differentiation Modeling Explains Phoneme Mispronunciation by Non-Native Speakers. In 2011 IEEE Int. Conf. on Acoust., Speech, Sig. Proc. (ICASSP) (pp. 5704-5707). Prague, Czech Republic. [abstract] [link]

Koniaris, C., & Engwall, O. (2011). Phoneme Level Non-Native Pronunciation Analysis by an Auditory Model-based Native Assessment Scheme. In Interspeech 2011 (pp. 1157-1160). Florence, Italy. [NOMINATED FOR BEST STUDENT PAPER AWARD]. [abstract] [html]

2010

Ananthakrishnan, G., Badin, P., Vargas, J. A. V., & Engwall, O. (2010). Predicting Unseen Articulations from Multi-speaker Articulatory Models. In Proc. Interspeech. Makuhari, Japan. [abstract] [pdf]

Engwall, O. (2010). Is there a McGurk effect for tongue reading?. In Proceedings of AVSP. Hakone, Japan. [pdf]

Picard, S., Ananthakrishnan, G., Wik, P., Engwall, O., & Abdou, S. (2010). Detection of Specific Mispronunciations using Audiovisual Features. In International Conference on Auditory-Visual Speech Processing. Kanagawa, Japan. [abstract] [pdf]

2009

Ananthakrishnan, G., Neiberg, D., & Engwall, O. (2009). In search of Non-uniqueness in the Acoustic-to-Articulatory Mapping. In INTERSPEECH 2009 - 10th Annual Conference of the International Speech Communication Association (pp. 2799 – 2802). Brighton, UK. [abstract] [pdf]

Engwall, O., & Wik, P. (2009). Are real tongue movements easier to speech read than synthesized?. In Proceedings of Interspeech. [pdf]

Engwall, O., & Wik, P. (2009). Can you tell if tongue movements are real or synthetic?. In Proceedings of AVSP. [pdf]

Engwall, O., & Wik, P. (2009). Real vs. rule-generated tongue movements as an audio-visual speech perception support. In Proceedings of Fonetik 2009.

Kjellström, H., & Engwall, O. (2009). Audiovisual-to-articulatory inversion. Speech Communication, 51(3), 195-209. [pdf]

2008

Ananthakrishnan, G., & Engwall, O. (2008). Important regions in the articulator trajectory. In Proceedings of International Seminar on Speech Production (pp. 305-308). Strasbourg, France. [pdf]

Beskow, J., Engwall, O., Granström, B., Nordqvist, P., & Wik, P. (2008). Visualization of speech and audio for hearing-impaired persons. Technology and Disability, 20(2), 97-107. [pdf]

Engwall, O. (2008). Bättre tala än texta - talteknologi nu och i framtiden. In Domeij, R. (Ed.), Tekniken bakom språket (pp. 98-118). Stockholm: Norstedts Akademiska Förlag.

Engwall, O. (2008). Can audio-visual instructions help learners improve their articulation? - an ultrasound study of short term changes. In Proceedings of Interspeech 2008 (pp. 2631-2634). Brisbane, Australia. [pdf]

Katsamanis, N., Ananthakrishnan, G., Papandreou, G., Engwall, O., & Maragos, P. (2008). Audiovisual speech inversion by switching dynamical modeling Governed by a Hidden Markov Process. In Proceedings of EUSIPCO. [pdf]

Neiberg, D., Ananthakrishnan, G., & Engwall, O. (2008). The Acoustic to Articulation Mapping: Non-linear or Non-unique?. In INTERSPEECH 2008 - 9th Annual Conference of the International Speech Communication Association (pp. 1485-1488). Brisbane, Australia. [pdf]

Wik, P., & Engwall, O. (2008). Can visualization of internal articulators support speech perception?. In Proceedings of Interspeech 2008 (pp. 2627-2630). Brisbane, Australia. [pdf]

Wik, P., & Engwall, O. (2008). Looking at tongues – can it help in speech perception?. In Proceedings of Fonetik 2008. [pdf]

2007

Engwall, O., & Bälter, O. (2007). Pronunciation feedback from real and virtual language teachers. Journal of Computer Assisted Language Learning, 20(3), 235-262. [pdf]

Kjellström, H., Engwall, O., Abdou, S., & Bälter, O. (2007). Audio-visual phoneme classification for pronunciation training applications. In Proceedings of Interspeech 2007 (pp. 702-705). Antwerpen, Belgium. [pdf]

2006

Engwall, O. (2006). Evaluation of speech inversion using an articulatory classifier. In Yehia, H., Demolin, D., & Laboissière, R. (Eds.), In Proceedings of the Seventh International Seminar on Speech Production (pp. 469-476). Ubatuba, Sao Paolo, Brazil. [pdf]

Engwall, O. (2006). Assessing MRI measurements: Effects of sustenation, gravitation and coarticulation. In Harrington, J., & Tabain, M. (Eds.), Speech production: Models, Phonetic Processes and Techniques (pp. 301-314). New York: Psychology Press. [pdf]

Engwall, O. (2006). Feedback strategies of human and virtual tutors in pronunciation training. TMH-QPSR, 48(1), 011-034. [pdf]

Engwall, O., Bälter, O., Öster, A-M., & Kjellström, H. (2006). Designing the user interface of the computer-based speech training system ARTUR based on early user tests. Journal of Behaviour and Information Technology, 25(4), 353-365. [pdf]

Engwall, O., Bälter, O., Öster, A-M., & Kjellström, H. (2006). Feedback management in the pronunciation training system ARTUR. In Proceedings of CHI 2006 (pp. 231-234). Montreal. [pdf]

Engwall, O., Delvaux, V., & Metens, T. (2006). Interspeaker Variation in the Articulation of French Nasal Vowels. In In Proceedings of the Seventh International Seminar on Speech Production (pp. 3-10). Ubatuba, Sao Paolo, Brazil. [pdf]

Kjellström, H., Engwall, O., & Bälter, O. (2006). Reconstructing Tongue Movements from Audio and Video. In Proc of Interspeech 2006 (pp. 2238–2241). Pittsburgh. [pdf]

2005

Bälter, O., Engwall, O., Öster, A-M., & Kjellström, H. (2005). Wizard-of-Oz Test of ARTUR - a Computer-Based Speech Training System with Articulation Correction. In Proceedings of the Seventh International ACM SIGACCESS Conference on Computers and Accessibility (pp. 36-43). Baltimore. [pdf]

Engwall, O. (2005). Articulatory synthesis using corpus-based estimation of line spectrum pairs. In Proceedings of Interspeech 2005. Lisbon, Portugal. [pdf]

Engwall, O. (2005). Introducing visual cues in acoustic-to-articulatory inversion. In Proceedings of Interspeech 2005. Lisbon, Portugal. [pdf]

Eriksson, E., Bälter, O., Engwall, O., Öster, A-M., & Kjellström, H. (2005). Design Recommendations for a Computer-Based Speech Training System Based on End-User Interviews. In Proceedings of the Tenth International Conference on Speech and Computers (pp. 483-486). Patras, Greece. [pdf]

2004

Engwall, O. (2004). From real-time MRI to 3D tongue movements. In Kim, S. H., & Young, D. H. (Eds.), Proc ICSLP 2004 (pp. 1109-1112). Jeju Island, Korea. [pdf]

Engwall, O. (2004). Speaker adaptation of a three-dimensional tongue model. In Kim, S. H., & Young, D. H. (Eds.), Proc ICSLP 2004 (pp. 465-468). Jeju Island, Korea. [pdf]

Engwall, O., Wik, P., Beskow, J., & Granström, G. (2004). Design strategies for a virtual language tutor. In Kim, S. H., & Young, D. H. (Eds.), Proc ICSLP 2004 (pp. 1693-1696). Jeju Island, Korea. [pdf]

2003

Beskow, J., Engwall, O., & Granström, B. (2003). Resynthesis of Facial and Intraoral Articulation from Simultaneous Measurements. In Solé, M., Recasens, D., & Romero, J. (Eds.), Proceedings of the 15th ICPhS (pp. 431-434). Barcelona, Spain. [pdf]

Beskow, J., Engwall, O., & Granström, B. (2003). Simultaneous measurements of facial and intraoral articulation. In Proc of Fonetik 2003, Umeå University, Dept of Philosophy and Linguistics PHONUM 9 (pp. 57-60). [pdf]

Engwall, O. (2003). A revisit to the application of MRI to the analysis of speech production. Testing our assumptions. In 6th Intl Seminar on Speech Production (pp. 43-48). Sydney. [pdf]

Engwall, O. (2003). Combining MRI, EMA & EPG measurements in a three-dimensional tongue model. Speech Communication, 41, 303-329. [pdf]

Engwall, O., & Beskow, J. (2003). Resynthesis of 3D tongue movements from facial data. In Proc EuroSpeech 2003 (pp. 2261-2264). [pdf]

Engwall, O., & Beskow, J. (2003). The effect of corpus choice on statistical articulatory modeling. In 7th Intl Seminar on Speech Production (pp. 49-54). Sydney. [pdf]

2002

Engwall, O. (2002). Evaluation of a system for concatenative articulatory visual speech synthesis. In Proc of ICSLP 2002 (pp. 665-668). Denver, Colorado, USA. [pdf]

Engwall, O. (2002). Tongue Talking - Studies in Intraoral Speech Synthesis. Doctoral dissertation, KTH. [pdf]

2001

Engwall, O. (2001). Considerations in intraoral visual speech synthesis: Data and modelling. In Proc of 4th Intl Speech Motor Conf (pp. 23-26). Nijmegen. [pdf]

Engwall, O. (2001). Making the tongue model talk: Merging MRI & EMA Measurements. In Proc of Eurospeech 2001 (pp. 261-264). Aalborg. [pdf]

Engwall, O. (2001). Synthesising static vowels and dynamic sounds using a 3D vocal tract model. In Proc of 4th ISCA Tutorial and Research Workshop on Speech Synthesis (pp. 38-41). Perthshire. [pdf]

Engwall, O. (2001). Using linguopalatal contact patterns to tune a 3D tongue model. In Proc of Eurospeech 2001 (pp. 1475-1478). Aalborg. [pdf]

2000

Engwall, O. (2000). A 3D tongue model based on MRI data. In Yuan, B., Huang, T., & Tang, X. (Eds.), Proc of ICSLP 2000, 6th Intl Conf on Spoken Language Processing (pp. 901-904). Beijing. [pdf]

Engwall, O. (2000). Are static MRI representative of dynamic speech? Results from a comparative study using MRI, EPG, and EMA. In Yuan, B., Huang, T., & Tang, X. (Eds.), Proc of ICSLP 2000, 6th Intl Conf on Spoken Language Processing (pp. 17-20). Beijing. [pdf]

Engwall, O. (2000). Dynamical aspects of coarticulation in Swedish fricatives - a combined EMA and EPG study. TMH-QPSR, 41(4), 049-073. [pdf]

Engwall, O. (2000). Replicating three-dimensional toungue shapes synthetically. TMH-QPSR, 41(2-3), 053-064. [pdf]

Engwall, O., & Badin, P. (2000). An MRI study of Swedish fricatives: coarticulatory effects. In Hole, P. (Ed.), Proc of 5th Speech Production Seminar: Models and data (pp. 297-300). Kloster Seeon, Germany. [pdf]

1999

Engwall, O. (1999). Modeling of the vocal tract in three dimensions. In Proc of Eurospeech 99 (pp. 113-116). Budapest. [pdf]

Engwall, O. (1999). Vocal tract modeling in 3D. TMH-QPSR, 40(1-2), 031-038. [pdf]

Engwall, O., & Badin, P. (1999). Collecting and analysing two- and three-dimensional MRI data for Swedish. TMH-QPSR, 40(3-4), 011-038. [pdf]

1998

Engwall, O. (1998). A 3D vocal tract model for articulatory and visual speech synthesis. In Branderud, P., & Traunmüller, H. (Eds.), Proc of Fonetik -98, The Swedish Phonetics Conference (pp. 196-199). Stockholm University.

Engwall, O. (1998). En tredimensionell modell för talsyntes. Master's thesis, KTH, TMH, CTT.







Published by: TMH, Speech, Music and Hearing
Webmaster, webmaster@speech.kth.se

Last updated: Tuesday, 29-May-2012 08:16:22 MEST