TMH Publications by author
Note: this list may be incomplete.
Journal of the Acoustical Society of America, 139(5). [abstract] (2016). Influence of lips on the production of vowels based on finite element simulations and experiments.
Journal of the Acoustical Society of America, 140(3), 1707-1718. (2016). Influence of vocal tract geometry simplifications on the numerical simulation of vowel sounds.
Proceedings of Interspeech 2016. San Fransisco. [abstract] [pdf] (2016). Using a Biomechanical Model and Articulatory Data for the Numerical Production of Vowels. In
Proceedings of the International Workshop on Spoken Dialogue Systems (IWSDS 2016),. [pdf] (2016). The effect of a physical robot on vocabulary learning. In
Proceedings of 18th International Congress of Phonetic Sciences. Glasgow. [abstract] (2015). Simplification of Vocal Tract Shapes with Different Levels of Detail. In
Speech Communication, 55(5), 691-706. [link] (2013). On Mispronunciation Analysis of Individual Foreign Speakers Using Auditory Periphery Models.
IEEE transactions on Audio, Speech, and Language Processing. [abstract] [link] (2012). Exploring the Predictability of Non-unique Acoustic-to-Articulatory Mappings.
Datoranimerade talande ansikten. [pdf] (2012).
Proceedings of the International Symposium on Automatic detection of Errors in Pronunciation Training (pp. 79-84). Stockholm. [pdf] (2012). Pronunciation analysis by acoustic-to-articulatory feature inversion. In Engwall, O. (Ed.),
Interspeech 2012. Portland, OR, USA. [abstract] [pdf] (2012). Auditory and Dynamic Modeling Paradigms to Detect L2 Mispronunciations. In
Inter. Symp. on Auto. Detect. Errors in Pronunc. Training (IS ADEPT), 2012 (pp. 59-64). Stockholm, Sweden. [abstract] [pdf] (2012). On the Benefit of Using Auditory Modeling for Diagnostic Evaluation of Pronunciations. In
Speech Communication, 53(4), 567-589. [link] (2011). Mapping between Acoustic and Articulatory Gestures.
ICASSP (pp. 4628-4631). Prague, Czech republic. (2011). Resolving Non-uniqueness in the Acoustic-to-Articulatory Mapping. In
TMH-QPSR, 51(1), 89-92. [pdf] (2011). Detecting confusable phoneme pairs for Swedish language learners depending on their first language.
Proceedings of SLaTE. Venice, Italy. (2011). Using an Ensemble of Classifiers for Mispronunciation Feedback. In Strik, H., Delmonte, R., & Russel, M. (Eds.),
Computer Assisted Language Learning. [abstract] [link] (2011). Analysis of and feedback on phonetic features in pronunciation training with a virtual teacher.
Augmented Reality. InTech. [pdf] (2011). Augmented Reality Talking Heads as a Support for Speech Perception and Production. In Nee, A. (Ed.),
2011 IEEE Int. Conf. on Acoust., Speech, Sig. Proc. (ICASSP) (pp. 5704-5707). Prague, Czech Republic. [abstract] [link] (2011). Perceptual Differentiation Modeling Explains Phoneme Mispronunciation by Non-Native Speakers. In
Interspeech 2011 (pp. 1157-1160). Florence, Italy. [NOMINATED FOR BEST STUDENT PAPER AWARD]. [abstract] [html] (2011). Phoneme Level Non-Native Pronunciation Analysis by an Auditory Model-based Native Assessment Scheme. In
Proc. Interspeech. Makuhari, Japan. [abstract] [pdf] (2010). Predicting Unseen Articulations from Multi-speaker Articulatory Models. In
Proceedings of AVSP. Hakone, Japan. [pdf] (2010). Is there a McGurk effect for tongue reading?. In
International Conference on Auditory-Visual Speech Processing. Kanagawa, Japan. [abstract] [pdf] (2010). Detection of Specific Mispronunciations using Audiovisual Features. In
INTERSPEECH 2009 - 10th Annual Conference of the International Speech Communication Association. Brighton, UK. [abstract] [pdf] (2009). In search of Non-uniqueness in the Acoustic-to-Articulatory Mapping. In
Proceedings of Interspeech. [pdf] (2009). Are real tongue movements easier to speech read than synthesized?. In
Proceedings of AVSP. [pdf] (2009). Can you tell if tongue movements are real or synthetic?. In
Proceedings of Fonetik 2009. (2009). Real vs. rule-generated tongue movements as an audio-visual speech perception support. In
Speech Communication, 51(3), 195-209. [pdf] (2009). Audiovisual-to-articulatory inversion.
Proceedings of International Seminar on Speech Production (pp. 305-308). Strasbourg, France. [pdf] (2008). Important regions in the articulator trajectory. In
Technology and Disability, 20(2), 97-107. [pdf] (2008). Visualization of speech and audio for hearing-impaired persons.
. Stockholm. (2008).
Proceedings of Interspeech 2008 (pp. 2631-2634). Brisbane, Australia. [pdf] (2008). Can audio-visual instructions help learners improve their articulation? - an ultrasound study of short term changes. In
Proceedings of EUSIPCO. [pdf] (2008). Audiovisual speech inversion by switching dynamical modeling Governed by a Hidden Markov Process. In
INTERSPEECH 2008 - 9th Annual Conference of the International Speech Communication Association (pp. 1485-1488). Brisbane, Australia. [pdf] (2008). The Acoustic to Articulation Mapping: Non-linear or Non-unique?. In
Proceedings of Interspeech 2008 (pp. 2627-2630). Brisbane, Australia. [pdf] (2008). Can visualization of internal articulators support speech perception?. In
Proceedings of Fonetik 2008. [pdf] (2008). . In
Journal of Computer Assisted Language Learning, 20(3), 235-262. [pdf] (2007). Pronunciation feedback from real and virtual language teachers.
Proceedings of Interspeech 2007 (pp. 702-705). Antwerpen, Belgium. [pdf] (2007). Audio-visual phoneme classification for pronunciation training applications. In
In Proceedings of the Seventh International Seminar on Speech Production (pp. 469-476). Ubatuba, Sao Paolo, Brazil. [pdf] (2006). Evaluation of speech inversion using an articulatory classifier. In
Speech production: Models, Phonetic Processes and Techniques (pp. 301-314). New York: Psychology Press. [pdf] (2006). Assessing MRI measurements: Effects of sustenation, gravitation and coarticulation. In Harrington, J., & Tabain, M. (Eds.),
TMH-QPSR, 48(1), 011-034. [pdf] (2006). Feedback strategies of human and virtual tutors in pronunciation training.
Journal of Behaviour and Information Technology, 25(4), 353-365. [pdf] (2006). Designing the user interface of the computer-based speech training system ARTUR based on early user tests.
Proceedings of CHI 2006 (pp. 231-234). Montreal. [pdf] (2006). Feedback management in the pronunciation training system ARTUR. In
In Proceedings of the Seventh International Seminar on Speech Production (pp. 3-10). Ubatuba, Sao Paolo, Brazil. [pdf] (2006). Interspeaker Variation in the Articulation of French Nasal Vowels. In
Proc of Interspeech 2006. Pittsburgh. [pdf] (2006). Reconstructing Tongue Movements from Audio and Video. In
Proceedings of the Seventh International ACM SIGACCESS Conference on Computers and Accessibility (pp. 36-43). Baltimore. [pdf] (2005). Wizard-of-Oz Test of ARTUR - a Computer-Based Speech Training System with Articulation Correction. In
Proceedings of Interspeech 2005. Lisbon, Portugal. [pdf] (2005). Articulatory synthesis using corpus-based estimation of line spectrum pairs. In
Proceedings of Interspeech 2005. Lisbon, Portugal. [pdf] (2005). Introducing visual cues in acoustic-to-articulatory inversion. In
Proceedings of the Tenth International Conference on Speech and Computers (pp. 483-486). Patras, Greece. [pdf] (2005). Design Recommendations for a Computer-Based Speech Training System Based on End-User Interviews. In
Proc ICSLP 2004 (pp. 1109-1112). Jeju Island, Korea. [pdf] (2004). From real-time MRI to 3D tongue movements. In Kim, S. H., & Young, D. H. (Eds.),
Proc ICSLP 2004 (pp. 465-468). Jeju Island, Korea. [pdf] (2004). Speaker adaptation of a three-dimensional tongue model. In Kim, S. H., & Young, D. H. (Eds.),
Proc ICSLP 2004 (pp. 1693-1696). Jeju Island, Korea. [pdf] (2004). Design strategies for a virtual language tutor. In Kim, S. H., & Young, D. H. (Eds.),
Proceedings of the 15th ICPhS (pp. 431-434). Barcelona, Spain. [pdf] (2003). Resynthesis of Facial and Intraoral Articulation from Simultaneous Measurements. In
Simultaneous measurements of facial and intraoral articulation. [pdf] (2003).
6th Intl Seminar on Speech Production (pp. 43-48). Sydney. [pdf] (2003). A revisit to the application of MRI to the analysis of speech production. Testing our assumptions. In
Speech Communication, 41, 303-329. [pdf] (2003). Combining MRI, EMA & EPG measurements in a three-dimensional tongue model.
Proc EuroSpeech 2003 (pp. 2261-2264). [pdf] (2003). Resynthesis of 3D tongue movements from facial data. In
7th Intl Seminar on Speech Production (pp. 49-54). Sydney. [pdf] (2003). The effect of corpus choice on statistical articulatory modeling. In
Proc of ICSLP 2002 (pp. 665-668). Denver, Colorado, USA. [pdf] (2002). Evaluation of a system for concatenative articulatory visual speech synthesis. In
Tongue Talking - Studies in Intraoral Speech Synthesis. Doctoral dissertation, KTH. [pdf] (2002).
Proc of 4th Intl Speech Motor Conf (pp. 23-26). Nijmegen. [pdf] (2001). Considerations in intraoral visual speech synthesis: Data and modelling. In
Proc of Eurospeech 2001 (pp. 261-264). Aalborg. [pdf] (2001). Making the tongue model talk: Merging MRI & EMA Measurements. In
Proc of 4th ISCA Tutorial and Research Workshop on Speech Synthesis (pp. 38-41). Perthshire. [pdf] (2001). Synthesising static vowels and dynamic sounds using a 3D vocal tract model. In
Proc of Eurospeech 2001 (pp. 1475-1478). Aalborg. [pdf] (2001). Using linguopalatal contact patterns to tune a 3D tongue model. In
Proc of ICSLP 2000, 6th Intl Conf on Spoken Language Processing (pp. 901-904). Beijing. [pdf] (2000). A 3D tongue model based on MRI data. In Yuan, B., Huang, T., & Tang, X. (Eds.),
Proc of ICSLP 2000, 6th Intl Conf on Spoken Language Processing (pp. 17-20). Beijing. [pdf] (2000). Are static MRI representative of dynamic speech? Results from a comparative study using MRI, EPG, and EMA. In Yuan, B., Huang, T., & Tang, X. (Eds.),
TMH-QPSR, 41(4), 049-073. [pdf] (2000). Dynamical aspects of coarticulation in Swedish fricatives - a combined EMA and EPG study.
TMH-QPSR, 41(2-3), 053-064. [pdf] (2000). Replicating three-dimensional toungue shapes synthetically.
Proc of 5th Speech Production Seminar: Models and data (pp. 297-300). Kloster Seeon, Germany. [pdf] (2000). An MRI study of Swedish fricatives: coarticulatory effects. In Hole, P. (Ed.),
Proc of Eurospeech 99 (pp. 113-116). Budapest. [pdf] (1999). Modeling of the vocal tract in three dimensions. In
TMH-QPSR, 40(1-2), 031-038. [pdf] (1999). Vocal tract modeling in 3D.
TMH-QPSR, 40(3-4), 011-038. [pdf] (1999). Collecting and analysing two- and three-dimensional MRI data for Swedish.
Proc of Fonetik -98, The Swedish Phonetics Conference (pp. 196-199). Stockholm University. (1998). A 3D vocal tract model for articulatory and visual speech synthesis. In
. Master's thesis, KTH, TMH, CTT.