KTH / TMH / Mattias Heldner's Personal Home Page / Publications

Mattias Heldner's publications

This is a personal webpage. More information.

In press

Heldner, M., & Edlund, J. (in press). Pauses, gaps and overlaps in conversations. Journal of Phonetics.

2009

Edlund, J., Heldner, M., & Pelcé, A. (2009). Prosodic features of very short utterances in dialogue. In M. Vainio, R. Aulanko & O. Aaltonen (Eds.), Nordic Prosody: Proceedings of the Xth Conference, Helsinki 2008 (pp. 57-68). Frankfurt am Main, Germany: Peter Lang.

Heldner, M., Edlund, J., Laskowski, K., & Pelcé, A. (2009). Prosodic features in the vicinity of silences and overlaps. In M. Vainio, R. Aulanko & O. Aaltonen (Eds.), Nordic Prosody: Proceedings of the Xth Conference, Helsinki 2008 (pp. 95-106). Frankfurt am Main, Germany: Peter Lang.

Laskowski, K., Heldner, M., & Edlund, J. (2009). A general-purpose 32 ms prosodic vector for hidden Markov modeling. In Proceedings Interspeech 2009 (pp. 724-727). Brighton, UK.

Laskowski, K., Heldner, M., & Edlund, J. (2009). Exploring the prosody of floor mechanisms in English using the fundamental frequency variation spectrum. In Proceedings of EUSIPCO 2009 (pp. 2539-2543). Glasgow, Scotland.

Edlund, J., Heldner, M., & Hirschberg, J. (2009). Pause and gap length in face-to-face interaction. In Proceedings of Interspeech 2009 (pp. 2779-2782). Brighton, UK.

Beskow, J., Carlson, R., Edlund, J., Granström, B., Heldner, M., Hjalmarsson, A., et al. (2009). Multimodal Interaction Control. In A. Waibel & R. Stiefelhagen (Eds.), Computers in the Human Interaction Loop (pp. 143-158). Berlin/Heidelberg: Springer. doi:10.1007/978-1-84882-054-8_14

2008

Laskowski, K., Wölfel, M., Heldner, M., & Edlund, J. (2008). Computing the fundamental frequency variation spectrum in conversational spoken dialogue systems. Journal of the Acoustical Society of America, 123(5), 3427. doi:10.1121/1.2934193

Laskowski, K., Wölfel, M., Heldner, M., & Edlund, J. (2008). Computing the fundamental frequency variation spectrum in conversational spoken dialogue systems. In Proceedings of the 155th Meeting of the Acoustical Society of America, 5th EAA Forum Acusticum, and 9th SFA Congrés Français d'Acoustique (Acoustics2008) (pp. 3305-3310). Paris, France. (slides)

Edlund, J., Gustafson, J., Heldner, M., & Hjalmarsson, A. (2008). Towards human-like spoken dialogue systems. Speech Communication, 50(8-9), 630-645. doi:10.1016/ j.specom.2008.04.002

Gustafson, J., Heldner, M., & Edlund, J. (2008). Potential benefits of human-like dialogue behaviour in the call routing domain. In Perception in Multimodal Dialogue Systems (pp. 240-251). Berlin/Heidelberg, Germany: Springer. doi:10.1007/978-3-540-69369-7_27

Laskowski, K., Edlund, J., & Heldner, M. (2008). The fundamental frequency variation spectrum. In Proceedings FONETIK 2008 (pp. 29-32). Gothenburg. (slides)

Laskowski, K., Edlund, J., & Heldner, M. (2008). Machine Learning of Prosodic Sequences Using the Fundamental Frequency Variation Spectrum. In Proceedings of the Speech Prosody 2008 Conference (pp. 151-154). Campinas, Brazil: Editora RG/CNPq. (poster)

Laskowski, K., Edlund, J., & Heldner, M. (2008). An instantaneous vector representation of delta pitch for speaker-change prediction in conversational dialogue systems. In Proceedings ICASSP 2008 (pp. 5041-5044). Las Vegas, Nevada, USA. (poster)

2007

Edlund, J., Heldner, M., & Gustafson, J. (2007). Two faces of spoken dialogue systems In M. F. McTear, K. Jokinen, J. Larson, R. López-Cózar & Z. Callejas (Eds.), Interspeech 2006 - ICSLP Satellite Workshop Dialogue on Dialogues: Multidisciplinary Evaluation of Advanced Speech-based Interactive Systems (pp. 51-54). Pittsburgh PA, USA.

Edlund, J., & Heldner, M. (2007). Underpinning /nailon/: automatic estimation of pitch range and speaker relative pitch. In C. Müller (Ed.), Speaker Classifcation II (Vol. LNAI 4441, pp. 229-242). Berlin, Germany: Springer-Verlag. doi:10.1007/978-3-540-74122-0_18

Heldner, M., & Edlund, J. (2007). What turns speech into conversation? A project description. In TMH-QPSR Vol. 50: Proceedings from Fonetik 2007 (pp. 45-48). Stockholm.

Edlund, J., Beskow, J., & Heldner, M. (2007). MushyPeek – an experiment framework for controlled investigation of human-human interaction control behaviour. In TMH-QPSR Vol. 50: Proceedings from Fonetik 2007 (pp. 65-68). Stockholm.

2006

Carlson, R., Edlund, J., Heldner, M., Hjalmarsson, A., House, D., & Skantze, G. (2006). Towards human-like behaviour in spoken dialog systems. In SLTC 2006, Swedish Language Technology Conference. Göteborg.

Edlund, J., & Heldner, M. (2006). /nailon/ – software for online analysis of prosody. In Proceedings of the 9th International Conference on Spoken Language Processing (Interspeech 2006) (pp. 2022-2025). Pittsburgh, PA, USA.

Edlund, J., & Heldner, M. (2006). /nailon/ – online analysis of prosody. In Working Papers 52: Proceedings of Fonetik 2006 (pp. 37-40). Lund, Sweden: Lund University, Centre for Languages & Literature, Dept. of Linguistics & Phonetics.

Heldner, M., & Edlund, J. (2006). Prosodic cues for interaction control in spoken dialogue systems. In Working Papers 52: Proceedings of Fonetik 2006 (pp. 53-56). Lund, Sweden: Lund University, Centre for Languages & Literature, Dept. of Linguistics & Phonetics.

Heldner, M., Edlund, J., & Carlson, R. (2006). Interruption impossible. In M. Horne & G. Bruce (Eds.), Nordic Prosody: Proceedings of the IXth Conference, Lund 2004 (pp. 97-105). Frankfurt am Main: Peter Lang.

Heldner, M., Edlund, J., Lamel, L, Devillers, L, & Rosset, S. (2006). Utilising online prosodic analyses for interaction control. Deliverable D6.11 of the Project CHIL (Computers in the Human Interaction Loop) IP 506909.

2005

Bell, L., Boye, J., Gustafson, J., Heldner, M., Lindström, A., & Wirén, M. (2005). The Swedish NICE Corpus – Spoken dialogues between children and embodied characters in a computer game scenario. In Proceedings of Interspeech 2005. Lisbon, Portugal.

Bell, L., Blasig, R., Boye, J., Buisine, S., Gustafson, J., Heldner, M., et al. (2005). The Fairy-tale world system. In deliverable D7.2-2 of the Project NICE (Natural Interactive Communication for Edutainment) IST-2001-35293, Evaluation of the Second NICE Prototype. Available from http://www.niceproject.com/deliverables/.

Edlund, J., & Heldner, M. (2005). Exploring Prosody in Interaction Control. Phonetica, 62(2-4), 215-226. doi:10.1159/000090099

Edlund, J., Heldner, M., & Gustafson, J. (2005). Utterance segmentation and turn-taking in spoken dialogue systems. In B. Fisseni, H.-C. Schmitz, B. Schröder & P. Wagner (Eds.), Sprachtechnologie, mobile Kommunikation und linguistische Ressourcen (pp. 576-587). Frankfurt am Main, Germany: Peter Lang.

Heldner, M., & Edlund, J. (2005). Implementation and evaluation of algorithms for online prosodic analyses for interaction control. Deliverable D6.7 of the Project CHIL (Computers in the Human Interaction Loop) IP 506909.

2004

Heldner, M., Edlund, J., & Björkenstam, T. (2004). Automatically extracted F0 features as acoustic correlates of prosodic boundaries. In Proceedings of Fonetik 2004 (pp. 52-55). Stockholm: Department of Linguistics, Stockholm University.

Heldner, M., Stiefelhagen, R., Devillers, L., Burger, S., Edlund, J., M., Lamel, Laskowski, K., Pardas, Sjölander, K., L., & Vidrascu, L. (2004). Initial algorithms for a) probabilistic classification of different activities, b) emotion recognition, c) topical segmentation. Deliverable D6.3 of the Project CHIL (Computers in the Human Interaction Loop) IP 506909.

Sjölander, K., & Heldner, M. (2004). Word level precision of the NALIGN automatic segmentation algorithm. In Proceedings of Fonetik 2004 (pp. 116-119). Stockholm: Department of Linguistics, Stockholm University.

2003

Bell, L., Gustafson, J., & Heldner, M. (2003). Prosodic adaptation in human-computer interaction. In Proceedings ICPhS 2003 (pp. 2453-2456). Barcelona.

Heldner, M. (2003). On the reliability of overall intensity and spectral emphasis as acoustic correlates of focal accents in Swedish. Journal of Phonetics, 31(1), 39 - 62. doi:10.1016/S0095-4470(02)00071-2

Heldner, M., & Megyesi, B. (2003). The acoustic and morpho-syntactic context of prosodic boundaries in dialogs. In M. Heldner (Ed.), Proceedings from Fonetik 2003 (pp. 117-120). Umeå: Dept. Philosophy and Linguistics, Umeå University.

Heldner, M., & Megyesi, B. (2003). Exploring the prosody-syntax interface in conversations. In Proceedings ICPhS 2003 (pp. 2501-2504). Barcelona.

2002

Carlson, R., Granström, B., Heldner, M., House, D., Megyesi, B., Strangert, E., et al. (2002). Boundaries and groupings - the structuring of speech in different communicative situations: a description of the GROG project. THM-QPSR, 44, 65-68.

2001

Heldner, M. (2001). Focal accent – f0 movements and beyond. PhD Thesis, Umeå University, Umeå.

Heldner, M. (2001). On the non-linear lengthening of focally accented Swedish words. In W. van Dommelen & T. Fretheim (Eds.), Nordic Prosody: Proceedings of the VIIIth Conference, Trondheim 2000 (pp. 103-112). Frankfurt am Main: Peter Lang.

Heldner, M. (2001). Spectral emphasis as a perceptual cue to prominence. TMH-QPSR, 2/2001, 51-57.

Heldner, M. (2001). Spectral emphasis as an additional source of information in accent detection. In M. Bacchiani, J. Hirschberg, D. Litman & M. Ostendorf (Eds.), Prosody 2001: ISCA Tutorial and Research Workshop on Prosody in Speech Recognition and Understanding (pp. 57-60). Red Bank, NJ: ISCA.

Heldner, M., & Strangert, E. (2001). Temporal effects of focus in Swedish. Journal of Phonetics, 29(3), 329-361. doi:10.1006/jpho.2001.0143

2000

Heldner, M. (2000). Is non-linear lengthening important for the perceived naturalness of focal accented Swedish words? In A. Botinis & N. Torstensson (Eds.), Proceedings Fonetik 2000 (pp. 69-72). Skövde: Department of Languages, University of Skövde.

1999

Heldner, M., Strangert, E., & Deschamps, T. (1999). Focus detection using overall intensity and high frequency emphasis. In Proceedings Fonetik 99 (pp. 73-76). Göteborg: Department of Linguistics, Göteborg University.

Heldner, M., Strangert, E., & Deschamps, T. (1999). A focus detector using overall intensity and high frequency emphasis. In Proceedings ICPhS'99 (Vol. 2, pp. 1491-1493). San Francisco: University of California, Berkeley.

1998

Heldner, M. (1998). Is an F0-rise a necessary or a sufficient cue to perceived focus in Swedish? In S. Werner (Ed.), Nordic Prosody: Proceedings of the VIIth Conference, Joensuu 1996 (pp. 109-125). Frankfurt am Main: Peter Lang.

Heldner, M., & Strangert, E. (1998). On the amount and domain of focal lengthening in Swedish two-syllable words. In Proceedings of FONETIK 98, the eleventh Swedish Phonetics Conference (pp. 154-157). Stockholm.

Strangert, E., & Heldner, M. (1998). On the amount and domain of focal lengthening in Swedish. In ICSLP'98 Proceedings (Vol. 7, pp. 3305-3308). Sydney.

Strangert, E., & Heldner, M. (1998). On the amount and domain of focal lengthening in two-syllable and longer Swedish words. In Proceedings of FONETIK 98, the eleventh Swedish Phonetics Conference (pp. 134-137). Stockholm.

1997

Heldner, M. (1997). The contribution of pitch movements to perceived focus. In PHONUM 4: Fonetik 97 (pp. 109-112). Umeå, Sweden.

Heldner, M. (1997). To what extent is perceived focus determined by F0-cues? In Eurospeech '97 Proceedings (Vol. 2, pp. 875-877). Rhodes, Greece: ESCA.

1996

Heldner, M. (1996). Phonetic correlates of focus accents in Swedish. In Fonetik 96: Papers presented at the Swedish Phonetics Conference (pp. 33-36). Nässlingen, Stockholm.

Swerts, M., Strangert, E., & Heldner, M. (1996). F0 declination in spontaneous and read-aloud speech. In Proceedings ICSLP 96 (Vol. 3, pp. 1501-1504). Philadelphia, USA.

1995

Horne, M., Strangert, E., & Heldner, M. (1995). Prosodic boundary strength in Swedish: Final lengthening and silent interval duration. In Proceedings ICPhS 95 (Vol. 1, pp. 170-173). Stockholm, Sweden.

Strangert, E., & Heldner, M. (1995). Labelling of boundaries and prominences by phonetically experienced and non-experienced transcribers. In PHONUM 3 (pp. 85-109). Umeå: Department of Phonetics, Umeå University.

Strangert, E., & Heldner, M. (1995). The labelling of prominence in Swedish by phonetically experienced transcribers. In Proceedings ICPhS 95 (Vol. 4, pp. 204-207). Stockholm.

1994

Strangert, E., & Heldner, M. (1994). Prosodic labelling and acoustic data. In Working Papers 43: Papers from the Eighth Swedish Phonetics Conference (pp. 120-123). Lund.

Other publications

Heldner, M. (1993). Mustasch på Mona Lisa: Några reflexioner om patafysiken. In I. Söhrman (Ed.), La culture dans la langue (pp. 71-87). Stockholm, Sweden: Almquist & Wiksell.

 







Sidansvarig: Mattias Heldner, KTH Tal, musik och hörsel
Uppdaterad: 2008-04-01