The rhythm of conversation

The project Rhythm of conversation investigates how a set of rhythmic prosodic features contributes to the joint interaction control in conversations.

Of particular interest is acoustic descriptions of features related to variations in speech rate (i.e. accelerations and decelerations in speech rate) and loudness (i.e. increases and decreases in loudness), and how these are used for interactional purposes.

Group: Speech Communication and Technology

Mattias Heldner (Project leader)
Jens Edlund
Kornel Laskowski

Funding: VR (2009-1766)

Duration: 2010 - 2012


Related publications:


Edlund, J., Alexandersson, S., Beskow, J., Gustavsson, L., Heldner, M., Hjalmarsson, A., Kallionen, P., & Marklund, E. (2012). 3rd party observer gaze as a continuous measure of dialogue flow. In Proc. of LREC 2012. Istanbul, Turkey. [abstract] [pdf]

Oertel, C., Wlodarczak, M., Edlund, J., Wagner, P., & Gustafson, J. (2012). Gaze Patterns in Turn-Taking. In Proc. of Interspeech 2012. Portland, Oregon, US. [abstract] [pdf]

Oertel, C., Cummins, F., Edlund, J., Wagner, P., & Campbell, N. (2012). D64: a corpus of richly recorded conversational interaction. Journal of Multimodal User Interfaces. [abstract]

Renklint, E., Cardell, F., Dahlbäck, J., Edlund, J., & Heldner, M. (2012). Conversational gaze in light and darkness. In Proc. of Fonetik 2012 (pp. 59-60). Gothenburg, Sweden. [abstract] [pdf]


Edlund, J. (2011). In search of the conversational homunculus - serving to understand spoken human face-to-face interaction. Doctoral dissertation, KTH. [abstract] [pdf]

Heldner, M. (2011). Detection thresholds for gaps, overlaps and no-gap-no-overlaps. Journal of the Acoustical Society of America, 130(1), 508-513. [abstract] [pdf]

Heldner, M., Edlund, J., Hjalmarsson, A., & Laskowski, K. (2011). Very short utterances and timing in turn-taking. In Proceedings of Interspeech 2011 (pp. 2837-2840). Florence, Italy. [abstract] [pdf]

Landsiedel, C., Edlund, J., Eyben, F., Neiberg, D., & Schuller, B. (2011). Syllabification of conversational speech using bidirectional long-short-term memory neural networks. In Acoustics, Speech and Signal Processing (ICASSP), 2011 IEEE International Conference on (pp. 5256 - 5259). Prague, Czech Republic. [abstract] [pdf]

Laskowski, K. (2011). Predicting, detecting and explaining the occurrence of vocal activity in multi-party conversation. Doctoral dissertation, Carnegie Mellon University. [abstract] [pdf]


Beskow, J., Edlund, J., Gustafson, J., Heldner, M., Hjalmarsson, A., & House, D. (2010). Modelling humanlike conversational behaviour. In Proceedings of SLTC 2010. Linköping, Sweden. [pdf]

Beskow, J., Edlund, J., Gustafson, J., Heldner, M., Hjalmarsson, A., & House, D. (2010). Research focus: Interactional aspects of spoken face-to-face communication. In Proc. of Fonetik 2010 (pp. 7-10). Lund, Sweden. [abstract] [pdf]

Edlund, J., Heldner, M., Al Moubayed, S., Gravano, A., & Hirschberg, J. (2010). Very short utterances in conversation. In Proc. of Fonetik 2010 (pp. 11-16). Lund, Sweden. [abstract] [pdf]

Heldner, M., & Edlund, J. (2010). Pauses, gaps and overlaps in conversations. Journal of Phonetics, 38, 555-568. [abstract] [pdf]

Laskowski, K. (2010). Modeling norms of turn-taking in multi-party conversation. In proceedings of the 48th Annual Meeting of the Association for Computational Linguistics (ACL2010) (pp. 999-1008). Uppsala, Sweden. [pdf]

