Papers of Joakim Gustafson

(2024)

Lameris, H., Székely, É. & Gustafson, J. (2023) "The Role of Creaky Voice in Turn Taking and the Perception of Speaker Stance: Experiments Using Controllable TTS", The 2024 joint conferences on Computational Linguistics and Language Resources and Evaluation ,Turine, Italy (audio samples)

Tånnander, C., Edlund, J. Gustafson, J. (2023) "Revisiting Three Text-to-Speech Synthesis Experiments with a Web-Based Audience Response System" The 2024 joint conferences on Computational Linguistics and Language Resources and Evaluation ,Turine, Italy

(2023)

Ekstedt, E., Wang, S., Székely, É., Gustafson, J. and Skantze, G. (2023) "Automatic Evaluation of Turn-taking Cues in Conversational Speech Synthesis", Proceedings of Interspeech 2023, Dublin, Ireland

Gustafson, J., Székely, É and Beskow, J. (2023) "Generation of speech and facial animation with controllable articulatory effort for amusing conversational characters" Proceedings of 23rd ACM International Conference on Interlligent Virtual Agent (IVA 2023), W�rzburg, Germany pdf (audio and video samples)

Kirkland, A., Wlodarczak, M., Gustafson, J. and Székely, É (2023) "Evaluating the impact of disfluencies on the perception of speaker competence using neural speech synthesis", Proceedings of ICPhS 2023, Prague, Czech Republic.

Kirkland, A., Gustafson, J. and Székely, É (2023) "Pardon my disfluency: The impact of disfluency effects on the perception of speaker competence and confidence", Proceedings of Interspeech 2023, Dublin, Ireland

Kirkland, A., Mehta, S., Lameris, H., Henter, G., Sz�kely, �. and Gustafson, J. (2023) "Stuck in the MOS pit: A critical analysis of MOS test methodology in TTS evaluation", Proceeding of the 12th ISCA Speech Synthesis Workshop (SSW), Grenoble, France

Lameris, H ., Mehta, S., Henter, G., Gustafson, J. and Székely, É (2023) "Prosody-controllable spontaneous TTS with neural HMMs", Proceedings of the ICASSP 2023, Rhodes, Greece. (audio samples)

Lameris, H ., Wlodarczak, M., Gustafson, J. and Székely, É "Neural speech synthesis with controllable creaky voice style", Proceedings of ICPhS 2023, Prague, Czech Republic.

Lameris, H., Gustafson, J. and Székely, É (2023) "Beyond Style: Synthesizing Speech with Pragmatic Functions", Proceedings of Interspeech 2023, Dublin, Ireland audio samples

Lameris, H., Kirkland, A., Gustafson, J. and Székely, É (2023) "Situating Speech Synthesis: Investigating Contextual Factors in the Evaluation of Conversational TTS", Proceeding of the 12th ISCA Speech Synthesis Workshop (SSW), Grenoble, France

Miniotaite, J., Wang, S., Beskow, J., Gustafson, J., Székely, É. and Andre Pereira (2023) "Hey robot, it�s not what you say, it�s how you say it", Porceedings of IEEE RO-MAN 2023, Busan, Korea

Székely, É., Gustafson, J.and Torre, I. (2023) "Prosody-controllable gender-ambiguous speech synthesis: a tool for investigating implicit bias in speech perception", Proceedings of Interspeech 2023, Dublin, Ireland audio samples

Székely, É., Wang, S. and Gustafson, J. (2023) "So-to-Speak: an exploratory platform for investigating the interplay between style and prosody in TTS", Show and Tell Interspeech 2023, Dublin, Ireland

Wang, S., Henter, G., Gustafson, J. and Sz�kely, �. (2023) "A comparative study of self-supervised speech representations in read and spontaneous TTS", Proceedings of CASSP 2023 Satellite Workshop: SASB 2023: Self-Supervision in Audio, Speech and Beyond (audio samples)

Wang, S., Henter, G., Gustafson, J. and Sz�kely, �. (2023) "On the Use of Self-Supervised Speech Representations in Spontaneous Speech Synthesis", Proceeding of the 12th ISCA Speech Synthesis Workshop (SSW), Grenoble, France (audio samples)

(2022)

Kirkland, A., Lameris, H., Gustafson, J. and Székely, É (2022) "Where's the uh, hesitation? The interplay between filled pause location, speech rate and fundamental frequency", Proceedings of the 23rd Annual Conference of the International Speech Communication Association, Interspeech 2022, Incheon, Korea. (Nominated for best paper award) (pdf, audio samples)

Wang, S., Gustafson, J. and Székely, É (2022) "Evaluating Sampling-based Filler Insertion with Spontaneous TTS", Proceedings of 13th Edition of the Language Resources and Evaluation Conference (LREC 2022), Marseille. (pdf , audio samples)

Moell, B., O'Regan, J., Mehta, S., Kirkland, A., Lameris, H., Gustafson, J. and Beskow, J. (2022) "Speech Data Augmentation for Improving Phoneme Transcriptions of Aphasic Speech using wav2vec 2.0 for the PSST Challenge", Proceedings of LREC 2022 workshop RaPID-4: Resources and ProcessIng of linguistic, para-linguistic and extra-linguistic Data from people with various forms of cognitive / psychiatric / developmental impairments, Marseille

(2021)

Gustafson, J., Beskow, J. and Székely, É (2021) "Personality in the mix - investigating the contribution of fillers and speakingstyle to the perception of spontaneous speech synthesis", Proceedings of the 11th Speech Synthesis Workshop (SSW11), Budapest, 2021. (pdf, audio samples)

Jonell, P., Moell, B., H�kansson, K., Henter, G., Kuchurenko, T., Mikheeva, O., Hagman, G., Holleman, J., Kivipelto, M., Kjellstr�m, H., Gustafson, J. and Beskow, J. (2021) "Multimodal capture of patient behaviour for improved detection of early dementia: clinical feasibility and preliminary results", Frontiers in Computer Science, section Human-Media Interaction. (pdf)

Kirkland, A., Wlodarczak, M., Gustafson, J.and Székely, É (2021) "Perception of smiling voice in spontaneous speech synthesis", Proceedings of the 11th Speech Synthesis Workshop (SSW11), Budapest, 2021. (pdf, audio samples)

Kontogiorgos, D., Pereira, A. and Gustafson, J. (2021) "Grounding Behaviours with Conversational Interfaces: Effects of Embodiment and Failures", Journal on Multimodal User Interfaces (pdf)

Kontogiorgos, D. and Gustafson, J. (2021) "Measuring Collaboration Load with Pupillary Responses - Implications for the Design of Instructions in Task-Oriented HRI", Frontiers in Psychology, section Cognitive Science (pdf)

Kontogiorgos, D., Tran, M., Gustafson, J. and Soleymani, M. (2021) "A Systematic Cross-Corpus Analysis of Human Reactions to Robot Conversational Failures", In proceedings of 23rd ACM International Conference on Multimodal Interaction (ICMI 2021), Montreal, Canada. October 18-22nd, 2021. (Nominated for best paper award)

Oertel, C., Jonell, P., Kontogiorgos, D.,Funes, K., Odobez, J-M. and Gustafson, J. (2021) "Towards an Engagement-Aware Attentive Artificial Listener for Multi-Party Interactions", Frontiers in Robotics and AI, section Human-Robot Interaction (pdf)

Wang, S., Alexanderson, A., Gustafson, J., Beskow, J., Henter, G. and Székely, É (2021) "Integrated Speech and Gesture Synthesis", In proceedings of 23rd ACM International Conference on Multimodal Interaction (ICMI 2021), Montreal, Canada. October 18-22nd, 2021 (pdf, video samples))

(2020)

Kontogiorgos, D., Pereira, A., Sahindal, B., van Waveren, S. and Gustafson, J. (2020) "Behavioural Responses to Robot Conversational Failures" Proceedings of the 2020 ACM/IEEE International Conference on Human-Robot Interaction, HRI'20, Cambridge, UK, 2020. (pdf)

Kontogiorgos, D., van Waveren, S., Wallberg, O.,Pereira, A., Leite, I. and Gustafson, J. (2020) "Embodiment Effects in Interactions with Failing Robots" SIGCHI Conference on Human Factors in Computing Systems, CHI �20, April 25�30, 2020, Honolulu, HI, USA, 2020. (pdf)

Kontogiorgos, D., Sibirtseva, E. and Gustafson, J. (2020) "Chinese Whispers: A Multimodal Dataset for Embodied Language Grounding", In Proceedings of the 12th Edition of its Language Resources and Evaluation Conference (LREC 2020), Marsielle, FR. (pdf)

Pereira, A., Oertel, C., Fermoselle, L., Mendelson, J. and Gustafson, J. (2020) "Effects of Different Interaction Contexts when Evaluating Gaze Models in HRI" In proceedings of The 15th Annual ACM/IEEE International Conference on Human Robot Interaction (HRI 2020),Cambridge, UK. (pdf)

Székely, É, Henter, G., Beskow, J. and Gustafson, J. (2020) "Breathing and speech planning in spontaneous speech synthesis In proceedings of the 45th International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2020), Barcelona, Spain. (pdf, audio samples)

Székely, É, Edlund, J., and Gustafson, J. (2020) "Augmented Prompt Selection for Evaluation of Spontaneous Speech Synthesis", In Proceedings of the 12th Edition of its Language Resources and Evaluation Conference (LREC 2020), Marsielle, FR. (pdf, audio samples)

(2019)

Kontogiorgos, D., Pereira, A. and Gustafson, J. (2019) "The Trade-off between Interaction Time and Social Facilitation with Collaborative Social Robots", In proceedings of the CHI workshop The Challenges of Working on Social Robots that Collaborate with People, May 4, Glasgow, UK.

Kontogiorgos, D., Pereira, A., Andersson, O., Koivisto, M., Gonzalez Rabal, E., Vartiainen, V. and Gustafson, J. (2019) �The Effects of Anthropomorphism and Non-verbal Social Behaviour in Virtual Assistants�, In Proceeings of ACM International Conference on Intelligent Virtual Agents, (IVA 2019) July 2-5, Paris, France. (pdf)

Kontogiorgos, D., Skantze, G. and Gustafson, J. (2019) "The Effects of Embodiment and Social Eye-Gaze in Conversational Agents", In Proceediongs of the 41st Annual Meeting of the Cognitive Science Society (COGSCI 2019), July 24-27, Montreal, Canada

Kontogiorgos, D., Pereira, A. and Gustafson, J. (2019) "Estimating Uncertainty in Task-Oriented Dialogue", In Proceedings of the 21st ACM International Conference on Multimodal Interaction, October 14-18, 2019 Suzhou, Jiangsu, China. (pdf)

Malisz, Z., Henter, G., Valentini-Botinhao, C., Watts, O., Beskow, J. and Gustafson, J. (2019) "Modern speech synthesis for phonetic sciences: a discussion and an evaluation", In proceedings of the 19th International Congress of Phonetic ICPhS 2019, August 5-9, Melbourne Australia. (pdf)

Malisz, Z., Berthelsen, H., Beskow, J. and Gustafson, J. (2019) "PROMIS: a statistical-parametric speech synthesis system with prominence control via a prominence network", In Proceedings of the 10th ISCA Speech Synthesis Workshop, September 20-22, Vienna, Austria (pdf)

Pereira, A., Oertel, C., Fermoselle, L., Mendelson, J. and Gustafson, J. (2019) "Responsive Joint Attention in Human-Robot Interaction", In proceedings of IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS 2019), November 3-8, Macau, China. (pdf)

Székely, É, Henter, G. and Gustafson, J. (2019) "Casting to corpus: segmenting and selecting spontaneous dialogue for tts with a cnn-lstm speaker-dependent breath detector", In proceedings of the 44th International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2019), May 12-17, Brighton, UK. (pdf)

Székely, É, Eje Henter, G., Beskow, J. and Gustafson, J. (2019) "Spontaneous Conversational Speech Synthesis from Found Data", In Proceedings of the 20th Annual Conference of the International Speech Communication Association (INTERSPEECH 2019), September 15-19, Graz, Austria. (pdf, audio samples)

Székely, É, Eje Henter, G., Beskow, J. and Gustafson, J. (2019) "Off the cuff: Exploring extemporaneous speech delivery with TTS", best demo award at Show & Tell, In Proceedings of the 20th Annual Conference of the International Speech Communication Association (INTERSPEECH 2019), September 15-19, Graz, Austria.(video)

Székely, É, Eje Henter, G., Beskow, J. and Gustafson, J. (2019) "How to train your fillers: uh and um in spontaneous speech synthesis", In Proceedings of the 10th ISCA Speech Synthesis Workshop, September 20-22, Vienna, Austria (pdf, audio samples)

Skantze, G., Gustafson, J. and Beskow, J. (2019) "Multimodal Conversational Interaction with Robots", in The Handbook of Multimodal-Multisensor Interfaces, Volume 3 : Language Processing, Software, Commercialization, and Emerging Directions, Sharon Oviatt, Bj�rn Schuller, Philip R. Cohen, Daniel Sonntag, Gerasimos Potamianos, Antonio Kr�ger red., : ACM Press, 2019.

T�nnander, C., Fallgren, P., Edlund, J. and Gustafson, J. (2019) "Spot the pleasant people! Navigating the cocktail party buzz", In Proceedings of the 20th Annual Conference of the International Speech Communication Association (INTERSPEECH 2019), September 15-19, Graz, Austria.

Wagner, P., Beskow, J., Betz, S., Edlund, J., Gustafson, J., Eje Henter, G., Le Maguer, S., Malisz, Z., Székely, É, T�nnander, C. and Vo�e, J. (2019) "Speech Synthesis Evaluation � State-of-the-Art Assessment and Suggestion for a Novel Research Program", In Proceedings of the 10th ISCA Speech Synthesis Workshop, September 20-22, Vienna, Austria pdf)

(2018)

Kragic, D., Gustafson, J., Karaoguz, H., Jensfelt, P., and Krug, R. (2018) �Interactive, collaborative robots: Challenges and opportunities�, in Proceeding of International Joint Conferences on Artificial Intelligence Organization (IJCAI 2018), pp. 18�25, Stockholm, Sweden. pdf)

Sibirtseva, E., Kontogiorgos, D., Nykvist, O., Karaoguz, H., Leite, I., Gustafson, J. and Kragic, D. (2018) �A comparison of visualisation methods for disambiguating verbal requests in human-robot interaction�, in Proc. IEEE Int. Conf. on Robot and Human Interactive Communication (RO-MAN 2018), Nanjing, China pdf)

Kontogiorgos, D., Sibirtseva, E., Pereira, A., Skantze, G. and Gustafson, J. (2018) �Multimodal reference resolution in collaborative assembly tasks�, in ICMI 2018 Satellite workshop Multimodal Analyses enabling Artificial Agents in Human-Machine Interaction, Boulder, USA

Jonell, P., Oertel, C., Kontogiorgos, D., Beskow, J. and Gustafson, J. (2018) "Crowdsourced Multimodal Corpora Collection Tool", In Proceedings of the 11th Edition of the Language Resources and Evaluation Conference (LREC-2018), Miyazaki, Japan.

Kontogiorgos, D., Avramova, V., Alexandersson, S., Jonell, P., Oertel, C., Beskow, J., Skantze, G. and Gustafson, J. (2018) "A Multimodal Corpus for Mutual Gaze and Joint Attention in Multiparty Situated Interaction", In Proceedings of the 11th Edition of the Language Resources and Evaluation Conference (LREC-2018), Miyazaki, Japan. pdf)

Székely, É, Wagner, P. and Gustafson, J. (2018) "The wrylie-board: mapping acoustic space of expressive feedback to attitude markers", Demo paper at IEEE Workshop on Spoken LAnguage Technology (SLT 2018). video

(2017)

Gustafson, J., Edlund, J., Beskow, J., Hedelind, M., Kragic, D., Ljunggren, P., Loutfi, A., Smith, C., Stany, P. and �stlund, B (2017) "Social robotics -a strategice innovation agenda", Vinnova, (pdf)

Jonell, P., Oertel, C., Kontogiorgos, D., Beskow, J. and Gustafson, J., (2017) "Crowd-Powered Design of Virtual Attentive Listeners", In Intelligent Virtual Agents IVA2017, (pdf)

Oertel, C., Jonell, P., Kontogiorgos, D., Mendelson, J., Beskow, J., & Gustafson, J. (2017) "Crowd-sourced design of artificial attentive listeners", In INTERSPEECH: Situated Interaction, Augusti 20-24 Augusti, 2017. (pdf)

Malisz, Z., Berthelsen, H., Beskow, J., & Gustafson, J. (2017) "Controlling prominence realisation in parametric DNN-based speech synthesis", In proccedings of Interspeech 2017, 1079-1083. (pdf)

Székely, É, Mendelson, J., & Gustafson, J. (2017) "Synthesising uncertainty: the interplay of vocal effort and hesitation disfluencies" In proceedings of Interspeech 2017, 804-808.(pdf)

Jonell, P., Mendelson, J., Storskog, T., Hagman, G., Ostberg, P., Leite, I., Kucherenko, T., Mikheeva, O., Akenine, U., Jelic, V., Solomon, A., Beskow, J., Gustafson, J., Kivipelto, M. and Kjellstr�m, H. 2017) "Machine Learning and Social Robotics for Detecting Early Signs of Dementia", arXiv preprint arXiv:1709.01613. (pdf)

(2016)

Andersson, J., Berlin, S., Costa, A., Berthelsen, H., Lindgren, H., Lindberg, N., Beskow, J., Edlund, J., & Gustafson, J. (2016) "WikiSpeech � enabling open source text-to-speech for Wikipedia", In Proceedings of 9th ISCA Speech Synthesis Workshop (pp. 111-117). Sunnyvale, USA. (pdf)

Edlund, J. and Gustafson, J., (2016) "Hidden Resources�Strategies to Acquire and Exploit Potential Spoken Language Resources in National Archives", In Proceedings of the 10th International Conference on Language Resources and Evaluation, LREC 2016. (s. 4531-4534). European Language Resources Association (ELRA).(pdf)

Johansson, M., Hori, T., Skantze, G., H�thker, A., & Gustafson, J. (2016) "Making Turn-taking Decisions for an Active Listening Robot for Memory Training", Best Paper Award, In Proceedings of International Conference on Social Robotics (ICSR). Kansas City, MO. (pdf)

Oertel, C., Gustafson, J., & Black, A. (2016) "On Data Driven Parametric Backchannel Synthesis for Expressing Attentiveness in Conversational Agents", In Proceedings of Multimodal Analyses enabling Artificial Agents in Human�-Machine Interaction (MA3HMI), satellite workshop of ICMI 2016. (pdf)

Oertel, C., Gustafson, J., & Black, A. (2016) "Towards Building an Attentive Artificial Listener: On the Perception of Attentiveness in Feedback Utterances", In Proceedings of Interspeech 2016. San Fransisco, USA. (pdf)

Oertel, C., Lopes, J., Yu, Y., Funes, K., Gustafson, J., Black, A., & Odobez, J-M. (2016) "Towards Building an Attentive Artificial Listener: On the Perception of Attentiveness in Audio-Visual Feedback Tokens", In Proceedings of the 18th ACM International Conference on Multimodal Interaction (ICMI 2016). Tokyo, Japan. (pdf)

Potamianos, A., Tzafestas, C., Iosif, E., Kirstein, F., Maragos, P., Dauthenhahn, K., Gustafson, J., �stergaard, J., Kopp, S., Wik, P., Pietquin, O., & Al Moubayed, S. (2016) "BabyRobot - Next Generation Social Robots: Enhancing Communication and Collaboration Development of TD and ASD Children by Developing and Commercially Exploiting the Next Generation of Human-Robot Interaction Technologies", In Proceedings of 2nd Workshop on Evaluating Child-Robot Interaction (CRI) at Human-Robot Interaction (HRI'16). Christchurch, New Zealand. (pdf)

(2015)

Oertel, C., Funes, K., Gustafson, J. and Odobez, J-M (2015) "Deciphering the Silent Participant - On the Use of Audio-Visual Cues for the Classification of Listener Categories", in Group Discussions", In proccedings of ICMI 2015, Seattle, US (pdf)

Edlund, J., T�nnander, C. and Gustafson, J. (2015) "Audience response system-based assessment for analysis-by-synthesis", In proceedings of ICPhS 2015, Glasgov, UK (pdf)

Lopes, J., Salvi, G., Skantze, G., Abad, A., Gustafson, J., Batista, F., Meena, R., and Trancoso, I. (2015) "Detecting Repetitions in Spoken Dialogue Systems Using Phonetic Distances", In Proceedings of Interspeech. Dresden, Germany. (pdf)

Meena, R., Lopes, J., Skantze, G., and Gustafson, J. (2015). "Automatic Detection of Miscommunications in Spoken Dialogue Systems". In proceedings of the 14th Annual Meeting of the Special Interest Group on Discourse and Dialogue - SIGdial. Prague, Czech. (pdf)

(2014)

Al Moubayed, S., Beskow, J., Bollepalli, B., Gustafson, J., Hussen-Abdelaziz, A., Johansson, M., Koutsombogera, M., Lopes, J., Novikova, J., Oertel, C., Skantze, G., Stefanov, K., & Varol, G. (2014) "Human-robot collaborative tutoring using multiparty multimodal spoken dialogue" in Proceedings of HRI'14. Bielefeld, Germany.

Bollepalli, B., Urbain, J., Raitio, T., Gustafson, J., and Cakmak, H. (2014) "A Comparative Evaluation of Vocoding Techniques for HMM-based Laughter Synthesis". in Proceedings of ICASSP 2014. (pdf)

Boye, J., Fredriksson, M., G�tze, J., Gustafson, J., and K�nigsmann, J. (2014) "Walk this way: Spatial grounding for city exploration". In Natural interaction with robots, knowbots and smartphones (pp. 59-67). Springer-Verlag. (pdf)

Edlund, J., Edelstam, F., & Gustafson, J. (2014) "Human pause and resume behaviours for unobtrusive humanlike in-car spoken dialogue systems". In proc. of the EACL Satellite Workshop Dialogue In Motion (DIM-2014). Gothenburg, Sweden. (pdf)

Johansson, M., Skantze, G., & Gustafson, J. ( 2014) "Comparison of human-human and human-robot Turn-taking Behaviour in multi-party Situated interaction". in International Workshop on Understanding and Modeling Multiparty, Multimodal Interactions, at ICMI 2014. Istanbul, Turkey. (pdf)

Meena, R., Boye, J., Skantze, G., & Gustafson, J. (2014) "Crowdsourcing Street-level Geographic Information Using a Spoken Dialogue System". In proceedoings of the 15th Annual Meeting of the Special Interest Group on Discourse and Dialogue - SIGdial. Philadelphia, PA, US. (pdf)

Meena, R., Skantze, G., & Gustafson, J. (2014) "Data-driven Models for Timing Feedback Responses in a Map Task Dialogue System". Computer Speech and Language. (pdf)

Oertel, C., Funes, K., Sheiki, S., Odobez, J-M., and Gustafson, J. 2014) "Who Will Get the Grant ? A Multimodal Corpus for the Analysis of Conversational Behaviours in Group Interviews". In procceings of International Workshop on Understanding and Modeling Multiparty, Multimodal Interactions, at ICMI 2014. Istanbul, Turkey. (pdf)

(2013)

Al Moubayed, S., Edlund, J., & Gustafson, J. (2013) "Analysis of gaze and speech patterns in three-party quiz game interaction". In Interspeech 2013. Lyon, France. (pdf)

Bollepalli, B., Beskow, J., & Gustafson, J. (2013) "Non-Linear Pitch Modification in Voice Conversion using Artificial Neural Networks". In ISCA Workshop on Non-Linear Speech Processing 2013. (pdf)

Edlund, J., Al Moubayed, S., T�nnander, C., & Gustafson, J. (2013) "Audience response system based annotation of speech". In Fonetik 2013. Link�ping, Sweden.

Edlund, J., Al Moubayed, S., T�nnander, C. and Gustafson, J., (2013) "Temporal precision and reliability of audience response system based annotation". In Multimodal Corpora (MMC2013), Beyond Audio and Video, IVA Workshop III; Edinburgh, UK, 29-31 August, 2013. (pdf)

Johansson, M., Skantze, G., & Gustafson, J. (2013) "Head Pose Patterns in Multiparty Human-Robot Team-Building Interactions". In proceedings of the International Conference on Social Robotics - ICSR 2013. Bristol, UK. (pdf)

Meena, R., Skantze, G., & Gustafson, J. (2013) "A Data-driven Model for Timing Feedback in a Map Task Dialogue System". In 14th Annual Meeting of the Special Interest Group on Discourse and Dialogue - SIGDial. Metz, France. (pdf)

Meena, R., Skantze, G., & Gustafson, J. (2013) "Human Evaluation of Conceptual Route Graphs for Interpreting Spoken Route Descriptions". In Proceedings of the 3rd International Workshop on Computational Models of Spatial Language Interpretation and Generation (pp. 30-35). Potsdam. (pdf)

Meena, R., Skantze, G., & Gustafson, J. (2013) "The Map Task Dialogue System: A Test-bed for Modelling Human-Like Dialogue". In 14th Annual Meeting of the Special Interest Group on Discourse and Dialogue - SIGdial (pp. 366-368). Metz, France. (pdf)

Mirnig, N., Weiss, A., Skantze, G., Al Moubayed, S., Gustafson, J., Beskow, J., Granstr�m, B., & Tscheligi, M. (2013) "Face-to-Face with a Robot: What do we actually talk about?". International Journal of Humanoid Robotics, 10(1). (pdf)

Neiberg, D., Salvi, G., & Gustafson, J. (2013) "Semi-supervised methods for exploring the acoustics of simple productive feedback". Speech Communication, 55(3), 451-469. (pdf)

Oertel, C., Salvi, G., G�tze, J., Edlund, J., Gustafson, J., & Heldner, M. (2013) "The KTH Games Corpora: How to Catch a Werewolf". In proceedings of IVA 2013 Workshop Multimodal Corpora: Beyond Audio and Video - MMC 2013 (pdf)

(2012)

Al Moubayed, S., Beskow, J., Granstr�m, B., Gustafson, J., Mirning, N., Skantze, G., & Tscheligi, M. (2012) "Furhat goes to Robotville: a large-scale multiparty human-robot interaction data collection in a public space", in Proceedings of LREC Workshop on Multimodal Corpora. Istanbul, Turkey. (pdf)

Al Moubayed, S., Skantze, G., Beskow, J., Stefanov, K., & Gustafson, J. (2012) "Multimodal Multiparty Social Interaction with the Furhat Head", outstanding demo award, in Proceedings of the 14th ACM International Conference on Multimodal Interaction ICMI. Santa Monica, CA, USA.

Blomberg, M., Skantze, G., Al Moubayed, S., Gustafson, J., Beskow J. and Granstr�m, B. (2012) "Children and adults in dialogue with the robot head Furhat - corpus collection and initial analysis", in Proceedings of WOCCI 2012 (Workshop on Child, Computer and Interaction), Portland, USA (pdf)

Boye, J., Fredriksson, M., G�tze, J., Gustafson, J. and K�nigsmann, J. (2012) "Walk this way: Spatial grounding for city exploration", in Proceedings of IWSDS2013.

Edlund, J., Heldner, M., & Gustafson, J. (2012) "Who am I speaking at? - perceiving the head orientation of speakers from acoustic cues alone". In proceedings of LREC Workshop on Multimodal Corpora 2012. Istanbul, Turkey. (pdf)

Edlund, J., Oertel, C., & Gustafson, J. (2012) "Investigating negotiation for load-time in the GetHomeSafe project", In Proc. of Workshop on Innovation and Applications in Speech Technology (IAST). Dublin, Ireland. (pdf)

Edlund, J., Heldner, M., & Gustafson, J. (2012). "On the effect of the acoustic environment on the accuracy of perception of speaker orientation from auditory cues alone" In proceedings of of Interspeech 2012. Portland, Oregon, US (pdf)

Meena, R., Skantze, G, , & Gustafson, J. (2012). "A Data-driven Approach to Understanding Spoken Route Directions in Human-Robot Dialogue" In Proceeding of Interspeech 2012. Portland, Oregon, US (pdf)

Neiberg, D., & Gustafson, J. (2012). "Towards letting machines humming in the right way - prosodic analysis of six functions of short feedback tokens in English" In Fonetik 2012. G�teborg, Sweden.

Neiberg, D. and Gustafson, J. (2012) "Exploring the implications for feedback of a neurocognitive theory of overlapped speech" in Proceedings of The Interdisciplinary Workshop on Feedback Behaviors, Oregon, USA.

Neiberg, D. and Gustafson, J. (2012) "Cues to perceived functions of acted and spontaneous feedback expressions" in Proceedings of The Interdisciplinary Workshop on Feedback Behaviors, Oregon, USA. (pdf)

Sch�tz, S., Bruce, B., Segerup, M., Beskow, J., Gustafson, J. and Granstr�m, B. (2012) "Regional Varieties of Swedish: Models and Synthesis", In Niebuhr, O. (Ed.) Understanding prosody: the role of context function and communication (pp. 119-134). De Gruyter.

Skantze, G., Al Moubayed, S., Gustafson, J., Beskow, J., & Granstr�m, B. (2012). Furhat at Robotville: A Robot Head Harvesting the Thoughts of the Public through Multi-party Dialogue. in Proceedings of IVA-RCVA. Santa Cruz, CA. (pdf)

Oertel, C., Wlodarczak, M., Edlund, J., Wagner, P., & Gustafson, J. (2012). "Gaze Patterns in Turn-Taking", In Proceedings of Interspeech 2012. Portland, Oregon, US (pdf)

(2011)

Johansson, M., Skantze, G., & Gustafson, J. (2011) "Understanding route directions in human-robot dialogue", in Proceedings of SemDial. Los Angeles, CA, (pdf)

Johnson-Roberson, M., Bohg, J., Skantze, G., Gustafson, J., Carlson, R., Rasolzadeh, B., & Kragic, D. (2011) "Enhanced Visual Scene Understanding through Human-Robot Dialog", In Proceedings of IEEE/RSJ International Conference on Intelligent Robots and Systems, (pdf)

Neiberg, D., & Gustafson, J. (2011) "A Dual Channel Coupled Decoder for Fillers and Feedback", In proceedings of Interspeech 2011. (pdf)

Neiberg, D., Ananthakrishnan, G., & Gustafson, J. (2011). "Tracking pitch contours using minimum jerk trajectories", In proceedings of Interspeech 2011. (pdf)

Neiberg, D., & Gustafson, J. (2011) "Predicting Speaker Changes and Listener Responses With And Without Eye-contact", In proceedings of Interspeech 2011. (pdf)

(2010)

Neiberg, D., & Gustafson, J. (2010) "The Prosody of Swedish Conversational Grunts", In Proceedings of Interspeech 2010, Special Session on Social Signals in proceedings of Interspeech 2010. (pdf)

Gustafson, J., & Neiberg, D. (2010) "Prosodic cues to engagement in non-lexical response tokens in Swedish", In Proceedings of the DiSS-LPSS Joint Workshop 2010. (pdf)

Neiberg, D., & Gustafson, J. (2010). "Modeling Conversational Interaction Using Coupled Markov Chains", In Proceedings of the DiSS-LPSS Joint Workshop 2010. (pdf)

Sch�tz, S., Beskow, J., Bruce, G., Granstr�m, B. & Gustafson, J. (2010) "Simulating Intonation in Regional Varieties of Swedish", Speech Prosody 2010, Chicago, USA. (Pdf)

Beskow, J., Edlund, J., Granstr�m, B., Gustafson, J. & House, D. (2010) "Face-to-face interaction and the KTH Cooking Show". In Esposito, Anna, Campbell, Nick, Vogel, Carl, Hussain, Amir, & Nijholt, Anton (Eds.), Development of Multimodal Interfaces: Active Listening and Synchrony (pp. 157 - 168). Berlin / Heidelberg: Springer.

Edlund, J. & Gustafson, J. (2010) "Ask the experts: Part I: Elicitation", in Allwood, J. & Juel Henrichsen, P (Eds.), Linguistic Theory and Raw Sound, Copenhagen Studies in Linguistics, spring 2010.

Edlund, J. & Gustafson, J. (2010) "Ask the experts: Part II: Analysis", in Allwood, J. & Juel Henrichsen, P (Eds.), Linguistic Theory and Raw Sound, Copenhagen Studies in Linguistics, spring 2010.

Johnson-Roberson, M., Bohg, J., Kragic, D., Skantze, G., Gustafson, J., & Carlson, R. (2010) "Enhanced Visual Scene Understanding through Human-Robot Dialog", In Proceedings of AAAI 2010 Fall Symposium: Dialog with Robots. Arlington, VA.

Beskow, J., Edlund, J., Gustafson, J., Heldner, M., Hjalmarsson, A. & House, D. (2010) "Research focus: Interactional aspects of spoken face-to-face communication", Fonetik 2010, Lund, Sweden. (pdf)

Sch�tz, S., Beskow, J., Bruce, G., Granstr�m, B., Gustafson, J. & Segerup, M. (2010) "Simulating Intonation in Regional Varieties of Swedish", Fonetik 2010, Lund, Sweden. (pdf)

Beskow, J., Edlund, J., Gustafson, J., Heldner, M., Hjalmarsson, A., & House, D. (2010) "Modelling humanlike conversational behaviour", Short paper in proceedings of SLTC 2010. Link�ping, Sweden.

Gustafson, J., & Neiberg, D. (2010) "Directing conversation using the prosody of mm and mhm", Short paper in proceedings of SLTC 2010. Link�ping, Sweden.

Edlund, J., Gustafson, J., & Beskow, J. (2010) "Cocktail - a demonstration of massively multi-component audio environments for illustration and analysis", Short paper in proceedings of SLTC 2010. Link�ping, Sweden.

(2009)

Gustafson, J. & Merkes, M. (2009) "Eliciting interactional phenomena in human-human dialogues" In Proceedings of SigDial 2009. London, UK. (pdf)

Skantze, G. & Gustafson, J. (2009) "Attention and Interaction Control in a Human-Human-Computer Dialogue Setting" In Proceedings of SigDial 2009. London, UK. (pdf)

Beskow, J., Edlund, J., Granstr�m, B., Gustafson, J., Skantze, G., & Tobiasson, H. (in press). "The MonAMI Reminder: a spoken dialogue system for face-to-face interaction". In Proceedings of Interspeech 2009. Brighton, U.K. (pdf)

Skantze, G. & Gustafson, J., "Multimodal interaction control in the NonAMI Reminder", Demo description in proceedings of DiaHolmia 09.

Beskow, J., & Gustafson, J., "Experiments with synthesis of Swedish dialects", Poster abstract in proceedings of Fonetik 2009.

(2008)

Edlund, J., Gustafson, J., Heldner, M., & Hjalmarsson, A. (2008) "Towards more human-like spoken dialogue systems" Journal of Speech Communication, Special Issue on Evaluating new methods and models for advanced speech-based interactive systems draft pdf (final version avaialable at www.sciencedirect.com)

Strangert, E., & Gustafson, J. (2008) "Subject ratings, acoustic measurements and synthesis of good-speaker characteristics" In Proceedings of Interspeech 2008. Brisbane, Australia. (pdf)

Beskow, J., Edlund, J., Granstr�m, B., Gustafson, J., & Skantze, G. (2008) "Innovative interfaces in MonAMI: the KTH Reminder" In Proceedings of the 4th IEEE Workshop on Perception and Interactive Technologies for Speech-Based Systems. Kloster Irsee, Germany. (pdf)

Gustafson, J., & Edlund, J. (2008) "expros: a toolkit for exploratory experimentation with prosody in customized diphone voices" In Proceedings of the 4th IEEE Workshop on Perception and Interactive Technologies for Speech-Based Systems. Kloster Irsee, Germany. (pdf)

Gustafson, J., Heldner, M., & Edlund, J. (2008) "Potential benefits of human-like dialogue behaviour in the call routing domain" In Proceedings of the 4th IEEE Workshop on Perception and Interactive Technologies for Speech-Based Systems. Kloster Irsee, Germany. (pdf)

Beskow, J., Edlund, J., Granstr�m, B., Gustafson, J., Jonsson, O., & Skantze, G. (2008) "Speech technology in the European project MonAMI" In Proceedings of FONETIK 2008 (pp. 33-36). Gothenburg, Sweden.

Gustafson, J., & Edlund, J. (2008) "EXPROS: Tools for exploratory experimentation with prosody" In Proceedings of FONETIK 2008 (pp. 17-20). Gothenburg, Sweden. (pdf)

Strangert, E., & Gustafson, J. (2008) "Improving speaker skill in a resynthesis experiment" In Proceedings of FONETIK 2008. Gothenburg, Sweden

(2007)

Bell, L & Gustafson, J (2007) "Children?s convergence in referring expressions to graphical objects in a speech-enabled computer game", Proceedings of Interspeech, Antwerp, Belgium, (pdf).

Edlund, J., Heldner, M., & Gustafson, J. (2007) "Two faces of spoken dialogue systems" In M. F. McTear, K. Jokinen, J. Larson, R. L�pez-C�zar & Z. Callejas (Eds.), Interspeech 2006 - ICSLP Satellite Workshop Dialogue on Dialogues: Multidisciplinary Evaluation of Advanced Speech-based Interactive Systems (pp. 51-54). Pittsburgh PA, USA. (pdf).

(2006)

Boye, J., Gustafson, J. & Wiren, M. (2006) "Robust spoken language understanding in a computer game", Journal of Speech Communication, special issue on spoken language understanding, Volume 48, Issues 3-4, March-April 2006, Pages 335-353.( abstract, pdf))

(2005)

Boye, J., & Gustafson, J. (2005) "How to do dialogue in a fairy-tale world", Proceedings of the sixth SIGdial Workshop on Discourse and Dialogue, Lisabon, 2005 pdf.

Bell, L., Boye, J., Gustafson, J., Heldner, M., Lindstr�m, A. & Wiren, M. (2005) "The Swedish NICE Corpus ? Spoken dialogues between children and embodied characters in a computer game scenario", proceedings of Interspeech05, Lisabon, pdf.

Gustafson, J., Boye, J., Fredriksson, M., Johannesson, L., & K�nigsmann, J., "Providing computer game characters with conversational abilities," in Proceedings of Intelligent Virtual Agent (IVA05). Kos, Greece. pdf

Edlund, J, Heldner, M, & Gustafson, J (2005): Utterance segmentation and turn-taking in spoken dialogue systems. In Fisseni, B, et.al. (eds) Computer Studies in Language and Speech, Vol. 8, pp. 576-587, Frankfurt am Main, Germany, Peter Lang. pdf

(2004)

Boye, J., Mats Wiren, M., & Gustafson, J. (2004) "Contextual Reasoning in Multimodal Dialogue Systems: Two Case Studies", Proceedings of The 8th Workshop on the Semantics and Pragmatics of Dialogue Catalogue'04 , Barcelona, July 19-21, 2004. pdf

Gustafson, J & Sjölander, K (2004) "Voice creation for conversational fairy-tale characters", Proceedings of the 5th ISCA Speech Synthesis Workshop, Carnegie Mellon University 14-16 juni 2004. pdf

Gustafson, J., Bell, L., Boye, J., Lindström, A. & Wiren, M (2004) "The NICE Fairy-tale Game System", proceedings of SIGdial 04, Boston. (abstract), pdf)

Deliverables from the EU funded project NICE (www.niceproject.com)

(2003)

Bell, L. & Gustafson, J. (2003) "Child and Adult Speaker Adaptation during Error Resolution in a Publicly Available Spoken Dialogue System", Proceedings of Eurospeech 03, Geneve, Schweiz. (pdf)

Bell, L., Gustafson, J. & Heldner, M. (2003) "Prosodic adaptation in human?computer interaction", Proceedings of ICPhS 03, Bercelona, Spain. (pdf)

(2002)

Gustafson, J. (2002) "Developing Multimodal Spoken Dialogue Systems - Empirical Studies of Spoken Human-Computer Interactions", PhD thesis, KTH, Stockholm. (pdf)

Gustafson, J & Sjölander, K (2002) "Voice Transformations For Improving Children's Speech Recognition In A Publicly Available Dialogue System", Proceedings of ICSLP02, Colorado USA. (pdf)

Gustafson, J, Bell, L, Boye, J, Edlund, J & Wiren, M (2002) "Constraint Manipulation And Visualization In A Multimodal Dialogue System", Proceedings of the ISCA Workshop Multi-Modal Dialogue in Mobile Environments Kloster Irsee, Germany (pdf)

(2001)

Bell, L, Boye, J, & Gustafson, J (2001) "Real-time Handling of Fragmented Utterances", Proceedings of NAACL 2001.(pdf)

(2000)

Lindberg, N, & Gustafson, J (2000) "Example based shallow semantic analysis in the August spoken dialogue system", STL-QPSR 1/00 (html)

Bell, L, Boye, J, Gustafson, J, & Wiren, M (2000) "Modality Convergence in a Multimodal Dialogue System", Proceedings of Götalog 2000, Fourth Workshop on the Semantics and Pragmatics of Dialogue, pages 29-34. (html or pdf)

Gustafson, J, & Bell, L (2000) "Speech Technology on Trial: Experiences from the August System", Journal of Natural Language Engineering: Special issue on Best Practice in Spoken Dialogue Systems. (abstract, pdf, postscript or zipped postscript)

Gustafson, J, Bell, L, Beskow, J, Boye, J, Carlson, R, Edlund, J, Granström, B, House, D and Wirén M (2000) "AdApt ? a multimodal conversational dialogue system in an apartment domain", Proceedings of ICSLP 00 (html or pdf)

Bell, L & Gustafson, J (2000) "Positive and Negative User Feedback in a Spoken Dialogue Corpus", Proceedings of ICSLP 00 (html or pdf)

Bell, L, Eklund, R & Gustafson, J (2000) "A Comparison of Disfluency Distribution in a Unimodal and a Multimodal Speech Interface", Proceedings of ICSLP 00 (html, pdf or postscript)

(1999)

Gustafson, J, Sjölander, K,  Beskow, J, Granström, B & Carlson, R (1999) "Creating web-based exercises for spoken language technology",  tutoriaÎ session in proceedings of IDS'99 (html or pdf)

Gustafson, J, Lundeberg, M & Liljencrants, J (1999) "Experiences from the development of August - a multimodal spoken dialogue system", in proceedings of IDS'99 (html or pdf)

Bell, L & Gustafson, J (1999) "Utterance types in the August System", in proceedings of IDS'99, (html or pdf)

Bell, L & Gustafson, J (1999) "Repetition and its phonetic realizations: investigating a Swedish database of spontaneous computer directed speech", in proceedings of ICPhS' 99, (html or pdf)

Bell, L & Gustafson, J (1999) "Interaction with an animated agent in a spoken dialogue system",
in proceedings of Eurospeech '99 (html or pdf)

Gustafson, J, Lindberg, N, & Lundeberg, M (1999) "The August spoken dialogue system", in proceedings of Eurospeech '99. (html or pdf)

Sjölander, K,  Gustafson, J, Beskow, J, Granström, B & Carlson, R (1999) "Web-based educational tools for speech technology", in proceedings of Matisse 99, (abstract)

(1998)

Gustafson, J., Elmberg, P., Carlson R. & Jönsson, A (1998) "An Educational Dialogue System With a User Controllable Dialogue Manager", ICSLP98, paper in html

 Sjölander, K., Beskow, J., Gustafson, J., Lewin, E., Carlson, R., & Granström, B. (1998) "Web-based Educational Tools for Speech Technology", ICSLP98, paper in html

 Carlson, R., Granström, B., Gustafson, J., Levin, E. & Sjölander, K. (1998) "Hands-on Speech Technology on the Web", the conferance ELSNET in Wonderland (pdf) HTML version

(1997)

Gustafson, J., Larsson, A., Carlson, R. & Hellman, K. (1997): "How do System Questions Influence Lexical Choices in User Answers?", Eurospeech '97. Abstract (paper - 4 pages, Postscript 336 kb, pdf or HTML)

Sjölander, K. & Gustafson, J. (1997): "An Integrated System for Teaching Spoken Dialogue Systems Technology", Eurospeech '97. Abstract (paper - 4 pages, Postscript, gzip 600 kb , pdf or HTML )

(1996)

Gustafson, J. (1996):"A Swedish Name Pronunciation System", Licenciate thesis, TMH, KTH, 1996 Pdf

(1995)

Gustafson, J. (1995):"Using Two-level Morphology To Transcribe Swedish Names ", Eurospeech '95, in Madrid, 1995 Abstract (paper - 4 pages, Postscript 166kb), (pdf)

Bertenstam, J. Blomberg, M., Carlson, R., Elenius, K, Granström, B., Gustafson, J., Hunnicutt, S., Högberg, J., Lindell, R., Neovius, L., de Serpa-Leitao, A., Nord, L. & Ström, N. (1995) "The Waxholm Application Data-Base", in proceedings of Eurospeech '95, in Madrid, 1995

The Onomastica Consortium* (1995). "The Onomastica interlanguage pronunciation lexicon", In Eurospeech, Madrid, Spain (*Anderson, O., Boves, L., Dalsgaard, P., Darsinos, V., Granstr�m, B., Gustafson, J., van den Heuvel, H., Jack, M., Kokkinakis, G., Konst, E., Logothetis, M., Mascarenhas, I., Mengel, A., Molbaek, P., Ottensen, G., Pardo, J., Pirrelli, V., Schmidt, M., Sutherland, A., Trancoso, I., Valverde, F., Viana, C., & Yvon, F.)

Gustafson, J. (1995). "Transcribing names with foreign origin in the ONOMASTICA project", in proceedings of ICPhS'95 in Stockholm, August 13-19 Abstract (paper - 4 pages, Pdf)

Carlson, R., Hunnicutt, S. & Gustafson, J.(1995):" Dialogue management in the Waxholm system", In Proceedings of Spoken Dialogue Systems, Vigsø(paper - 4 pages, Pdf 109kb)

Bertenstam, J. Blomberg, M., Carlson, R., Elenius, K, Granström, B., Gustafson, J., Hunnicutt, S., Högberg, J., Lindell, R., Neovius, L., de Serpa-Leitao, A., Nord, L. & Ström, N. (1995):" The Waxholm system - a progress report", In Proceedings of Spoken Dialogue Systems, Vigsø(Pdf)

Bertenstam, J. Blomberg, M., Carlson, R., Elenius, K, Granström, B., Gustafson, J., Hunnicutt, S., Högberg, J., Lindell, R., Neovius, L., de Serpa-Leitao, A., Nord, L. & Ström, N. (1995):" Spoken dialogue data collection in the Waxholm project", STL-QPSR 1/1995, pp. 50-73.(pdf)

(1994)

Gustafson, J. (1994). "ONOMASTICA - Creating a multi-lingual dictionary of European names", In FONETIK ´94, Papers from the 8th Swedish Phonetics Conference, May 24-26, Lund, Sweden, pp. 66-69.1-94(Pdf)

(1993)

Blomberg, M., Carlson, R., Elenius, K, Granström, B., Gustafson, J., Hunnicutt, S., Lindell, R., & Neovius, L. (1993):" An experimental dialogue system: WAXHOLM,", Eurospeech '93, Berlin, Sept 21-23, 1993, Vol. 3, pp. 1867-1870

Blomberg, M., Carlson, R., Elenius, K, Granström, B., Gustafson, J., Hunnicutt, S., Lindell, R., Neovius, L. & Nord, L. (1993):" An experimental dialogue system: WAXHOLM,", STL-QPSR 2-3/1993, pp. 15-20.(pdf)

(1992)

Gustafson J. (1992), "Databashantering som del av ett talförståelsesystem", Master of Science Thesis, Department of Speech Communication and Music Acoustics, KTH, Stockholm.