Research
The Department of Speech, Music and Hearing has a unifying research theme in communication and interaction between humans via speech and music. The department is engaged in a diverse set of multi-disciplinary research activities, commonly classified into speech communication, speech technology, sound and music computing, auditory perception and second language acquisition, to mention the largest areas.
Currently active projects at TMH:
 | BioASU - Biologically inspired statistical methods for flexible automatic speech understanding Funding: VR The project will develop machine learning methods for speech understanding that more closely resemble the biological approach to learning. [more] |  | CALST - Computer-Assisted Listening and Speaking Tutor Funding: NTNU + Norgesuniversitetet The project aims at developing a computer program that will be used to train Norwegian as a second language. [more] |  | FonaDyn - Phonatory Dynamics and States Funding: VR The voice has several non-linear and context-dependent mechanisms that can give rise to distinct phonatory states. We submit that much of the observed variability in objective voice metrics results from the influence of such states, and will attempt to account for some of them, using a state-based analysis paradigm.
[more] |  | GetHomeSafe - Extended Multimodal Search and Communication Systems for Safe In-Car Application Funding: EU The aim of the proposed project is to develop a system for safe information access and communication while driving. [more] |  | ISHT - Interior sound design of high-speed trains Funding: KK-stiftelsen (The Knowledge Foundation) The main research question in this project is: "How can we develop design methods and acoustic artefacts in order to improve the sound environment of the high-speed trains of the future?" [more] |  | IURO - Interactive Urban Robot Funding: EU The goal of IURO project is to develop a robot that can engage in information-gathering face-to-face interactions in multi-user settings. [more] |  | Ljudparken/The Soundpark - Using modern smartphones to create interactive listening experiences for hearing impaired Funding: PTS - Post och Telestyrelsen The aim of the project is to create interactive listening experiences for persons with hearing impairments through smartphone applications. Interactive listening applications will provide better conditions for partaking in commonly available audio-based entertainment, but also offer new possibilities for active heartraining. [more] |  | MASSIVE - Large-scale massively multimodal modelling of non-verbal behaviour in spontaneous dialogue Funding: VR The aim is to provide a large-scale kinematic database based on motion capture of human conversational behaviour, as well as to build statistical models of multimodal non-verbal behaviour in dialogue. [more] |  | SAMPROS - Prosody in conversation Funding: RJ (Bank of Sweden Tercentenary Foundation) The project investigates how people talking to each other jointly decide who should speak when, and the role of prosody in making these joint decisions. [more] |  | SAMRYTM - The rhythm of conversation Funding: VR The project Rhythm of conversation investigates how a set of rhythmic prosodic features contributes to the joint interaction control in conversations. [more] |  | SAMSYNT - Introducing interactional phenomena in speech synthesis Funding: VR The project will develop and verify ways of including interactional phenomena in speech synthesis, resulting in well-described and tested methods for synthesizing these phenomena in such a way that they can be employed to recreate human interactional behaviour. [more] |  | SAVIR - Situated Audio Visual Interaction with Robots Funding: SRA/KTH The projects investigate how a robot can improve its visual scene understanding by engaging in spoken dialogue with a human.
[more] | | SEMIR - Bridging the semantic gap in Music Information Retrieval: Modelling perceptual-based features in music audio Funding: VR This project aims at deveoping new computer tools for characterizing, and indexing music audio. [more] |  | SOM - The sound of motion: Providing sound feedback to human movements Funding: VR The main aim of this project is the development of theories, models and tools for representing human movements by means of sound. This work is part of growing research fields known as data sonification, embodied music cognition and mediation technology. [more] |  | Song - Sundberg's Voice Science Funding: KTH CSC - TMH Sundberg Kulning - Hard rock - Twang - Belting - Chest/Falsetto - Whisper - High pitch singing - Text intelligibility - MRI
[more] | | SVP Voice - Detailed multiphysics simulation of human voice production with neural control - a feasibility study Funding: KTH CSC This project is a feasibility study in which we examine whether it is possible to make a unified-domain numerical simulation of human voice production that covers the mechanical, fluid and acoustic phenomena involved; and also attempts to control the simulation using representations of (simulated) muscle activation.
[more] |  | TIVOLI - Sign learning via game-based interaction Funding: PTS - Post och Telestyrelsen TIVOLI aims to create learning application for sign language signs, in the form of a computer game featuring sign recogntion via webcam and a signing avatar. The target group is children with communication disorders. [more] |  | VariQ - Intonational variation in questions in Swedish Funding: VR This project investigates questions in dialogue. What is a question, and what makes it into one? [more] |  | VoxLog - VoxLog: portable voice analyzer Funding: NUTEK/VINNOVA A new wearable voice+noise dosimeter has been developed. The project will assess the validity, usability and commercial potential of the new device.
[more] |
|
|