Contact





Research

The Department of Speech, Music and Hearing has a unifying research theme in communication and interaction between humans via speech and music. The department is engaged in a diverse set of multi-disciplinary research activities, commonly classified into speech communication, speech technology, sound and music computing, auditory perception and second language acquisition, to mention the largest areas.

Currently active projects at TMH:

BioASU - Biologically inspired statistical methods for flexible automatic speech understanding
Funding: VR

The project will develop machine learning methods for speech understanding that more closely resemble the biological approach to learning. [more]

CALST - Computer-Assisted Listening and Speaking Tutor
Funding: NTNU + Norgesuniversitetet

The project aims at developing a computer program that will be used to train Norwegian as a second language. [more]

FonaDyn - Phonatory Dynamics and States
Funding: VR

The voice has several non-linear and context-dependent mechanisms that can give rise to distinct phonatory states. We submit that much of the observed variability in objective voice metrics results from the influence of such states, and will attempt to account for some of them, using a state-based analysis paradigm. [more]

GetHomeSafe - Extended Multimodal Search and Communication Systems for Safe In-Car Application
Funding: EU

The aim of the proposed project is to develop a system for safe information access and communication while driving. [more]

ISHT - Interior sound design of high-speed trains
Funding: KK-stiftelsen (The Knowledge Foundation)

The main research question in this project is: "How can we develop design methods and acoustic artefacts in order to improve the sound environment of the high-speed trains of the future?" [more]

IURO - Interactive Urban Robot
Funding: EU

The goal of IURO project is to develop a robot that can engage in information-gathering face-to-face interactions in multi-user settings. [more]

Ljudparken/The Soundpark - Using modern smartphones to create interactive listening experiences for hearing impaired
Funding: PTS - Post och Telestyrelsen

The aim of the project is to create interactive listening experiences for persons with hearing impairments through smartphone applications. Interactive listening applications will provide better conditions for partaking in commonly available audio-based entertainment, but also offer new possibilities for active heartraining. [more]

MASSIVE - Large-scale massively multimodal modelling of non-verbal behaviour in spontaneous dialogue
Funding: VR

The aim is to provide a large-scale kinematic database based on motion capture of human conversational behaviour, as well as to build statistical models of multimodal non-verbal behaviour in dialogue. [more]

SAMPROS - Prosody in conversation
Funding: RJ (Bank of Sweden Tercentenary Foundation)

The project investigates how people talking to each other jointly decide who should speak when, and the role of prosody in making these joint decisions. [more]

SAMRYTM - The rhythm of conversation
Funding: VR

The project Rhythm of conversation investigates how a set of rhythmic prosodic features contributes to the joint interaction control in conversations. [more]

SAMSYNT - Introducing interactional phenomena in speech synthesis
Funding: VR

The project will develop and verify ways of including interactional phenomena in speech synthesis, resulting in well-described and tested methods for synthesizing these phenomena in such a way that they can be employed to recreate human interactional behaviour. [more]

SAVIR - Situated Audio Visual Interaction with Robots
Funding: SRA/KTH

The projects investigate how a robot can improve its visual scene understanding by engaging in spoken dialogue with a human. [more]

SEMIR - Bridging the semantic gap in Music Information Retrieval: Modelling perceptual-based features in music audio
Funding: VR

This project aims at deveoping new computer tools for characterizing, and indexing music audio. [more]

SOM - The sound of motion: Providing sound feedback to human movements
Funding: VR

The main aim of this project is the development of theories, models and tools for representing human movements by means of sound. This work is part of growing research fields known as data sonification, embodied music cognition and mediation technology. [more]

Song - Sundberg's Voice Science
Funding: KTH CSC - TMH Sundberg

Kulning - Hard rock - Twang - Belting - Chest/Falsetto - Whisper - High pitch singing - Text intelligibility - MRI [more]

SVP Voice - Detailed multiphysics simulation of human voice production with neural control - a feasibility study
Funding: KTH CSC

This project is a feasibility study in which we examine whether it is possible to make a unified-domain numerical simulation of human voice production that covers the mechanical, fluid and acoustic phenomena involved; and also attempts to control the simulation using representations of (simulated) muscle activation. [more]

TIVOLI - Sign learning via game-based interaction
Funding: PTS - Post och Telestyrelsen

TIVOLI aims to create learning application for sign language signs, in the form of a computer game featuring sign recogntion via webcam and a signing avatar. The target group is children with communication disorders. [more]

VariQ - Intonational variation in questions in Swedish
Funding: VR

This project investigates questions in dialogue. What is a question, and what makes it into one? [more]

VoxLog - VoxLog: portable voice analyzer
Funding: NUTEK/VINNOVA

A new wearable voice+noise dosimeter has been developed. The project will assess the validity, usability and commercial potential of the new device. [more]







Published by: TMH, Speech, Music and Hearing
Webmaster, webmaster@speech.kth.se

Last updated: Tuesday, 01-Nov-2011 10:47:14 MET