Annual Report 1999
Table of Contents
TMH Series of Seminars
January
Hur gör vi våra spektrogram egentligen
Svante Granqvist, Inger Karlsson, Johan Liljencrants, Kåre
Sjölander & Sten Ternström
From air to music - Acoustical, physiological and perceptual aspects
of reed wind instrument playing and vocal - ventricular fold phonation
Leonardo Fuks (Defence of a DSc thesis)
Manligt – kvinnligt: identifiering av röster i SpeechDat
Niklas Torstensson, Högskolan i Skövde (MSc project
work)
Användaraspekter av talarverifiering för röstringning
Marcus Hoverby (MSc project work)
Bandwidth extension of speech
Niklas Enbom Skövde (MSc project work)
February
Grammatisk analys baserad på n-gram
Alice Carlberger
Talad dialog med ett resebokningssystem
Johan Boye och Mats Wirén, Telia Research
Analys och simulering av ensembleljud - ett examensarbete
Daniel Kahlin
A computational memory and processing model for prosody
Janet Cahn, MIT Media Laboratory
March
Quantization noise modeling in predictive speech coding
Søren Vang Andersen (MSc thesis)
Forskning i K2-lab (Kunskap och kommunikation) med fokus på kognition,
lärande, undervisning och IT.
Robert Ramberg, DSV
Taligenkänning på MIT Lab for Computer Science
Nikko Ström
April
Generering av segmentdurationer i ett text-till-talsystem - ett examensarbete
utfört vid Toshiba R&D Center
Mattias Ånstrand (MSc project work)
Multi-microphone conference systems - changes in speech intelligibility
as a function of the source-microphone distance in a room
Jakob Holm (MSc project work)
Talarverifiering i Per – ett inpasseringssystem
Erik Armerén (MSc project work)
May
Acoustic assessment of pathological voice characteristics and its relation
to voice generation and perception
Mattias Fröhlich and Dirk Michaelis, Göttingen University,
Germany
Multi-microphone conference systems - changes in speech intelligibility
as a function of the source-microphone distance in a room
Jakob Holm (MSc project work)
The future of computing: A linguistic user interface
Michael Picheny, IBM T. J. Watson Research Center, Yorktown Heights,
NY
June
DynaLite – dynamisk igenkänning med StarLite
Victor Sandin (MSc project work)
Vad gör olinjära hörapparater med ljudet
Arne Leijon och Peter Nordqvist
Text-to-speech synthesis of Farsi
Keyhan Hadjari (MSc project work
Speech recognition based on linear prediction parameters
Laurent Feurer (MSc project work)
PER - En automatisk receptionist: Systemkonstruktion och dialogdesign
Martin Nilsson (MSc project work)
September
Swedish vowel quantity in L2 speakers – evaluation by manual and automatic
methods
Julien Roland (MSc project work)
Low bit high quality audio coding with combined harmonic and wavelet
representations
Christophe Delachanal (MSc project work)
Workshoprapporter från sommarens möten bl a IDS-99, AVSP 99,
ILP-99, ICML, LLL
October
Självlärande metoder i språkbehandling
Henrik Boström, Martin Eineborg och Nikolaj Lindberg
Implementering och utvärdering av en metod för mätning
av upplevd ljudkvalitet i realtid - ett examensarbete vid Teracom AB
Carl Skedinger (MSc project work)
Modeling the phase of the pitch cycle in harmonic coding of speech
Andrei Jefremov (MSc project work)
November
Talfilformat, .wav och .smp och .ra och allt vad dom heter
Johan Liljencrants, Svante Granqvist och Sten Ternström
All-pole spectral envelope modeling of speech
Manohar N. Murthi - new post-doc at TMH
Speech transmission via the internet
Niklas Enbom, Intertex Data AB
En himla många program: computer programs for sound and voice analysis.
Svante Granqvist
Att automatiskt skilja mellan tal och musik – en litteraturstudie
Stefan Karnebäck
Metoder för talaradaptering av HMM – en jämförelse
Ylva Gustafsson (MSc project work)
Communication research at the University of Wollongong
Professor Joe F Chicharo, Wollongong Univ, NSW, Australia
December
Formantfrekvenser för svenska vokaler – dialektal variation i SpeechDat-databasen
André Heinonen (MSc project work)
Structural, computational, and perceptual aspects of sound synthesis
by physical modeling with application to the synthesis of piano music
Davide Rocchesso, Verona & Giovanni De Poli, Padova
Hur man uppfinner ett musikinstrument
Sten Ternström (Docentföreläsning)
Ett nytt system för analys och prediktion av F0 i tal
Gunnar Fant, Anita Kruckenberg och Johan Liljencrants
Datadriven semantisk gruppering
Joakim Laksov (MSc project work)
Special SPEECH Seminars
Bullet course, April
Chin-Hui Lee, Lucent Technologies, USA
Three lectures on automatic speech and speaker recognition:
- Robust speech recognition -- overview of statistical pattern recognition
approach and the implied robustness problems and solutions.
- Speaker and speech verification -- overview of statistical pattern verification
approach and applications to speaker and speech verification.
- A detection approach to speech recognition and understanding -- a paradigm
to combine feature-based detection and state-of-the-art recognition to open up new possibilities.
Bullet course, May
Anne Cutler, Max Planck Institute for Psycholinguistics, Nijmegen,
Holland
Three lectures on human recognition of spoken words
- The psycholinguistic approach
- From input to lexicon
- Language-specificity
Bullet course, June
Gerard Chollet, ENST, Paris,France
- Speech analysis, coding, synthesis and recognition:Time-frequency representations,
wavelets, Temporal decomposition, Time-dependent models, Analysis by synthesis, H+N, HMMs, Markov fields,…
- ALISP (Automatic Language Independent Speech Processing): Learning from
examples, Segmental models, Speaker normalisation, Very Low bit rate coding, Multilingual speech recognition…
- Applications: Identity verification, decision fusion, Interactive Voice
Servers, 'Majordome', Multicasting….
Pre-Speech-Coding-Workshop, June
The adaptive multi-rate speech coder
Roar Hagen (Ericsson)
Reverse water-filling in predictive encoding of speech
Soren V Andersen (KTH)
Auditory modeling for speech coding
Gernot Kubin, Vienna Technical University
Making waveform coded WI possible
Ian Burnett, University of Wollongong
Spectrum Analyisis: Alternative Linear Prediction Techniques
Nicola Chong, University of Wollongong
Optimized error correction of MELP speech parameters via maximum a posteriori
(MAP) techniques
Douglas Rahikka (NSA)
Speech services in third generation wireless systems
Eric Ekudden (Ericsson)
Interoperable global communications, a government perspective
John Collura (NSA)
Discussions and KTH demos
MiLaSS -
7th European Summer School on Language and Speech Communication,
July 12-23
Multimodal speech perception: A paradigm
for speech science (Plenary)
Dominic W. Massaro
Multimodality of meaning in speech and gesture (Parallel)
David McNeill
Multimodality in language and speech systems
- from theory to design support tool (Parallell)
Niels Ole Bernsen
Intelligent multimedia presentation systems (Parallel)
Elisabeth André and Björn Granström
Architectures for integrated multimodal
input-output systems and the humanoid interface (Parallel)
Kristinn Thórisson
Face-to-face communication including different
modalities (Plenary)
Jens Allwood
Multimodal interaction and people with disabilities (Parallel)
Alistair Edwards
Developing intelligent multimedia applications (Parallel)
Paul Mc Kevitt
Multimodal dialogue systems & audio-visual synthesis (Parallel)
Hands-on sessions, the KTH group
Bullet course, October
Sharon Oviatt, Professor and Co-Director, Center for Human-Computer
Communication, Dept of Computer Science, Oregon Graduate Institute of Science & Technology
- Modeling hyperarticulate speech to interactive systems
- Mutual disambiguation of recognition errors in a multimodal architecture
- Designing and evaluating conversational interfaces with animated characters
TMH Generalseminarium, November
Vad är utmärkande för mänsklig ljudkommunikation?
Vårt forskningsområde
Gunnar Fant
Gruppering och differentiering i musikalisk interpretation
Johan Sundberg
Simulera känslor i musik med strukturmarkeringsregler
Anders Friberg och Robert Bresin
Perception av accentuering och frasering i tal
David House
Visuell återkoppling och motorkontroll i tal
Anne-Marie Öster
Phase perception in speech
Harald Pobloth
Talteknologi och mänsklig kommunikation
Rolf Carlson
Special seminar, November
Microsoft’s tap and talk interface research
X D Huang, Sr Researcher and Manager, Speech Technology Group
TMH Seminars in MUSIC Acoustics
January
From air to music - Acoustical, physiological and perceptual aspects
of reed wind instrument playing and vocal - ventricular fold phonation
Leonardo Fuks (Defence of a DSc thesis)
On the effect of phase relations for the perception of pitch and timbre
of bass tones
Alex Galembo, Anders Askenfelt and Lola Cuddy
February
Analys och simulering av ensembleljud
Daniel Kahlin (MSc project work)
Violinens gåta löst? Huvudresonansen i ny tolkning
Erik Jansson and Benedykt Niewczyk
March
Breathing habits in singers.
Monica Thomasson
April
Sammanfattning av litteratur inom biomekanik, bl.a. Biomechanics and
motor control of human movement, by David A Winter
Sofia Dahl
October
Old Estonian folksongs: Tradition, structure and acoustics
Jaan Ross
November
3D sounds in virtual reality.
Peter Lundén, Interactive Institute
En himla många program: computer programs for sound and voice analysis.
Samseminarium med TMH.
Svante Granqvist
December
Current projects at Laboratorio de Informatica Musicale, DIST, Genova,
Italy
http://www.infomus.dist.unige.it/
Antonio Camurri
Study on the adjustment ofperformance parameters in the education of
the performer
Giovanni Umberto Battel, Conservatorio de Musica "B. Marcello",
Venedig
Music and Motion
- at Italienska Kulturinstitutet:
Four years of research on "music and motion"
Johan Sundberg:
Simulating body movement in music performance.
Anders Friberg
Analysis of music performance: interpretation of a virtual pianist as
mean of interpretations of different pianists.
Maestro Giovanni Umberto Battel (invited speaker), Conservatorio
di musica "Benedetto Marcello", Venezia, Italia
Happy or sad? Computer simulation of different emotional states in music
performance.
Roberto Bresin
Recognizing gestures and emotions in interactive systems for music and
dance.
Professor Antonio Camurri (invited speaker), Università
degli Studi di Genova, Dipartimento di Informatica, Sistemistica e Telematica
SensOrg: cyber instrument for real-time music performance.
Professor Tamas Ungvary, Kacor – KTH
Software system for automatic choreography.
Peter Rajka, Kacor – KTH
Real-time simulation of moving sound sources in virtual 3-dimentional
spaces.
Peter Lundén, Kacor – KTH
APEX: an articulatory model of speech production.
Christine Ericsdotter Bresin, Johan Stark, TMH - KTH & Stockholm
university, Department
of Linguistics, Phonetics
Temporal and kinematic analysis of percussionists.
Sofia Dahl, TMH – KTH
The essential guide to boring sounds - or, why is it so hard to make
interesting sounds with synthesizers?
Sten Ternström, TMH – KTH
Concert
Maestro Giovanni Umberto Battel, pianoforte
Conservatorio di musica "Benedetto Marcello", Venezia, Italia
Åsa Unander-Scharin, i samarbete med Kacor – KTH
Peter Rajka, Kacor – KTH
Virtuosi Virtuali: tre studi per ballerini virtuali
Coreography and digital video: Peter Rajka
Music: Domenico Scarlatti
Interpretation of MIDI-data: Roberto Bresin, Anders Friberg, Julieta Gleiser
Peter Lundén, Kacor – KTH
Professor Tamas Ungvary, SensOrg, & Georg Weidinger (Austria),
piano, Kacor – KTH
Music Acoustics Group at KTH: http://www.speech.kth.se/music
Kacor Group at KTH: http://www.speech.kth.se/kacor
Seminars on the Singing Voice
February
Röstkällan i modal- och falsettregister hos bas, bariton, tenor
och counter tenor
Carl Høgset, counter-tenorsångare, sångpedagog
och körledare från Oslo, och Johan Sundberg
March
Velumöppning i vokaler hos operasångare
Peer Birch, sångpedagog,
Det Jyske Musikkonservatorium, Aarhus;
Bodil Gümoes,
sångpedagog, Det Kgl Danske Musikkonservatorium, Köpenhamn;
Svend Prytz, foniater,
Bispebjergshospitalet, Köpenhamn
Om sångpedagogutbildningen i Danmark
Peer Birch & Bodil Gümoes
April
Perceptuell röstbedömning – erfarenheter, design och u tvärdering
av analysmetoder
Eva B Holmberg
Demonstration och jämförelse av datorprogram skräddarsydda
för perceptuell röstanalys
Svante Granqvist
Other participators: Jan Dahlqvist, Britta Hammarberg, Anita McAllister,
Maria Södersten, Joakim Westerlund
May
Expressivitetssignaler, använder sångare och talare samma?
Sångformantens mittfrekvens och styrka
Johan Sundberg
November
Sång fast på annat vis.
Jaan Ross, professor i musikvetenskap i Tartu, Estland
Muscle memory for pitch: relevance of auditory feedback to pitch control.
Dirk Mürbe, Dresden
Röstbehandling utan röstbehandling 1.
Naprapaterna Marie Andersson och Ulrika Bergman och Sten Ternström
December
Om flyttning av F1 inom sångkonsten
Christine Ericsdotter och Johan Sundberg & Johan Stark
Current projects at Audiovisual Institute /IUA), Pompeo Fabra University,
Barcelona, Spain
http://www.iua.upf.es
Xavier Serra
Structural, computational and perceptual aspects of sound synthesis by
physical modeling with application to the synthesis of piano
http://www.sci.univr.it/~rocchesso/talks/Stockholm1999
Davide Rocchesso, Verona, and Giovanni De Poli, Padova
KTH Voice Research Centre (VRC) Seminar
International symposium, October
Real-Time Biofeedback In Voice Therapy And Pedagogy
Principles of biofeedback
David Krebs
Stroboscopy
Janina Casper & Laryngograph Ltd (Adrian Fourcin)
Electromyography
Katherine Verdolini & Medtronic Symetics (Magnus Skytt)
Real-time spectrum display
Anne-Maria Laukkanen & Nyvalla DSP (Sten Ternström)
Phonetography
Maria Södersten & Nyvalla DSP (Sten Ternström)
Advanced phonetography
Peter Pabon
The new Glottal Enterprises Oronasal System
Martin Rothenberg
Real-time displays and singing (the SINGAD system)
David Howard
Real-time spectrogram display
Katherine Verdolini & Laryngograph Ltd (Adrian Fourcin)
Palatography
Anita & Bob McAllister & SNORS+
Electrolaryngography
Eva Carlsson & Laryngograph Ltd (Adrian Fourcin)
Larynx height
Anne-Marie Laukkanen & Glottal Enterprises (Martin Rothenberg)
Inverse filtering
Johan Sundberg & Glottal Enterprises (Martin Rothenberg)
Discussion: Potentials and limitations of real-time biofeedback in voice
therapy and training
|