Resources for Automatic Speech Recognition

KTH Royal Institute of Technology,
School of Computer Science and Communication,
Dept. for Speech Music and Hearing

This page collects resources for Automatic Speech Recognition (ASR) that the group has developed and made freely available to other researchers.

PI: Giampiero Salvi

Main contributors: Giampiero Salvi, Niklas Vanhainen

The WaveSurfer Automatic Speech Recognition Plugin

The plugin is described in the following paper:

Salvi, G., & Vanhainen, N. (2014). The WaveSurfer Automatic Speech Recognition Plugin. In Proceedings of LREC. Reykjavik, Iceland. [pdf]

Installation instructions:

Preliminaries (all platforms): Installation Linux/Mac OS X: Installation MS Windows:

HTK based WaveSurfer Automatic Speech Recognition Plugin

This is an earlier version of the plugin above. The difference is that this version is based on HVite (HTK). This plugin is still used for educational purposes in the DT2112 Speech Technology course at KTH.

ASR models for Swedish (Version 0.1)

This is a collection of acoustic and language models that were used in the paper:

Vanhainen, N., & Salvi, G. (2014). Free Acoustic and Language Models for Large Vocabulary Continuous Speech Recognition in Swedish. In Proceedings of LREC. Reykjavik, Iceland. [pdf]

Creative Commons License
This work is licensed under a Creative Commons Attribution 3.0 Unported License.