Graduate School of Language Technology

Nordic Graduate School of Language Technology

Speech and Speaker Recognition (2007)

Back to Speech and Speaker Recognition home page

Additional papers

A selection of papers and other publications will be used as additional reading material for each subtopic.

Most papers can be found on the web. Some of the external links require login. Some papers will be printed and distributed.

Speech Recognition

Lawrence R. Rabiner (1989) A Tutorial on Hidden Markov Models and Selected Applications in Speech Recognition, Proceedings of the IEEE, vol 77, no. 2, pp. 257-286. http://www.caip.rutgers.edu/%7Elrr/Reprints/tutorial on hmm and applications.pdf

S. Young (1996). "Large Vocabulary Continuous Speech Recognition." IEEE Signal Processing Magazine 13(5): 45-57. http://mi.eng.cam.ac.uk/~sjy/papers/youn96.ps.gz

Ronald Rosenfeld (2000) Two decades of Statistical Language Modeling: Where Do We Go From Here? Proceedings of the IEEE, 88(8), (pdf)

Ingunn Amdal, Eric Fossler-Lussier (2003) "Pronunciation variation modeling in automatic speech recognition", Telektronikk, vol. 99, no. 2 http://www.telenor.com/telektronikk/volumes/pdf/2.2003/Side_70-82.pdf

R.P. Lippman (1997) Speech recognition by machines and humans, Speech Communication vol 22 no 1, pp 1-15 (pdf)

M Mohri, F Pereira, M Riley (2000) Weighted finite state transducers in speech recognition, ISCA ITRW ASR2000, Paris http://www.cs.nyu.edu/~mohri/postscript/asr2000.ps

Speaker Recognition

Gish, H. and Schmidt, M. (1994): "Text-independent speaker identification", IEEE Signal Processing Magazine Oct. 94, pp. 18-32 (pdf)

S. Furui (1997): "Recent Advances in Speaker Recognition", Pattern Recognition Letters, vol 18, pp 859-872. (pdf)

Douglas A. Reynolds, Thomas F. Quatieri, Robert B. Dunn (2000): "Speaker verification using adapted Gaussian mixture models", Digital Signal Processing, vol. 10, no. 1-3, Jan-July 2000 (pdf)

Bimbot, F., Bonastre, J.-F., Fredouille, C., Gravier, G., Magrin-Chagnolleau, I., Meignier, S., Merlin, T., Ortega-García, J., Petrovska-Delacrétaz, D., and Reynolds, D. (2004): "A Tutorial on Text-Independent Speaker Verification", EURASIP Journal on Applied Signal Processing, Hindawi Publishing Corporation Vol. 2004, no 4, pp 432-451 (pdf)

Melin, H. (2006) Automatic speaker verification on site and by telephone: methods, applications and assessment, Doctoral Thesis, Dept of Speech, Music and Hearing, School of Computer Science and Communication, KTH, Stockholm. (Full report pdf, 332 pages) Selections: (ASV system 18 p) (Corpora 24 p), (PER system 12 p) (PER experiments 34p)
Nakasone, H. and Beck, S. D., “Forensic automatic speaker recognition,”, 2001: A Speaker Odyssey—The Speaker Recognition
Workshop, pp. 139–142, Crete, Greece, June 2001. (ISCA Archive pdf)
Mirghafori, N., Hébert, M. (2004) "Parameterization of the score threshold for a text-dependent adaptive speaker verification system", Proc. of ICASSP 2004, pp 361-364.
Hébert, M., Mirghafori, N. (2004) "Desperately seeking impostors: Data-mining for competitive impostor testing in a text-dependent speaker verification system, Proc. of ICASSP 2004, pp 365-368.
Hébert, M., Foies, D. (2005) "T-norm for Text-Dependent Commercial Speaker Verification Applications: Effect of Lexical Mismatch", Proc. of ICASSP 2005, pp 729-732.
Teunen, R., Shahshahani, B., Heck, L. (2000) "A model-based transformational approach to robust speaker recognition, Proc of ICSLP 2000.
Mirghafori, N., Heck, L. (2002) "An adaptive speaker verification system with speaker dependent a priori decision thresholds", Proc of ICSLP 2002, pp 589-592. (ISCA Archive pdf)
Doddington, G. (2001) "Speaker Recognition based on Idiolectal Differences between Speakers", Proc of Eurospeech 2001, pp 2521-2524 (ISCA Archive pdf).
Zetterholm, E., Blomberg, M., & Elenius, D. (2004). A comparison between human perception and a speaker verification system score of a voice imitation. In Proc of Tenth Australian International Conference on Speech Science & Technology (pp. 393-397). Macquarie Univ, Sydney, Australia. [pdf]

Some more links

Speech Technology Magazine's NewsBlast http://www.speechtechmag.com/eletter/archives/

CTT - Selection of conferences/workshops http://www.speech.kth.se/conferences/