Nordic Graduate School of
Language Technology
Back
to Speech and Speaker Recognition home page
A selection of papers and other publications will be used as additional reading material for each subtopic.
Most papers can be found on the web. Some of the external links require login. Some papers will be printed and distributed.
Speech
Recognition
Lawrence
R. Rabiner (1989) A Tutorial on Hidden
Markov Models and
Selected Applications in Speech Recognition, Proceedings of the IEEE,
vol 77, no. 2, pp. 257-286. http://www.caip.rutgers.edu/%7Elrr/Reprints/tutorial
on hmm and applications.pdf
S. Young
(1996). "Large Vocabulary Continuous Speech
Recognition." IEEE Signal Processing Magazine 13(5):
45-57. http://mi.eng.cam.ac.uk/~sjy/papers/youn96.ps.gz
Ronald
Rosenfeld (2000) Two decades of
Statistical
Language Modeling: Where Do We Go From
Here? Proceedings
of the IEEE, 88(8), (pdf)
Ingunn Amdal,
Eric Fossler-Lussier
(2003) "Pronunciation variation modeling
in
automatic speech recognition", Telektronikk,
vol. 99, no. 2 http://www.telenor.com/telektronikk/volumes/pdf/2.2003/Side_70-82.pdf
R.P. Lippman (1997) Speech recognition by machines
and humans, Speech
Communication vol 22 no 1, pp 1-15 (pdf)
M Mohri,
F Pereira, M Riley (2000) Weighted finite state
transducers in speech recognition, ISCA ITRW ASR2000,
Speaker
Recognition
Gish, H.
and Schmidt, M. (1994): "Text-independent speaker identification", IEEE
Signal Processing Magazine Oct. 94, pp. 18-32 (pdf)
S. Furui
(1997): "Recent Advances in Speaker
Recognition", Pattern Recognition Letters, vol
18, pp 859-872. (pdf)
Douglas A.
Reynolds, Thomas F. Quatieri, Robert B.
Dunn (2000):
"Speaker verification using adapted Gaussian mixture models", Digital
Signal Processing, vol. 10, no. 1-3, Jan-July 2000 (pdf)
Bimbot, F., Bonastre,
J.-F., Fredouille,
C., Gravier, G., Magrin-Chagnolleau,
I., Meignier, S., Merlin, T., Ortega-García, J., Petrovska-Delacrétaz,
D., and Reynolds, D. (2004): "A Tutorial on Text-Independent Speaker
Verification", EURASIP Journal on Applied Signal Processing,
Hindawi Publishing Corporation Vol. 2004,
no 4, pp 432-451 (pdf)
Melin, H. (2006) Automatic
speaker verification on site and by telephone: methods, applications
and assessment, Doctoral Thesis, Dept of Speech, Music and
Hearing, School of Computer Science and Communication, KTH, Stockholm. (Full
report pdf, 332 pages) Selections: (ASV
system 18 p) (Corpora
24 p), (PER
system 12 p) (PER
experiments 34p)
Nakasone, H. and Beck, S. D., “Forensic automatic speaker
recognition,”, 2001: A Speaker
Odyssey—The Speaker Recognition
Workshop, pp. 139–142, Crete,
Greece, June 2001. (ISCA
Archive pdf)
Mirghafori, N., Hébert, M. (2004) "Parameterization of the score
threshold for a text-dependent adaptive speaker verification system",
Proc. of ICASSP 2004, pp 361-364.
Hébert, M., Mirghafori, N. (2004) "Desperately seeking impostors:
Data-mining for competitive impostor testing in a text-dependent
speaker verification system, Proc. of ICASSP 2004, pp 365-368.
Hébert, M., Foies, D. (2005) "T-norm for Text-Dependent Commercial
Speaker Verification Applications: Effect of Lexical Mismatch", Proc.
of ICASSP 2005, pp 729-732.
Teunen, R., Shahshahani, B., Heck, L. (2000) "A model-based
transformational approach to robust speaker recognition, Proc of ICSLP
2000.
Mirghafori, N., Heck, L. (2002) "An adaptive speaker verification
system with speaker dependent a priori decision thresholds", Proc of
ICSLP 2002, pp 589-592. (ISCA Archive pdf)
Doddington, G. (2001) "Speaker Recognition based on Idiolectal
Differences between Speakers", Proc of Eurospeech 2001, pp 2521-2524
(ISCA Archive pdf).
Zetterholm, E., Blomberg, M., & Elenius, D. (2004). A comparison
between human perception and a speaker verification system score of a
voice imitation. In Proc of Tenth Australian International Conference
on Speech Science & Technology (pp. 393-397). Macquarie Univ,
Sydney, Australia. [pdf]
Speech Technology Magazine's NewsBlast http://www.speechtechmag.com/eletter/archives/
CTT - Selection of conferences/workshops http://www.speech.kth.se/conferences/