Software:Kaldi
Developer(s) | Daniel Povey and others |
---|---|
Stable release | Revision 3122
/ October 2013 |
Repository | https://github.com/kaldi-asr/kaldi |
Written in | C++ |
Operating system | Unix systems (Linux, BSD, OSX 10.{8,9} etc.), Windows (via Cygwin) |
Type | Speech recognition |
License | Apache License v.2.0[1] |
Website | kaldi-asr |
Kaldi is an open-source speech recognition toolkit written in C++ for speech recognition and signal processing, freely available under the Apache License v2.0.
Kaldi aims to provide software that is flexible and extensible,[2] and is intended for use by automatic speech recognition (ASR) researchers for building a recognition system.
It supports linear transforms, MMI, boosted MMI and MCE discriminative training, feature-space discriminative training, and deep neural networks.[3]
Kaldi is capable of generating features like mfcc, fbank, fMLLR, etc. Hence in recent deep neural network research, a popular usage of Kaldi is to pre-process raw waveform into acoustic feature for end-to-end neural models.
Kaldi has been incorporated as part of the CHiME Speech Separation and Recognition Challenge over several successive events.[4][5][6] The software was initially developed as part of a 2009 workshop at Johns Hopkins University.[7]
Kaldi is named after the legendary Ethiopian goat herder Kaldi who was said to have discovered the coffee plant.[8]
See also
References
- ↑ "Kaldi: Legal stuff". http://kaldi-asr.org/doc/legal.html.
- ↑ "Kaldi: About the Kaldi project". http://kaldi-asr.org/doc/about.html.
- ↑ "Kaldi: Deep Neural Networks in Kaldi". http://kaldi-asr.org/doc/dnn.html.
- ↑ "The 4th CHiME Speech Separation and Recognition Challenge". http://spandh.dcs.shef.ac.uk/chime_challenge/software.html. Retrieved 15 February 2017.
- ↑ "The 3rd CHiME Speech Separation and Recognition Challenge". http://spandh.dcs.shef.ac.uk/chime_challenge/chime2015/software.html. Retrieved 15 February 2017.
- ↑ Emmanuel Vincent, Jon Barker, Shinji Watanabe, Jonathan Le Roux, Francesco Nesta, et al.. The second 'CHiME' Speech Separation and Recognition Challenge: Datasets, tasks and baselines. ICASSP - 38th International Conference on Acoustics, Speech, and Signal Processing - 2013, May 2013, Vancouver, Canada. pp.126-130, 2013.
- ↑ "History of the Kaldi project". http://kaldi-asr.org/doc/history.html. Retrieved 26 July 2017.
- ↑ "Kaldi: About the Kaldi project". https://kaldi-asr.org/doc/about.html.
External links
- No URL found. Please specify a URL here or add one to Wikidata.
- Kaldi – The official GitHub project
- How to start with Kaldi and Speech Recognition - A guide regarding the different parts of the system
- Kaldi paper - The Kaldi Speech Recognition Toolkit
- VOSK – open source and commercial models from Alpha Cephei on Kaldi foundations