Kaldi (software) - Misplaced Pages

Open-source speech recognition software toolkit

This article has multiple issues. Please help improve it or discuss these issues on the talk page. (Learn how and when to remove these messages)

The topic of this article may not meet Misplaced Pages's notability guidelines for products and services. Please help to demonstrate the notability of the topic by citing reliable secondary sources that are independent of the topic and provide significant coverage of it beyond a mere trivial mention. If notability cannot be shown, the article is likely to be merged, redirected, or deleted.
Find sources: "Kaldi" software – news · newspapers · books · scholar · JSTOR (October 2022) (Learn how and when to remove this message)

This article relies excessively on references to primary sources. Please improve this article by adding secondary or tertiary sources.
Find sources: "Kaldi" software – news · newspapers · books · scholar · JSTOR (October 2022) (Learn how and when to remove this message)

(Learn how and when to remove this message)

Kaldi
Developer(s)	Daniel Povey and others

Stable release	Revision 3122 / October 2013; 11 years ago (2013-10)

Repository	https://github.com/kaldi-asr/kaldi
Written in	C++
Operating system	Unix systems (Linux, BSD, OSX 10.{8,9} etc.), Windows (via Cygwin)
Type	Speech recognition
License	Apache License v.2.0
Website	kaldi-asr.org

Kaldi is an open-source speech recognition toolkit written in C++ for speech recognition and signal processing, freely available under the Apache License v2.0.

Kaldi aims to provide software that is flexible and extensible, and is intended for use by automatic speech recognition (ASR) researchers for building a recognition system.

It supports linear transforms, MMI, boosted MMI and MCE discriminative training, feature-space discriminative training, and deep neural networks.

Kaldi is capable of generating features like mfcc, fbank, fMLLR, etc. Hence in recent deep neural network research, a popular usage of Kaldi is to pre-process raw waveform into acoustic feature for end-to-end neural models.

Kaldi has been incorporated as part of the CHiME Speech Separation and Recognition Challenge over several successive events. The software was initially developed as part of a 2009 workshop at Johns Hopkins University.

Kaldi is named after the legendary Ethiopian goat herder Kaldi who was said to have discovered the coffee plant.

References

"Kaldi: Legal stuff". kaldi-asr.org.
"Kaldi: About the Kaldi project". kaldi-asr.org.
"Kaldi: Deep Neural Networks in Kaldi". kaldi-asr.org.
"The 4th CHiME Speech Separation and Recognition Challenge". Archived from the original on 16 February 2017. Retrieved 15 February 2017.
"The 3rd CHiME Speech Separation and Recognition Challenge". Retrieved 15 February 2017.
Emmanuel Vincent, Jon Barker, Shinji Watanabe, Jonathan Le Roux, Francesco Nesta, et al.. The second 'CHiME' Speech Separation and Recognition Challenge: Datasets, tasks and baselines. ICASSP - 38th International Conference on Acoustics, Speech, and Signal Processing - 2013, May 2013, Vancouver, Canada. pp.126-130, 2013.
"History of the Kaldi project". Retrieved 26 July 2017.
"Kaldi: About the Kaldi project".

External links

Official website
Kaldi – The official GitHub project
Kaldi paper - The Kaldi Speech Recognition Toolkit
VOSK – open source and commercial models from Alpha Cephei on Kaldi foundations

This computational linguistics-related article is a stub. You can help Misplaced Pages by expanding it.

Categories:

See also

References

External links