Skip to content

Geting started

Features

The Next-gen Kaldi currently supports speech recognition (ASR), speech synthesis (TTS), keyword spotting (KWS), voice activity detection (VAD), speaker identification, spoken language identification, and so on. Some of them provide both training and deployment pipeline, while some are only deployment supports based on excellent third-party open-source libraries. The specific details are as follows:

Task Training Deployment Docs
Speech recognition(ASR) Training Deployment
Speech synthesis(TTS) Training Deployment
Keyword spotting(KWS) Training Deployment
Voice activity detection(VAD) Deployment
Speaker identification Deployment
Spoken language identification Deployment

Comments