Skip to content

Keyword spotting models

Currently, we offer two basic models in Chinese and English, both supporting the Pytorch and onnxruntime frameworks. The Pytorch model is mainly used for fine-tuning, while the onnx model is mainly used for deployment. You can first use onnx models to test the performance of the target keywords. If the expected results are not achieved, then consider fine-tuning based on the basic model we provide.

Language Framework Download link Usage Description
Chinese Pytorch github Training and Fine-tuning This model is trained on Wenetspeech L (10,000 hours), with a model parameter of about 3.3M. The modeling units are pinyin (initials and finals), and can be used as a basic model for fine-tuning.
Chinese onnxruntime github Deployment docs This model is exported from the model above,could be used for deployment on sherpa-onnx
English Pytorch github Training and Fine-tuning This model is trained on Gigaspeech XL (10,000 hours), with a model parameter of about 3.3M. The modeling units are BPEs, and can be used as a basic model for fine-tuning.
English onnxruntime github Deployment docs This model is exported from the model above,could be used for deployment on sherpa-onnx

Comments