Keyword spotting models¶

Currently, we offer two basic models in Chinese and English, both supporting the Pytorch and onnxruntime frameworks. The Pytorch model is mainly used for fine-tuning, while the onnx model is mainly used for deployment. You can first use onnx models to test the performance of the target keywords. If the expected results are not achieved, then consider fine-tuning based on the basic model we provide.

Language	Framework	Download link	Usage	Description
Chinese	Pytorch	github	Training and Fine-tuning	This model is trained on Wenetspeech L (10,000 hours), with a model parameter of about 3.3M. The modeling units are pinyin (initials and finals), and can be used as a basic model for fine-tuning.
Chinese	onnxruntime	github	Deployment docs	This model is exported from the model above，could be used for deployment on sherpa-onnx
English	Pytorch	github	Training and Fine-tuning	This model is trained on Gigaspeech XL (10,000 hours), with a model parameter of about 3.3M. The modeling units are BPEs, and can be used as a basic model for fine-tuning.
English	onnxruntime	github	Deployment docs	This model is exported from the model above，could be used for deployment on sherpa-onnx

Keyword spotting models¶

Comments