Keyword spotting models¶
Currently, we offer two basic models in Chinese and English, both supporting the Pytorch and onnxruntime frameworks. The Pytorch model is mainly used for fine-tuning, while the onnx model is mainly used for deployment. You can first use onnx models to test the performance of the target keywords. If the expected results are not achieved, then consider fine-tuning based on the basic model we provide.
Language | Framework | Download link | Usage | Description |
---|---|---|---|---|
Chinese | Pytorch | github | Training and Fine-tuning | This model is trained on Wenetspeech L (10,000 hours), with a model parameter of about 3.3M. The modeling units are pinyin (initials and finals), and can be used as a basic model for fine-tuning. |
Chinese | onnxruntime | github | Deployment docs | This model is exported from the model above,could be used for deployment on sherpa-onnx |
English | Pytorch | github | Training and Fine-tuning | This model is trained on Gigaspeech XL (10,000 hours), with a model parameter of about 3.3M. The modeling units are BPEs, and can be used as a basic model for fine-tuning. |
English | onnxruntime | github | Deployment docs | This model is exported from the model above,could be used for deployment on sherpa-onnx |