Confirmed users
58
edits
Andrenatal (talk | contribs) |
Andrenatal (talk | contribs) |
||
Line 24: | Line 24: | ||
* Decoder | * Decoder | ||
** Third-party licensing is extremely costly (usual unit is millions) and lead to an unwanted dependency. Write a decoder from scratch is tough, and requires highly specialized and difficult to find engineers. | ** Third-party licensing is extremely costly (usual unit is millions) and lead to an unwanted dependency. Write a decoder from scratch is tough, and requires highly specialized and difficult to find engineers. | ||
The good news are that exists great open source toolkits that we can use and enhance. I am a long time supportert and contributor of CMU Sphinx that have a number of quality models on different languages openly available. Plus pocketsphinx can run very fast and accurate when well tuned for both FSG and LVSCR language models. | |||
For LVSCR we can also consider Julius and benchmark it since he has great proved results. | |||
* Automatic retrain | * Automatic retrain |