The stt module converts normalized speech frontend frame data into a text transcription.
| Mode | Input | Output | Description |
|---|---|---|---|
nf
|
w
|
Accepts normalized feature frames and performs speech-to-text. Produces a word label stream. |
Example:
w <- stt (_, nf)
| Diag | Lang* |
| DiagFile | LatFile |
| DiagFileLabel | LatScale |
| DnnScale | LatWordFile |
| EnableConfidence | Mode |
| FrameDupl | ModeValue |
*You must set the value of this configuration parameter by setting the corresponding action parameter.
|
|