# Recognize Speech with Whisper

Recognizes speech from an audio file to text. Supported format: mp3. Maximum file size - 25 MB.

<table data-header-hidden><thead><tr><th width="282.24993896484375" valign="top"></th><th width="323.56671142578125" valign="top"></th></tr></thead><tbody><tr><td valign="top">Audio File</td><td valign="top">[Text] The name and path of the input audio file. Supported format: mp3. Maximum file size - 25 MB.</td></tr><tr><td valign="top">Model</td><td valign="top">Select a model for speech recognition.</td></tr><tr><td valign="top">Audio Language</td><td valign="top"><p>[Text] The language of the audio. If necessary, you can specify the language value in ISO 639-1 format. This will improve recognition accuracy and increase processing speed.</p><p>For example:</p><ul><li>"en" - English;</li><li>"ru" - Russian.</li></ul></td></tr><tr><td valign="top">Prompt</td><td valign="top">[Text] If necessary, you can specify a prompt for the language model. The prompt language must match the audio file language.</td></tr><tr><td valign="top">Temperature</td><td valign="top"><p>[Number] Sampling temperature from 0 to 1. Higher values, such as <code>0.8</code>, will make the output more random, while lower values (e.g., <code>0.2</code>) will make it more focused and deterministic.</p><p>If set to <code>0</code>, the model will use logarithmic probability to automatically raise the temperature until certain threshold values are reached.</p></td></tr><tr><td valign="top">Timeout</td><td valign="top">[Number] Maximum wait time for a response in seconds.</td></tr><tr><td valign="top">Result</td><td valign="top">[Text] The recognized string.</td></tr><tr><td valign="top">Error Handling Level</td><td valign="top"><p>Select the error handling level. Possible values:</p><ul><li>"Default" - default;</li><li>"Ignore" - errors are ignored;</li><li>"Handle" - errors are handled.</li></ul><p>If "Default" is selected, the value from the "Start" block of this diagram will be used.</p></td></tr><tr><td valign="top">Message Level</td><td valign="top"><p>Select the message level that blocks will output during operation. Possible values:</p><ul><li>"Default" - default;</li><li>"Release" - output is disabled;</li><li>"Debug" - main information output;</li><li>"Detailed" - detailed information output.</li></ul><p>If "Default" is selected, the value from the "Start" block of this diagram will be used.</p></td></tr><tr><td valign="top">Error Message</td><td valign="top">[Text] Returns detailed information about the error in case of incorrect block execution.</td></tr></tbody></table>


---

# Agent Instructions: Querying This Documentation

If you need additional information that is not directly available in this page, you can query the documentation dynamically by asking a question.

Perform an HTTP GET request on the current page URL with the `ask` query parameter:

```
GET https://docs.sherparpa.ru/en/sherpa-rpa/sherpa-designer/spravochnik-blokov/mashinnoe-obuchenie-machine-learning/raspoznat-rech-whisper-whisperrecognizespeech.md?ask=<question>
```

The question should be specific, self-contained, and written in natural language.
The response will contain a direct answer to the question and relevant excerpts and sources from the documentation.

Use this mechanism when the answer is not explicitly present in the current page, you need clarification or additional context, or you want to retrieve related documentation sections.
