Skip to content

Whisper Settings

Configure speech-to-text functionality using OpenAI's Whisper model.

Enable Whisper

Master toggle for voice input features.

When enabled:

  • Voice input button appears in the chat interface
  • Use the keyboard shortcut (default: Cmd/Ctrl+Shift+V) to toggle voice input
  • Speech is transcribed locally using the Whisper model

Model Selection

Selected Model

Choose which downloaded Whisper model to use for transcription.

TIP

You must download at least one model before you can use voice input.

Language

Select the language for transcription:

  • Auto Detect: Automatically detect the spoken language
  • Or choose a specific language for better accuracy

Supported languages include: English, Chinese, Japanese, Korean, German, French, Spanish, Portuguese, Russian, Italian, and many more.

Available Models

Download Whisper models based on your needs:

ModelSizeSpeedAccuracyBest For
Large V3 Turbo (Recommended)1.6 GBFastHighBest overall choice
Tiny75 MBFastestLowQuick tests, limited storage
Tiny (English)75 MBFastestLowEnglish only, fastest
Base142 MBVery FastMediumDaily use, balanced
Base (English)142 MBVery FastMediumEnglish only
Small466 MBFastGoodMost users
Small (English)466 MBFastGoodEnglish only
Medium1.5 GBMediumHighHigh accuracy needs
Medium (English)1.5 GBMediumHighEnglish only
Large V33.1 GBSlowHighestBest accuracy

English-Only Models

Models with "(English)" suffix are optimized for English and may provide better accuracy for English speech, but cannot transcribe other languages.

Managing Models

Downloading a Model

  1. Find the model you want in the Available Models list
  2. Click the Download button
  3. Wait for the download to complete
  4. The model will show a checkmark when ready

Download progress shows:

  • Percentage complete
  • Download speed
  • Estimated time remaining

Deleting a Model

  1. Click the delete (trash) icon next to a downloaded model
  2. Confirm the deletion

WARNING

If you delete the currently selected model, Alma will switch to another available model or disable Whisper if no models remain.

Usage

Once configured:

  1. Click the microphone icon in the chat input, or use the keyboard shortcut
  2. Speak clearly into your microphone
  3. Your speech will be transcribed and inserted into the input field
  4. Edit if needed, then send your message