Whisper speech-to-text that runs on all Hailo accelerators (Hailo-8, Hailo-8L, Hailo-10H).
Record from your microphone and get real-time transcription, or transcribe audio files.
Unlike simple_whisper_chat (H10-only, uses hailo_platform.genai.Speech2Text), this app uses the low-level HailoRT InferModel API with separate encoder/decoder HEFs — compatible with all Hailo devices.
- Hailo-8, Hailo-8L, or Hailo-10H accelerator
- Python 3.10+, HailoRT 4.20+
ffmpegandlibportaudio2:sudo apt install ffmpeg libportaudio2
pip install -e ".[speech-rec]"Models (HEF files and decoder assets) are managed by the repo's central resource system
(resources_config.yaml) and auto-downloaded on first run via resolve_hef_paths().
No manual download step is needed.
Live microphone recording (interactive loop):
python -m hailo_apps.python.standalone_apps.speech_recognition.speech_recognitionPress Enter to start recording, Enter again to stop. Press 'q' to quit.
Transcribe an audio file:
python -m hailo_apps.python.standalone_apps.speech_recognition.speech_recognition \
--audio /path/to/audio.wav| Flag | Description |
|---|---|
--audio PATH |
Transcribe a file instead of recording |
--arch {hailo8,hailo8l,hailo10h} |
Target architecture (auto-detected if omitted) |
--variant {base,tiny,tiny.en} |
Whisper variant (default: base) |
--duration N |
Max recording length in seconds (default: 10) |
--list-models |
List available models and exit |
| Variant | Hailo-8 | Hailo-8L | Hailo-10H |
|---|---|---|---|
base |
✓ | ✓ | ✓ |
tiny |
✓ | ✓ | ✓ |
tiny.en |
— | — | ✓ |