Skip to content

Latest commit

 

History

History
455 lines (349 loc) · 18.7 KB

File metadata and controls

455 lines (349 loc) · 18.7 KB

Alternative Voice Transcription Tools

Transparency wins. Here's an honest comparison of FluidVoice with other voice transcription tools.

1. VoiceInk

Key Features:

  • 100% offline processing (Apple Neural Engine)
  • 99% accuracy claim
  • 100+ languages
  • Personal dictionary training
  • Intelligent app/context detection
  • Voice assistant mode
  • Configurable keyboard shortcuts

Installation:

  • Download from website
  • Homebrew: brew install --cask voiceink
  • Build from source

Differentiation vs FluidVoice:

  • More mature: 103 releases, 18 contributors
  • Better distribution: Pre-built binaries, Homebrew support
  • Voice assistant mode: Extra feature
  • Apple Silicon only: FluidVoice supports Intel too
  • $39.99 cost: FluidVoice is free
  • ⚠️ GPL license: More restrictive than MIT

2. Wispr Flow

  • Website: https://wisprflow.ai/
  • Platform: macOS + Windows + iPhone
  • Pricing: Free (2k words/week) + $15/mo Pro ($12/mo annual)
  • License: Proprietary (Closed Source)
  • Technology: Whisper-based (likely cloud)
  • Compliance: HIPAA-ready, SOC 2 Type II (Enterprise)

Key Features:

  • 100+ languages with auto-detection
  • AI-powered auto-editing (removes filler words)
  • Tone adjustment per application
  • Personal dictionary learning
  • Snippet library for repetitive text
  • Command mode for voice editing (Pro)
  • 220 wpm claimed speed (4x typing)
  • Works in all apps

Free Tier (2,000 words/week):

  • Lightning-fast voice typing
  • 100+ languages
  • Privacy mode
  • iPhone app

Pro Tier ($15/month, $12/month annual):

  • Unlimited words
  • Command mode editing
  • Personalized writing style
  • 2-week free trial

Special Pricing:

  • Students: $6/month (50% off with .edu)
  • Non-profits: $8/month (annual)
  • Enterprise: $24/user/month (SSO, team context)

Differentiation vs FluidVoice:

  • Cross-platform: Windows + iPhone support
  • AI editing: Auto-removes filler, adjusts tone
  • Polished UX: Commercial product quality
  • Command mode: Voice-based editing
  • Easy install: No build required
  • Subscription: $180/year for Pro
  • Cloud-based: Likely not fully offline
  • Word limits: 2k/week on free tier
  • Closed source: No transparency

3. VoiceTypr

  • Website: https://voicetypr.com/
  • Platform: macOS 13+ + Windows 10+ (Intel + Apple Silicon)
  • Pricing: $25 one-time (1 device) / $40 (2 devices)
  • License: Proprietary (Closed Source)
  • Technology: Local AI (Whisper-based), built with Tauri
  • Free Trial: 3 days unlimited, no credit card

Key Features:

  • 100+ languages
  • 100% offline/local processing
  • 99% accuracy claim
  • Works in any app
  • Audio file transcription (MP3, WAV, M4A)
  • Smart formatting modes
  • Toggle or push-to-talk
  • Global hotkey
  • Productivity tracking
  • Cross-platform (macOS + Windows)

Pricing:

  • Pro: $25 one-time (1 device) - was $50
  • Plus: $40 one-time (2 devices) - was $80
  • Lifetime access + all future updates
  • 3-day free trial (unlimited, no CC)

Differentiation vs FluidVoice:

  • Windows support: Cross-platform
  • File transcription: MP3/WAV/M4A support
  • Easy install: Pre-built binaries
  • Productivity tracking: Usage analytics
  • One-time purchase: No subscriptions
  • 100% offline: Like FluidVoice
  • $25-40 cost: FluidVoice is free
  • Closed source: No transparency
  • ⚠️ Limited devices: 1-2 device activations

4. Better Dictation

  • Website: https://betterdictation.com/
  • Platform: macOS (M1+ only, Apple Silicon only)
  • Pricing: $39-149 lifetime + optional $2/mo Pro features
  • License: Proprietary (Closed Source)
  • Technology: OpenAI Whisper on Apple Neural Engine
  • Free Trial: 14-day refund policy

Key Features:

  • 100+ languages
  • Fully offline processing
  • Automatic punctuation
  • Automatic language detection
  • Push-to-talk functionality
  • Works in any application
  • Handles multiple accents

Pricing Tiers:

  • Basic: $39 lifetime (1 device)
  • Flex: $49 lifetime + $2/mo (3 devices, 3 months Pro free)
  • Studio: $149 lifetime + $2/mo/device (10 devices, account manager)
  • Enterprise: Custom pricing (unlimited devices)

Pro Features ($2/month):

  • Stammer correction
  • Automatic formatting
  • Grammar improvement
  • Post-processing with OpenAI prompts

Differentiation vs FluidVoice:

  • Easy install: Pre-built binaries
  • Grammar/formatting: AI post-processing (Pro)
  • Multi-device: Up to 10 devices (Studio)
  • 100% offline: Basic transcription local
  • $39+ cost: FluidVoice is free
  • Apple Silicon only: No Intel support
  • Closed source: No transparency
  • ⚠️ Pro features subscription: $2/mo for advanced features
  • ⚠️ Windows coming soon: Not yet available

5. Monologue

  • Website: https://www.monologue.to/
  • Platform: macOS
  • Pricing: $10/mo standalone OR $30/mo Every bundle (1k words free trial)
  • License: Proprietary (Closed Source)
  • Technology: Whisper-based (cloud processing)
  • Launch: September 2025 (very new)

Key Features:

  • 100+ languages with auto-switching
  • Smart formatting per app context
  • Personal dictionary (auto-learns names/acronyms)
  • Deep context (screen awareness with permission)
  • Flexible modes (email, docs, notes, code)
  • Custom workflow design
  • Privacy: No audio/transcripts saved, screenshots deleted immediately

Pricing:

  • Free Trial: 1,000 words
  • Standalone: $10/month
  • Every Bundle: $30/month (includes Cora, Spiral, Sparkle AI apps + newsletter)

Differentiation vs FluidVoice:

  • Smart formatting: Context-aware output per app
  • Deep context: Screen awareness for better results
  • Custom workflows: Pre-built modes + customization
  • Easy install: Pre-built binary
  • Auto-learning dictionary: Names/acronyms learned automatically
  • $10-30/mo subscription: $120-360/year
  • Cloud-based: Not fully offline
  • Closed source: No transparency
  • Very new: Launched September 2025
  • ⚠️ Word limit on free: 1k words trial only

6. Spokenly

  • Website: https://spokenly.app/
  • Platform: macOS 13+ + iPhone
  • Pricing: Free (local models) OR BYOK (free) OR Pro $7.99/mo
  • License: Proprietary (Closed Source)
  • Technology: Whisper (local + cloud options)
  • Download: 7MB, quick install

Key Features:

  • 100+ languages with auto-detection
  • Real-time transcription
  • AI text processing (grammar, formatting, context)
  • Agent Mode (voice commands)
  • Local-only mode (blocks network)
  • Works in any text input
  • No account required
  • iPhone support

Pricing Tiers:

  • Local Models: Free, unlimited, 100% offline (all Whisper sizes)
  • BYOK (Bring Your Own Keys): Free, use OpenAI/Deepgram/Groq APIs
  • Spokenly Pro: $7.99/month, premium cloud models, no API keys needed

Privacy:

  • Local mode: Audio never leaves device
  • Cloud mode: Audio processed and immediately deleted

Differentiation vs FluidVoice:

  • Free offline option: Unlimited local models
  • Agent Mode: Voice commands feature
  • iPhone support: Cross-device
  • BYOK option: Use own API keys
  • Easy install: 7MB download, pre-built
  • Flexible pricing: Free local OR pay for cloud
  • Closed source: No transparency
  • ⚠️ Pro subscription: $96/year for cloud features
  • Better than FluidVoice?: Free tier is actually competitive

7. SuperWhisper

  • Website: https://superwhisper.com/
  • Platform: macOS 13+ (best on Apple Silicon) + iPhone
  • Pricing: Free tier + $8.49/month Pro
  • License: Proprietary (Closed Source)
  • Technology: Whisper-based (cloud + local)

Key Features:

  • 100+ languages
  • Custom vocabulary support
  • Offline-first design
  • Language translation to English
  • Meeting recording
  • Audio/video file transcription
  • Works in any app
  • Personal AI API key support (Pro)

Free Tier:

  • Voice-to-text in any app
  • Meeting recording
  • Unlimited small models
  • Custom prompt control

Pro Tier ($8.49/month):

  • Personal AI API keys
  • Unlimited AI model usage
  • Language translation
  • File transcription
  • Priority support

Trial: 15 minutes Pro trial + 30-day money-back guarantee

Differentiation vs FluidVoice:

  • iPhone support: Cross-platform
  • Polished UX: Professional commercial product
  • File transcription: Extra feature
  • Translation: Language translation built-in
  • Subscription model: $102/year recurring
  • Closed source: No transparency
  • Cloud dependency: Intel Macs require cloud
  • ⚠️ Free tier limited: Small models only

Feature Comparison Matrix

Feature FluidVoice VoiceInk VoiceTypr Better Dictation Monologue Spokenly Wispr Flow SuperWhisper
Pricing Free $40 one-time $25-40 one-time $39-149 + $2/mo Pro $10-30/mo Free local + $7.99 Pro Free (2k/wk) + $15/mo Free + $8.49/mo
License MIT GPL v3.0 Proprietary Proprietary Proprietary Proprietary Proprietary Proprietary
Platform macOS 14+ (AS only) macOS 14+ (AS only) macOS 13+ + Win 10+ macOS (M1+ only) macOS macOS 13+ + iPhone macOS + Win + iPhone macOS 13+ + iPhone
Local Processing ✅ 100% offline ✅ 100% offline ✅ 100% offline ✅ 100% offline ⚠️ Cloud ✅ Free local option ⚠️ Cloud-based ✅ Local + ☁️ Cloud
Multilingual ✅ 25 languages ✅ 100+ ✅ 100+ ✅ 100+ ✅ 100+ ✅ 100+ ✅ 100+ ✅ 100+
Custom Vocabulary ✅ JSONC config ✅ Personal dict ✅ Auto-learns ✅ Learns vocab ✅ Custom vocab
Hotkey Support ✅ Configurable ✅ Configurable ✅ Global ✅ Push-to-talk
Privacy ✅ 100% offline ✅ 100% offline ✅ 100% offline ✅ 100% offline ⚠️ Cloud, no saves ✅ Local mode blocks ⚠️ Cloud ⚠️ Offline-first
Open Source ✅ MIT ✅ GPL v3.0 ❌ Closed ❌ Closed ❌ Closed ❌ Closed ❌ Closed ❌ Closed
Pre-built Binaries ❌ Build source ✅ Download+Brew ✅ Download ✅ Download ✅ Download ✅ 7MB download ✅ Download ✅ Download
Installation ⚠️ Code signing ✅ Simple ✅ Simple ✅ Simple ✅ Simple ✅ Quick install ✅ Simple ✅ Simple
Agent/Voice Cmds ✅ Voice assistant ✅ Agent Mode
Smart Formatting ✅ Per-app context ✅ AI formatting ✅ Auto-edit, tone
Deep Context ✅ Screen awareness
Custom Workflows ✅ Pre-built modes
AI Editing ✅ Pro: Grammar ✅ Smart format ✅ Grammar/context ✅ Filler removal
BYOK (Own API) ✅ Free tier ✅ Pro tier
File Transcription ✅ MP3/WAV/M4A ✅ Pro only
Translation ✅ Pro only
Meeting Recording ✅ Free tier
Productivity Track
Intel Mac Support ⚠️ Cloud only
Windows Support ⚠️ Coming soon
Word Limits ∞ Unlimited ∞ Unlimited ∞ Unlimited ∞ Unlimited ⚠️ 1k trial ∞ Free unlimited local ⚠️ 2k/wk free ∞ Free unlimited
Device Limit ∞ Unlimited ∞ Unlimited 1-2 devices 1-10 devices Per account Per account Per account Per account
Initial Cost $0 $40 $25-40 $39-149 $0 (1k trial) $0 (local free) $0 $0
Annual Recurring $0 $0 $0 $24 (Pro) $120-360 $96 (Pro) $144-180 $102

Comparison Summary

FluidVoice Unique Advantages

1. Truly Free & Open

  • $0 forever vs VoiceInk $39.99 or SuperWhisper $102/year
  • MIT license vs GPL (VoiceInk) or closed source (SuperWhisper)
  • No feature paywalls - everything included

2. Maximum Privacy

  • 100% offline on Apple Silicon
  • Open source auditable - verify no telemetry yourself
  • No cloud fallback - your audio never leaves the machine

3. Developer-Friendly

  • JSONC vocabulary config - comments, version control friendly
  • MIT license - use in commercial projects freely
  • Hackable codebase - customize anything you want

FluidVoice Weaknesses

1. Distribution Gap

  • No pre-built binaries - must build from source
  • No Homebrew - manual installation only
  • Code signing required - 5-minute setup hurdle
  • 📊 Impact: Limits audience to technical users

2. Feature Gaps vs Competitors

  • No file transcription (SuperWhisper has it)
  • No voice assistant mode (VoiceInk has it)
  • No translation (SuperWhisper has it)
  • No meeting recording (SuperWhisper has it)
  • Only 25 languages vs 100+ (but covers major ones)

3. Polish & Maturity

  • ⚠️ Less polished UX than commercial products
  • ⚠️ Fewer releases than VoiceInk (103 releases)
  • ⚠️ Smaller community (fewer contributors)

Market Positioning

Who Should Choose FluidVoice?

Primary Users:

  • 🎯 Privacy-first developers - need auditable, 100% offline solution
  • 🎯 Open source advocates - want MIT-licensed, hackable software
  • 🎯 Apple Silicon users - M1/M2/M3 Macs only (Parakeet requires MLX)
  • 🎯 Budget-conscious - $0 forever vs $40-$100/year competitors
  • 🎯 Technical users - comfortable building from source

Example Personas:

  • Security researcher who can't send audio to cloud
  • Developer wanting to customize/extend the tool
  • Apple Silicon Mac user who wants local transcription
  • Student/hobbyist on tight budget
  • Open source contributor wanting to hack on voice tools

Who Should Choose Competitors?

VoiceInk ($39.99 one-time):

  • Apple Silicon users wanting pre-built binary
  • Users who need voice assistant mode
  • GPL-compatible projects

SuperWhisper (Free/$8.49/mo):

  • iPhone users needing cross-platform
  • Users needing file transcription
  • Users wanting translation features
  • Non-technical users wanting polished UX
  • Teams needing meeting recording

Strategic Recommendations

Short-Term (Improve Competitiveness)

  1. Homebrew formula - Dramatically easier installation
  2. Pre-built binaries - Reach non-technical users (requires $99/year or CI signing)
  3. Better documentation - Match commercial polish
  4. Language expansion - Add more Parakeet languages

Medium-Term (Feature Parity)

  1. File transcription - Transcribe existing audio files
  2. Meeting recording - Record system audio + transcribe
  3. Translation - Add language translation (via MLX model?)
  4. More hotkey modes - Match VoiceInk configurability

Long-Term (Differentiation)

  1. Developer integrations - VS Code, terminal, IDE plugins
  2. CLI tool - Scriptable transcription for automation
  3. API/daemon mode - Let other apps use transcription
  4. Custom model support - Allow users to bring their own Parakeet/Whisper models

Last updated: 2025-10-02