Show HN: Free alternative to Wispr Flow, Superwhisper, and Monologue

Cloud vs local models & “deep context”

  • FreeFlow uses Groq-hosted models for fast transcription and an LLM “deep context” step: capturing a screenshot of the active window and asking an LLM to infer names/terms and fix spelling.
  • Several commenters like the idea but see screenshotting + cloud LLM as overkill, suggesting using accessibility APIs to pull nearby text instead, processed by a small local LLM.
  • Others argue local-only pipelines (e.g., Parakeet, Whisper variants) now feel fast enough and avoid vendor lock‑in and privacy concerns.

Alternative tools mentioned

  • Strong enthusiasm for Handy (cross‑platform, Parakeet support, optional LLM post‑processing, context capture PR in progress), Hex (Mac‑only, CoreML/Neural Engine, very fast), MacWhisper, VoiceInk, Whispering, Whistle, MacWhisper, Murmure, OpenSuperWhisper, Jabber, Auriscribe, hyprwhspr, and others.
  • Multiple Android/iOS options surface: FUTO keyboard/voice, Whisper+, VoiceFlow, Utter, Talkie/ottex-based flows, Whisper Memos.
  • Some say SuperWhisper already offers a free Parakeet‑based local mode comparable to what FreeFlow targets.

Performance, latency, and hardware

  • One camp says local is “too slow” once you add post‑processing; others report sub‑second transcriptions on M1/M2 Macs or any GPU using Parakeet or Whisper large‑v3‑turbo.
  • Parakeet (especially v3) is repeatedly praised for speed, accuracy, and multilingual handling, even on mid‑range phones.
  • Concerns raised about future Groq pricing/free‑tier changes versus the stability of fully local solutions.

UX, workflows, and features

  • People care a lot about hotkeys (Caps Lock, Scroll Lock, right Option, Stream Deck buttons, foot pedals), push‑to‑talk vs toggle, and live/streaming transcription.
  • Some want preserved audio alongside transcripts; FreeFlow currently stores WAVs tied to history entries but auto‑deletes them (noted as easily changeable).
  • Post‑processing to transform raw transcripts into structured, editor‑ready text (for notes, books, coding agents) is seen as the next frontier.

DIY & meta discussion

  • Several describe rolling their own STT scripts around faster‑whisper, sox/arecord, and clipboard tools, valuing instant adoption of new models.
  • One commenter criticizes “free alternative to X” marketing as copycat; others respond that many devs simply reinvented tools they didn’t realize already existed.
  • A minority questions the appeal of speech‑to‑text at all; others point to accessibility, injury recovery, long-form dictation, and hands‑free coding as compelling uses.