Show HN: Dictly – Local, real‑time voice‑to‑text for macOS (sub‑100ms, no cloud)

Share This Post

TL;DR: I built a native macOS dictation app that transcribes locally and instantly.
Text appears as you speak (measured ~100 ms first‑character latency).
No accounts, no servers, no tracking.

Links:
• Website: https://dictly.app
• Mac App Store: https://apps.apple.com/de/app/dictly-no-keys-just-clarity/id…
• Free download; optional Pro tier (pipelines, unlimited history, etc.)

What it does

Real‑time transcription — streaming text while you talk, not after you stop.
Quick‑Capture Overlay (macOS) — global hotkey, drop text into any app/field.
Custom Pipelines — local post‑processing steps for cleanup, punctuation, or house style.
Dictionary Profiles — teach domain terms (names, brands, code tokens, etc.).
Local Analytics — see time saved vs typing (computed on device, never sent anywhere).

Why I built it

I wanted dictation that felt as immediate as typing and was trustworthy. Most tools stream audio to a server; I wanted something that never leaves the machine.

How it’s built (high‑level)

Swift + Apple speech/ML frameworks.
Streaming audio capture → on‑device recognition → local pipeline → paste into the active app.
Works with Wi‑Fi off; there are no network requests in the transcription path.

What’s different vs built‑ins

Always on‑device + streaming with a global overlay that works in any app.
Extensible, deterministic cleanup via pipelines (not a black‑box cloud).
Per‑project dictionaries to tame jargon and proper nouns.

Numbers (early)

Latency: ~100 ms (first visible characters from speech onset) in typical conditions on modern Macs.
Privacy: zero telemetry; no account; no background syncing. Everything stays local.

Trade‑offs (calling them out up front)

Accuracy depends on mic and environment (no surprise).
For weird proper nouns/jargon, you’ll want a dictionary profile.
Heavy background noise will degrade results (pipelines can only do so much).

What I’m looking for from HN

Performance impressions on different hardware.
Failure cases (accents, acronyms, coding, meetings).
Pipeline ideas you’d actually use (e.g., Markdown formatting, code‑block guards, style rules).
Integration wishes: CLI? Shortcut actions? Editor‑specific helpers?

I’m a solo dev.
Happy to answer pointed questions and ship fixes fast.
If you spot hand‑wavy claims, call them out.


Comments URL: https://news.ycombinator.com/item?id=45707339

Points: 1

# Comments: 0

Source: dictly.app

Subscribe To Our Newsletter

Get updates and learn from the best

More To Explore

Windows Securitym Hackers Feeds

Touch, Our Most Complex Sense, Is a Landscape of Cellular Sensors

Article URL: https://www.quantamagazine.org/touch-our-most-complex-sense-is-a-landscape-of-cellular-sensors-20250416/ Comments URL: https://news.ycombinator.com/item?id=45739822 Points: 1 # Comments: 0 Source: www.quantamagazine.org

Do You Want To Boost Your Business?

drop us a line and keep in touch

We are here to help

One of our technicians will be with you shortly.