Two voice-typing features in Windows, and what each actually is
Windows in 2026 ships two separate voice-typing features. Most articles online treat them as one product. They are not.
Voice Typing (activated by Win+H) is a cloud-based dictation tool. Microsoft documents this explicitly: "To use voice typing, you'll need to be connected to the internet". On every dictation session, your microphone audio is sent to Microsoft's Azure Speech services for transcription. The text comes back, gets pasted into the focused text field, and your audio (according to Microsoft) is de-identified and not stored without consent — but it has left your device.
Voice Access is the newer feature, added in Windows 11 22H2 (October 2022). It is a broader accessibility tool that includes dictation but also lets you control the OS by voice — open apps, click buttons, scroll, navigate. The dictation portion of Voice Access runs on-device, offline. It does not exist on Windows 10. On Windows 10 you instead get the older "Windows Speech Recognition" tool, which is a separate, much older feature.
The practical difference: Voice Typing supports more languages but always needs internet. Voice Access runs offline but supports fewer languages. Neither one does both.
The language coverage gap is the real story
Voice Access — the offline option — ships with only 11 distinct locales: six English variants (US, UK, India, New Zealand, Canada, Australia), two Spanish (Spain, Mexico), German, French (France and Canada), Italian, Japanese, Simplified Chinese, and Traditional Chinese (Taiwan). That is it.
What is missing: Russian, Portuguese (both Brazil and Portugal), Polish, Dutch, Swedish, Danish, Norwegian, Finnish, Czech, Hungarian, Greek, Turkish, Hindi, Arabic, Korean, Thai, Vietnamese, and dozens more. When asked about Swedish on Microsoft's own Q&A forum, the official response confirms gaps are "by design" with no roadmap commitment.
Voice Typing — the cloud option — supports more (~43 languages including Portuguese, Korean, Thai, Turkish, Vietnamese, Hindi). But it sends your audio to Microsoft on every dictation. For anyone whose dictation contains client information, medical notes, source code, or anything sensitive, "cloud STT with no offline option" is the wrong architecture.
SnailText runs Whisper locally. Whisper is multilingual by design — the same model that handles English handles 100+ languages including all the ones Microsoft's offline option does not. Russian dictation works on SnailText. Portuguese dictation works on SnailText. Polish, Dutch, Czech, all on the same install. No language packs to download. No cloud detour.
The Win+H pause timeout — most-cited complaint
Windows Voice Typing has an uncustomizable silence timeout that ends the dictation session after roughly 5 to 10 seconds of pause. The exact number is not in Microsoft's marketing copy, but it is the subject of multiple user threads — including a long-running Microsoft Q&A thread and a Windows Forum thread asking how to prevent it. The answer in both: you cannot.
For composing an email longer than two paragraphs, this means re-activating Win+H two or three times in a single message. For thinking-while-dictating workflows — research notes, treatment plans, brief drafts where pauses for thought are normal — the cutoff makes the tool feel like it is fighting you.
SnailText runs as long as you hold the hotkey down, or until you press it again to stop. There is no silence timeout. A five-minute brain-dump dictates as one session.
"Fluid Dictation" is hardware-gated — most PCs do not qualify
Microsoft's 2025 marketing push for Voice Typing centered on "Fluid Dictation" — a polish layer that adds auto-punctuation, removes filler words (um, uh), and corrects grammar in real time. The reviews of this feature are positive when it works.
Microsoft's own documentation states Fluid Dictation requires a Copilot+ PC — meaning a dedicated NPU (Neural Processing Unit) in your hardware. Snapdragon X (Surface laptops 2024+), Intel Core Ultra with NPU, or AMD Ryzen AI. And it is English-only.
For 2026, the install base of Copilot+ PCs is still small. A standard Windows 11 PC bought in 2022 or 2023, without an NPU, gets the older, rawer Voice Typing experience — no auto-grammar fix, no filler removal, no real-time polish. The 2025 marketing applied to perhaps 5-10% of the Windows install base.
How SnailText fills the Windows voice-to-text gaps
Local processing. SnailText runs the Whisper speech model on your PC — CPU on older machines, Vulkan on AMD and Intel iGPUs, CUDA on NVIDIA GPUs. Audio is captured into a RAM buffer, processed by the model, transcribed text is pasted at your cursor, audio is discarded. See how it works for the architecture diagram. Verifiable in your network monitor — no outbound traffic during dictation.
Any language Whisper supports. 100+ languages on the same install, no language packs to download. Russian works the same as English. Portuguese works the same as French. No locale-specific gaps — see also our offline dictation page for the architectural argument.
No timeout. Press the hotkey, talk for as long as you want — five seconds or five minutes — press again to stop. The transcript is one block.
Configurable hotkey. Default is Ctrl+Shift+Space; reassign to anything that does not conflict with your other shortcuts. No Win+H lock-in.
Works in any app. SnailText pastes into the focused text field, same way Ctrl+V does. Slack, Chrome textareas, VS Code, Cursor, terminal emulators, browser-based EHRs, web forms, Anki — anywhere a keyboard works, dictation works. No "Voice typing functionality is limited on this app" warnings like the documented Anki experience.
Free to start. The compact Whisper Base model handles everyday English dictation; Pro adds larger Whisper models and 25+ European languages via Parakeet TDT. If you want the cross-platform story, see voice to text on Mac. For the broader "free" angle (no signup, no time limit), see free voice to text.
How to set up voice to text on Windows in 60 seconds
1. Download the SnailText installer from snailtext.app/download/windows/.
2. Run the installer. Windows SmartScreen may prompt because SnailText is not yet Authenticode-signed by a Microsoft-recognized certificate authority — click "More info" → "Run anyway". Authenticode certification is in progress.
3. On first launch, SnailText downloads the default Whisper model (Base, around 80 MB) and loads it.
4. Set your global hotkey in Settings. Default is Ctrl+Shift+Space.
5. Open any app — Slack, Chrome, Word, Notion, your IDE. Press the hotkey. Talk. Press it again. Your transcribed text appears at the cursor.