Dictate text by voice using your microphone — the result is transcribed in real time. Uses the browser's native Web Speech API, no external services, no upload.
Speech-to-text (or voice typing) converts spoken audio to written text. Modern browsers (Chrome, Edge, Safari) include a Web Speech API that does this on-device or via Google's speech servers depending on platform. No external SaaS account needed.
Click Start Recording. Grant microphone permission when prompted. Speak. Text appears in the output area in real time. Click Stop when done. The output is editable — fix recognition errors by clicking and typing.
Use it for quick voice notes while driving (use a hands-free setup), for drafting emails when you're tired of typing, for transcribing brief audio recordings, and for accessibility if typing is painful or slow.
Speak clearly and at moderate pace. Pause briefly between sentences. Add punctuation by saying it ('comma', 'period', 'new line'). Background noise reduces accuracy — record in a quiet room. For long-form transcription, break into 30-second segments.
Limited — Web Speech API is best supported in Chrome, Edge, and Safari. Firefox support is experimental.
On Chrome, audio is sent to Google's speech servers. On Safari, it's mostly on-device. On Edge, a mix. Check your browser's privacy settings.
Yes — change the language code in browser settings or extend the tool to expose a language picker.
Explore more media & ocr on the tool hub — or jump straight to the Image To Text Converter, OCR (Optical Character Recognition), JPG To Word.