🎤 Speech To Text

Dictate text by voice using your microphone — the result is transcribed in real time. Uses the browser's native Web Speech API, no external services, no upload.

What is the speech to text?

Speech-to-text (or voice typing) converts spoken audio to written text. Modern browsers (Chrome, Edge, Safari) include a Web Speech API that does this on-device or via Google's speech servers depending on platform. No external SaaS account needed.

How does this speech to text work?

Click Start Recording. Grant microphone permission when prompted. Speak. Text appears in the output area in real time. Click Stop when done. The output is editable — fix recognition errors by clicking and typing.

When should you use this tool?

Use it for quick voice notes while driving (use a hands-free setup), for drafting emails when you're tired of typing, for transcribing brief audio recordings, and for accessibility if typing is painful or slow.

Tips & best practices

Speak clearly and at moderate pace. Pause briefly between sentences. Add punctuation by saying it ('comma', 'period', 'new line'). Background noise reduces accuracy — record in a quiet room. For long-form transcription, break into 30-second segments.

Frequently asked questions

Does it work in Firefox?

Limited — Web Speech API is best supported in Chrome, Edge, and Safari. Firefox support is experimental.

Is my voice uploaded?

On Chrome, audio is sent to Google's speech servers. On Safari, it's mostly on-device. On Edge, a mix. Check your browser's privacy settings.

Does it support Hindi?

Yes — change the language code in browser settings or extend the tool to expose a language picker.

Related tools

Explore more media & ocr on the tool hub — or jump straight to the Image To Text Converter, OCR (Optical Character Recognition), JPG To Word.