Skip to content
Processing locally — files never leave your device

Text to Speech

Type or paste text and hear it spoken with your browser's built-in voices. Choose voice, rate, and pitch. Many languages supported.

How to use Text to Speech

  1. Type or paste the text you want read aloud into the text box.
  2. Choose a voice from the list — the options come from the voices installed on your operating system, and the language is shown next to each name.
  3. Adjust the rate (speaking speed) and pitch sliders to taste.
  4. Click "Speak" to hear it. Use "Stop" to cut playback short at any time.
  5. Fine-tune the text, voice, rate, or pitch and play again until it sounds right.

Reading text aloud with your browser's built-in voices

This tool speaks any text you give it using the voices already on your computer or phone. It is handy for proofreading by ear, learning pronunciation, creating quick voiceovers, or making content accessible to people who prefer to listen. There is nothing to install — the speech comes from your operating system through the browser.

How browser text-to-speech works

The tool uses the Web Speech Synthesis API (speechSynthesis and SpeechSynthesisUtterance). When you click Speak, it wraps your text in an utterance, applies your chosen voice, rate, and pitch, and hands it to the browser, which renders the audio through your system's speech engine. The list of voices is read from getVoices() — and because that list can load asynchronously, the tool refreshes it when the browser signals new voices are ready.

Why the available voices vary

The voices are not part of this website — they belong to your operating system. That is why the dropdown looks different on Windows, macOS, iOS, and Android, and even between browsers on the same machine. You can add more by installing additional language packs or accessibility voices in your system settings; they will appear here automatically the next time you load the page.

Browser support

Speech synthesis is widely supported — it works in Chrome, Edge, Safari, and Firefox on desktop and mobile — provided your OS has at least one installed voice. It is far more universally available than the reverse direction (speech recognition), which is limited to certain browsers. If you hear nothing, confirm your system has a TTS voice and that your output volume is up.

Saving the output

The API plays audio but does not give you a file to download. If you need an audio file, play the speech while capturing it with a recorder — the SnapTools Audio Recorder can grab the sound, and the Audio Trimmer can tidy the ends. For studying or proofreading, though, live playback is usually all you need.

Tips for natural-sounding speech

  • Match voice to language. Use an en-US voice for English text, an es-ES voice for Spanish, and so on, for correct pronunciation.
  • Keep pitch near default. Extreme pitch settings sound robotic; small adjustments are usually enough.
  • Use punctuation. Commas and periods create natural pauses, making the reading easier to follow.
  • Split long text. Shorter passages avoid browser length limits and are easier to replay.

Related media tools

  • Audio Recorder — record the speech as it plays to turn this voiceover into a saved file.
  • Audio Trimmer — tidy the start and end of a captured voiceover before you use it.
  • Speech to Text — the mirror tool, going from spoken words back to editable text.

Frequently asked questions

Where do the voices come from?
They are the text-to-speech voices installed on your operating system, exposed to the browser. macOS and Windows ship many out of the box; on iOS and Android you can add more languages and voices in the system accessibility or language settings, and they then appear in the list here.
Why does the voice list differ between my devices and browsers?
Because the voices belong to the operating system, not the website, the available options depend on your OS, its installed language packs, and the browser. Chrome on Windows, Safari on a Mac, and Chrome on Android will each show a different set. This is expected — pick the best voice available on the device you are using.
Can I download the spoken audio as an MP3 or WAV?
Not directly. The Web Speech Synthesis API plays audio aloud but does not expose a downloadable file. To capture it, play the speech while running a recorder — our Audio Recorder can record your system or microphone output, which you can then trim and save.
Is my text sent to a server?
The speech synthesis runs through your browser and operating system. SnapTools does not upload or store your text — it stays in the page. Note that some system voices are cloud-backed by the OS vendor; offline/local voices keep everything on-device.
Which browsers support text to speech?
The Web Speech Synthesis API works in Chrome, Edge, Safari, and Firefox on both desktop and mobile, as long as the operating system has at least one voice installed. If you hear nothing, check that your OS has a TTS voice and that your volume is up.
What do the rate and pitch controls do?
Rate sets the speaking speed (0.5× is slow and deliberate, 2× is fast); pitch shifts the voice higher or lower. Lower rates aid comprehension for studying or proofreading, while a natural pitch around the default tends to sound most human.
Can it read other languages?
Yes — any language for which you have a voice installed. The dropdown shows each voice's language code (for example en-US, es-ES, ja-JP). For best pronunciation, match the voice's language to the language of your text.
Why did it stop partway through a long passage?
Some browsers limit how long a single utterance runs. If long text cuts off, break it into shorter paragraphs and play them in turn, which also makes it easier to re-listen to a specific section.

More tools you might find useful in the same flow.

Built by Muhammad Tahir · About