# Text to Speech > Convert text to spoken audio using browser speech synthesis with voice, speed, and pitch controls **Category:** Utility **Keywords:** text to speech, tts, speech, voice, read aloud, synthesis, accessibility, audio, narrate **URL:** https://complete.tools/text-to-speech ## How it works The tool uses the **Web Speech Synthesis API**, a standard feature built into modern browsers. When you press Speak, your browser converts the text into audio using a speech synthesis engine on your device. **Estimated duration** is calculated using: ``` Duration = (Word Count / Words Per Minute) x 60 seconds ``` The base rate is approximately 150 words per minute at 1.0x speed. Adjusting the rate slider scales this proportionally. ## Voice options Available voices depend on your operating system and browser: - **Windows**: Typically includes Microsoft voices in multiple languages - **macOS / iOS**: Offers a wide selection of high-quality voices through the system - **Chrome**: May provide additional Google voices alongside system voices - **Android**: Uses Google TTS voices when available Voices marked **[Remote]** are processed through your browser's cloud services rather than locally. Local voices work offline. ## Controls explained - **Rate** (0.5x to 2.0x): Controls speaking speed. 1.0x is normal conversational pace. Lower values slow the speech down; higher values speed it up. - **Pitch** (0.5 to 2.0): Adjusts the vocal tone. 1.0 is the voice's natural pitch. Lower values produce a deeper sound; higher values make it higher. - **Speak**: Begins reading the text from the start. - **Pause / Resume**: Temporarily halts and continues playback. - **Stop**: Ends playback entirely and resets to the beginning. ## Who should use this - **Proofreaders and writers**: Hearing text read aloud helps catch awkward phrasing, typos, and rhythm issues that are easy to miss when reading silently. - **Language learners**: Listen to pronunciation in different voices and languages while following along with the text. - **Accessibility needs**: Anyone who benefits from audio versions of text content, whether due to visual impairment, reading difficulties, or personal preference. - **Content creators**: Preview how scripts, presentations, or social media posts sound before recording or publishing. - **Multitaskers**: Listen to articles or notes while doing other tasks. ## Tips for best results 1. **Break long text into sections** for easier listening and more reliable playback. 2. **Try different voices** to find one that sounds clear and natural for your content. 3. **Slow the rate to 0.7x or 0.8x** when listening to dense or technical material. 4. **Add punctuation** for more natural pauses. Commas create short pauses; periods create longer ones. 5. **Use line breaks** between paragraphs to add natural breathing room to the speech. ## Browser compatibility The Web Speech Synthesis API is supported in all major modern browsers: - Chrome (desktop and Android) - Safari (macOS and iOS) - Edge - Firefox Note: The number and quality of available voices varies by browser and operating system. Chrome on desktop typically offers the largest selection. ## FAQs ** **Q:** Is my text sent to a server?** **A:** No. All speech synthesis happens locally in your browser. Your text never leaves your device (unless you select a remote voice, which is handled by your browser, not this tool). ** **Q:** Why do I see different voices on different devices?** **A:** Voices are provided by your operating system and browser. macOS has Apple voices, Windows has Microsoft voices, and Chrome may add Google voices on top of those. ** **Q:** Can I save the audio as a file?** **A:** The Web Speech API does not support audio export. For saving speech as audio files, you would need a dedicated text-to-speech service. ** **Q:** Why does playback stop mid-sentence on long text?** **A:** Some browsers have a timeout on speech synthesis for long passages. If this happens, try breaking your text into shorter sections. ** **Q:** Does this work offline?** **A:** Yes, as long as you use a local voice (not marked [Remote]). Local voices are processed entirely on your device. --- *Generated from [complete.tools/text-to-speech](https://complete.tools/text-to-speech)*