How can I download audio from Google’s sample text-to-speech?

I would like to be able to save the audio from the sample from Google’s cloud-based text-to-speed service. Is it possible to do this?

7 Answers

If you're going to do a lot of this, you should really use the API that they offer, but here's a quick step by step if you just want to download a single sample of Google's speech synthesis.

Go to the page in Google Chrome.
Open the Developer Tools (by pushing F12)
Go to the "Network" tab.
Enter the text you want to get audio of.
Click the "SPEAK IT" button.
Watch the "Network" tab populate with a couple of entries.
Right-click the entry that starts with data:audio/wav;base64, and click "Open in new tab".
In the new tab, right-click the audio player, and click "Save video as..."
Choose where you want to save the resulting .wav file.

Note: This will get you a (marginally) cleaner copy of the audio than recording the Stereo Mix from your sound card.

Edit: @3D1T0R 's answer is simpler and will most likely result in a higher quality file.

I would just use an audio recording program, such as Audacity, to record the "Stereo Mix" of your computer. You haven't specified an OS but assuming Windows:

First go to the Sound applet in the Control Panel, go to the Recording tab, right-click Stereo Mix and select Enable. (If you don't see Stereo Mix, check Show disabled devices)
Then download/open Audacity, and in the Recording Device dropdown box, select Stereo Mix. Then just hit Record, and anything you hear playing out of your speakers will be recorded to a sound file.

As predicted in this comment, the accepted answer is now broken. The basic approach still works though, except you have to save the proxy.json and then decode the base64-encoded audio:

cat proxy.json | jq '.audioContent' -r | base64 -d > your-audio.wav

I wrote a browser extension specifically for this purpose:

It uses the WebExtensions API to sniff HTTP responses and saves the wav audio when matched.

Here are the current steps

Go to
Open the Developer Tools (by F12)
Go to the "Network" tab.
Enter the text you want to get audio of.
Click the "SPEAK IT" button.
Click the “I’m not a robot” checkbox
Watch the "Network" tab populate with a couple of entries.
Right-click the entry that starts with proxy?url=
In the preview to right, click "copy" in the line for “audiocontent”
Save this as a text file (base64.txt). Remove the quotes (") from beginning and end This contains the base64 encoded audio
In Ubuntu, decode base64 to wav file with the following command:
cat base64.txt | base64 --decode > audio.mp3

screenshot

basically the same way. Inspect json response, take audiocontent base64 and decode to wav.

To make it easier for you, I made a chrome extension named audiotts. Just click SPEAK IT button and waiting the audio player come in. You can download the audio using download menu of the audio player.

feel free to check that out 🤟

In this video is it very well explained in Hindi/Urdu that how you can Covert Google TTS and Download as mp3:

Velvet Star Monitor

How can I download audio from Google’s sample text-to-speech?

7 Answers

Your Answer

Sign up or log in

Post as a guest

Similar Journal

Do lure modules and incense cause new pokemon to spawn near you, or existing pokemon to be drawn to you?

What is the strongest fixed location equipment you can obtain at Level 1?

What's the best strategy to keep the chaos low?

Where do you find slime chunks?