You can now convert your voice into any other using a new tool from ElevenLabs. The AI platform says the conversion keeps any emotion and intonations expressed in the original recording and carries it to the new voice.
The quality of synthetic speech has improved dramatically in recent months, as has the speed of training an AI model on an entirely new voice. ElevenLabs lets you clone your own voice from a minute of audio, or create one by describing how it should sound.
Previously this was only available for text-to-speech conversion, which often lost some of the hidden meaning present in natural spoken language. It also struggled to process unknown words such as company or product names, or unusual personal names.
With its new voice-to-voice model ElevenLabs promises that you can, “say it how you want it and transform your voice into another character, with full control over emotions, timing, and delivery.”
How to use Voice-to-Voice technology
To test these claims, I had ChatGPT write a short radio play scene featuring three distinct characters. Then using my broken, flu-raddled voice, I recorded all the parts into ElevenLabs, selecting a different synthetic voice for each.
I didn’t speak particularly clearly at any point, put on faux and terrible American accents, used slang terms, and left dramatic pauses. For the most part, it did well with the emotion and intention of the phrasing — but it did struggle to convert a few letters and even whole words.
If you want to try creating your own virtual voices the process is fairly straightforward and there are a large number of options available even on the free plan.
Cloning a voice requires a premium plan and you are limited to 5,000 letters, spaces, and characters per month without paying — so if you have aspirations of making a radio play, you may want to consider an upgrade.
To get the most out of the voice AI tools why not invest in a new microphone? This black friday you can get $45 off the Logitech Blue Yeti microphone or $40 of the Razer Seiren V2 at Amazon.
1. Register for an ElevenLabs account
You can use Google to sign up or register directly with ElevenLabs. The process is straightforward and you’ll be registered with the default, free plan from the start. Simply click Sign Up in the top-left corner of the screen. When registered you will be taken straight to the voice synthesis page.
2. Switch to speech-to-speech
It defaults to text-to-speech, where you enter words you want to use and it says it out loud. To access the new speech-to-speech tool just click that button on the top row.
3. Select a voice you want to use
There are dozens of pre-loaded voices, and it will remember any you’ve used recently. You can also find more voices by clicking on Voice Library. But to get things moving the list of voices available by default should be enough. You can press the play icon next to the voice to preview it, or just click on the name to have it selected.
4. Fine-tune the voice in settings
You can fine-tune the way the voice reacts and sounds including exaggerating the style, making it closer to the original voice, and adding a greater or lesser degree of variability. To access this click on voice settings.
5. To upload or record directly - that is the question
Next, you have to decide whether to just record directly into ElevenLabs, as I did with the short radio play scene, or to upload a pre-recorded piece of audio. Uploading a clip could be useful if you've been in a studio, or want to mess with a friend by changing their voice.
To upload a clip simply click the play button with the + in the top-right corner. Or to record simply click the record audio button.
6. Recording the audio requires another click
If you decide to record directly with ElevenLabs it'll change to a microphone in a circle. To start recording simply click the microphone icon. It will change to a stop icon and you click that to stop recording.
7. Playback, delete or get on with it
Once you’ve finished the recording it will allow you to play back what you’ve recorded. To do this just click the play icon. This will be in your own voice. If you don’t like it click delete and it will take you back to step 5, where you can record or upload a clip.
8. Let's change the voice
If you’re happy with the recording simply click the generate button. It takes anything up to a couple of minutes depending on the length of the recording.
9. Play, download and use the audio
Once the new voice has been generated it will automatically start playing. A new menu will appear at the bottom of the screen with playback controls. You can also download the generated audio from this menu by clicking the upward arrow icon on the right.