These researchers have found a new way to turn text into speech.
It’s called VALL-E, and it’s a computer program that uses a special kind of language to make speech sound more natural. They trained it using a lot of speech data, 60,000 hours worth, which is way more than other similar programs. With VALL-E, you can make speech that sounds like a specific person, even if the program hasn’t heard them before. Tests show that it sounds better and more like the person than other similar programs.
Plus, it can keep the emotions and background sounds of the person’s speech.
Check it out here: https://valle-demo.github.io/