Software:VALL-E
From HandWiki
Short description: Speech synthesis software
Developer(s) | Microsoft |
---|---|
Platform | Cloud computing platforms |
Website | https://www.microsoft.com/en-us/research/project/vall-e-x/ |
Machine learning and data mining |
---|
VALL-E is a generative artificial intelligence system for speech synthesis developed by Microsoft Research and announced on January 5, 2023.[1] It can "recreate any voice from a three-second sample clip".[2] It has been trained on 60,000 hours of English language speech from Meta’s audio library LibriLight.[3]
See also
- Amazon Polly
- Audio deepfake
- Comparison of speech synthesizers
- Deep learning speech synthesis
- Natural language generation
- Speechify
- Voice phishing
- Zero-shot learning
External links
References
- ↑ Dominguez, Daniel (January 27, 2023). "Microsoft Unveils VALL-E, a Game-Changing TTS Language Model" (in en). https://www.infoq.com/news/2023/01/microsoft-text-to-speech-valle/.
- ↑ Morrison, Ryan (2023-01-10). "Microsoft's new VALL-E AI can clone your voice from a three-second audio clip" (in en-US). https://techmonitor.ai/technology/ai-and-automation/vall-e-synthetic-voice-ai-microsoft.
- ↑ Wodecki, Ben (January 11, 2023). "Microsoft’s VALL-E Generates Speech From Just 3 Seconds of Audio". https://aibusiness.com/microsoft/microsoft-s-vall-e-generates-speech-from-just-3-seconds-of-audio.
Original source: https://en.wikipedia.org/wiki/VALL-E.
Read more |