Word2Wave: a framework for generating short audio samples from a text prompt using WaveGAN and COALA.
Here are some cherry-picked outputs from Word2Wave.
“Rain”
“Rave”
“Ring”
“Shot”
“Typing”
“Psytrance”
“Videogame”
“Siren”
“Aluminium”
“Alien playing a metallic flute”
“Crazy cool ambient siren”
“Splash of water”
“Evil robot laughing”
“Phone notification”
“Yamaha”
“Weapon”
“Stab”
“Flatulence”
“Voice”
“Shout”
“Shuffle”