Get a Voice to Rouse the Masses

November 28, 2010
0
Categories:
- Articles: Beginner

IF YOU want to command respect, having a strong, authoritative voice helps. But if you sound weak and weedy, forget voice-training lessons: there’ll soon be away to “evolve” your voice so that it sounds the way you want it to when you speak through a mike.

For some, having the right voice can make the difference between success and failure. Britain’s ex-Prime Minister Margaret Thatcher famously had voice training to lower the pitch of her voice, making her sound more authoritative.

Effective public speaking requires a voice that’s both expressive and dear, says Yuji Sato at Hosei University in Tokyo. So to help people develop it, he’s devised a voice processing system that makes people sound more “joyful, calm or manly”–even the women.

His system uses a genetic algorithm to “evolve” an improved voice, he told the Genetic and Evolutionary Computing conference in New York last week. Starting with sample sentences, the algorithm analyses the voice signal to work out which aspects of it need to be enhanced or suppressed to produce the required effect.

To do this, it randomly creates a series of “voice chromosomes” representing ways in which the voice could be modified. Each chromosome is made up of three genes, corresponding to changes in voice pitch, volume and speed. Individual chromosomes are applied to the waveform of the recorded speech, and the results played to observers who rank them according to the required criteria.

After the group has ranked the new voices, the system takes the most successful voice chromosomes and swaps genes between them to produce a new generation of chromosomes. In addition, the software can randomly tweak certain genes, producing potentially interesting variations in the calmness, manliness or joyfulness of the voice.

Within a few generations, Sato’s system has been able to create speech chromosomes that turn a feeble voice into a clear, rich authoritative one.

Unfortunately, Sato’s system can’t yet reproduce the converted voice in real time, but he’s working on it. However, even the current version should be good enough to make recorded TV soundtracks and computer game narratives more interesting.

by Ian Sample