Text To Speech

by minimal

In-game text to speech; preview, global chat and speaker blueprint generation modes.

Tweaks
5 years ago
0.15 - 0.17
15

i Better speech synthesis alghorithm

6 years ago

The quality of speech is horrible. I can't understand what is said even if I wrote it a moment ago. I'm writing specifically about the male voice. Maybe a part of the problem is that I'm not a native speaker but the HL1 VOX is pretty distorted and I can understand it just fine.
I know that this is not trivial problem to solve. I even suspect that this is too hard. What I'm suggesting is that the whole idea of synthesizing voice from phonemes is a dead end. That is a big challenge to a team of specialists, let alone a moder.

I would suggest to do something similar to the HL1 VOX but with bigger library of words, starting from names of all items in the game so you can build sentences like 'Coal supply in north power plants is too low.'

6 years ago
(updated 6 years ago)

Hi,

Yeah, I agree that the quality of the phoneme voice is pretty low, it was originally only meant to be a holdover until I could get hold of some diphone voices (for comparison the phoneme voice has 39 unique sounds, a diphone voice has around 1600) which is much clearer and higher quality. However, it's proven quite difficult to get hold of such voices.

For context, this article (http://www.mperfect.net/ttSpeech/) was the inspiration for the TTS mod, in it he was able to convert a voice file encoded using LPC (Linear predictive coding) to it's constituent diphones. This is where progress stopped, the linked article does not supply code for this process and my attempts at modifying the LPC source code of the mentioned speech engine (FreeTTS) have failed. But long story short; since I'm rather busy lately, I doubt I would make any progress towards that any time soon, so I agree that using phonemes is likely a dead end.

I like the idea of having a bunch of common, high quality words, however I'm concerned about the much larger file size, but it might be okay with careful word selection.

I'll definitely look into adding a few of the voices you suggested to begin with, it would make the mod much more useful, it's more or less a novelty/proof of concept at the moment.

Thank you for your constructive feedback, I appreciate it

New response