Howto:Customizing Flite TTS Voices

From FlightGear wiki
Revision as of 16:38, 31 October 2017 by Hooray (talk | contribs) (→‎Objective)
Jump to navigation Jump to search
This article is a stub. You can help the wiki by expanding it.

Motivation

What we'd like to have is exemplified by https://www.youtube.com/watch?v=qcvTHQxBcLw (after 28 seconds or so there's a voice exchange). There's little static, but the voices sound flat and perhaps a bit metallic. I'm not sure what exactly was in the Shuttle vid you have seen - I have no simulation of static in, it used to be that the aircraft com instruments produced some when a transmitting station was overflown (frequency match but distance too far), but this should be gone now as the Shuttle doesn't use a standard aircraft com stack. Only the minority of voice callouts done by the system are intended to simulate 'real' communication with mission control, many are advisory messages, limit warnings or failure notices. There is filtering done, so we know for every message what it is. It would be cool to get a list of real messages and record them with the right distortion - but unfortunately some of them need to be assembled dynamically (like when you get burn parameters transmitted). There's still a siable chunk of standard messages and callouts which could be pre-recorded.[1]


Background

Thorsten stated that he'd very much like to have improvements in the sound, especially in the group of real voice com, but doen't really know how to get there all the way.[2]

He is planning on compiling a list of real Shuttle phraseology - if we do a recording job, let's do it properly. He would like to improve the sound immersion very much, let's see how this goes.[3]

Objective

we're going to implement the solution that's better (in this case, leads to better immersion). Just like it has been done in practically any other area where native FG solutions were not good enough (failure modeling, interaction with navaids, thruster flames, co-orbiting objects...)[4]

Status

right now speech is dynamic in that sentences are assembled through text and converted to audio. So really either then filter that output (I'm talking about an audio filter, just so we're clear), or pre-record words or phrases which can then be manipulated and triggered by the system as needed. [5]


Related

References
  1. Thorsten  (Oct 27th, 2017).  Re: Space Shuttle .
  2. Thorsten  (Oct 27th, 2017).  Re: Space Shuttle .
  3. Thorsten  (Oct 27th, 2017).  Re: Space Shuttle .
  4. Thorsten  (Oct 30th, 2017).  Re: Space Shuttle .
  5. Lydiot  (Oct 27th, 2017).  Re: Space Shuttle .