Suggestion: Make Radio Com sound a bit more Radio like

sunrrrise

It will happens.
I mean, if we like to go forward and consider more about AWACS comms’ JTAC etc … there is not ither solution since current available frags do not cover our needs any more. This is also why CeckIn freq is not yet really implemented in 3.34 … And why ATC and AI calls are not 100% accurate in terms. Tweaking exisitng frags has a its limits.

OK, I will understand and appreciate expansion in area of communication (more speeches), but I really want to have current voices with Falcon 4.0 origin kept in future releases.

Khronik

People want to keep 8 bit 22khz radio voices? LMAO…

Are you guys also running 640x480 in 3D for that “classic” experience?

Dee-Jay

@sunrrrise:

OK, I will understand and appreciate expansion in area of communication (more speeches), but I really want to have current voices with Falcon 4.0 origin kept in future releases.

Imagine if we mix TTS an real voice frag! … for a same pilot … and differences on each words!? … The day we will switch into full TTS, it will just be impossible to keep old voices. Unless you can fine original actors and ask them to records new frags/sentences with the same tone they did 20y ago …

For instance : you guys want the L16 … how to make AIs saying “Hook” “Zap” “Markpoint” “Track Number” … etc … if the frags doesn’t exist!
Do you understand why we can’t (existing frags vs non existing frags)?

beppe_goodoldrebel

There are some good TTS nowadays ( mostly payware ), the problem is, if you want to do a “good” job, you have to work on a single sentence , doing a mass porting will make things worst.

I use this for DCS voiceovers. They work very well but they are not perfect.

http://www.fromtexttospeech.com/

Khronik

@beppe_goodoldrebel:

There are some good TTS nowadays ( mostly payware ), the problem is, if you want to do a “good” job, you have to work on a single sentence , doing a mass porting will make things worst.

I use this for DCS voiceovers. They work very well but they are not perfect.

http://www.fromtexttospeech.com/

From quick review most strings were said with the correct pronunciation with payware TTS.

All voices share the same text strings so it only has to be edited once per string in question which is do-able as most are ok.

Many are just single words, letters and numbers, not everything is a phrase.

The proper solution for BMS would be to integrate a TTS engine into the code, and just read text strings externally. Recording TTS to a WAV clip is kinda a hack to get them in BMS.

Boxer

@Khronik:

Recording TTS to a WAV clip is kinda a hack to get them in BMS.

A superficial look at the problem would suggest this is correct.

However, practically all of the AI speech is parameterized such that what you hear is assembled from pieces parts rather than recorded as a sentence in it’s entirety. There is a good reason why the eval/frag system that Microprose chose has lasted so long…replacing it with something better is quite tricky.

TTS engines are only recently starting to get to the level of quality that would make literal on-the-fly assembly of strings for pronunciation in real time viable (for things other than ATIS that use stilted TTS in real life also; in certain places anyway). And as things stand today most of the alternatives that look most promising are commercial which puts them effectively out of reach for BMS to use.

Khronik

@Boxer:

A superficial look at the problem would suggest this is correct.

However, practically all of the AI speech is parameterized such that what you hear is assembled from pieces parts rather than recorded as a sentence in it’s entirety. There is a good reason why the eval/frag system that Microprose chose has lasted so long…replacing it with something better is quite tricky.

TTS engines are only recently starting to get to the level of quality that would make literal on-the-fly assembly of strings for pronunciation in real time viable (for things other than ATIS that use stilted TTS in real life also; in certain places anyway). And as things stand today most of the alternatives that look most promising are commercial which puts them effectively out of reach for BMS to use.

Have you read my replies in this thread?

I’m well aware of how the voice is strung together, as I needed that info to make the application for this task.

What I was trying to say before you made a quick assumption is that having TTS as wav files makes no sense pragmatically.

Wasn’t saying the current system is bad, it makes sense for recorded wav voices.

Boxer

@Khronik:

Have you read my replies in this thread?

Yes.

What I was trying to say before you made a quick assumption is that having TTS as wav files makes no sense pragmatically.

I know what you are trying to say and I disagree. That happens sometimes when reasonable people debate

Khronik

@Boxer:

Yes.

I know what you are trying to say and I disagree. That happens sometimes when reasonable people debate

Well we agree to disagree, at least I do

Might also be an option to do real voice recordings but not use so many variants, ie. 2 male 2 female (or people as Justin Trudeau would say)

As mentioned, BMS will probably never implement a payware TTS engine (understandably) so using WAV clips is the obvious answer.

To reply directly to the OPs initial question, I’ve already attempted to apply compression and some “radio” efx (bandpass, static, noise) globally to all wav clips and the end result didn’t sound any better in game. If the engine was set to use at least 16bit / 44khz samples then these effects would be much more audible if applied directly to the source.

beppe_goodoldrebel

@beppe_goodoldrebel:

Sure! As for now I have a generic background noise with open and close click , I think IVC effects are more close to what we hear in some RL videos but they are good anyway.

My 2 cents, I think only flight radio calls needs some effects. Maybe even AWACS and tanker… ATC is good as it is now.

This are 2 examples, I might review my thoughts but I think it can be done using IVC effects.
https://www.dropbox.com/s/necirko2f0q24yh/examples.rar?dl=0

IVC effects are in wav files so I’ve used them ( can be found in IVC folder )

HQ-noise-48kHz.wav
mic_in.wav
mic_out.wav
opencircuit.wav

I’ve added this effects to original files with AUDACITY ( can be done massively to all files )
BMS just link wav files to each other to create phrases so , to obtain a similar method to IVC , it can be done like that

ch 1 - (open mic wav) + wav 1 + wav 2 + wav 3 + wav 4 + ( close mic wav )
ch 2 - –----------------------Open circuit wav ------------------------------------
ch 3 - ------------------------HQ-noise-48kHz.wav--------------------------------

the only problem is to repeat background waves and cut them when original phrase is ended - can it be done?

Khronik

@beppe_goodoldrebel:

This are 2 examples, I might review my thoughts but I think it can be done using IVC effects.
https://www.dropbox.com/s/necirko2f0q24yh/examples.rar?dl=0

IVC effects are in wav files so I’ve used them ( can be found in IVC folder )

HQ-noise-48kHz.wav
mic_in.wav
mic_out.wav
opencircuit.wav

I’ve added this effects to original files with AUDACITY ( can be done massively to all files )

BMS just link wav files to each other to create phrases so , to obtain a similar method to IVC , it can be done like that

ch 1 - (open mic wav) + wav 1 + wav 2 + wav 3 + wav 4 + ( close mic wav )
ch 2 - –----------------------Open circuit wav ------------------------------------
ch 3 - ------------------------HQ-noise-48kHz.wav--------------------------------

the only problem is to repeat background waves and cut them when original phrase is ended - can it be done?

Adding clicks or radio noise at start and end isn’t possible because some fragments are single words and numbers.

ie. Raven 6 2 would be 3 samples one for “Raven”, one for “six” and one for “two”. You would have open noises between words and numbers in a single phrase which doesn’t work.

Only the code would be able to insert fx before/after each phrase because it’s aware of the order of what clips would be triggered.

Also, listen to the clips you provided in-game. They sound different at 8bit/22khz, which is the format the SoundBlaster Pro 2 was able to process in 1991.

What can be done with bulk processing is background noise, and EQ filtering. Just remember to zero cross the beginning and end of each clip if not you will get pops and clicks when the clips are triggered back in sim.

sunrrrise

@Khronik:

People want to keep 8 bit 22khz radio voices?

They are perfectly fine for me. More, I think those speeches are the best ever made for any (combat) flight simulator since 1997.

And speaking of nostalgia - GFX should be always better and better, but SFX brings memories. At least my brain works in that way.

Dee-Jay

“mic click” are not always heared on the radio IRL

gza036

@Khronik:

People want to keep 8 bit 22khz radio voices? LMAO…

Are you guys also running 640x480 in 3D for that “classic” experience?

show something better, because TTS sounds like garbage LOL…

Blu3wolf

Depends on the TTS in question.

beppe_goodoldrebel

@Khronik:

Adding clicks or radio noise at start and end isn’t possible because some fragments are single words and numbers.

ie. Raven 6 2 would be 3 samples one for “Raven”, one for “six” and one for “two”. You would have open noises between words and numbers in a single phrase which doesn’t work.

Only the code would be able to insert fx before/after each phrase because it’s aware of the order of what clips would be triggered.

That’s what I meant. Adding mic open and mic close as “new words” in the code.

l3crusader

FYI the code does insert a little sample before and after each sentence. It is subtle but it is there.

beppe_goodoldrebel

@l3crusader:

FYI the code does insert a little sample before and after each sentence. It is subtle but it is there.

which are the related strings or wave files?

Dee-Jay

@beppe_goodoldrebel:

This are 2 examples, I might review my thoughts but I think it can be done using IVC effects.
https://www.dropbox.com/s/necirko2f0q24yh/examples.rar?dl=0

IVC effects are in wav files so I’ve used them ( can be found in IVC folder )

HQ-noise-48kHz.wav
mic_in.wav
mic_out.wav
opencircuit.wav

I’ve added this effects to original files with AUDACITY ( can be done massively to all files )

BMS just link wav files to each other to create phrases so , to obtain a similar method to IVC , it can be done like that

ch 1 - (open mic wav) + wav 1 + wav 2 + wav 3 + wav 4 + ( close mic wav )
ch 2 - –----------------------Open circuit wav ------------------------------------
ch 3 - ------------------------HQ-noise-48kHz.wav--------------------------------

the only problem is to repeat background waves and cut them when original phrase is ended - can it be done?

That filter sounds good beppe_goodoldrebel.

I’ve added this effects to original files with AUDACITY ( can be done massively to all files )

Where is that effect located in Audacity?

beppe_goodoldrebel

@Dee-Jay:

That filter sounds good beppe_goodoldrebel.

Where is that effect located in Audacity?

In the Effect menu, apply all in the same order :
-> highpass
-> lowpass
-> amplify

The pic is relative to the macro I made so many effects can be added with just a click.

Note that in that 2 clips I’ve also added the 2 radio noises from IVC.
This cant be done like massively like Khronik said before, the risk is having pauses between words.
They should be added in the code as a parallel loop in the radio transmission and stopped at the end of the close click.

Suggestion: Make Radio Com sound a bit more Radio like

80

10.6k

23.0k

372.0k