turbulence component is missing

Question

turbulence component is missing

belm0 opened this issue 7 years ago · 4 comments

voc has omitted the turbulence (a.k.a. fricative) component of the tract model, where noise is generated at the location where the tract is narrowed.

Without the turbulence component it's impossible to emit fricative consonants like "v", "z", "th", etc.

Refer to PT code Tract.addTurbulenceNoise(), etc.

Answer 1 · 2018-01-31T19:00:43.000Z

Yes, I believe I left this out in order to simplify the model. If it doesn't require too much additional code, I would be amenable to accepting a PR adding it.

Answer 2 · 2018-05-09T07:11:10.000Z

Hi Paul,
I intend to make use of the idea in voc that is the ability to put in some numbers and getting as output the audio of vocal tract.
I see potential to estimate the reverse of this function by making use of recurrent neural network to recognize voice inputs etc.. using much lesser human effort.
For this I will have to generate random numbers and corresponding outputs into an output file that is read by a python script using tensorflow etc.
But as you point out here, that some important component is missing, do you think using voc code might not be as useful?
What do you sugget?

Answer 3 · 2018-05-09T08:40:22.000Z

With the current implementation, you will still get vowel sounds. You just won't be able to produce fricatives. There's still plenty that RNNs can do here with regards to finding suitable tract diameters.

Answer 4 · 2018-05-10T02:44:03.000Z

@vaibhawc you may want to consider what level you want the neural network to control constriction. Voc requires direct diameter control of the entire vocal tract length, while Pink Trombone has high-level constrictions which take care of creating a natural curve around the constriction point as well as restoring the tract gradually when the constriction is removed.

Unless you have some processing constraint requiring a c++ implementation of the synth, you may want to use the original Pink Trombone and e.g. control it from a websocket.

I've refactored Pink Trombone to separate the UI and synth so it's more suitable as a library or verbatim porting, which I hope to share eventually (not able to yet).