[Question] about `intersperse` function. #75

chep0k · 2023-09-04T14:33:05Z

Hi!
During preprocessing, when add_blank is True in hparams, some weird intersperse function (here) intersperses an index, which is out of vocabulary bounds (item=len(symbols)), between each pair of adjacent tokens.
My first guess was that this token plays the role of some pauses between tokens, as pause token was not presented in vocabulary. So while training, all pauses sift to this token.
Then, as it's name state, I treated it as some blank token, which is needed to absorb all "noises" between adjacent tokens, as for other tokens to present more clear phonemes. There I thought it may also be used to learn transformations from one phoneme to another, which is not a part of any of two adjacant phonemes itself, but a separate part. but if so, why is it a common token for all gaps?
So, what is the real purpose of this blank token?
this question is more addressed to the authors, but any guesses are welcome.
thanks in advance.

The text was updated successfully, but these errors were encountered:

chnk58hoang · 2024-06-04T17:52:55Z

Can someone explain the real purpose of interperse function ? I'm confuse with it a little bit.

chep0k · 2024-09-09T20:41:29Z

Can someone explain the real purpose of interperse function ? I'm confuse with it a little bit.

as long as I have been working with GradTTS I treated the interspersed token, for which item argument stands in the respective function, as kind of "space" token, inserted between each two adjacent phonemes and denoting the amount of, say, "silence" between them which model should learn to pronounce. thus, each non-"space" token should be filled only with sound immediately relevant to this token, while all pauses, skips and spaces should be delegated to this token. moreover, in case of noisy data, all irrelevant (background) buzz could be fed into this tokens hence purging it from other actual phonemes. otherwise, id est if this "space" token is omitted, all the noise and silence would have no other choice but to be memorised as parts of actual phonemes tokens hence contaminating them.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Question] about `intersperse` function. #75

[Question] about `intersperse` function. #75

chep0k commented Sep 4, 2023 •

edited

Loading

chnk58hoang commented Jun 4, 2024

chep0k commented Sep 9, 2024

[Question] about intersperse function. #75

[Question] about intersperse function. #75

Comments

chep0k commented Sep 4, 2023 • edited Loading

chnk58hoang commented Jun 4, 2024

chep0k commented Sep 9, 2024

[Question] about `intersperse` function. #75

[Question] about `intersperse` function. #75

chep0k commented Sep 4, 2023 •

edited

Loading