-
Notifications
You must be signed in to change notification settings - Fork 3
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Real syllables being labeled as "noise" (point label = -1) #32
Comments
Hi Ar, Great to hear! In this context, 'Noise' just refers to data that hasn't been assigned to a cluster. Pykanto utilizes UMAP and HDBSCAN for a preliminary classification, which you can then adjust interactively. The number of data points without cluster membership depends on: i) The nature of the data, There's no universal set of parameters that work well for all situations. This is because some datasets may defy assumptions made by each algorithm, and in many cases, discrete population-wide categories might not exist. See:
Also, see this bit from the app notes
If you attach a couple of screenshots of the interactive app I can also try to give you more targeted advice. Hope that helps |
Thank you so much for the speedy response! This helped considerably, deepest thanks! |
Shoot! One more question. Once labeling occurs, is it possible to view/get data from songs as a sequence of these new lables? For instance, if I get syllable/unit clusters, A, B, C, D, E, F, and G, is there a way I could see one of the songs in the dataset as CABGDEF or something like this? Thank you again! Best, Ar K |
Yes - here you go! There are no explicit ways to do this in |
Hi!
First of all, thank you so much for your earlier help!
I have gotten everything to run smoothly up to this point, but as I try to open my dataset in the app post-segmentation, I am finding a large amount of syllables being labelled as noise. Given that I did a careful and pretty thorough job of noise-reduction before introducing my recordings into pykanto, there ought not to be much noise left to disregard. Is there anything I can do about this?
Thank you so much!
Ar K
The text was updated successfully, but these errors were encountered: