Replies: 1 comment
-
You should define your own pyannote database protocol (as you'll see in the README, there is a couple of ways to do this), and leave the train and dev sets empty (which is fine for pyannote). If i'm not mistaken, pyannote doesn't care if audio files in the test set don't contain annotations (although, you obviously won't be able to score the performance of the model). |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
The tutorials give some really nice examples on how to apply a pretrained model to a single test file, but it's not clear to me how to apply a pretrained model to a large set of my own data (parallelized in batches to speed up processing). One of the tutorials (https://github.com/pyannote/pyannote-audio/tree/master/tutorials/pretrained/pipeline) mentions defining one's own protocol and points to the data prep tutorial (https://github.com/pyannote/pyannote-audio/tree/master/tutorials/data_preparation), but that tutorial seems to be mostly related to training. What is required to create a protocol for unannotated test data?
Thanks!
Beta Was this translation helpful? Give feedback.
All reactions