Comparing between two audio #23

leonardltk · 2020-04-09T13:36:10Z

Hi,

according to the README, everything seems to be end-to-end when running benchmark across the whole covers80 dataset.

Is there a way to simply compare two audio using any of the algorithms, and determine whether they are indeed cover versions of each other?

furkanyesiler · 2020-04-09T13:51:28Z

Hi @leonardltk!

The way almost all of these algorithms (included the ones that are not in the repo) work is that they estimate a similarity/dissimilarity score among a collection of songs. These similarity scores are sorted to compute a number of performance metrics common in Information Retrieval tasks, e.g. mean average precision, mean rank, number of relevant items in top-1. As a result, the absolute values of these similarity scores do not necessarily mean something. What we care more is that when we give a query, whether the algorithm returns a relevant item (in our case, a cover) in the first retrieved results or not.

Based on the algorithm that you use, you can check the distance (or similarity score) distributions to set some thresholds. For example, if the distances of covers lie between 0 and 0.4, and the distances of non-covers lie between 0.3 and 0.9, you can then set a threshold considering whether precision or recall is more important to you. Keep in mind that these distance distributions are likely to differ depending on the algorithm that you use.

I hope this answers your question. Please let me know if you have any further questions!

leonardltk · 2020-04-10T03:42:20Z

Thanks for your reply!

In that case, how do you suggest i go about proceeding with this problem ?

Consider a database of 80 original songs. I have a collection of 1000 queries. it is unknown how many of these queries are a cover of the 80, and there might be none at all.

An approach im considering right now is this:
Considering how acoss.algorithms.*.similarity() are done, it seems they are deterministic, and fixed a threshold value of lets say 0.4, and in test time, i just follow how the similarity is computed, and decide whether the test file is a cover song of the original.

For example, rqa_serra09.py has the following similarity measure :

    def similarity(self, idxs):
        for i,j in zip(idxs[:, 0], idxs[:, 1]):
            query = self.load_features(i)
            reference = self.load_features(j)
            # create instance of cover similarity algorithms from essentia
            crp_algo = ChromaCrossSimilarity(frameStackSize=self.m, 
                                            frameStackStride=self.tau, 
                                            binarizePercentile=self.kappa, 
                                            oti=self.oti)
            alignment_algo = CoverSongSimilarity(alignmentType='serra09', distanceType='symmetric')
            # compute similarity
            csm = crp_algo(query, reference)
            _, score = alignment_algo(csm)
            for key in self.Ds.keys():
                self.Ds[key][i][j] = score

Does it mean i can use this function during test time ?

    def test_ij(self, i, j, threshold):
        query = self.load_features(i)
        reference = self.load_features(j)

        # create instance of cover similarity algorithms from essentia
        crp_algo = ChromaCrossSimilarity(frameStackSize=self.m, 
                                        frameStackStride=self.tau, 
                                        binarizePercentile=self.kappa, 
                                        oti=self.oti)
        alignment_algo = CoverSongSimilarity(alignmentType='serra09', distanceType='symmetric')
        
        # compute similarity
        csm = crp_algo(query, reference)
        _, score = alignment_algo(csm)

        return True if score >= threshold else False

1 problem i foresee using this is that the scores are unnormalised, as i did not take into account normalize_by_length.

mayassin · 2021-10-15T18:25:26Z

Hey @leonardltk ! I am currently facing a similar problem as the one you mentioned. How were you able to solve it ?
My problem is that given an audio cover, I am trying to retrieve the best matching song to it if it exists and return null if it does not.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Comparing between two audio #23

Comparing between two audio #23

leonardltk commented Apr 9, 2020

furkanyesiler commented Apr 9, 2020

leonardltk commented Apr 10, 2020

mayassin commented Oct 15, 2021 •

edited

Loading

Comparing between two audio #23

Comparing between two audio #23

Comments

leonardltk commented Apr 9, 2020

furkanyesiler commented Apr 9, 2020

leonardltk commented Apr 10, 2020

mayassin commented Oct 15, 2021 • edited Loading

mayassin commented Oct 15, 2021 •

edited

Loading