Can't reproduce paper #142

dkun7944 · 2024-09-23T00:01:49Z

I'm trying to replicate the ICASSP 2022 paper result (A Lightweight Instrument-Agnostic Model for Polyphonic Note Transcription and Multipitch Estimation).

Having some trouble getting the model to converge. I used all the hyperparameters mentioned in the paper, but the loss seems to plateau at around ~0.35 for note loss, ~0.4 for onset loss, and ~0.3 for contour loss. The code has a learning rate scheduler but that doesn't seem to help – eventually the early stopping gets hit.

I've tried:

adjusting the learning rate up and down by an order of magnitude
training on subsets of the total dataset (only maestro, only guitarset, etc)
training on CPU (M1 Max) and GPU (1x A10)
training with and without contours

But all yield the same result.

The paper mentions a weighted binary crossentropy:

"Binary cross entropy is used as the loss function for each
output, and the total loss is the sum of the three losses. However,
for Yo, there is a heavy class imbalance that drives models to output
Yo = 0 everywhere. As a countermeasure, we use a class-balanced
cross entropy loss, where the weight for the negative class is 0.05
and the positive is 0.95"

So I also enabled this in the training arguments. I had to fix the weighted_transcription_loss function in models.py since it was outputting the wrong dimension. I'm about 95% sure I got it right:

def weighted_transcription_loss(
    y_true: tf.Tensor, y_pred: tf.Tensor, label_smoothing: float, positive_weight: float = 0.5
) -> tf.Tensor:
    """The transcription loss where the positive and negative true labels are balanced by a weighting factor.

    Args:
        y_true: The true labels.
        y_pred: The predicted labels.
        label_smoothing: Smoothing factor. Squeezes labels towards 0.5.
        positive_weight: Weighting factor for the positive labels.

    Returns:
        The weighted transcription loss.
    """
    y_true_weighted = tf.where(tf.equal(y_true, 1), positive_weight, 1 - positive_weight)
    bce = tf.keras.losses.binary_crossentropy(y_true_weighted, y_pred, label_smoothing=label_smoothing)
    
    return bce

But regardless of whether I use weighted or unweighted loss, I get the same result.

Any advice?

The text was updated successfully, but these errors were encountered:

dkun7944 · 2024-10-02T14:18:34Z

@drubinstein ?

drubinstein · 2024-10-02T14:19:46Z

@rabitt

dkun7944 · 2024-10-02T14:35:49Z

Just found @bgenchel's tensorboard screenshot (#136) where the total loss converges at ~0.99. This is basically the result I'm getting, so maybe code is working as intended? I guess I expected the loss to go lower

bgenchel · 2024-10-04T15:18:10Z

Hey Daniel, one possible explanation for your results (and mine) is that we are both not training on the full set of data used in the paper, some constituents of which are unavailable publicly. Could you list which of the datasets you're currently using to train?

dkun7944 · 2024-10-04T16:46:41Z

@bgenchel I have tried Maestro and GuitarSet. Did not realize the paper used non-public data. What proportion of the training set in the paper is proprietary? My goal is to train on a self generated synthetic dataset, but just want to validate the training code is working properly beforehand

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Can't reproduce paper #142

Can't reproduce paper #142

dkun7944 commented Sep 23, 2024

dkun7944 commented Oct 2, 2024

drubinstein commented Oct 2, 2024

dkun7944 commented Oct 2, 2024

bgenchel commented Oct 4, 2024

dkun7944 commented Oct 4, 2024

Can't reproduce paper #142

Can't reproduce paper #142

Comments

dkun7944 commented Sep 23, 2024

dkun7944 commented Oct 2, 2024

drubinstein commented Oct 2, 2024

dkun7944 commented Oct 2, 2024

bgenchel commented Oct 4, 2024

dkun7944 commented Oct 4, 2024