Question on MobileNet SSD architechture with focal loss. #5

MAGI003769 · 2018-03-22T13:18:02Z

Hello! First of all thanks for your implementation. You're really awesome.
My question is how to merger the focal loss with SSD architecture as I'm know working on SSD for my project.

Is it correct that we just replace the original softmax loss by focal loss? Or, it is necessary to apply it to location loss as well?
As the strength of focal loss is to solve the class imbalance, should I remove the the hard negative mining operations mentioned in SSD paper? What's your idea when your implementation?

Thanks a lot for your brilliant work and patience to read these questions. Look forward your reply.

ChiefGodMan · 2018-03-24T16:25:12Z

Hi, thanks for your praise.

You just need to replace softmax loss by focal loss of classification.
You can remove hard mining operations, but I choose to change the config file to (num_hard_example=20000, max_neg_per_pos=1000, max_total_detection=20000) instead. Actually you can have a try.

MAGI003769 · 2018-03-29T05:44:21Z

Thanks for your answering.

The output shape of original softmax loss tensor is (batch_size, num_box) which means a scalar value for each box. But, in your implementation, the return of focal loss is just a scalar. Is that means one loss value for each batch??? Doesn't the last line of code tf.reduce_sum() need to specify the axis, maybe -1 ? I think each single sample oriented by loss is a box rather than a batch.

Could you please see this problem at your earliest convenience?

ChiefGodMan · 2018-03-31T16:42:51Z

Oh, my god. The latest version of models has changed the loss function return value. My code is for previous version(maybe before version 1.2). You just need return per_entry_cross_ent variable rather then tf.reduce_sum() result.
Actually, the models has implemented focal loss called 'class SigmoidFocalClassificationLoss(Loss)', you can have a try.

GallonDeng · 2019-02-15T04:35:17Z

hi, @ailias @MAGI003769 , thus I can directly use tensorflow object_detection models api to merge focal loss with SSD. However, I am not sure,
whether it can directly be used for multi-label dataset or not? Should I do some modifications? Thanks

DonghoonPark12 mentioned this issue May 23, 2019

[Question] How to implement focal loss on ssd300 model pierluigiferrari/ssd_keras#248

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Question on MobileNet SSD architechture with focal loss. #5

Question on MobileNet SSD architechture with focal loss. #5

MAGI003769 commented Mar 22, 2018

ChiefGodMan commented Mar 24, 2018

MAGI003769 commented Mar 29, 2018

ChiefGodMan commented Mar 31, 2018

GallonDeng commented Feb 15, 2019

Question on MobileNet SSD architechture with focal loss. #5

Question on MobileNet SSD architechture with focal loss. #5

Comments

MAGI003769 commented Mar 22, 2018

ChiefGodMan commented Mar 24, 2018

MAGI003769 commented Mar 29, 2018

ChiefGodMan commented Mar 31, 2018

GallonDeng commented Feb 15, 2019