About the rich feedback model release #3

srymaker · 2024-06-20T10:16:55Z

Thanks for your great work!
Will the rich feedback model be released?I'd love to test and experience the model and apply it to my own tasks！

leebird · 2024-06-22T05:09:27Z

Hello, currently we don't have the plan to release the model.

densechen · 2024-06-24T02:49:34Z

@leebird Looking forward to the rich feedback model...

udrs · 2024-06-25T07:53:00Z

Looking forward

leebird · 2024-06-25T23:29:56Z

Thanks for all the interests in our work! Due to company policies (related to productization etc.) we could not open source the model. We have included details of how to reproduce the results in our paper. If you have further questions please email the corresponding authors, and we'd be happy to help you reproduce the results.

srymaker · 2024-06-26T08:53:33Z

Hello, can you tell me how you trained the reward model, like which layers were frozen and which tuning method was used?

leebird · 2024-06-29T00:58:10Z

Hi @srymaker , we finetuned all the layers in the model, including the ViT component. We tried freezing the ViT component but it didn't work well, especially for the heatmap tasks. Experiment details including hyperparameters and optimizer can be found in Section 9 in the paper.

srymaker · 2024-07-01T06:14:36Z

Thank you for your answer. Do all layers refer to the encoder and decoder in t5?

leebird · 2024-07-07T06:42:50Z

@srymaker yes, all the layers are from the ViT and T5 encoder/decoder. Note there is a pretraining stage for the ViT and T5 layers on multimodal data as they were originally pretrained on unimodal data only.

Nieleilei · 2024-09-10T10:15:02Z

@srymaker yes, all the layers are from the ViT and T5 encoder/decoder. Note there is a pretraining stage for the ViT and T5 layers on multimodal data as they were originally pretrained on unimodal data only.

Hello, For single-modal pretraining tasks, are only the natural image captioning tasks on the WebLI dataset used? What other tasks are included?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

About the rich feedback model release #3

About the rich feedback model release #3

srymaker commented Jun 20, 2024

leebird commented Jun 22, 2024

densechen commented Jun 24, 2024

udrs commented Jun 25, 2024

leebird commented Jun 25, 2024 •

edited

Loading

srymaker commented Jun 26, 2024

leebird commented Jun 29, 2024

srymaker commented Jul 1, 2024 •

edited

Loading

leebird commented Jul 7, 2024

Nieleilei commented Sep 10, 2024

About the rich feedback model release #3

About the rich feedback model release #3

Comments

srymaker commented Jun 20, 2024

leebird commented Jun 22, 2024

densechen commented Jun 24, 2024

udrs commented Jun 25, 2024

leebird commented Jun 25, 2024 • edited Loading

srymaker commented Jun 26, 2024

leebird commented Jun 29, 2024

srymaker commented Jul 1, 2024 • edited Loading

leebird commented Jul 7, 2024

Nieleilei commented Sep 10, 2024

leebird commented Jun 25, 2024 •

edited

Loading

srymaker commented Jul 1, 2024 •

edited

Loading