forked from open-mmlab/mmpretrain
-
Notifications
You must be signed in to change notification settings - Fork 0
/
metafile.yml
89 lines (88 loc) · 2.86 KB
/
metafile.yml
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
Collections:
- Name: OFA
Metadata:
Architecture:
- ResNet
- Transformer
Training Data:
- CC12M
- CC3M
- SBU
- COCO
- VG
- VQAv2
- GQA
- RefCOCO
- OpenImages
- Object365
- YFCC100M
- ImageNet-21K
- Pile
Paper:
Title: 'OFA: Unifying Architectures, Tasks, and Modalities Through a Simple
Sequence-to-Sequence Learning Framework'
URL: https://arxiv.org/abs/2202.03052
README: configs/ofa/README.md
Models:
- Name: ofa-base_3rdparty-finetuned_refcoco
Metadata:
FLOPs: null
Parameters: 182238536
In Collection: OFA
Results:
- Task: Visual Grounding
Dataset: RefCOCO
Metrics:
Accuracy (testA): 90.49
Accuracy (testB): 83.63
Weights: https://download.openmmlab.com/mmclassification/v1/ofa/ofa-base_3rdparty_refcoco_20230418-2797d3ab.pth
Config: configs/ofa/ofa-base_finetuned_refcoco.py
Converted From:
Weights: https://ofa-beijing.oss-cn-beijing.aliyuncs.com/checkpoints/refcoco_base_best.pt
Code: https://github.com/OFA-Sys/OFA
- Name: ofa-base_3rdparty-finetuned_vqa
Metadata:
FLOPs: null
Parameters: 182238536
In Collection: OFA
Results:
- Task: Visual Question Answering
Dataset: VQAv2
Metrics:
Accuracy: 78.00 # Report from the official repo
Weights: https://download.openmmlab.com/mmclassification/v1/ofa/ofa-base_3rdparty_coco-vqa_20230418-f38539a5.pth
Config: configs/ofa/ofa-base_finetuned_vqa.py
Converted From:
Weights: https://ofa-beijing.oss-cn-beijing.aliyuncs.com/checkpoints/vqa_large_best.pt
Code: https://github.com/OFA-Sys/OFA
- Name: ofa-base_3rdparty-finetuned_caption
Metadata:
FLOPs: null
Parameters: 182238536
In Collection: OFA
Results:
- Task: Image Caption
Dataset: COCO
Metrics:
BLEU-4: 42.64
CIDER: 144.50
Weights: https://download.openmmlab.com/mmclassification/v1/ofa/ofa-base_3rdparty_coco-caption_20230418-de18914e.pth
Config: configs/ofa/ofa-base_finetuned_caption.py
Converted From:
Weights: https://ofa-beijing.oss-cn-beijing.aliyuncs.com/checkpoints/caption_base_best.pt
Code: https://github.com/OFA-Sys/OFA
- Name: ofa-base_3rdparty-zeroshot_vqa
Metadata:
FLOPs: null
Parameters: 182238536
In Collection: OFA
Results:
- Task: Visual Question Answering
Dataset: VQAv2
Metrics:
Accuracy: 58.32
Weights: https://download.openmmlab.com/mmclassification/v1/ofa/ofa-base_3rdparty_pretrain_20230418-dccfc07f.pth
Config: configs/ofa/ofa-base_zeroshot_vqa.py
Converted From:
Weights: https://ofa-beijing.oss-cn-beijing.aliyuncs.com/checkpoints/ofa_base.pt
Code: https://github.com/OFA-Sys/OFA