Skip to content

Commit

Permalink
clean up
Browse files Browse the repository at this point in the history
  • Loading branch information
Kye committed Oct 2, 2023
1 parent fecb85d commit 529b594
Showing 1 changed file with 4 additions and 0 deletions.
4 changes: 4 additions & 0 deletions README.md
Original file line number Diff line number Diff line change
Expand Up @@ -63,6 +63,10 @@ preds = v(img) # (1, 1000)
print(preds)
```

# Model Architecture
- Regular vit with new parallel layers, QK(Query/Key)Normalization, and omitted biases.

----
# Dataset Strategy
The paper trains ViT-22B on a version of the JFT dataset that has been extended to around 4 billion images. JFT is a large-scale dataset scraped from the internet, originally containing over 300 million images labeled with a hierarchical taxonomy of 30,000 categories.

Expand Down

0 comments on commit 529b594

Please sign in to comment.