Skip to content

Mixed precision and Tensorflow XLA #322

Discussion options

You must be logged in to vote

Thanks for opening! I think this is a great callout!

We are just progressing to covering modeling workflows in KerasNLP, but I think what you are saying is spot on. We should really strive to have mixed-precision and XLA not just supported, but enabled as the default option wherever possible. They are both huge speedups.

For XLA, I believe that most components where we need support either have it, or there is work happening to support it (e.g. beam search utility).

For mixed precision, we do need to figure out how to instantiate say a BERT network with mixed precision along with checkpointed weights. Looks like @jbischof just opened a bug, so will kick off a comment there. #323

Replies: 3 comments 4 replies

Comment options

You must be logged in to vote
0 replies
Comment options

You must be logged in to vote
4 replies
@bhack
Comment options

@abheesht17
Comment options

@bhack
Comment options

@mattdangerw
Comment options

Comment options

You must be logged in to vote
0 replies
Answer selected by mattdangerw
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
5 participants