Skip to content

Commit

Permalink
remove exact timings.
Browse files Browse the repository at this point in the history
  • Loading branch information
markus583 committed Sep 22, 2024
1 parent 1e15560 commit b9198d0
Showing 1 changed file with 3 additions and 3 deletions.
6 changes: 3 additions & 3 deletions README.md
Original file line number Diff line number Diff line change
Expand Up @@ -60,13 +60,13 @@ sat = SaT("sat-3l-sm", onnx_providers=["CUDAExecutionProvider"])
# PyTorch GPU
>>> model = SaT("sat-3l-sm")
>>> model.half().to("cuda")
>>> %timeit list(model.split(texts))
138 ms ± 8.41 ms per loop (mean ± std. dev. of 7 runs, 1 loop each)
>>> list(model.split(texts))
# quite fast already, but...

# onnxruntime GPU
>>> model = SaT("sat-3l-sm", ort_providers=["CUDAExecutionProvider"])
>>> %timeit list(model.split(texts))
198 ms ± 1.36 ms per loop (mean ± std. dev. of 7 runs, 1 loop each)
# ...this should be ~50% faster!
```

If you wish to use LoRA in combination with an ONNX model:
Expand Down

0 comments on commit b9198d0

Please sign in to comment.