Skip to content

Commit

Permalink
Lower TGI IE batch size (#71)
Browse files Browse the repository at this point in the history
* feat(TGI): entrypoint's default batch size set to 2

To avoid memory issues, default batch size is now lowered.

* chore: version bumped to 0.1.3
  • Loading branch information
tengomucho authored Jul 9, 2024
1 parent 77bebf8 commit e09a66b
Show file tree
Hide file tree
Showing 3 changed files with 3 additions and 3 deletions.
2 changes: 1 addition & 1 deletion optimum/tpu/version.py
Original file line number Diff line number Diff line change
Expand Up @@ -15,5 +15,5 @@
from pkg_resources import parse_version


__version__ = "0.1.2"
__version__ = "0.1.3"
VERSION = parse_version(__version__)
2 changes: 1 addition & 1 deletion text-generation-inference/docker/entrypoint.sh
Original file line number Diff line number Diff line change
Expand Up @@ -6,7 +6,7 @@ ulimit -l 68719476736

# Hugging Face Hub related
if [[ -z "${BATCH_SIZE}" ]]; then
BATCH_SIZE=4
BATCH_SIZE=2
fi
export BATCH_SIZE="${BATCH_SIZE}"

Expand Down
Original file line number Diff line number Diff line change
@@ -1,5 +1,5 @@
from pkg_resources import parse_version


__version__ = "0.1.2"
__version__ = "0.1.3"
VERSION = parse_version(__version__)

0 comments on commit e09a66b

Please sign in to comment.