Skip to content

Commit

Permalink
Add AliBi to supported features in README_GAUDI.md (HabanaAI#287)
Browse files Browse the repository at this point in the history
ALiBi was fixed in HabanaAI#254, so it
should be added to supported features in README.
  • Loading branch information
kzawora-intel authored Oct 7, 2024
2 parents db5aed6 + 347f9c7 commit 902f575
Show file tree
Hide file tree
Showing 2 changed files with 2 additions and 2 deletions.
2 changes: 1 addition & 1 deletion README_GAUDI.md
Original file line number Diff line number Diff line change
Expand Up @@ -81,14 +81,14 @@ Supported Features
- Inference with [HPU
Graphs](https://docs.habana.ai/en/latest/PyTorch/Inference_on_PyTorch/Inference_Using_HPU_Graphs.html)
for accelerating low-batch latency and throughput
- Attention with Linear Biases (ALiBi)
- INC quantization

Unsupported Features
====================

- Beam search
- LoRA adapters
- Attention with Linear Biases (ALiBi)
- AWQ quantization
- Prefill chunking (mixed-batch inferencing)

Expand Down
2 changes: 1 addition & 1 deletion docs/source/getting_started/gaudi-installation.rst
Original file line number Diff line number Diff line change
Expand Up @@ -76,14 +76,14 @@ Supported Features
- Tensor parallelism support for multi-card inference
- Inference with `HPU Graphs <https://docs.habana.ai/en/latest/PyTorch/Inference_on_PyTorch/Inference_Using_HPU_Graphs.html>`__
for accelerating low-batch latency and throughput
- Attention with Linear Biases (ALiBi)
- INC quantization

Unsupported Features
====================

- Beam search
- LoRA adapters
- Attention with Linear Biases (ALiBi)
- AWQ quantization
- Prefill chunking (mixed-batch inferencing)

Expand Down

0 comments on commit 902f575

Please sign in to comment.