The Llama inference examples needs to be updated to maintain parity with transformers==4.36 #14

sol0invictus · 2024-01-24T01:12:05Z

Line 36 in a80091d

LlamaDecoderLayer,

The llama inference example needs to be updated because transformers==4.36 now needs an additional argument layer_idx in the LlamaDecoderLayer class.
https://github.com/huggingface/transformers/blob/v4.37.0/src/transformers/models/llama/modeling_llama.py#L754

The text was updated successfully, but these errors were encountered:

jluntamazon · 2024-01-24T18:13:52Z

Thank you for the code reference! We reproduced the problem here and are intending to release a fix for this example in the upcoming release

As an immediate solution, you can either downgrade the transformers version or update the code to handle the layer_idx argument.

aws-taylor added documentation Improvements or additions to documentation Trn1 labels Nov 11, 2024

Provide feedback