You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Some of the qunatization models require loading the Hugging Face Datasets including customizing codes, and the HF datasets library performs time-limited interactive verification process to trust remote codes by using command-line in default.
This behavior is not suitable for non-interactive automated quantization processes.
Information
The official example scripts
My own modified scripts
Tasks
An officially supported task in the examples folder (such as GLUE/SQuAD, ...)
The quantization process should scceed without trust_remote_code related errors.
actual behavior
The quantization process freezes at load_calib_dataset() in tensorrt_llm/models/convert_utils.py to wait user's input whether enable trust_remote_code or not.
additional notes
We should modify load_calib_dataset() in one of the followings.
Remove trust_remote_code from the args of load_calib_dataset()
Change the default value of trust_remote_code as False and pass it for datasets.load_dataset()
In either case, trust_remote_code should be set as an arg when calling load_calib_dataset() if the dataset specified by dataset_name_or_dir includes the custom code.
My suggetion is adding calib_trust_remote_code arg to the command-line options of quantization.py and the other similar situations.
System Info
Who can help?
The argument of
trust_remote_code
is never used intensorrt_llm.models.convert_utils.load_calib_dataset()
. @Tracinhttps://github.com/NVIDIA/TensorRT-LLM/blob/v0.15.0/tensorrt_llm/models/convert_utils.py#L284
Some of the qunatization models require loading the Hugging Face Datasets including customizing codes, and the HF datasets library performs time-limited interactive verification process to trust remote codes by using command-line in default.
This behavior is not suitable for non-interactive automated quantization processes.
Information
Tasks
examples
folder (such as GLUE/SQuAD, ...)Reproduction
Expected behavior
The quantization process should scceed without trust_remote_code related errors.
actual behavior
The quantization process freezes at
load_calib_dataset()
intensorrt_llm/models/convert_utils.py
to wait user's input whether enable trust_remote_code or not.additional notes
We should modify
load_calib_dataset()
in one of the followings.trust_remote_code
from the args ofload_calib_dataset()
trust_remote_code
asFalse
and pass it fordatasets.load_dataset()
In either case,
trust_remote_code
should be set as an arg when callingload_calib_dataset()
if the dataset specified bydataset_name_or_dir
includes the custom code.My suggetion is adding
calib_trust_remote_code
arg to the command-line options ofquantization.py
and the other similar situations.TensorRT-LLM/examples/quantization/quantize.py
Lines 30 to 50 in 340a1b6
The text was updated successfully, but these errors were encountered: