trust_remote_code argument ignored in load_calib_dataset() #2537

hiroshi-matsuda-rit · 2024-12-05T04:19:43Z

System Info

CPU: Any
GPU: Any
TensorRT-LLM main branch (including <= v0.15.0)
NVIDIA driver: Any
OS: Any

Who can help?

The argument of trust_remote_code is never used in tensorrt_llm.models.convert_utils.load_calib_dataset(). @Tracin
https://github.com/NVIDIA/TensorRT-LLM/blob/v0.15.0/tensorrt_llm/models/convert_utils.py#L284

Some of the qunatization models require loading the Hugging Face Datasets including customizing codes, and the HF datasets library performs time-limited interactive verification process to trust remote codes by using command-line in default.
This behavior is not suitable for non-interactive automated quantization processes.

Information

The official example scripts
My own modified scripts

Tasks

An officially supported task in the examples folder (such as GLUE/SQuAD, ...)
My own task or dataset (give details below)

Reproduction

MODEL_DIR=Llama-3.3-70B-Instruct
DTYPE=bfloat16 # or float16
TP=2
PP=1
MAX_SEQ_LEN=2048
SETTINGS=sq-0.5_tp${TP}_pp${PP}_$((MAX_SEQ_LEN / 1024))k
CKPT_DIR=./${MODEL_DIR}.${SETTINGS}.ckpt
ENGINE_DIR=./${MODEL_DIR}.${SETTINGS}
python3 TensorRT-LLM/examples/llama/convert_checkpoint.py \
  --model_dir ${MODEL_DIR} --dtype ${DTYPE} \
  --tp_size ${TP} --pp_size ${PP} \
  --smoothquant 0.5 --per_token --per_channel \
  --output_dir ${CKPT_DIR}

Expected behavior

The quantization process should scceed without trust_remote_code related errors.

actual behavior

The quantization process freezes at load_calib_dataset() in tensorrt_llm/models/convert_utils.py to wait user's input whether enable trust_remote_code or not.

additional notes

We should modify load_calib_dataset() in one of the followings.

Remove trust_remote_code from the args of load_calib_dataset()
Change the default value of trust_remote_code as False and pass it for datasets.load_dataset()

In either case, trust_remote_code should be set as an arg when calling load_calib_dataset() if the dataset specified by dataset_name_or_dir includes the custom code.
My suggetion is adding calib_trust_remote_code arg to the command-line options of quantization.py and the other similar situations.

TensorRT-LLM/examples/quantization/quantize.py

Lines 30 to 50 in 340a1b6

    
           parser.add_argument( 
        
               '--calib_dataset', 
        
               type=str, 
        
               default='cnn_dailymail', 
        
               help= 
        
               "The huggingface dataset name or the local directory of the dataset for calibration." 
        
           ) 
        
           parser.add_argument( 
        
               '--calib_tp_size', 
        
               type=int, 
        
               default=1, 
        
               help= 
        
               "Tensor parallel size for calibration; effective for NeMo checkpoint only." 
        
           ) 
        
           parser.add_argument( 
        
               '--calib_pp_size', 
        
               type=int, 
        
               default=1, 
        
               help= 
        
               "Pipeline parallel size for calibration; effective for NeMo checkpoint only." 
        
           )

The text was updated successfully, but these errors were encountered:

hiroshi-matsuda-rit · 2024-12-11T11:54:00Z

@Tracin I just described more detailed reproduction steps.

hiroshi-matsuda-rit added the bug Something isn't working label Dec 5, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

trust_remote_code argument ignored in load_calib_dataset() #2537

trust_remote_code argument ignored in load_calib_dataset() #2537

hiroshi-matsuda-rit commented Dec 5, 2024 •

edited

Loading

hiroshi-matsuda-rit commented Dec 11, 2024

trust_remote_code argument ignored in load_calib_dataset() #2537

trust_remote_code argument ignored in load_calib_dataset() #2537

Comments

hiroshi-matsuda-rit commented Dec 5, 2024 • edited Loading

System Info

Who can help?

Information

Tasks

Reproduction

Expected behavior

actual behavior

additional notes

hiroshi-matsuda-rit commented Dec 11, 2024

hiroshi-matsuda-rit commented Dec 5, 2024 •

edited

Loading