Skip to content

Enabled configurable auto Tensor Parallelism (TP) for the inference of diverse models #4029

Enabled configurable auto Tensor Parallelism (TP) for the inference of diverse models

Enabled configurable auto Tensor Parallelism (TP) for the inference of diverse models #4029

Annotations

2 errors

unit-tests

failed Jan 22, 2025 in 6h 0m 13s