Skip to content

Enabled configurable auto Tensor Parallelism (TP) for the inference of diverse models #12810

Enabled configurable auto Tensor Parallelism (TP) for the inference of diverse models

Enabled configurable auto Tensor Parallelism (TP) for the inference of diverse models #12810

Re-run triggered January 21, 2025 19:57
Status Success
Total duration 7m 34s
Artifacts

nv-accelerate-v100.yml

on: pull_request
Fit to window
Zoom out
Zoom in