Nebius package for NVIDIA® GPU Operator

Description

NVIDIA® GPU Operator helps you provision GPUs in Kubernetes clusters. Using the operator pattern to extend Kubernetes, NVIDIA GPU Operator automatically manages the components needed to provision GPUs, such as the NVIDIA drivers (to enable CUDA), Kubernetes device plugin for GPUs, the NVIDIA Container Toolkit, automatic node labelling using GFD, DCGM based monitoring and others.

Short description

NVIDIA® GPU Operator automates GPU setup in Kubernetes clusters.

Tutorial

Configure the application:
- RDMA: Select this option to enable GPUDirect RDMA and boost the data exchange speed between GPUs. It is recommended to select the option.
Click Install.

Usage

To check that the NVIDIA GPU Operator is working:

Install the kubectl and configure it to work with the created cluster.
Check that NVIDIA GPU Operator pods are running:
```
kubectl get pods -n <namespace>
```

Use cases

Automating management of GPU software components in Kubernetes clusters.
Scaling GPU deployments in Kubernetes.

Links

NVIDIA GPU Operator
Using node groups with GPUs

Term of service

NVIDIA GPU Operator Licenses

Legal

By using the application, you agree to their terms and conditions: the helm-chart and NVIDIA GPU Operator Licenses.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

Nebius package for NVIDIA® GPU Operator

Description

Short description

Tutorial

Usage

Use cases

Links

Term of service

Legal

Files

README.md

Latest commit

History

README.md

File metadata and controls

Nebius package for NVIDIA® GPU Operator

Description

Short description

Tutorial

Usage

Use cases

Links

Term of service

Legal