Skip to content

Latest commit

 

History

History
101 lines (63 loc) · 4.69 KB

README.md

File metadata and controls

101 lines (63 loc) · 4.69 KB

Nebius package for Jupyterhub

Description

JupyterHub is a multi-user server for Jupyter notebooks, providing a collaborative environment for data science, machine learning, and scientific computing. This Kubernetes-compatible application includes PyTorch, a popular deep learning framework, and CUDA support for GPU acceleration. JupyterHub allows organizations to serve computational environments to multiple users, making it ideal for teams, academic courses, and research labs. PyTorch enables efficient tensor computations and dynamic neural networks, while CUDA support leverages NVIDIA GPUs for accelerated computing. You can deploy JupyterHub with PyTorch and CUDA in your Nebius AI Managed Service for Kubernetes clusters.

Short description

Multi-user JupyterHub with PyTorch & CUDA, deployable on Kubernetes.

Use cases

  • Data preprocessing and exploration for machine learning projects
  • Collaborative research in academic and scientific environments
  • Training and fine-tuning deep learning models using PyTorch
  • Accelerated computing tasks using NVIDIA GPUs
  • Interactive coding and visualization for data analysis
  • Educational purposes, such as teaching programming or data science courses
  • Prototyping and experimenting with machine learning algorithms
  • Sharing and reproducing research results in computational sciences
  • Running distributed machine learning workloads across multiple nodes
  • Developing and testing AI applications in a scalable environment

Links

JupyterHub website JupyterHub on GitHub JupyterHub documentation PyTorch website PyTorch documentation NVIDIA CUDA Toolkit

Legal

By using the application, you agree to their terms and conditions: the helm-chart, Jupyter Hub, CUDA Toolkit and PyTorch.

Tutorial

{% note info %}

Each node in the group should have at least 8 GB of RAM.

To use GPU create GPU node group and install NVIDIA® GPU Operator

{% endnote %}

Install and configure the product:

  1. Configure the application:

    1. JupyterHub accessibility: Select a way to access the JupyterHub cluster:

      • ClusterIP for access by port-forwarding.
      • LoadBalancer for access from the Internet by IP or by port-forwarding.

    {% note warning %}

    LoadBalancer uses one IP address from the Number of public IP addresses per cloud quota.

    {% endnote %}

  2. Click Install.

  3. Wait for the application to change its status to Deployed.

Usage

Depending on the way to access the cluster you have chosen while configuring the application, get the IP or use port-forwarding to access the JupyterHub UI:

{% list tabs %}

  • LoadBalancer

    1. Install and configure kubectl, to get the IP using the following command:
    kubectl describe svc proxy-public -n <namespace> | grep "LoadBalancer Ingress:"
    1. Go to http://<ip> in your browser. On the login screen, specify the user as admin and enter a password that will be saved as the admin's password.
  • ClusterIP

    1. Set up port forwarding (you will need to install and configure kubectl for that):

      kubectl --namespace=<namespace> port-forward service/proxy-public 8080:http
    2. Go to http://localhost:8080 in your browser. On the login screen, specify the user as admin and enter a password that will be saved as the admin's password.

{% endlist %}

To create and manage users in JupyterHub:

  1. In the top menu, click FileHub Control Panel.
  2. Click Admin.
  3. Click Add Users.
  4. Enter usernames, one per line. To make the new users admins and allow them to manage users, select the Admin checkbox.
  5. Click Add Users.

When adding users, you will not create passwords for them. A new user creates their password themselves when they first log into JupyterHub.