Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Changes from main backport to the brach #95

Merged
merged 43 commits into from
Nov 25, 2024
Merged

Conversation

shoguevara
Copy link
Collaborator

No description provided.

iglunchadze and others added 30 commits October 28, 2024 10:17
…ting README.md for k8s-training and k8s-inference for well known limitations and csi-driver how-to-use guide.
MSP-3313: Add NSYS profiling in GPT3 mlperf implementation
Add configs for H200 nodes to GPT3 training implementation
fix public ip allocation for gpu nodes
Update README for training in Kubernetes after proofreading
…nference

Adding csi-driver-mounted-fs solution to support k8s-inference + upda…
nccl_use_infiniband true and nccl_benchmark_min_threshold 45
@shoguevara shoguevara had a problem deploying to project-e00pjzzrtk1fs3yavy November 25, 2024 14:17 — with GitHub Actions Error
@shoguevara shoguevara merged commit dec39b4 into feature/ipip-tunnel-routing Nov 25, 2024
1 check failed
@shoguevara shoguevara had a problem deploying to project-e00pjzzrtk1fs3yavy November 25, 2024 14:17 — with GitHub Actions Error
@shoguevara shoguevara had a problem deploying to project-e00pjzzrtk1fs3yavy November 25, 2024 14:17 — with GitHub Actions Error
@shoguevara shoguevara had a problem deploying to project-e00pjzzrtk1fs3yavy November 25, 2024 14:17 — with GitHub Actions Error
@shoguevara shoguevara had a problem deploying to project-e00pjzzrtk1fs3yavy December 8, 2024 16:02 — with GitHub Actions Failure
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

9 participants