You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
There are a number of ops-related ToDo items related to the BaJoR-facing zoobot Azure Batch account.
Update pytorch image in zoobot container registry: We want to upgrade to Zoobot 2.0 and use its recommended combo of pytorch + CUDA (torch == 2.1.0+cu121; see repo readme). An image update will also resolve the following pool-level warning from Azure: "This pool's image is nearing its end-of-life date of Tuesday, April 22, 2025 at 19:00:00. After this date, it will not appear as an option when creating new pools. API calls to create or scale pools using the image may continue to function for up to 60 days afterwards."
Select & Update VM type: we had selected Standard_NC6s_v3 as the cheapest single GPU (didn't use the CPUs), but we now need to resolve the following warning: "Support for the NCv3-series virtual machine family will be retired for Azure Batch pools on 30 September 2025 -- You're receiving this notice because you're currently using NCv3-series virtual machine sizes with Azure Batch pools. We'll retire support for NCv3-series virtual machines on 30 September 2025. This includes Standard_NC24rs_v3, Standard_NC6s_v3, Standard_NC12s_v3, and Standard_NC24s_v3. Between now and 30 September 2025, you'll need to either migrate your Batch pools to a newer virtual machine series in the same NC product line, or migrate to a different Batch-supported virtual machine size suitable for your workload."
Upgrade to blobfuse2: based on sporadic MountConfigurationError issues, consider upgrade of the mount-to-blob library we're using to current up-to-date version. See repo and docs.
Note: configuration of blob storage directory mounting (e.g., models and predictions for prediction job) is part of the pool configuration. See this pool configuration notebook for details.
Unlike past issues related to the job preparation tasks (configured as part of job), this error stems from the mount commands set via the pool-level configuration.
The text was updated successfully, but these errors were encountered:
There are a number of ops-related ToDo items related to the BaJoR-facing
zoobot
Azure Batch account.Update pytorch image in zoobot container registry: We want to upgrade to Zoobot 2.0 and use its recommended combo of pytorch + CUDA (torch == 2.1.0+cu121; see repo readme). An image update will also resolve the following pool-level warning from Azure: "This pool's image is nearing its end-of-life date of Tuesday, April 22, 2025 at 19:00:00. After this date, it will not appear as an option when creating new pools. API calls to create or scale pools using the image may continue to function for up to 60 days afterwards."
Select & Update VM type: we had selected Standard_NC6s_v3 as the cheapest single GPU (didn't use the CPUs), but we now need to resolve the following warning: "Support for the NCv3-series virtual machine family will be retired for Azure Batch pools on 30 September 2025 -- You're receiving this notice because you're currently using NCv3-series virtual machine sizes with Azure Batch pools. We'll retire support for NCv3-series virtual machines on 30 September 2025. This includes Standard_NC24rs_v3, Standard_NC6s_v3, Standard_NC12s_v3, and Standard_NC24s_v3. Between now and 30 September 2025, you'll need to either migrate your Batch pools to a newer virtual machine series in the same NC product line, or migrate to a different Batch-supported virtual machine size suitable for your workload."
Upgrade to blobfuse2: based on sporadic
MountConfigurationError
issues, consider upgrade of the mount-to-blob library we're using to current up-to-date version. See repo and docs.models
andpredictions
for prediction job) is part of the pool configuration. See this pool configuration notebook for details.The text was updated successfully, but these errors were encountered: