[CUDA] Refactor GroupNorm and add common vectorize implementation #19158
Azure Pipelines / Linux CPU CI Pipeline (arm64_build Linux_py_Wheels_aarch64)
succeeded
Jan 26, 2024 in 34m 6s
arm64_build Linux_py_Wheels_aarch64 succeeded
Loading