v3.12.2
CUDA v3.12.2
Closed issues:
- CUDA.jl cuFFT underperforming against CuPy cuFFT (#1682)
- Error during CUDA test (#1718)
- Kernel error from bad broadcast (should be regular error?) (#1720)
- Freeze into StackOverflow when
JULIA_DEBUG=CUDA
set (#1721) - Use of linear operators in CUDA.jl (#1727)
- Is block-spmm supported? (#1736)
Merged pull requests:
- Allow
copy(::RNG)
(#1719) (@mcabbott) - Update manifest (#1722) (@github-actions[bot])
- Simplify CuError rendering before library initialization. (#1723) (@maleadt)
- Simplify CuError rendering before library initialization (master branch version) (#1724) (@maleadt)
- Make device RNG test more robust. (#1725) (@maleadt)
- Rely on LLVM.jl's typed_ccall for more intrinsics. (#1728) (@maleadt)
- Backports for 3.13 (#1729) (@maleadt)
- Simplify CUBLAS and CUSPARSE wrappers, reducing code generated. (#1730) (@maleadt)
- Add Julia 1.9 CI. (#1731) (@maleadt)
- Use released dependencies. (#1732) (@maleadt)
- Remove NVTX. (#1733) (@maleadt)
- Introduce cuFFT plan cache; switch to auto-managed memory. (#1734) (@maleadt)
- Stop pirating GPUArrays' RNG methods. (#1735) (@maleadt)