Skip to content

Releases: JuliaGPU/GemmKernels.jl

v0.2.0

18 Apr 07:40
26cbc57
Compare
Choose a tag to compare

GemmKernels v0.2.0

Diff since v0.1.0

Merged pull requests:

Closed issues:

  • Errors on small array inputs (#52)
  • Feature request: support for matmul with integer matrices (#64)
  • Feature request: support Matrix{Float32} = Matrix{Float32} × Matrix{Float32} (#75)
  • Remove fragtype_a (#84)
  • Replace GPUifyLoops.@unroll (#86)
  • Use LLVMLoopInfo.jl (#104)
  • Optimizations when alpha or beta is 0 (#110)
  • Transform functions: pass values, not VecElements (#114)
  • Benchmark bot (#115)
  • Questions about usage of registers (#152)
  • A wrong function name parallellise (#177)

v0.1.0

30 Jan 17:49
abd9cba
Compare
Choose a tag to compare

GemmKernels v0.1.0

Closed issues:

  • Unable to add the Package (#48)
  • Migrate GPU CI from GitLab to Buildkite? (#54)
  • Submit code coverage information to Codecov (#55)
  • Warning: Performing scalar operations on GPU arrays when running the test suite (#62)

Merged pull requests: