v4.8.2
github-actions
released this
06 Sep 21:40
·
1122 commits
to refs/heads/master
since this release
API
- dsl: Removed dynamic classes for AbstractFunctions (fixes memory leaks seen by some users) @FabioLuporini (#2190)
- api: Fix symbolic coefficients for cross derivatives @mloubout (#2185)
- api: Allow parametric par-tile as input @FabioLuporini (#2168)
- api: Use subs for origin in the case where index is a function @mloubout (#2120)
- compiler: Add utility function to normalize sympy arguments @FabioLuporini (#2125)
Examples
Compiler
- compiler: Keep -qopenmp by default after icx 2023.2 @georgebisbas (#2164)
- compiler: Add opkwargs property to ArgumentsMap @ccuetom (#2142)
- compiler: Misc compiler fixes and improvements -- part II @FabioLuporini (#2138)
- compiler: Pass operator arguments to downstream operators @ccuetom (#2139)
- compiler: Improve lowering of IndexDerivatives @FabioLuporini (#2112)
- compiler: Misc compiler tweaks and improvements @FabioLuporini (#2136)
- compiler: Avoid generating collapse(1) @FabioLuporini (#2129)
- compiler: Patch pickling of GuardFactor and reconstruction @FabioLuporini (#2126)
- compiler: Introduce gpu-create parameter for buffers initialized on device @ccuetom (#2107)
- compiler: Change tile use in DeviceAcczier to allow multiple tile sizes @gpc1064 (#2095)
- compiler: Add host-*-pin handles; more volatile with pthreads @FabioLuporini (#2116)
- compiler: Support template parameters @FabioLuporini (#2105)
MPI
- mpi: Instrument compute0 core after specialising as ComputeCall @georgebisbas (#2143)
- mpi: Enhance flexibility for custom topologies @georgebisbas (#2134)
- mpi: Packed gathers and scatters @FabioLuporini (#2109)
GPU
Architectures and JIT
- compiler: Enable AVX512 compiler support when available. @ggorman (#2184)
- arch: Correct
march
tomcpu
for ppc @raminammour (#2174) - misc: Add deviceid to configuration and enhance switchconfig @ccuetom (#2175)
- arch: Add ICX support @georgebisbas (#2051)
🐛 Bug Fixes
- dsl: Removed dynamic classes for AbstractFunctions @FabioLuporini (#2190)
- dsl: Prevent aggregation for symbolic coefficients @mloubout (#2182)
- api: Prevent factorization for symbolic coefficients @mloubout (#2179)
- compiler: Prevent Eq dims to be lost if only implicit @mloubout (#2169)
- compiler: Fix non-arithmetic distances @mloubout (#2165)
- compiler: Prevent adding breaking guard to nokey @mloubout (#2160)
- compiler: Add guards to prevent OOB when streaming buffers with ConditionalDimension @ccuetom (#2150)
- compiler: Fix CondDim's factor auto-override @FabioLuporini (#2154)
- compiler: Fix pickling of aliasing SparseFunction @FabioLuporini (#2148)
- compiler: Revert "compiler: Relax WaitLock regions in a ScheduleTree" @FabioLuporini (#2141)
- compiler: Patch pickling of GuardFactor and reconstruction @FabioLuporini (#2126)
- compiler: Fix OpenMP reductions in tandem with linearize=True @FabioLuporini (#2117)
Testing
- misc: fix openmp= deprecation @mloubout (#2186)
- Removing AWS ondemand gh-runners from CI @ggorman (#2155)
Continuous Integration
- CI: pytest setup fix @mloubout (#2177)
- docker: add some tweaks to Nvidia docker @mloubout (#2171)
- CI: Fix asv setup @mloubout (#2167)
- CI: Fix asv devito install @mloubout (#2166)
- ci: Add python 3.11 and minor CI fixing @georgebisbas (#2158)
- docker: revamp base deployment @mloubout (#2162)
- ci: switch to concurrency settings rather than extra action @mloubout (#2119)
Installation
- docker: Add intel advisor to icx image @mloubout (#2180)
- docker: Add some tweaks to nvidia docker @mloubout (#2171)
- docker: Switch to intelpython for icc/icx build @mloubout (#2172)
- docker: Revamp base deployment @mloubout (#2162)
- pip prod(deps): update distributed requirement from <2023.7 to <2023.8 @dependabot (#2161)
- pip prod(deps): update anytree requirement from <=2.8,>=2.4.3 to >=2.4.3,<=2.9.0 @dependabot (#2152)
- pip prod(deps): update distributed requirement from <2023.6 to <2023.7 @dependabot (#2145)
- pip prod(deps): update distributed requirement from <2023.5 to <2023.6 @dependabot (#2127)
- deps: sympy 1.12 compat @mloubout (#2123)
- pip prod(deps): update distributed requirement from <2023.4 to <2023.5 @dependabot (#2118)
- reqs: Move pyrevolve to optionals and introduce testing-only reqs @georgebisbas (#2096)
- reqs: Fix for matplotlib >=3.6.3 @georgebisbas (#2047)
- install: Make mpi4py portable across Intel and AMD @FabioLuporini (#2115)
- install: Overhaul Dockerfile.amd for MPI support @FabioLuporini (#2104)
- pip prod(deps): update distributed requirement from <2023.4 to <2023.5 @dependabot (#2110)
- pip prod(deps): update ipyparallel requirement from <8.6 to <8.7 @dependabot (#2106)
New Contributors
- @gpc1064 made their first contribution in #2095
- @raminammour made their first contribution in #2174
Full Changelog: v4.8.1...v4.8.2