Releases: JuliaGPU/AMDGPU.jl
Releases · JuliaGPU/AMDGPU.jl
v0.5.3
AMDGPU v0.5.3
Closed issues:
- AMDGPU.jl master is broken on Julia 1.7 (#372)
- Failure calling upon calling Enzyme
autodiff_deferred(#444) - Segmentation fault on
hipStreamDestroy(#449) - Setting HIP_VISIBLE_DEVICES to an invalid ID fails in an unhelpful way (#450)
- hipErrorSharedObjectInitFailed (#451)
- Unexpected error: ccall requires compiler when using QR (#461)
Merged pull requests:
- Add
AMDGPU.@syncmacro (#454) (@luraess) - Add rocSOLVER routines (#456) (@pxl-th)
- Add missing HIP error code (#457) (@pxl-th)
- Add env variable if Navi 2 detected (#458) (@pxl-th)
- Update docs (#459) (@pxl-th)
- Update doc (#460) (@luraess)
- blas: Improve error on missing rocBLAS (#462) (@jpsamaroo)
- rocSPARSE support (#463) (@pxl-th)
- Check libraries are functional once during init (#464) (@pxl-th)
v0.5.2
v0.5.1
AMDGPU v0.5.1
Closed issues:
- Implement Neural Network primitives (#11)
- [Mark/Wait] Use HIP events to do fine-grained sync (#127)
- Implement memory reclaim mechanism similar to CUDA's (#134)
- NNlibAMDGPU.jl ? (#143)
- Deprecation warning
unsafe_length()(#183) - Test suite failures due to segfaults on Julia 1.8 (#261)
- HSA memory region query test fail (#275)
- ROCBlas support for gfx1031, 1032, and 1033 (#314)
Merged pull requests:
v0.5.0
AMDGPU v0.5.0
Closed issues:
- Test failures locally on 1.9.0-beta4 -- Radeon 6800XT (#400)
- Update HIP errors codes (#404)
- Optimize
wait!for HSA kernel launches (#405) - rocBLAS synchronization issue? (#418)
- First install with
JULIA_AMDGPU_DISABLE_ARTIFACTSleads to broken config (#424) - Cannot
unsafe_wrapa device array iflock=false(#436)
Merged pull requests:
- Use HIP as kernel backend instead of HSA (#423) (@pxl-th)
- fix(docs): Wrong symbol in
functionaldocs (#431) (@kunzaatko) - Update to GPUCompiler 0.21 & LLVM 6 (#437) (@pxl-th)
- Fix docs for HIP (#439) (@luraess)
- Run tests on multiple workers again (#441) (@pxl-th)
- Specialize ROCArray on buffer type (#442) (@pxl-th)
v0.4.15
AMDGPU v0.4.15
Merged pull requests:
v0.4.14
AMDGPU v0.4.14
Closed issues:
- Switching to device ≠ 1 hangs on multi-GPU node (#425)
- @ROCDynamicLocalArray: add support for dynamic eltype and expressions for dims (#428)
Merged pull requests:
- Fix host synchronization (#417) (@pxl-th)
- Add device selection in current task by ID (#420) (@luraess)
- Declare compatibility with
LLVM_jll15 (#426) (@giordano) - Remove buggy uses of default_device (#427) (@jpsamaroo)
- at-ROC*LocalArray: Escape arguments (#430) (@jpsamaroo)
v0.4.13
AMDGPU v0.4.13
Merged pull requests:
v0.4.12
v0.4.11
v0.4.10
AMDGPU v0.4.10
Merged pull requests: