Releases: EnzymeAD/Reactant.jl
Releases · EnzymeAD/Reactant.jl
v0.2.9
v0.2.8
Reactant v0.2.8
Merged pull requests:
- Add IFRT C-API bindings (#178) (@mofeing)
- Reactant interpreter overload (#259) (@wsmoses)
- First batch of shortcuts to HLO ops (#267) (@mofeing)
- fix attribute for arrray of bools (#279) (@Pangoraw)
- enable new optimizations (#280) (@Pangoraw)
- gensym in
@code_hlo(#281) (@jumerckx) - test for traced_type where array isn't captured by type param (#282) (@jumerckx)
- fix: don't print "Module" (#283) (@avik-pal)
- docs: nicer documentation setup (#284) (@avik-pal)
- add check for null in
show(::Operation). (#285) (@jumerckx) - build(deps): bump codecov/codecov-action from 4 to 5 (#286) (@dependabot[bot])
- Format code of branch "main" (#289) (@github-actions[bot])
- CompatHelper: add new compat entry for EnzymeCore at version 0.8, (keep existing compat) (#290) (@github-actions[bot])
- chore: specify sources for local dev (#292) (@avik-pal)
- feat: define aos_to_soa (#294) (@avik-pal)
- add fill to TracedRNumber (#295) (@glou-nes)
- Format code of branch "main" (#296) (@github-actions[bot])
- fix: vararg compilation (#299) (@avik-pal)
- ci: add downgrade testing (#300) (@avik-pal)
- Fix tracedarray (#302) (@wsmoses)
- fix
create_result(::Array)for non-vectors (#304) (@mofeing) - implement
collectforTracedRArray(#305) (@mofeing) - fix
YaoBlocks.matfor rotation gates withTracedRNumberparameters (#306) (@mofeing) - use row major when building attributes (#307) (@Pangoraw)
- Format code of branch "main" (#309) (@github-actions[bot])
- Speed up invoke usage (#313) (@wsmoses)
- specify aarch64-linux toolchain for Yggdrasil build (#316) (@mofeing)
- add optional debug info to Ops calls (#317) (@mofeing)
- Implement NNlib.∇conv_data! and NNlib.∇conv_filter! (#318) (@Pangoraw)
- Ops remaining basic numerical test (#320) (@glou-nes)
- fix: preserve IOContext for printing (#322) (@avik-pal)
- fix: #196 (#323) (@glou-nes)
- [JLL] Add conversion from llvm to mlir (#324) (@wsmoses)
Closed issues:
- IFRT tracking issue (#176)
- Abstract interpretation over
ReactantInterpretersometimes skips overriden methods (#189) - crash with parameter less function (#196)
- Mismatch computing
Enzyme.gradientbetween regular array andConcreteRArray(#236) - Add downgrade testing CI (#271)
- Inconsistent result for
iszerofor booleans (#277) - Implement
fillforTracedRNumber(#288) - compilation failure of getindex with a type-assert (#293)
v0.2.7
v0.2.6
Reactant v0.2.6
Merged pull requests:
- feat: functionalities for supporting NeuralOperators.jl (#217) (@avik-pal)
- CompatHelper: add new compat entry for AbstractFFTs in [weakdeps] at version 1, (keep existing compat) (#229) (@github-actions[bot])
- feat: overload
mul!(#230) (@avik-pal) - feat: extend
mul!work for subarrays (#231) (@avik-pal) - fix abs of complex number (#239) (@Pangoraw)
- feat: support setindex with views (#240) (@avik-pal)
- fix: bypass segfault with fill complex (#245) (@avik-pal)
- feat: add support for
clampandclamp!(#247) (@avik-pal) - fix: reduction of complex arrays (#248) (@avik-pal)
- feat: add support for NNlib make causal mask (#249) (@avik-pal)
- feat: partial NNlib.gather support + better indexing support (#252) (@avik-pal)
- Add 32bit CI (#254) (@wsmoses)
- implement
@tracefor (#255) (@Pangoraw) - Format code of branch "main" (#257) (@github-actions[bot])
- feat: more coverage for NNlib functions (#258) (@avik-pal)
- feat: inherit scalar indexing functionality from GPUArraysCore (#268) (@avik-pal)
- Disable verifier in main pass manager pipeline (#269) (@Pangoraw)
- chore: bump ReactantCore version (#270) (@avik-pal)
- fix: add GPU backend as default if available (#272) (@avik-pal)
- Use gensym-ed variable in
@compileand@jit(#274) (@jumerckx)
Closed issues:
YaoBlocks.cunmathas problems during tracing (#174)- Lux example MLP with TrainState API throws JIT session errors (#220)
- Use
GPUArraysCorefor scalar indexing flags (#232) - Reactant fails to compile a function accepting complex parameters (#235)
- Name clash when calling
Reactant.@compileover a function that is namedf(#237) getindexassumes static indexing (#243)mlirDenseElementsAttrSplatGetwith ComplexNumbers lead to a SegFault (#244)- conv! (#251)
- 32-Bit CI (#253)
v0.2.5
Reactant v0.2.5
Merged pull requests:
v0.2.4
Reactant v0.2.4
Merged pull requests:
- Move Julia bindings to MLIR dialects out of JLL (#166) (@mofeing)
- Implement
conj,conj!forTracedRArray(#169) (@mofeing) - feat: allow passing a optimize option to MLIR (#171) (@avik-pal)
- Format code of branch "main" (#175) (@github-actions[bot])
- Regenerate MLIR Dialects (#177) (@github-actions[bot])
- Update Project.toml (#179) (@wsmoses)
- Fix
similaronTracedRArray(#184) (@mofeing) - feat: support conditionals (#185) (@avik-pal)
- Improve GitHub Actions settings (#186) (@giordano)
- build(deps): bump codecov/codecov-action from 1 to 4 (#187) (@dependabot[bot])
- build(deps): bump peter-evans/create-pull-request from 6 to 7 (#188) (@dependabot[bot])
- Add needs primal for forward mode (#192) (@wsmoses)
- fix primitive_type for complex (#193) (@Pangoraw)
- Add jit macro for simplified testing (#194) (@wsmoses)
- complex DenseElementsAttribute (#195) (@Pangoraw)
- Make libMLIR_h automatically regenerate (#197) (@Pangoraw)
- prevent docs warning (#199) (@Pangoraw)
- update libMLIR_h bindings and fix bindings update step n°2 (#202) (@Pangoraw)
- Refactor regenerate-mlir-bindings.yml workflow (#204) (@mofeing)
- feat: support tracing scalars (#205) (@avik-pal)
- Add C-API for constructing Complex Attributes (#208) (@mofeing)
- feat: allow type-casting numbers to tracednumbers (#209) (@avik-pal)
- feat: specialize dispatches for faster concrete array generation (#213) (@avik-pal)
- feat: add
syncto synchonize all result buffers (#216) (@avik-pal) - fix complex attr c-api (#218) (@mofeing)
- fix resource exhaustion bug #190 (#219) (@mofeing)
- Refactor
mlirComplexAttrDoubleGet,mlirComplexAttrDoubleGetChecked(#221) (@mofeing) - feat: allow conversion to numbers (#222) (@avik-pal)
- Regenerate MLIR Bindings (#223) (@github-actions[bot])
- refactor: use
@jitin the tests (#224) (@avik-pal) - feat: make compile and code_hlo macros work with broadcasting (#225) (@avik-pal)
Closed issues:
- Shortcuts for
to_rarray(#138) - Infinte recursion when compiling a function (#168)
ConcreteRArrayonComplexarray zeroes the buffer (#170)XlaRuntimeErrorwhen passing aComplexbuffer to a compiled function (#172)- Forward-mode differentiation is broken (#183)
- Resource exhaustion when compiling many functions (#190)
- Error on regenerating "libMLIR_h.jl" on GitHub runner (#201)
v0.2.3
Reactant v0.2.3
Merged pull requests:
- Export
GetDefaultTargetTriplesymbol (#160) (@mofeing) - feat: implement a separate
TracedRNumber(#161) (@avik-pal) - Generalize
Base._catto non-Val, typedBase._cat_tand implementtyped_hcat,typed_vcat,typed_hvcat,typed_hvncat(#163) (@mofeing) - Bypass identical construction on
TracedRArray(#165) (@mofeing) - test: separate out test groups (#167) (@avik-pal)
Closed issues:
v0.2.2
Reactant v0.2.2
Merged pull requests:
- feat: compiling vision transformers (#130) (@avik-pal)
- feat: robust handling of wrapped arrays of reactant arrays (#133) (@avik-pal)
- Replace
getfieldfortraced_getfieldinset!(#134) (@mofeing) - perf: benchmarking CI (#136) (@avik-pal)
- Add fallback gpu setindex (#137) (@wsmoses)
- Bump dependencies (#143) (@wsmoses)
- Build (#145) (@wsmoses)
- feat: generalize indexing to all wrappers (#146) (@avik-pal)
- Arm64 (#147) (@wsmoses)
- Format code of branch "main" (#153) (@github-actions[bot])
- fix: broadcasted type casting (#156) (@avik-pal)
- Fix vararg (#158) (@wsmoses)
Closed issues:
- Handling exponents (#27)
- Incosistent semantics of element-wise application of array functions compared to Julia (#51)
- NNlib activations cannot be compiled without manual dispatches (#54)
- Linear algebra on
TracedRArray(#131) - scalar broadcasting not functional (#135)
libdevicenot found (#149)- Vararg compilation broken (#151)
- Can I compile in-place functions? (#152)
Optimisers.jlfails to compile (#154)- Type-casting doesn't compile (#155)
- Unbreak Reactant benchmark CI (#157)
v0.2.1
Reactant v0.2.1
Merged pull requests:
- Format code of branch "main" (#101) (@github-actions[bot])
- Compile NNlib.maxpool and NNlib.meanpool (#102) (@Pangoraw)
- feat: handle cat/hcat/vcat (#103) (@avik-pal)
- feat: implementing view/getindex/setindex (#104) (@avik-pal)
- perf: test compile times for neural networks -- VGG16 (#105) (@avik-pal)
- fix: local JLL build script (#108) (@avik-pal)
- NNlib: forward feature group counts to convolution (#109) (@Pangoraw)
- tests: more comprehensive NN testing (#110) (@avik-pal)
- ci(github-actions): update the formatter action (#111) (@avik-pal)
- Format code of branch "main" (#112) (@github-actions[bot])
- Format code of branch "main" (#113) (@github-actions[bot])
- feat: allow building for CPU target on CUDA-enabled machines (#114) (@avik-pal)
- NNlib: handle kernel flip with reverse instead of window_reversal (#115) (@Pangoraw)
- Format code of branch "main" (#116) (@github-actions[bot])
- Configure CompatHelper workflow (#118) (@mofeing)
- chore: testing Enzyme 0.13 (#119) (@avik-pal)
- Fix non-used argument (#123) (@wsmoses)
- Fix typo in
mlirIsNullcall (#124) (@mofeing) - Format code of branch "main" (#125) (@github-actions[bot])
- Refactor
compileand test buffer donation (#126) (@mofeing) - Replace
IdDictforOrderedIdDict(#128) (@mofeing) - CompatHelper: add new compat entry for OrderedCollections at version 1, (keep existing compat) (#129) (@github-actions[bot])
Closed issues:
v0.2.0
Reactant v0.2.0
Merged pull requests:
- Move shape information of
RArrayto field (#58) (@mofeing) - Refactor
traced_typeto allow customization for external types (#59) (@mofeing) - Despecialize some methods to lower excessive codegen (#61) (@mofeing)
- Add benchmarking infrastructure (#65) (@mofeing)
- Tpu (#66) (@wsmoses)
- Refactor compilation (#68) (@mofeing)
- Format code of branch "main" (#69) (@github-actions[bot])
- Format code of branch "main" (#70) (@github-actions[bot])
- ci(buildkite): fix scripts (#71) (@avik-pal)
- Fix
permutedimsonTracedRArray(#72) (@mofeing) - Fix broadcasting on
ConcreteRArray(#73) (@mofeing) - Small fixes on abstract interpreter (#74) (@mofeing)
- Try hermetic cuda (#75) (@wsmoses)
- Implement
create_resultmethod forAbstractDict(#76) (@mofeing) - Implement
onlyforTracedRArray(#77) (@mofeing) - Reuse code cache in abstract interpretation to minimize
compiletime (#78) (@mofeing) - TPU index setup (#81) (@wsmoses)
- Format code of branch "main" (#82) (@github-actions[bot])
- Export
LoadPjrtPlugin,InitializePjrtPlugin,GetCApiClientfunctions (#84) (@mofeing) - Compile NNlib convolution (#87) (@Pangoraw)
- Format code of branch "main" (#89) (@github-actions[bot])
- Cleanup api (#90) (@wsmoses)
- Imrpove FFI (#91) (@wsmoses)
- Format code of branch "main" (#92) (@github-actions[bot])
- Fix libtpu auto download (#93) (@wsmoses)
- CUDA build local (#94) (@wsmoses)
- Format code of branch "main" (#95) (@github-actions[bot])
- Add back support for buffer donation (#96) (@mofeing)
Closed issues: