WIP: Interleave modular transform processing by hjanuschka · Pull Request #796 · libjxl/jxl-rs

hjanuschka · 2026-06-08T12:14:27Z

WIP follow-up to #795.

This tries the deeper path for #782: for safe gridded Modular frames, decode a group and immediately run any modular transforms that became ready, instead of decoding all groups before transform processing. This lets large squeeze images free intermediates during decode.

The optimization is gated to straight non-flush Modular decodes, before any flush happened, gridded outputs, and sufficiently tiled frames. Flush/progressive and small-frame paths keep the existing batched behavior.

Peak RSS on the repro:

decoder	peak RSS
jxl-rs main	2739 MiB
#795	1198 MiB
this branch	333 MiB
jxl-oxide 0.12.6	421 MiB
djxl -> PPM	1657 MiB

This avoids the global scratch-pool cap from #795 by only skipping center-buffer caching while interleaved modular output is actively feeding the low-memory pipeline. Normal scratch-buffer reuse is left unchanged.

veluca93 · 2026-06-08T12:38:54Z

I had a different approach in mind that might be simpler and more effective during progressive renders. I will give that some thought and hopefully write something up in the next day or two :-)

Only retain progressive render snapshots in the CLI when they can be written to an output.

Inverse squeeze steps read neighbor grids (next average and previous decoded) that the transform graph counts as buffer uses, but the per-step code never released them, so those intermediate modular buffers stayed allocated for the whole frame. Mark them used on the final render so they are freed once consumed.

Modular frames never reclaim center group buffers via get_buffer, so the scratch pool grew to a full-frame copy that was retained for the pipeline's lifetime. Cap it to the few buffers sequential rendering can actually reuse.

Run safe gridded modular transforms during decode so large lossless frames can free squeeze intermediates as soon as their dependencies are ready.

This reverts commit 6489571.

Avoid retaining modular center group buffers while interleaved processing is feeding the low-memory pipeline, without changing the normal scratch-buffer reuse path.

veluca93

approving for benchmark purposes ;-)

veluca93 · 2026-06-10T11:24:20Z

Performance Summary (Commit `a42d5b2`)

Machine	Threading	Base MP/s	PR MP/s	Avg Improvement

Detailed per-image results

hjanuschka mentioned this pull request Jun 8, 2026

Reduce memory use for progressive lossless images #795

Open

hjanuschka added 6 commits June 10, 2026 12:38

Avoid storing unused partial renders

ed2c75b

Only retain progressive render snapshots in the CLI when they can be written to an output.

Bound render pipeline scratch buffer pool

0eecd0a

Modular frames never reclaim center group buffers via get_buffer, so the scratch pool grew to a full-frame copy that was retained for the pipeline's lifetime. Cap it to the few buffers sequential rendering can actually reuse.

Interleave modular transform processing

86accce

Run safe gridded modular transforms during decode so large lossless frames can free squeeze intermediates as soon as their dependencies are ready.

Revert "Bound render pipeline scratch buffer pool"

2163e3f

This reverts commit 6489571.

Skip interleaved center buffer caching

a42d5b2

Avoid retaining modular center group buffers while interleaved processing is feeding the low-memory pipeline, without changing the normal scratch-buffer reuse path.

veluca93 force-pushed the experiment/issue-782-interleave-modular branch from 35bbb98 to a42d5b2 Compare June 10, 2026 10:38

veluca93 reviewed Jun 10, 2026

View reviewed changes

veluca93 approved these changes Jun 10, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

WIP: Interleave modular transform processing#796

WIP: Interleave modular transform processing#796
hjanuschka wants to merge 6 commits into
libjxl:mainfrom
hjanuschka:experiment/issue-782-interleave-modular

hjanuschka commented Jun 8, 2026 •

edited

Loading

Uh oh!

veluca93 commented Jun 8, 2026

Uh oh!

veluca93 left a comment

Uh oh!

veluca93 commented Jun 10, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Uh oh!

Conversation

hjanuschka commented Jun 8, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

veluca93 commented Jun 8, 2026

Uh oh!

veluca93 left a comment

Choose a reason for hiding this comment

Uh oh!

veluca93 commented Jun 10, 2026

Performance Summary (Commit a42d5b2)

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

hjanuschka commented Jun 8, 2026 •

edited

Loading

Performance Summary (Commit `a42d5b2`)