Reduce memory use for progressive lossless images by hjanuschka · Pull Request #795 · libjxl/jxl-rs

hjanuschka · 2026-06-08T09:42:56Z

Fixes #782.

The inverse squeeze transforms read two neighbor grids per step (the next average and the previously decoded output), which the transform graph counts as buffer uses, but do_run only released the primary inputs. So those intermediate modular buffers stayed allocated for the whole frame. This releases the neighbor grids on the final render, mirroring how the graph registers them (with a dedup guard for when a coarser average channel maps the neighbor onto the same grid index).

jxl_cli also no longer stores progressive render snapshots when there's no output path to write them to.

Peak RSS on the repro:

scenario	before	after
decode (`-s`)	2740 MiB	1191 MiB
`--render-interval 1000`	4208 MiB	3626 MiB
`--render-interval 250`	7411 MiB	5815 MiB

For full decode that's now below djxl (~1516 MiB) and ~1.5x jxl-oxide (~792 MiB), down from 3.4x.

Only retain progressive render snapshots in the CLI when they can be written to an output.

Inverse squeeze steps read neighbor grids (next average and previous decoded) that the transform graph counts as buffer uses, but the per-step code never released them, so those intermediate modular buffers stayed allocated for the whole frame. Mark them used on the final render so they are freed once consumed.

Modular frames never reclaim center group buffers via get_buffer, so the scratch pool grew to a full-frame copy that was retained for the pipeline's lifetime. Cap it to the few buffers sequential rendering can actually reuse.

veluca93

I intend to revisit how modular transform processing works, but this is a good fix in the meantime.

hjanuschka · 2026-06-08T09:50:17Z

started working on the modular transforms, it was just to big! so this PR is a temp. improvement!

veluca93 · 2026-06-08T10:24:31Z

Performance Summary (Commit `58821ab`)

Machine	Threading	Base MP/s	PR MP/s	Avg Improvement
desktop	Single	80.80	81.01	+0.61% ± 0.39%
desktop	Multi	80.82	80.82	+0.49% ± 0.33%
framework-desktop	Single	94.43	93.71	+0.13% ± 0.45%
framework-desktop	Multi	94.79	94.65	+0.20% ± 0.44%
pixel7a	Single (Fast)	28.58	28.90	+0.44% ± 0.40%
pixel7a	Single (Mid)	20.83	21.10	+0.39% ± 0.38%
pixel7a	Multi	29.08	29.06	-0.07% ± 0.42%

Detailed per-image results

veluca93 · 2026-06-08T12:03:38Z

I think I know where the speed regressions are coming from - can you remove the part to limit the scratch buffers for now?

This reverts commit 6489571.

hjanuschka · 2026-06-08T12:21:42Z

done, also did #796 that has the refactored interleave! and completely wins in terms of RSS (could we bench this?)

hjanuschka added 3 commits June 8, 2026 10:44

Avoid storing unused partial renders

0d19aac

Only retain progressive render snapshots in the CLI when they can be written to an output.

Bound render pipeline scratch buffer pool

6489571

Modular frames never reclaim center group buffers via get_buffer, so the scratch pool grew to a full-frame copy that was retained for the pipeline's lifetime. Cap it to the few buffers sequential rendering can actually reuse.

veluca93 approved these changes Jun 8, 2026

View reviewed changes

Revert "Bound render pipeline scratch buffer pool"

4446cea

This reverts commit 6489571.

hjanuschka mentioned this pull request Jun 8, 2026

WIP: Interleave modular transform processing #796

Draft

Merge branch 'main' into fix/issue-782-progressive-memory

58821ab

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Reduce memory use for progressive lossless images#795

Reduce memory use for progressive lossless images#795
hjanuschka wants to merge 5 commits into
libjxl:mainfrom
hjanuschka:fix/issue-782-progressive-memory

hjanuschka commented Jun 8, 2026 •

edited

Loading

Uh oh!

veluca93 left a comment

Uh oh!

hjanuschka commented Jun 8, 2026

Uh oh!

veluca93 commented Jun 8, 2026 •

edited

Loading

Uh oh!

veluca93 commented Jun 8, 2026

Uh oh!

hjanuschka commented Jun 8, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Uh oh!

Conversation

hjanuschka commented Jun 8, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

veluca93 left a comment

Choose a reason for hiding this comment

Uh oh!

hjanuschka commented Jun 8, 2026

Uh oh!

veluca93 commented Jun 8, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Performance Summary (Commit 58821ab)

Uh oh!

veluca93 commented Jun 8, 2026

Uh oh!

hjanuschka commented Jun 8, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

hjanuschka commented Jun 8, 2026 •

edited

Loading

veluca93 commented Jun 8, 2026 •

edited

Loading

Performance Summary (Commit `58821ab`)