Adaptive (runtime, stats-based) conjunct reordering for FilterExec by adriangb · Pull Request #22698 · apache/datafusion

adriangb · 2026-06-01T13:33:46Z

Which issue does this PR close?

part of [EPIC] Adaptive predicate evaluation #22883
Related to Reorder boolean expressions (including filter predicates) according to evaluation cost / selectivity #11262
complements perf: Reorder predicates in conjuncts via simple heuristic #22343

Rationale for this change

Predicate evaluation order matters: running a selective predicate first lets it
gate the work of the predicates after it. The static cheap/expensive heuristic
(#22343) sorts conjuncts into two cost classes and stable-sorts within each, so
it does nothing to order multiple similarly-expensive predicates; and
BinaryExpr's AND short-circuit only gates on a leftmost selective conjunct.
So a conjunction of several expensive predicates whose selective member is not
written first is evaluated with every predicate scanning ~every row — and
neither mechanism fixes it.

This PR adds runtime, statistics-based conjunct reordering for FilterExec:
it measures each conjunct's selectivity and cost on the rows that actually reach
it and runs the ones that discard the most rows per unit of CPU time first.
Maximising discards-per-second is exactly minimising cost_per_row / (1 - pass_rate),
the classic optimal ordering key for independent conjuncts.

It is off by default (datafusion.execution.adaptive_filter_reordering).

What changes are included in this PR?

Everything lives in a new private module,
datafusion/physical-plan/src/adaptive_filter.rs; FilterExec itself gains one
field, an Option on the stream, and a two-arm match in the poll loop. No plan,
EXPLAIN, statistics, or public-API changes beyond the config flag.

The compact-once evaluation loop: conjuncts are evaluated sequentially,
masks combined with cheap bitwise ANDs, and the working batch is physically
compacted to the survivors once the accumulated mask keeps ≤20% of rows. This
compaction is what makes ordering pay off — a fused BinaryExpr AND chain
evaluates ~every conjunct on ~every row regardless of order. (An earlier
revision of this PR validated adoptions with A/B trials and a
champion/challenger arbiter; benchmarking showed the compact-once loop is
where the win actually comes from, so this PR ships the simple one-shot
cost-model design and leaves A/B validation as possible future work.)
Warm-up and settle: each conjunct's marginal pass rate and per-row cost
are measured for 8 batches; conjuncts are then ranked by discards-per-
nanosecond, and the ranked order is adopted only if its expected cost is
materially (≥5%) cheaper than the written order. If not, the settled path
evaluates the original fused predicate — identical to the feature being off.
Compact-once is used only in service of a reorder.
Cross-stream pooling (AdaptiveFilterShared): all partition streams of
one FilterExec pool their measurements; the first to finish the warm-up
publishes the settled order via an epoch atomic (one relaxed load per batch
for the others). The warm-up is paid roughly once per query, not once per
stream — which is what makes the win materialise when each stream only sees a
handful of batches.
Safety rails: volatile predicates are never reordered; reset_state
gives re-executions (e.g. recursive queries) fresh measurements; predicate
rewrites reset the pooled stats; empty batches don't consume the warm-up;
results are provably order-independent (a conjunction's value doesn't depend
on evaluation order).
Config flag execution.adaptive_filter_reordering (experimental,
default false), plus regenerated configs.md / information_schema.

Known limitations (documented in the module): the statistics are
conditional — each conjunct is measured on the rows that survived the ones
before it — so correlated conjuncts can be misjudged, and the settle is
one-shot with no drift re-measurement. The material-win guard makes adoption
conservative. Drift re-thaw, confidence intervals, and A/B-validated adoption
are explicitly future work layered on this core.

Reuse note (#22237 / #22144): the adaptive parquet filter-placement work
needs the same ideas (per-predicate selectivity/cost accumulators, a
measure → settle → publish pattern) but a different arrangement type and
lifetime. Everything here is deliberately pub(crate): rather than pre-build a
shared abstraction in physical-expr-common from a single consumer, the plan
is to lift the small shared pieces once the second consumer's shape is
concrete. Nothing in this PR commits any public API beyond the config flag.

Are these changes tested?

Yes:

Unit tests for the ranking key, expected-cost model, warm-up boundary
(mask always equals the plain predicate's, before and after settling),
no-reorder-runs-plain-predicate, cross-stream pooling/adoption, empty-batch
and timer-resolution edge cases, and reset_state.
adaptive_filter.slt: the table is stored as 64-row batches so the flag-on
queries cross the warm-up and exercise the settled (reordered) path
end-to-end, including a NULL-producing conjunct; results and EXPLAIN are
identical with the flag on and off.

Are there any user-facing changes?

One new config option, datafusion.execution.adaptive_filter_reordering
(experimental, default false). When enabled, query results never change, but
observable side effects of fallible predicates can — even when no reorder is
adopted — because while measuring (and after a reorder) conjuncts are evaluated
only on rows that survived the conjuncts before them, so an error the fused
predicate would have raised on an already-filtered row (e.g.
b <> 0 AND 1/b > 2 evaluating 1/b on every row) may not occur. Predicates
containing volatile expressions are never reordered.

github-actions · 2026-06-01T13:43:43Z

Thank you for opening this pull request!

Reviewer note: cargo-semver-checks reported the current version number is not SemVer-compatible with the changes in this pull request (compared against the base branch).

Details

     Cloning apache/main
    Building datafusion-common v54.0.0 (current)
       Built [  27.518s] (current)
     Parsing datafusion-common v54.0.0 (current)
      Parsed [   0.051s] (current)
    Building datafusion-common v54.0.0 (baseline)
       Built [  26.123s] (baseline)
     Parsing datafusion-common v54.0.0 (baseline)
      Parsed [   0.051s] (baseline)
    Checking datafusion-common v54.0.0 -> v54.0.0 (no change; assume patch)
     Checked [   0.808s] 223 checks: 222 pass, 1 fail, 0 warn, 30 skip

--- failure constructible_struct_adds_field: externally-constructible struct adds field ---

Description:
A pub struct constructible with a struct literal has a new pub field. Existing struct literals must be updated to include the new field.
        ref: https://doc.rust-lang.org/reference/expressions/struct-expr.html
       impl: https://github.com/obi1kenobi/cargo-semver-checks/tree/v0.48.0/src/lints/constructible_struct_adds_field.ron

Failed in:
  field ExecutionOptions.adaptive_filter_reordering in /home/runner/work/datafusion/datafusion/datafusion/common/src/config.rs:723

     Summary semver requires new major version: 1 major and 0 minor checks failed
    Finished [  55.880s] datafusion-common
    Building datafusion-physical-plan v54.0.0 (current)
       Built [  28.684s] (current)
     Parsing datafusion-physical-plan v54.0.0 (current)
      Parsed [   0.115s] (current)
    Building datafusion-physical-plan v54.0.0 (baseline)
       Built [  28.869s] (baseline)
     Parsing datafusion-physical-plan v54.0.0 (baseline)
      Parsed [   0.112s] (baseline)
    Checking datafusion-physical-plan v54.0.0 -> v54.0.0 (no change; assume patch)
     Checked [   0.792s] 223 checks: 223 pass, 30 skip
     Summary no semver update required
    Finished [  59.911s] datafusion-physical-plan
    Building datafusion-sqllogictest v54.0.0 (current)
       Built [ 149.612s] (current)
     Parsing datafusion-sqllogictest v54.0.0 (current)
      Parsed [   0.019s] (current)
    Building datafusion-sqllogictest v54.0.0 (baseline)
       Built [ 147.482s] (baseline)
     Parsing datafusion-sqllogictest v54.0.0 (baseline)
      Parsed [   0.019s] (baseline)
    Checking datafusion-sqllogictest v54.0.0 -> v54.0.0 (no change; assume patch)
     Checked [   0.097s] 223 checks: 223 pass, 30 skip
     Summary no semver update required
    Finished [ 299.635s] datafusion-sqllogictest

The compact-once loop wins by gating expensive conjuncts behind a selective one, but its per-conjunct bookkeeping (mask AND, true_count, the compaction copy) is pure overhead when there is nothing to gate. On a conjunction of interchangeable predicates — several equally expensive, equally unselective regexps, say — the warm-up settles on the written order (nothing to reorder) yet still paid compact-once on every batch, regressing ~11% vs the plain predicate (predicate_eval neutral_q61). Guard it: compact-once is adopted only when the warm-up actually reorders the conjuncts. When the settled order equals the written order, evaluate the predicate as-is — byte-for-byte the flag-off path, zero overhead. Since every real win reorders (a selective conjunct moves toward the front), this keeps the full win while removing the no-reorder regression. predicate_eval (vs flag off): neutral_q61 +11% -> ~0; wins preserved (costsel_q01 -60%, width -58..-66%, cardinality k16 -31%). A small residual remains on low-cardinality cheap conjunctions that do reorder (k4/k8 ~+3-4%), where compaction's cost is not repaid by gating so few/cheap predicates; the full champion/challenger arbiter regresses these more (~+10%), so a heavier guard is not worth it here. Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com> Claude-Session: https://claude.ai/code/session_01Lh7i9DyeFWuTFWjogrVNkb

@xudong963

Two points raised by @xudong963 that carried over into the compact-once rewrite of the adaptive `FilterExec` conjunct evaluator: - Replace `.expect("u32 live")` on the live-row index downcast with a let-else returning `internal_err!`, so a broken invariant surfaces as a clean error rather than a panic. - Add a `debug_assert!` documenting that live-row indices are tracked in arrow's `u32` `filter`/`take` index space, making the `num_rows as u32` cast's precondition explicit. Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com> Claude-Session: https://claude.ai/code/session_01R1mvYrjFyTy2kbBoGrWzT6

@xudong963

@xudong963 noted the pooled adaptive-conjunct measurements live on the `FilterExec` plan node and are reused by every `execute()` call, leaking the learned conjunct order across independent executions. Implement `ExecutionPlan::reset_state` for `FilterExec` — the sanctioned mechanism for exactly this (its trait docs cite `DynamicFilterPhysicalExpr`; `CrossJoinExec`, `HashJoinExec`, and `SortExec` use it for their build-side / dynamic-filter state). It returns a fresh node with a new `AdaptiveFilterShared` (and fresh metrics), so a re-executed plan re-learns from scratch, while preserving the still-valid predicate, input, and cached plan properties. Reordering only ever affects performance, never results, so this closes a perf-staleness gap, not a correctness bug. Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com> Claude-Session: https://claude.ai/code/session_01R1mvYrjFyTy2kbBoGrWzT6

- An empty batch no longer consumes the warm-up: a run of empty batches would settle the written order on no evidence, permanently disabling adaptation for the stream. - A conjunct evaluated faster than the timer's resolution now clamps its cost to 1ns instead of dropping out of the ranking as unmeasured (which sorted the cheapest conjunct last — backwards). - The u32::MAX row-count guard is now a real internal error instead of a debug_assert; in release the indices would have silently wrapped. Also documents the known limitations of the one-shot, conditional-stats settle (correlated conjuncts, no drift re-measurement) in the module doc. Co-Authored-By: Claude Fable 5 <noreply@anthropic.com> Claude-Session: https://claude.ai/code/session_01DgXQmKrKgre4epbcSxUHNo

The previous test stored the table as a single batch, so with WARMUP_BATCHES = 8 the flag-on queries only ever exercised the measuring path. Store 4000 rows as 64-row batches so the warm-up completes and the settled (possibly reordered) path runs end-to-end, and add a query whose conjunct produces NULLs to exercise the null-mask path through real SQL. Co-Authored-By: Claude Fable 5 <noreply@anthropic.com> Claude-Session: https://claude.ai/code/session_01DgXQmKrKgre4epbcSxUHNo

The config docs claimed reordering was the only observable difference; in fact side effects of fallible predicates can change even when no reorder is adopted, because while measuring (and after a reorder) conjuncts are evaluated only on rows that survived the conjuncts before them. Say so explicitly, with an example, and note in FilterExec that Clone sharing the pooled measurements is deliberate. Co-Authored-By: Claude Fable 5 <noreply@anthropic.com> Claude-Session: https://claude.ai/code/session_01DgXQmKrKgre4epbcSxUHNo

…atch strategy Two review responses: - AdaptiveConjunction::try_new no longer takes an `enabled` bool that short-circuits to None; whether the feature is on is FilterExec's policy, so the flag check moves to FilterExec::execute and try_new answers only the structural question (reorderable, non-volatile conjunction). - The evaluator's per-batch behaviour is now observable: evaluate is a thin wrapper over evaluate_traced, which also reports the BatchStrategy used (Measure / Fused / Reordered). Two scenario tests exercise the input/output contract end to end — batches in, masks + strategy trace out — with per-conjunct costs injected by seeding the shared pool with synthetic measurements (the stand-in for a mocked clock), so which strategy gets adopted is deterministic: warm-up settles on a reorder for cheap-unselective + expensive-selective conjuncts, and on the written fused predicate for interchangeable ones. Co-Authored-By: Claude Fable 5 <noreply@anthropic.com> Claude-Session: https://claude.ai/code/session_01DgXQmKrKgre4epbcSxUHNo

adriangb · 2026-07-10T22:21:35Z

@alamb @xudong963 I think this is ready for review. The meat of the complexity is in adaptive_filter.rs, but the only public API impact is the config option. The rest we can change with time.

alamb · 2026-07-12T10:41:30Z

@alamb @xudong963 I think this is ready for review. The meat of the complexity is in adaptive_filter.rs, but the only public API impact is the config option. The rest we can change with time.

I will review this tomorrow morning hopefully (I am off mon / tue but have some time in the mornings)

alamb · 2026-07-13T11:46:19Z

run benchmark tpch10 clickbench_partitioned tpcds

baseline:
ref: a991d38
env:
DATAFUSION_EXECUTION_ADAPTIVE_FILTER_REORDERING: "false"
changed:
ref: a991d38
env:
DATAFUSION_EXECUTION_ADAPTIVE_FILTER_REORDERING: "true"

alamb · 2026-07-13T11:46:22Z

run benchmark tpch10 clickbench_partitioned tpcds

adriangbot · 2026-07-13T11:48:51Z

🤖 Benchmark running (GKE) | trigger
Instance: c4a-highmem-16 (12 vCPU / 65 GiB) | Linux bench-c4957592543-1025-c9tl6 6.12.85+ #1 SMP Mon May 11 08:17:35 UTC 2026 aarch64 GNU/Linux

CPU Details (lscpu)

Architecture:                            aarch64
CPU op-mode(s):                          64-bit
Byte Order:                              Little Endian
CPU(s):                                  16
On-line CPU(s) list:                     0-15
Vendor ID:                               ARM
Model name:                              Neoverse-V2
Model:                                   1
Thread(s) per core:                      1
Core(s) per cluster:                     16
Socket(s):                               -
Cluster(s):                              1
Stepping:                                r0p1
BogoMIPS:                                2000.00
Flags:                                   fp asimd evtstrm aes pmull sha1 sha2 crc32 atomics fphp asimdhp cpuid asimdrdm jscvt fcma lrcpc dcpop sha3 sm3 sm4 asimddp sha512 sve asimdfhm dit uscat ilrcpc flagm sb paca pacg dcpodp sve2 sveaes svepmull svebitperm svesha3 svesm4 flagm2 frint svei8mm svebf16 i8mm bf16 dgh rng bti
L1d cache:                               1 MiB (16 instances)
L1i cache:                               1 MiB (16 instances)
L2 cache:                                32 MiB (16 instances)
L3 cache:                                80 MiB (1 instance)
NUMA node(s):                            1
NUMA node0 CPU(s):                       0-15
Vulnerability Gather data sampling:      Not affected
Vulnerability Indirect target selection: Not affected
Vulnerability Itlb multihit:             Not affected
Vulnerability L1tf:                      Not affected
Vulnerability Mds:                       Not affected
Vulnerability Meltdown:                  Not affected
Vulnerability Mmio stale data:           Not affected
Vulnerability Reg file data sampling:    Not affected
Vulnerability Retbleed:                  Not affected
Vulnerability Spec rstack overflow:      Not affected
Vulnerability Spec store bypass:         Mitigation; Speculative Store Bypass disabled via prctl
Vulnerability Spectre v1:                Mitigation; __user pointer sanitization
Vulnerability Spectre v2:                Mitigation; CSV2, BHB
Vulnerability Srbds:                     Not affected
Vulnerability Tsa:                       Not affected
Vulnerability Tsx async abort:           Not affected
Vulnerability Vmscape:                   Not affected

Comparing lift-selectivity-stats (a991d38) to 63e25c1 (merge-base) diff using: tpch10
Results will be posted here when complete

File an issue against this benchmark runner

adriangbot · 2026-07-13T11:48:51Z

🤖 Benchmark running (GKE) | trigger
Instance: c4a-highmem-16 (12 vCPU / 65 GiB) | Linux bench-c4957592543-1026-ps8vs 6.12.85+ #1 SMP Mon May 11 08:17:35 UTC 2026 aarch64 GNU/Linux

CPU Details (lscpu)

Architecture:                            aarch64
CPU op-mode(s):                          64-bit
Byte Order:                              Little Endian
CPU(s):                                  16
On-line CPU(s) list:                     0-15
Vendor ID:                               ARM
Model name:                              Neoverse-V2
Model:                                   1
Thread(s) per core:                      1
Core(s) per cluster:                     16
Socket(s):                               -
Cluster(s):                              1
Stepping:                                r0p1
BogoMIPS:                                2000.00
Flags:                                   fp asimd evtstrm aes pmull sha1 sha2 crc32 atomics fphp asimdhp cpuid asimdrdm jscvt fcma lrcpc dcpop sha3 sm3 sm4 asimddp sha512 sve asimdfhm dit uscat ilrcpc flagm sb paca pacg dcpodp sve2 sveaes svepmull svebitperm svesha3 svesm4 flagm2 frint svei8mm svebf16 i8mm bf16 dgh rng bti
L1d cache:                               1 MiB (16 instances)
L1i cache:                               1 MiB (16 instances)
L2 cache:                                32 MiB (16 instances)
L3 cache:                                80 MiB (1 instance)
NUMA node(s):                            1
NUMA node0 CPU(s):                       0-15
Vulnerability Gather data sampling:      Not affected
Vulnerability Indirect target selection: Not affected
Vulnerability Itlb multihit:             Not affected
Vulnerability L1tf:                      Not affected
Vulnerability Mds:                       Not affected
Vulnerability Meltdown:                  Not affected
Vulnerability Mmio stale data:           Not affected
Vulnerability Reg file data sampling:    Not affected
Vulnerability Retbleed:                  Not affected
Vulnerability Spec rstack overflow:      Not affected
Vulnerability Spec store bypass:         Mitigation; Speculative Store Bypass disabled via prctl
Vulnerability Spectre v1:                Mitigation; __user pointer sanitization
Vulnerability Spectre v2:                Mitigation; CSV2, BHB
Vulnerability Srbds:                     Not affected
Vulnerability Tsa:                       Not affected
Vulnerability Tsx async abort:           Not affected
Vulnerability Vmscape:                   Not affected

Comparing lift-selectivity-stats (a991d38) to 63e25c1 (merge-base) diff using: clickbench_partitioned
Results will be posted here when complete

File an issue against this benchmark runner

adriangbot · 2026-07-13T11:48:53Z

🤖 Benchmark running (GKE) | trigger
Instance: c4a-highmem-16 (12 vCPU / 65 GiB) | Linux bench-c4957592543-1027-8gm8z 6.12.85+ #1 SMP Mon May 11 08:17:35 UTC 2026 aarch64 GNU/Linux

CPU Details (lscpu)

Architecture:                            aarch64
CPU op-mode(s):                          64-bit
Byte Order:                              Little Endian
CPU(s):                                  16
On-line CPU(s) list:                     0-15
Vendor ID:                               ARM
Model name:                              Neoverse-V2
Model:                                   1
Thread(s) per core:                      1
Core(s) per cluster:                     16
Socket(s):                               -
Cluster(s):                              1
Stepping:                                r0p1
BogoMIPS:                                2000.00
Flags:                                   fp asimd evtstrm aes pmull sha1 sha2 crc32 atomics fphp asimdhp cpuid asimdrdm jscvt fcma lrcpc dcpop sha3 sm3 sm4 asimddp sha512 sve asimdfhm dit uscat ilrcpc flagm sb paca pacg dcpodp sve2 sveaes svepmull svebitperm svesha3 svesm4 flagm2 frint svei8mm svebf16 i8mm bf16 dgh rng bti
L1d cache:                               1 MiB (16 instances)
L1i cache:                               1 MiB (16 instances)
L2 cache:                                32 MiB (16 instances)
L3 cache:                                80 MiB (1 instance)
NUMA node(s):                            1
NUMA node0 CPU(s):                       0-15
Vulnerability Gather data sampling:      Not affected
Vulnerability Indirect target selection: Not affected
Vulnerability Itlb multihit:             Not affected
Vulnerability L1tf:                      Not affected
Vulnerability Mds:                       Not affected
Vulnerability Meltdown:                  Not affected
Vulnerability Mmio stale data:           Not affected
Vulnerability Reg file data sampling:    Not affected
Vulnerability Retbleed:                  Not affected
Vulnerability Spec rstack overflow:      Not affected
Vulnerability Spec store bypass:         Mitigation; Speculative Store Bypass disabled via prctl
Vulnerability Spectre v1:                Mitigation; __user pointer sanitization
Vulnerability Spectre v2:                Mitigation; CSV2, BHB
Vulnerability Srbds:                     Not affected
Vulnerability Tsa:                       Not affected
Vulnerability Tsx async abort:           Not affected
Vulnerability Vmscape:                   Not affected

Comparing lift-selectivity-stats (a991d38) to 63e25c1 (merge-base) diff using: tpcds
Results will be posted here when complete

File an issue against this benchmark runner

adriangbot · 2026-07-13T11:49:12Z

🤖 Benchmark running (GKE) | trigger
Instance: c4a-highmem-16 (12 vCPU / 65 GiB) | Linux bench-c4957592236-1023-2sgnq 6.12.85+ #1 SMP Mon May 11 08:17:35 UTC 2026 aarch64 GNU/Linux

CPU Details (lscpu)

Architecture:                            aarch64
CPU op-mode(s):                          64-bit
Byte Order:                              Little Endian
CPU(s):                                  16
On-line CPU(s) list:                     0-15
Vendor ID:                               ARM
Model name:                              Neoverse-V2
Model:                                   1
Thread(s) per core:                      1
Core(s) per cluster:                     16
Socket(s):                               -
Cluster(s):                              1
Stepping:                                r0p1
BogoMIPS:                                2000.00
Flags:                                   fp asimd evtstrm aes pmull sha1 sha2 crc32 atomics fphp asimdhp cpuid asimdrdm jscvt fcma lrcpc dcpop sha3 sm3 sm4 asimddp sha512 sve asimdfhm dit uscat ilrcpc flagm sb paca pacg dcpodp sve2 sveaes svepmull svebitperm svesha3 svesm4 flagm2 frint svei8mm svebf16 i8mm bf16 dgh rng bti
L1d cache:                               1 MiB (16 instances)
L1i cache:                               1 MiB (16 instances)
L2 cache:                                32 MiB (16 instances)
L3 cache:                                80 MiB (1 instance)
NUMA node(s):                            1
NUMA node0 CPU(s):                       0-15
Vulnerability Gather data sampling:      Not affected
Vulnerability Indirect target selection: Not affected
Vulnerability Itlb multihit:             Not affected
Vulnerability L1tf:                      Not affected
Vulnerability Mds:                       Not affected
Vulnerability Meltdown:                  Not affected
Vulnerability Mmio stale data:           Not affected
Vulnerability Reg file data sampling:    Not affected
Vulnerability Retbleed:                  Not affected
Vulnerability Spec rstack overflow:      Not affected
Vulnerability Spec store bypass:         Mitigation; Speculative Store Bypass disabled via prctl
Vulnerability Spectre v1:                Mitigation; __user pointer sanitization
Vulnerability Spectre v2:                Mitigation; CSV2, BHB
Vulnerability Srbds:                     Not affected
Vulnerability Tsa:                       Not affected
Vulnerability Tsx async abort:           Not affected
Vulnerability Vmscape:                   Not affected

Comparing a991d38 (a991d38) to a991d38 diff using: clickbench_partitioned
Results will be posted here when complete

File an issue against this benchmark runner

adriangbot · 2026-07-13T11:49:15Z

🤖 Benchmark running (GKE) | trigger
Instance: c4a-highmem-16 (12 vCPU / 65 GiB) | Linux bench-c4957592236-1024-kbzk6 6.12.85+ #1 SMP Mon May 11 08:17:35 UTC 2026 aarch64 GNU/Linux

CPU Details (lscpu)

Architecture:                            aarch64
CPU op-mode(s):                          64-bit
Byte Order:                              Little Endian
CPU(s):                                  16
On-line CPU(s) list:                     0-15
Vendor ID:                               ARM
Model name:                              Neoverse-V2
Model:                                   1
Thread(s) per core:                      1
Core(s) per cluster:                     16
Socket(s):                               -
Cluster(s):                              1
Stepping:                                r0p1
BogoMIPS:                                2000.00
Flags:                                   fp asimd evtstrm aes pmull sha1 sha2 crc32 atomics fphp asimdhp cpuid asimdrdm jscvt fcma lrcpc dcpop sha3 sm3 sm4 asimddp sha512 sve asimdfhm dit uscat ilrcpc flagm sb paca pacg dcpodp sve2 sveaes svepmull svebitperm svesha3 svesm4 flagm2 frint svei8mm svebf16 i8mm bf16 dgh rng bti
L1d cache:                               1 MiB (16 instances)
L1i cache:                               1 MiB (16 instances)
L2 cache:                                32 MiB (16 instances)
L3 cache:                                80 MiB (1 instance)
NUMA node(s):                            1
NUMA node0 CPU(s):                       0-15
Vulnerability Gather data sampling:      Not affected
Vulnerability Indirect target selection: Not affected
Vulnerability Itlb multihit:             Not affected
Vulnerability L1tf:                      Not affected
Vulnerability Mds:                       Not affected
Vulnerability Meltdown:                  Not affected
Vulnerability Mmio stale data:           Not affected
Vulnerability Reg file data sampling:    Not affected
Vulnerability Retbleed:                  Not affected
Vulnerability Spec rstack overflow:      Not affected
Vulnerability Spec store bypass:         Mitigation; Speculative Store Bypass disabled via prctl
Vulnerability Spectre v1:                Mitigation; __user pointer sanitization
Vulnerability Spectre v2:                Mitigation; CSV2, BHB
Vulnerability Srbds:                     Not affected
Vulnerability Tsa:                       Not affected
Vulnerability Tsx async abort:           Not affected
Vulnerability Vmscape:                   Not affected

Comparing a991d38 (a991d38) to a991d38 diff using: tpcds
Results will be posted here when complete

File an issue against this benchmark runner

adriangbot · 2026-07-13T11:49:42Z

🤖 Benchmark running (GKE) | trigger
Instance: c4a-highmem-16 (12 vCPU / 65 GiB) | Linux bench-c4957592236-1022-zjbbz 6.12.85+ #1 SMP Mon May 11 08:17:35 UTC 2026 aarch64 GNU/Linux

CPU Details (lscpu)

Architecture:                            aarch64
CPU op-mode(s):                          64-bit
Byte Order:                              Little Endian
CPU(s):                                  16
On-line CPU(s) list:                     0-15
Vendor ID:                               ARM
Model name:                              Neoverse-V2
Model:                                   1
Thread(s) per core:                      1
Core(s) per cluster:                     16
Socket(s):                               -
Cluster(s):                              1
Stepping:                                r0p1
BogoMIPS:                                2000.00
Flags:                                   fp asimd evtstrm aes pmull sha1 sha2 crc32 atomics fphp asimdhp cpuid asimdrdm jscvt fcma lrcpc dcpop sha3 sm3 sm4 asimddp sha512 sve asimdfhm dit uscat ilrcpc flagm sb paca pacg dcpodp sve2 sveaes svepmull svebitperm svesha3 svesm4 flagm2 frint svei8mm svebf16 i8mm bf16 dgh rng bti
L1d cache:                               1 MiB (16 instances)
L1i cache:                               1 MiB (16 instances)
L2 cache:                                32 MiB (16 instances)
L3 cache:                                80 MiB (1 instance)
NUMA node(s):                            1
NUMA node0 CPU(s):                       0-15
Vulnerability Gather data sampling:      Not affected
Vulnerability Indirect target selection: Not affected
Vulnerability Itlb multihit:             Not affected
Vulnerability L1tf:                      Not affected
Vulnerability Mds:                       Not affected
Vulnerability Meltdown:                  Not affected
Vulnerability Mmio stale data:           Not affected
Vulnerability Reg file data sampling:    Not affected
Vulnerability Retbleed:                  Not affected
Vulnerability Spec rstack overflow:      Not affected
Vulnerability Spec store bypass:         Mitigation; Speculative Store Bypass disabled via prctl
Vulnerability Spectre v1:                Mitigation; __user pointer sanitization
Vulnerability Spectre v2:                Mitigation; CSV2, BHB
Vulnerability Srbds:                     Not affected
Vulnerability Tsa:                       Not affected
Vulnerability Tsx async abort:           Not affected
Vulnerability Vmscape:                   Not affected

Comparing a991d38 (a991d38) to a991d38 diff using: tpch10
Results will be posted here when complete

File an issue against this benchmark runner

adriangbot · 2026-07-13T12:06:17Z

🤖 Benchmark completed (GKE) | trigger

Instance: c4a-highmem-16 (12 vCPU / 65 GiB)

CPU Details (lscpu)

Architecture:                            aarch64
CPU op-mode(s):                          64-bit
Byte Order:                              Little Endian
CPU(s):                                  16
On-line CPU(s) list:                     0-15
Vendor ID:                               ARM
Model name:                              Neoverse-V2
Model:                                   1
Thread(s) per core:                      1
Core(s) per cluster:                     16
Socket(s):                               -
Cluster(s):                              1
Stepping:                                r0p1
BogoMIPS:                                2000.00
Flags:                                   fp asimd evtstrm aes pmull sha1 sha2 crc32 atomics fphp asimdhp cpuid asimdrdm jscvt fcma lrcpc dcpop sha3 sm3 sm4 asimddp sha512 sve asimdfhm dit uscat ilrcpc flagm sb paca pacg dcpodp sve2 sveaes svepmull svebitperm svesha3 svesm4 flagm2 frint svei8mm svebf16 i8mm bf16 dgh rng bti
L1d cache:                               1 MiB (16 instances)
L1i cache:                               1 MiB (16 instances)
L2 cache:                                32 MiB (16 instances)
L3 cache:                                80 MiB (1 instance)
NUMA node(s):                            1
NUMA node0 CPU(s):                       0-15
Vulnerability Gather data sampling:      Not affected
Vulnerability Indirect target selection: Not affected
Vulnerability Itlb multihit:             Not affected
Vulnerability L1tf:                      Not affected
Vulnerability Mds:                       Not affected
Vulnerability Meltdown:                  Not affected
Vulnerability Mmio stale data:           Not affected
Vulnerability Reg file data sampling:    Not affected
Vulnerability Retbleed:                  Not affected
Vulnerability Spec rstack overflow:      Not affected
Vulnerability Spec store bypass:         Mitigation; Speculative Store Bypass disabled via prctl
Vulnerability Spectre v1:                Mitigation; __user pointer sanitization
Vulnerability Spectre v2:                Mitigation; CSV2, BHB
Vulnerability Srbds:                     Not affected
Vulnerability Tsa:                       Not affected
Vulnerability Tsx async abort:           Not affected
Vulnerability Vmscape:                   Not affected

Details

Comparing HEAD and lift-selectivity-stats
--------------------
Benchmark tpch_sf10.json
--------------------
┏━━━━━━━━━━━┳━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━┳━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━┳━━━━━━━━━━━━━━━┓
┃ Query     ┃                               HEAD ┃             lift-selectivity-stats ┃        Change ┃
┡━━━━━━━━━━━╇━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━╇━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━╇━━━━━━━━━━━━━━━┩
│ QQuery 1  │  311.48 / 313.73 ±1.20 / 314.79 ms │  308.69 / 310.23 ±1.00 / 311.50 ms │     no change │
│ QQuery 2  │  102.77 / 105.57 ±2.93 / 110.44 ms │  100.45 / 102.91 ±1.71 / 104.85 ms │     no change │
│ QQuery 3  │  233.93 / 236.53 ±2.12 / 239.55 ms │  230.65 / 234.27 ±3.45 / 240.62 ms │     no change │
│ QQuery 4  │  112.40 / 114.92 ±2.65 / 119.22 ms │  113.06 / 114.96 ±1.50 / 117.26 ms │     no change │
│ QQuery 5  │  345.58 / 354.43 ±6.41 / 364.67 ms │  353.73 / 357.36 ±4.88 / 366.89 ms │     no change │
│ QQuery 6  │  123.78 / 126.49 ±3.02 / 132.15 ms │  102.82 / 105.87 ±3.13 / 111.59 ms │ +1.19x faster │
│ QQuery 7  │  460.19 / 464.60 ±5.76 / 475.84 ms │  455.82 / 468.52 ±8.42 / 477.47 ms │     no change │
│ QQuery 8  │  379.02 / 386.02 ±5.03 / 393.37 ms │  382.94 / 389.01 ±6.64 / 400.73 ms │     no change │
│ QQuery 9  │  552.90 / 562.71 ±8.01 / 576.68 ms │  557.15 / 562.38 ±5.77 / 571.80 ms │     no change │
│ QQuery 10 │  308.11 / 317.38 ±9.32 / 330.25 ms │  297.07 / 308.79 ±8.84 / 321.26 ms │     no change │
│ QQuery 11 │    81.18 / 91.44 ±9.79 / 103.29 ms │    86.04 / 93.60 ±7.48 / 106.10 ms │     no change │
│ QQuery 12 │  176.57 / 179.36 ±2.57 / 183.77 ms │  120.98 / 123.39 ±2.32 / 127.34 ms │ +1.45x faster │
│ QQuery 13 │  277.97 / 292.86 ±8.52 / 304.37 ms │  290.45 / 295.58 ±4.96 / 304.39 ms │     no change │
│ QQuery 14 │  172.28 / 176.00 ±4.24 / 184.08 ms │  171.30 / 176.41 ±6.97 / 190.24 ms │     no change │
│ QQuery 15 │  302.56 / 305.10 ±2.80 / 310.32 ms │  306.52 / 309.08 ±2.28 / 312.36 ms │     no change │
│ QQuery 16 │     64.85 / 67.99 ±3.05 / 73.80 ms │     63.94 / 69.16 ±7.20 / 83.24 ms │     no change │
│ QQuery 17 │  621.69 / 636.04 ±7.31 / 641.38 ms │ 610.23 / 625.29 ±12.49 / 646.24 ms │     no change │
│ QQuery 18 │ 666.57 / 688.57 ±11.35 / 696.83 ms │  677.67 / 688.46 ±6.07 / 695.27 ms │     no change │
│ QQuery 19 │ 247.36 / 265.35 ±13.69 / 285.93 ms │ 241.83 / 256.93 ±18.34 / 287.63 ms │     no change │
│ QQuery 20 │  291.97 / 300.32 ±6.30 / 308.58 ms │ 279.01 / 294.58 ±10.58 / 306.83 ms │     no change │
│ QQuery 21 │  653.01 / 660.61 ±6.27 / 671.47 ms │  648.90 / 660.14 ±6.39 / 667.18 ms │     no change │
│ QQuery 22 │     59.76 / 62.75 ±2.43 / 67.15 ms │     59.51 / 61.01 ±0.90 / 62.14 ms │     no change │
└───────────┴────────────────────────────────────┴────────────────────────────────────┴───────────────┘
┏━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━┳━━━━━━━━━━━┓
┃ Benchmark Summary                     ┃           ┃
┡━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━╇━━━━━━━━━━━┩
│ Total Time (HEAD)                     │ 6708.76ms │
│ Total Time (lift-selectivity-stats)   │ 6607.93ms │
│ Average Time (HEAD)                   │  304.94ms │
│ Average Time (lift-selectivity-stats) │  300.36ms │
│ Queries Faster                        │         2 │
│ Queries Slower                        │         0 │
│ Queries with No Change                │        20 │
│ Queries with Failure                  │         0 │
└───────────────────────────────────────┴───────────┘

Resource Usage

tpch10 — base (merge-base)

Metric	Value
Wall time	35.0s
Peak memory	4.5 GiB
Avg memory	1.5 GiB
CPU user	339.7s
CPU sys	19.5s
Peak spill	0 B

tpch10 — branch

Metric	Value
Wall time	35.0s
Peak memory	5.2 GiB
Avg memory	1.6 GiB
CPU user	333.7s
CPU sys	19.6s
Peak spill	0 B

File an issue against this benchmark runner

adriangbot · 2026-07-13T12:06:26Z

🤖 Benchmark completed (GKE) | trigger

Instance: c4a-highmem-16 (12 vCPU / 65 GiB)

CPU Details (lscpu)

Architecture:                            aarch64
CPU op-mode(s):                          64-bit
Byte Order:                              Little Endian
CPU(s):                                  16
On-line CPU(s) list:                     0-15
Vendor ID:                               ARM
Model name:                              Neoverse-V2
Model:                                   1
Thread(s) per core:                      1
Core(s) per cluster:                     16
Socket(s):                               -
Cluster(s):                              1
Stepping:                                r0p1
BogoMIPS:                                2000.00
Flags:                                   fp asimd evtstrm aes pmull sha1 sha2 crc32 atomics fphp asimdhp cpuid asimdrdm jscvt fcma lrcpc dcpop sha3 sm3 sm4 asimddp sha512 sve asimdfhm dit uscat ilrcpc flagm sb paca pacg dcpodp sve2 sveaes svepmull svebitperm svesha3 svesm4 flagm2 frint svei8mm svebf16 i8mm bf16 dgh rng bti
L1d cache:                               1 MiB (16 instances)
L1i cache:                               1 MiB (16 instances)
L2 cache:                                32 MiB (16 instances)
L3 cache:                                80 MiB (1 instance)
NUMA node(s):                            1
NUMA node0 CPU(s):                       0-15
Vulnerability Gather data sampling:      Not affected
Vulnerability Indirect target selection: Not affected
Vulnerability Itlb multihit:             Not affected
Vulnerability L1tf:                      Not affected
Vulnerability Mds:                       Not affected
Vulnerability Meltdown:                  Not affected
Vulnerability Mmio stale data:           Not affected
Vulnerability Reg file data sampling:    Not affected
Vulnerability Retbleed:                  Not affected
Vulnerability Spec rstack overflow:      Not affected
Vulnerability Spec store bypass:         Mitigation; Speculative Store Bypass disabled via prctl
Vulnerability Spectre v1:                Mitigation; __user pointer sanitization
Vulnerability Spectre v2:                Mitigation; CSV2, BHB
Vulnerability Srbds:                     Not affected
Vulnerability Tsa:                       Not affected
Vulnerability Tsx async abort:           Not affected
Vulnerability Vmscape:                   Not affected

Details

Comparing HEAD and lift-selectivity-stats
--------------------
Benchmark tpcds_sf1.json
--------------------
┏━━━━━━━━━━━┳━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━┳━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━┳━━━━━━━━━━━━━━━┓
┃ Query     ┃                                   HEAD ┃                lift-selectivity-stats ┃        Change ┃
┡━━━━━━━━━━━╇━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━╇━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━╇━━━━━━━━━━━━━━━┩
│ QQuery 1  │            5.45 / 6.00 ±0.94 / 7.86 ms │           5.56 / 6.08 ±0.89 / 7.85 ms │     no change │
│ QQuery 2  │         80.16 / 80.59 ±0.41 / 81.34 ms │        82.69 / 83.58 ±0.60 / 84.32 ms │     no change │
│ QQuery 3  │         29.75 / 29.90 ±0.21 / 30.31 ms │        29.88 / 30.42 ±0.41 / 31.07 ms │     no change │
│ QQuery 4  │      475.41 / 481.57 ±3.20 / 484.14 ms │     490.20 / 492.98 ±2.53 / 497.09 ms │     no change │
│ QQuery 5  │         51.97 / 52.28 ±0.31 / 52.73 ms │        52.99 / 53.24 ±0.35 / 53.93 ms │     no change │
│ QQuery 6  │         36.49 / 36.77 ±0.31 / 37.30 ms │        36.85 / 37.48 ±0.64 / 38.70 ms │     no change │
│ QQuery 7  │         94.21 / 94.69 ±0.37 / 95.18 ms │        95.37 / 97.02 ±1.21 / 98.35 ms │     no change │
│ QQuery 8  │         36.53 / 37.13 ±0.40 / 37.78 ms │        37.07 / 38.73 ±2.67 / 44.06 ms │     no change │
│ QQuery 9  │         51.83 / 54.01 ±2.16 / 57.74 ms │        52.44 / 56.63 ±3.43 / 62.21 ms │     no change │
│ QQuery 10 │         62.84 / 63.35 ±0.33 / 63.78 ms │        64.11 / 64.44 ±0.38 / 65.16 ms │     no change │
│ QQuery 11 │      293.67 / 297.32 ±2.82 / 300.70 ms │     298.38 / 305.68 ±9.29 / 323.90 ms │     no change │
│ QQuery 12 │         28.12 / 28.54 ±0.34 / 29.05 ms │        29.41 / 29.72 ±0.25 / 30.08 ms │     no change │
│ QQuery 13 │      118.25 / 118.73 ±0.36 / 119.33 ms │     121.37 / 122.04 ±0.59 / 122.97 ms │     no change │
│ QQuery 14 │      409.60 / 414.24 ±3.35 / 418.62 ms │     414.19 / 418.15 ±3.72 / 422.97 ms │     no change │
│ QQuery 15 │         56.75 / 58.70 ±1.96 / 62.40 ms │        57.30 / 57.73 ±0.34 / 58.16 ms │     no change │
│ QQuery 16 │            6.51 / 6.64 ±0.21 / 7.06 ms │           6.52 / 6.69 ±0.24 / 7.15 ms │     no change │
│ QQuery 17 │         80.24 / 81.29 ±0.64 / 82.14 ms │        79.59 / 80.79 ±0.99 / 82.38 ms │     no change │
│ QQuery 18 │      125.68 / 126.25 ±0.78 / 127.77 ms │     123.39 / 126.63 ±4.45 / 135.44 ms │     no change │
│ QQuery 19 │         41.84 / 42.29 ±0.29 / 42.74 ms │        41.87 / 42.19 ±0.28 / 42.68 ms │     no change │
│ QQuery 20 │         35.31 / 35.73 ±0.40 / 36.29 ms │        35.62 / 35.86 ±0.29 / 36.40 ms │     no change │
│ QQuery 21 │         17.37 / 18.13 ±0.96 / 20.03 ms │        17.74 / 17.86 ±0.10 / 18.00 ms │     no change │
│ QQuery 22 │         62.41 / 63.33 ±0.70 / 64.54 ms │        63.45 / 63.72 ±0.26 / 64.07 ms │     no change │
│ QQuery 23 │      343.88 / 347.26 ±4.21 / 354.71 ms │     346.35 / 349.85 ±4.01 / 357.62 ms │     no change │
│ QQuery 24 │      225.71 / 228.33 ±4.35 / 237.00 ms │     225.84 / 227.54 ±1.48 / 229.44 ms │     no change │
│ QQuery 25 │      111.38 / 112.27 ±0.84 / 113.69 ms │     110.88 / 111.94 ±0.72 / 113.14 ms │     no change │
│ QQuery 26 │         56.94 / 59.12 ±2.93 / 64.92 ms │        57.63 / 59.65 ±2.00 / 62.64 ms │     no change │
│ QQuery 27 │            6.22 / 6.33 ±0.13 / 6.56 ms │           6.17 / 6.32 ±0.18 / 6.67 ms │     no change │
│ QQuery 28 │         58.83 / 61.52 ±1.37 / 62.64 ms │        57.86 / 61.03 ±1.86 / 62.67 ms │     no change │
│ QQuery 29 │         97.50 / 98.61 ±0.87 / 99.79 ms │       97.38 / 98.65 ±1.25 / 100.63 ms │     no change │
│ QQuery 30 │         32.42 / 33.07 ±0.65 / 33.96 ms │        31.70 / 32.59 ±0.85 / 34.19 ms │     no change │
│ QQuery 31 │      111.94 / 113.05 ±0.87 / 114.20 ms │     111.34 / 112.27 ±0.53 / 112.92 ms │     no change │
│ QQuery 32 │         20.17 / 20.53 ±0.31 / 21.11 ms │        20.48 / 20.75 ±0.43 / 21.60 ms │     no change │
│ QQuery 33 │         37.38 / 37.83 ±0.29 / 38.20 ms │        37.74 / 38.28 ±0.44 / 38.92 ms │     no change │
│ QQuery 34 │          9.83 / 10.04 ±0.21 / 10.37 ms │         9.60 / 10.00 ±0.43 / 10.78 ms │     no change │
│ QQuery 35 │         73.13 / 74.31 ±1.10 / 75.93 ms │        72.21 / 73.44 ±1.19 / 75.66 ms │     no change │
│ QQuery 36 │            5.69 / 5.81 ±0.17 / 6.16 ms │           5.82 / 5.94 ±0.20 / 6.33 ms │     no change │
│ QQuery 37 │            6.85 / 6.98 ±0.10 / 7.14 ms │           6.98 / 7.00 ±0.03 / 7.05 ms │     no change │
│ QQuery 38 │         61.85 / 62.42 ±0.48 / 63.29 ms │        62.61 / 62.78 ±0.17 / 63.10 ms │     no change │
│ QQuery 39 │         89.73 / 90.93 ±1.00 / 92.34 ms │        90.13 / 92.40 ±1.99 / 96.07 ms │     no change │
│ QQuery 40 │         23.35 / 23.96 ±0.42 / 24.57 ms │        23.57 / 23.83 ±0.22 / 24.03 ms │     no change │
│ QQuery 41 │         11.42 / 11.58 ±0.21 / 11.98 ms │        11.55 / 11.71 ±0.15 / 11.99 ms │     no change │
│ QQuery 42 │         23.58 / 24.53 ±0.60 / 25.34 ms │        24.10 / 24.82 ±0.82 / 26.15 ms │     no change │
│ QQuery 43 │            4.88 / 4.96 ±0.09 / 5.14 ms │           4.94 / 5.03 ±0.10 / 5.19 ms │     no change │
│ QQuery 44 │            9.14 / 9.21 ±0.05 / 9.29 ms │           9.17 / 9.38 ±0.12 / 9.53 ms │     no change │
│ QQuery 45 │         37.81 / 38.32 ±0.64 / 39.54 ms │        37.59 / 38.01 ±0.52 / 39.03 ms │     no change │
│ QQuery 46 │         11.48 / 11.91 ±0.26 / 12.27 ms │        11.60 / 11.80 ±0.15 / 12.01 ms │     no change │
│ QQuery 47 │      227.70 / 232.25 ±3.44 / 236.49 ms │     226.73 / 229.48 ±2.34 / 233.25 ms │     no change │
│ QQuery 48 │         95.60 / 96.58 ±0.77 / 97.39 ms │       97.08 / 98.83 ±1.26 / 100.98 ms │     no change │
│ QQuery 49 │         75.93 / 76.90 ±0.59 / 77.46 ms │        77.41 / 77.96 ±0.30 / 78.31 ms │     no change │
│ QQuery 50 │         59.50 / 61.69 ±3.23 / 68.10 ms │        59.74 / 61.39 ±1.84 / 63.88 ms │     no change │
│ QQuery 51 │         93.02 / 93.93 ±0.62 / 94.59 ms │        92.03 / 93.42 ±1.12 / 95.45 ms │     no change │
│ QQuery 52 │         24.28 / 24.66 ±0.19 / 24.80 ms │        24.06 / 24.59 ±0.44 / 25.39 ms │     no change │
│ QQuery 53 │         29.46 / 29.63 ±0.09 / 29.71 ms │        29.35 / 29.59 ±0.17 / 29.87 ms │     no change │
│ QQuery 54 │         55.83 / 57.38 ±1.95 / 61.14 ms │        55.72 / 57.52 ±2.89 / 63.26 ms │     no change │
│ QQuery 55 │         23.46 / 23.97 ±0.64 / 25.25 ms │        23.49 / 23.77 ±0.22 / 24.02 ms │     no change │
│ QQuery 56 │         38.54 / 39.33 ±0.86 / 40.92 ms │        39.65 / 40.11 ±0.29 / 40.54 ms │     no change │
│ QQuery 57 │      174.60 / 178.26 ±3.31 / 183.75 ms │     175.67 / 177.58 ±1.59 / 179.71 ms │     no change │
│ QQuery 58 │      113.82 / 115.48 ±1.13 / 117.04 ms │     114.35 / 116.05 ±1.09 / 117.57 ms │     no change │
│ QQuery 59 │      117.13 / 118.08 ±0.67 / 119.20 ms │     118.84 / 120.18 ±1.24 / 122.34 ms │     no change │
│ QQuery 60 │         39.65 / 40.01 ±0.27 / 40.43 ms │        39.56 / 40.39 ±0.47 / 40.99 ms │     no change │
│ QQuery 61 │         11.87 / 12.02 ±0.22 / 12.46 ms │        12.02 / 12.21 ±0.22 / 12.62 ms │     no change │
│ QQuery 62 │         46.26 / 46.36 ±0.08 / 46.45 ms │        46.23 / 47.17 ±1.57 / 50.30 ms │     no change │
│ QQuery 63 │         30.14 / 31.44 ±2.36 / 36.16 ms │        29.27 / 29.90 ±0.35 / 30.20 ms │     no change │
│ QQuery 64 │      409.03 / 415.49 ±5.90 / 424.72 ms │     407.27 / 411.26 ±2.81 / 415.04 ms │     no change │
│ QQuery 65 │      147.51 / 150.85 ±2.23 / 153.71 ms │     145.26 / 147.88 ±2.07 / 151.17 ms │     no change │
│ QQuery 66 │         80.07 / 81.22 ±1.10 / 83.28 ms │        80.54 / 82.27 ±1.50 / 84.93 ms │     no change │
│ QQuery 67 │      237.97 / 242.97 ±5.24 / 252.56 ms │     236.04 / 243.89 ±5.17 / 251.14 ms │     no change │
│ QQuery 68 │         11.60 / 11.79 ±0.23 / 12.24 ms │        11.57 / 11.81 ±0.25 / 12.28 ms │     no change │
│ QQuery 69 │         57.09 / 57.33 ±0.13 / 57.47 ms │        57.49 / 59.41 ±3.13 / 65.63 ms │     no change │
│ QQuery 70 │     105.49 / 112.84 ±10.78 / 134.23 ms │     106.20 / 110.31 ±4.45 / 118.51 ms │     no change │
│ QQuery 71 │         35.10 / 35.70 ±0.54 / 36.68 ms │        35.13 / 35.83 ±0.50 / 36.62 ms │     no change │
│ QQuery 72 │ 2143.26 / 2237.91 ±102.81 / 2423.42 ms │ 2107.49 / 2167.70 ±44.61 / 2229.51 ms │     no change │
│ QQuery 73 │           9.42 / 9.66 ±0.20 / 10.03 ms │          9.65 / 9.94 ±0.28 / 10.43 ms │     no change │
│ QQuery 74 │      176.39 / 180.90 ±6.05 / 192.54 ms │     169.40 / 173.87 ±5.50 / 184.10 ms │     no change │
│ QQuery 75 │      150.38 / 154.30 ±4.17 / 161.64 ms │     148.67 / 154.22 ±5.06 / 161.21 ms │     no change │
│ QQuery 76 │         35.43 / 35.72 ±0.24 / 36.00 ms │        35.59 / 35.90 ±0.18 / 36.05 ms │     no change │
│ QQuery 77 │         61.45 / 61.76 ±0.29 / 62.17 ms │        61.84 / 62.49 ±0.67 / 63.49 ms │     no change │
│ QQuery 78 │      212.63 / 216.70 ±4.59 / 225.64 ms │     197.03 / 200.05 ±3.93 / 207.82 ms │ +1.08x faster │
│ QQuery 79 │         71.36 / 73.01 ±1.21 / 75.03 ms │        67.09 / 68.08 ±0.85 / 69.35 ms │ +1.07x faster │
│ QQuery 80 │      106.75 / 108.67 ±1.63 / 111.50 ms │      99.60 / 101.10 ±1.06 / 102.69 ms │ +1.07x faster │
│ QQuery 81 │         27.33 / 32.94 ±9.01 / 50.91 ms │        26.14 / 26.35 ±0.25 / 26.80 ms │ +1.25x faster │
│ QQuery 82 │         17.09 / 18.46 ±2.06 / 22.45 ms │        16.85 / 17.03 ±0.12 / 17.19 ms │ +1.08x faster │
│ QQuery 83 │         41.03 / 41.56 ±0.39 / 42.12 ms │        40.39 / 42.77 ±4.14 / 51.01 ms │     no change │
│ QQuery 84 │         30.20 / 30.83 ±0.68 / 32.12 ms │        30.61 / 32.24 ±2.84 / 37.91 ms │     no change │
│ QQuery 85 │      106.37 / 109.37 ±4.46 / 118.21 ms │     107.13 / 107.62 ±0.49 / 108.51 ms │     no change │
│ QQuery 86 │         25.18 / 26.36 ±0.86 / 27.72 ms │        25.15 / 25.43 ±0.30 / 25.99 ms │     no change │
│ QQuery 87 │         63.69 / 64.47 ±1.04 / 66.48 ms │        63.03 / 65.75 ±3.12 / 71.14 ms │     no change │
│ QQuery 88 │         62.04 / 62.43 ±0.23 / 62.76 ms │        64.31 / 64.63 ±0.30 / 65.13 ms │     no change │
│ QQuery 89 │         35.69 / 37.78 ±3.46 / 44.68 ms │        36.65 / 37.09 ±0.43 / 37.79 ms │     no change │
│ QQuery 90 │         17.03 / 17.23 ±0.13 / 17.40 ms │        17.48 / 17.84 ±0.24 / 18.11 ms │     no change │
│ QQuery 91 │         46.12 / 46.85 ±0.66 / 48.05 ms │        45.95 / 47.55 ±2.31 / 52.15 ms │     no change │
│ QQuery 92 │         29.11 / 29.95 ±0.54 / 30.68 ms │        29.81 / 31.38 ±1.87 / 35.03 ms │     no change │
│ QQuery 93 │         49.25 / 50.25 ±0.81 / 51.64 ms │        50.39 / 51.17 ±0.61 / 51.93 ms │     no change │
│ QQuery 94 │         37.54 / 38.03 ±0.51 / 38.90 ms │        38.14 / 38.67 ±0.40 / 39.34 ms │     no change │
│ QQuery 95 │         80.37 / 82.93 ±1.70 / 85.11 ms │        80.82 / 81.83 ±1.34 / 84.49 ms │     no change │
│ QQuery 96 │         24.09 / 24.32 ±0.26 / 24.80 ms │        24.55 / 25.40 ±1.21 / 27.80 ms │     no change │
│ QQuery 97 │         47.05 / 47.43 ±0.24 / 47.76 ms │        47.00 / 47.81 ±0.76 / 49.14 ms │     no change │
│ QQuery 98 │         42.69 / 43.40 ±0.63 / 44.56 ms │        43.28 / 44.04 ±0.74 / 45.26 ms │     no change │
│ QQuery 99 │         70.47 / 72.92 ±2.71 / 77.67 ms │        70.35 / 70.84 ±0.34 / 71.37 ms │     no change │
└───────────┴────────────────────────────────────────┴───────────────────────────────────────┴───────────────┘
┏━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━┳━━━━━━━━━━━━┓
┃ Benchmark Summary                     ┃            ┃
┡━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━╇━━━━━━━━━━━━┩
│ Total Time (HEAD)                     │ 10062.67ms │
│ Total Time (lift-selectivity-stats)   │  9994.28ms │
│ Average Time (HEAD)                   │   101.64ms │
│ Average Time (lift-selectivity-stats) │   100.95ms │
│ Queries Faster                        │          5 │
│ Queries Slower                        │          0 │
│ Queries with No Change                │         94 │
│ Queries with Failure                  │          0 │
└───────────────────────────────────────┴────────────┘

Resource Usage

tpcds — base (merge-base)

Metric	Value
Wall time	55.0s
Peak memory	2.0 GiB
Avg memory	1.4 GiB
CPU user	229.2s
CPU sys	6.2s
Peak spill	0 B

tpcds — branch

Metric	Value
Wall time	55.0s
Peak memory	1.9 GiB
Avg memory	1.3 GiB
CPU user	229.2s
CPU sys	5.9s
Peak spill	0 B

File an issue against this benchmark runner

adriangbot · 2026-07-13T12:06:31Z

🤖 Benchmark completed (GKE) | trigger

Instance: c4a-highmem-16 (12 vCPU / 65 GiB)

CPU Details (lscpu)

Architecture:                            aarch64
CPU op-mode(s):                          64-bit
Byte Order:                              Little Endian
CPU(s):                                  16
On-line CPU(s) list:                     0-15
Vendor ID:                               ARM
Model name:                              Neoverse-V2
Model:                                   1
Thread(s) per core:                      1
Core(s) per cluster:                     16
Socket(s):                               -
Cluster(s):                              1
Stepping:                                r0p1
BogoMIPS:                                2000.00
Flags:                                   fp asimd evtstrm aes pmull sha1 sha2 crc32 atomics fphp asimdhp cpuid asimdrdm jscvt fcma lrcpc dcpop sha3 sm3 sm4 asimddp sha512 sve asimdfhm dit uscat ilrcpc flagm sb paca pacg dcpodp sve2 sveaes svepmull svebitperm svesha3 svesm4 flagm2 frint svei8mm svebf16 i8mm bf16 dgh rng bti
L1d cache:                               1 MiB (16 instances)
L1i cache:                               1 MiB (16 instances)
L2 cache:                                32 MiB (16 instances)
L3 cache:                                80 MiB (1 instance)
NUMA node(s):                            1
NUMA node0 CPU(s):                       0-15
Vulnerability Gather data sampling:      Not affected
Vulnerability Indirect target selection: Not affected
Vulnerability Itlb multihit:             Not affected
Vulnerability L1tf:                      Not affected
Vulnerability Mds:                       Not affected
Vulnerability Meltdown:                  Not affected
Vulnerability Mmio stale data:           Not affected
Vulnerability Reg file data sampling:    Not affected
Vulnerability Retbleed:                  Not affected
Vulnerability Spec rstack overflow:      Not affected
Vulnerability Spec store bypass:         Mitigation; Speculative Store Bypass disabled via prctl
Vulnerability Spectre v1:                Mitigation; __user pointer sanitization
Vulnerability Spectre v2:                Mitigation; CSV2, BHB
Vulnerability Srbds:                     Not affected
Vulnerability Tsa:                       Not affected
Vulnerability Tsx async abort:           Not affected
Vulnerability Vmscape:                   Not affected

Details

Comparing HEAD and lift-selectivity-stats
--------------------
Benchmark tpch_sf10.json
--------------------
┏━━━━━━━━━━━┳━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━┳━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━┳━━━━━━━━━━━┓
┃ Query     ┃                               HEAD ┃             lift-selectivity-stats ┃    Change ┃
┡━━━━━━━━━━━╇━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━╇━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━╇━━━━━━━━━━━┩
│ QQuery 1  │  313.93 / 315.83 ±1.11 / 317.17 ms │  315.48 / 317.59 ±1.36 / 319.69 ms │ no change │
│ QQuery 2  │  104.91 / 107.06 ±1.60 / 109.15 ms │  102.90 / 105.54 ±2.55 / 109.80 ms │ no change │
│ QQuery 3  │  235.63 / 238.87 ±2.71 / 243.03 ms │  233.51 / 236.96 ±4.15 / 245.00 ms │ no change │
│ QQuery 4  │  111.68 / 112.79 ±0.72 / 113.78 ms │  112.27 / 114.88 ±2.16 / 118.69 ms │ no change │
│ QQuery 5  │  354.37 / 359.88 ±3.18 / 363.29 ms │  352.35 / 362.81 ±6.88 / 372.20 ms │ no change │
│ QQuery 6  │  126.19 / 129.22 ±3.47 / 135.66 ms │  125.58 / 128.29 ±3.25 / 134.34 ms │ no change │
│ QQuery 7  │ 456.69 / 471.43 ±10.60 / 485.07 ms │ 454.49 / 468.48 ±15.78 / 498.78 ms │ no change │
│ QQuery 8  │  381.43 / 386.38 ±3.43 / 391.46 ms │  384.70 / 389.44 ±3.58 / 395.68 ms │ no change │
│ QQuery 9  │  549.77 / 558.15 ±5.84 / 566.36 ms │  550.15 / 558.80 ±9.64 / 576.92 ms │ no change │
│ QQuery 10 │  293.85 / 302.50 ±5.12 / 307.72 ms │  302.68 / 310.68 ±6.25 / 319.81 ms │ no change │
│ QQuery 11 │    84.39 / 90.48 ±7.00 / 103.11 ms │    87.56 / 93.37 ±4.97 / 101.57 ms │ no change │
│ QQuery 12 │  180.13 / 185.00 ±3.97 / 189.84 ms │  175.80 / 180.45 ±3.33 / 185.17 ms │ no change │
│ QQuery 13 │  289.11 / 300.04 ±8.04 / 312.76 ms │  286.99 / 291.03 ±3.05 / 295.81 ms │ no change │
│ QQuery 14 │  173.20 / 177.13 ±4.17 / 182.95 ms │  172.29 / 179.41 ±5.83 / 188.27 ms │ no change │
│ QQuery 15 │  306.48 / 309.36 ±3.67 / 316.23 ms │  306.79 / 310.68 ±2.60 / 313.66 ms │ no change │
│ QQuery 16 │     64.21 / 66.25 ±2.03 / 70.02 ms │     64.65 / 67.86 ±3.74 / 75.16 ms │ no change │
│ QQuery 17 │  620.66 / 632.08 ±7.91 / 642.64 ms │  616.47 / 625.90 ±6.57 / 633.92 ms │ no change │
│ QQuery 18 │  673.68 / 684.02 ±6.71 / 694.61 ms │ 670.14 / 685.90 ±12.83 / 700.26 ms │ no change │
│ QQuery 19 │ 249.21 / 262.36 ±11.29 / 278.69 ms │ 248.72 / 263.46 ±11.61 / 281.26 ms │ no change │
│ QQuery 20 │  279.58 / 286.46 ±6.21 / 297.11 ms │  273.66 / 289.13 ±9.78 / 300.55 ms │ no change │
│ QQuery 21 │  651.20 / 658.56 ±6.82 / 669.39 ms │  655.51 / 663.26 ±5.90 / 673.74 ms │ no change │
│ QQuery 22 │     61.05 / 64.00 ±2.87 / 68.89 ms │     61.08 / 63.77 ±2.12 / 66.53 ms │ no change │
└───────────┴────────────────────────────────────┴────────────────────────────────────┴───────────┘
┏━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━┳━━━━━━━━━━━┓
┃ Benchmark Summary                     ┃           ┃
┡━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━╇━━━━━━━━━━━┩
│ Total Time (HEAD)                     │ 6697.85ms │
│ Total Time (lift-selectivity-stats)   │ 6707.70ms │
│ Average Time (HEAD)                   │  304.45ms │
│ Average Time (lift-selectivity-stats) │  304.90ms │
│ Queries Faster                        │         0 │
│ Queries Slower                        │         0 │
│ Queries with No Change                │        22 │
│ Queries with Failure                  │         0 │
└───────────────────────────────────────┴───────────┘

Resource Usage

tpch10 — base (merge-base)

Metric	Value
Wall time	35.0s
Peak memory	4.4 GiB
Avg memory	1.4 GiB
CPU user	340.0s
CPU sys	20.4s
Peak spill	0 B

tpch10 — branch

Metric	Value
Wall time	35.0s
Peak memory	4.2 GiB
Avg memory	1.4 GiB
CPU user	339.5s
CPU sys	20.5s
Peak spill	0 B

File an issue against this benchmark runner

adriangbot · 2026-07-13T12:06:44Z

🤖 Benchmark completed (GKE) | trigger

Instance: c4a-highmem-16 (12 vCPU / 65 GiB)

CPU Details (lscpu)

Architecture:                            aarch64
CPU op-mode(s):                          64-bit
Byte Order:                              Little Endian
CPU(s):                                  16
On-line CPU(s) list:                     0-15
Vendor ID:                               ARM
Model name:                              Neoverse-V2
Model:                                   1
Thread(s) per core:                      1
Core(s) per cluster:                     16
Socket(s):                               -
Cluster(s):                              1
Stepping:                                r0p1
BogoMIPS:                                2000.00
Flags:                                   fp asimd evtstrm aes pmull sha1 sha2 crc32 atomics fphp asimdhp cpuid asimdrdm jscvt fcma lrcpc dcpop sha3 sm3 sm4 asimddp sha512 sve asimdfhm dit uscat ilrcpc flagm sb paca pacg dcpodp sve2 sveaes svepmull svebitperm svesha3 svesm4 flagm2 frint svei8mm svebf16 i8mm bf16 dgh rng bti
L1d cache:                               1 MiB (16 instances)
L1i cache:                               1 MiB (16 instances)
L2 cache:                                32 MiB (16 instances)
L3 cache:                                80 MiB (1 instance)
NUMA node(s):                            1
NUMA node0 CPU(s):                       0-15
Vulnerability Gather data sampling:      Not affected
Vulnerability Indirect target selection: Not affected
Vulnerability Itlb multihit:             Not affected
Vulnerability L1tf:                      Not affected
Vulnerability Mds:                       Not affected
Vulnerability Meltdown:                  Not affected
Vulnerability Mmio stale data:           Not affected
Vulnerability Reg file data sampling:    Not affected
Vulnerability Retbleed:                  Not affected
Vulnerability Spec rstack overflow:      Not affected
Vulnerability Spec store bypass:         Mitigation; Speculative Store Bypass disabled via prctl
Vulnerability Spectre v1:                Mitigation; __user pointer sanitization
Vulnerability Spectre v2:                Mitigation; CSV2, BHB
Vulnerability Srbds:                     Not affected
Vulnerability Tsa:                       Not affected
Vulnerability Tsx async abort:           Not affected
Vulnerability Vmscape:                   Not affected

Details

Comparing HEAD and lift-selectivity-stats
--------------------
Benchmark tpcds_sf1.json
--------------------
┏━━━━━━━━━━━┳━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━┳━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━┳━━━━━━━━━━━━━━━┓
┃ Query     ┃                                  HEAD ┃                lift-selectivity-stats ┃        Change ┃
┡━━━━━━━━━━━╇━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━╇━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━╇━━━━━━━━━━━━━━━┩
│ QQuery 1  │           5.57 / 6.10 ±0.92 / 7.95 ms │           5.34 / 5.90 ±0.96 / 7.81 ms │     no change │
│ QQuery 2  │        82.15 / 82.41 ±0.18 / 82.66 ms │        80.84 / 81.02 ±0.14 / 81.17 ms │     no change │
│ QQuery 3  │        29.52 / 29.73 ±0.15 / 29.97 ms │        29.32 / 29.44 ±0.10 / 29.61 ms │     no change │
│ QQuery 4  │     490.39 / 494.16 ±2.76 / 497.32 ms │     488.91 / 493.97 ±4.63 / 502.07 ms │     no change │
│ QQuery 5  │        52.28 / 52.94 ±0.43 / 53.52 ms │        51.86 / 52.64 ±0.69 / 53.77 ms │     no change │
│ QQuery 6  │        36.76 / 37.01 ±0.16 / 37.15 ms │        36.38 / 36.86 ±0.32 / 37.23 ms │     no change │
│ QQuery 7  │        94.30 / 95.50 ±0.82 / 96.74 ms │        92.55 / 92.89 ±0.28 / 93.35 ms │     no change │
│ QQuery 8  │        37.40 / 38.50 ±1.93 / 42.37 ms │        36.49 / 37.84 ±1.71 / 41.20 ms │     no change │
│ QQuery 9  │        53.12 / 55.55 ±1.28 / 56.58 ms │        55.46 / 57.43 ±1.57 / 59.75 ms │     no change │
│ QQuery 10 │        63.41 / 64.08 ±0.37 / 64.52 ms │        63.14 / 63.71 ±0.41 / 64.43 ms │     no change │
│ QQuery 11 │     303.08 / 307.27 ±4.73 / 316.36 ms │     300.48 / 302.68 ±2.56 / 307.51 ms │     no change │
│ QQuery 12 │        28.65 / 29.06 ±0.45 / 29.87 ms │        28.61 / 28.90 ±0.40 / 29.67 ms │     no change │
│ QQuery 13 │     119.97 / 120.80 ±0.81 / 122.03 ms │     117.93 / 119.66 ±2.33 / 124.22 ms │     no change │
│ QQuery 14 │     414.44 / 416.76 ±3.20 / 422.99 ms │     410.32 / 415.11 ±2.81 / 418.24 ms │     no change │
│ QQuery 15 │        57.90 / 58.28 ±0.26 / 58.61 ms │        58.30 / 58.94 ±0.56 / 59.59 ms │     no change │
│ QQuery 16 │           6.52 / 6.77 ±0.21 / 7.10 ms │           6.50 / 6.68 ±0.21 / 7.09 ms │     no change │
│ QQuery 17 │        80.51 / 81.84 ±1.85 / 85.50 ms │        80.21 / 80.91 ±1.02 / 82.92 ms │     no change │
│ QQuery 18 │     125.96 / 127.30 ±1.96 / 131.17 ms │     123.72 / 127.39 ±3.33 / 132.87 ms │     no change │
│ QQuery 19 │        42.52 / 42.68 ±0.15 / 42.86 ms │        41.83 / 42.17 ±0.29 / 42.55 ms │     no change │
│ QQuery 20 │        36.14 / 37.61 ±1.06 / 38.96 ms │        36.23 / 36.75 ±0.37 / 37.22 ms │     no change │
│ QQuery 21 │        17.75 / 18.01 ±0.25 / 18.42 ms │        17.55 / 17.98 ±0.41 / 18.55 ms │     no change │
│ QQuery 22 │        63.21 / 65.50 ±3.07 / 71.55 ms │        64.02 / 64.51 ±0.37 / 65.14 ms │     no change │
│ QQuery 23 │     343.17 / 347.56 ±4.58 / 355.07 ms │    345.86 / 355.91 ±16.68 / 389.10 ms │     no change │
│ QQuery 24 │     225.74 / 230.46 ±4.83 / 239.47 ms │     224.93 / 229.26 ±5.14 / 239.17 ms │     no change │
│ QQuery 25 │     111.13 / 112.14 ±1.11 / 114.21 ms │     110.88 / 111.34 ±0.31 / 111.65 ms │     no change │
│ QQuery 26 │        58.25 / 60.43 ±2.18 / 64.57 ms │        56.62 / 58.39 ±2.35 / 63.02 ms │     no change │
│ QQuery 27 │           6.21 / 6.31 ±0.15 / 6.60 ms │           6.24 / 6.36 ±0.16 / 6.66 ms │     no change │
│ QQuery 28 │        57.31 / 61.24 ±2.07 / 63.06 ms │        54.90 / 58.09 ±1.88 / 60.48 ms │ +1.05x faster │
│ QQuery 29 │      98.36 / 101.06 ±2.43 / 104.08 ms │        96.82 / 98.17 ±0.73 / 99.05 ms │     no change │
│ QQuery 30 │        32.70 / 33.63 ±0.72 / 34.54 ms │        33.27 / 35.68 ±3.36 / 42.24 ms │  1.06x slower │
│ QQuery 31 │     112.21 / 112.61 ±0.39 / 113.08 ms │     111.20 / 112.51 ±1.38 / 115.09 ms │     no change │
│ QQuery 32 │        20.50 / 20.86 ±0.35 / 21.44 ms │        20.56 / 20.79 ±0.20 / 21.12 ms │     no change │
│ QQuery 33 │        38.26 / 40.42 ±3.25 / 46.88 ms │        37.41 / 37.75 ±0.30 / 38.22 ms │ +1.07x faster │
│ QQuery 34 │         9.98 / 10.23 ±0.18 / 10.52 ms │         9.94 / 10.47 ±0.46 / 11.30 ms │     no change │
│ QQuery 35 │        73.51 / 74.74 ±2.18 / 79.09 ms │        72.40 / 73.70 ±0.85 / 74.85 ms │     no change │
│ QQuery 36 │           5.88 / 6.03 ±0.21 / 6.45 ms │           5.66 / 5.78 ±0.16 / 6.09 ms │     no change │
│ QQuery 37 │           6.86 / 6.96 ±0.05 / 7.01 ms │           6.84 / 6.96 ±0.13 / 7.21 ms │     no change │
│ QQuery 38 │        63.48 / 63.74 ±0.15 / 63.94 ms │        62.75 / 63.20 ±0.31 / 63.51 ms │     no change │
│ QQuery 39 │        91.11 / 91.40 ±0.23 / 91.78 ms │        90.46 / 91.32 ±0.66 / 92.12 ms │     no change │
│ QQuery 40 │        23.49 / 23.97 ±0.32 / 24.51 ms │        23.41 / 23.77 ±0.21 / 24.05 ms │     no change │
│ QQuery 41 │        11.63 / 11.72 ±0.14 / 12.01 ms │        11.48 / 11.68 ±0.14 / 11.92 ms │     no change │
│ QQuery 42 │        24.14 / 24.71 ±0.52 / 25.66 ms │        23.80 / 24.28 ±0.57 / 25.32 ms │     no change │
│ QQuery 43 │           4.94 / 5.06 ±0.12 / 5.28 ms │           4.91 / 5.04 ±0.14 / 5.29 ms │     no change │
│ QQuery 44 │           9.32 / 9.39 ±0.05 / 9.46 ms │           9.23 / 9.41 ±0.14 / 9.65 ms │     no change │
│ QQuery 45 │        38.36 / 38.59 ±0.20 / 38.93 ms │        38.40 / 39.24 ±0.50 / 39.85 ms │     no change │
│ QQuery 46 │        11.64 / 13.01 ±2.06 / 17.09 ms │        11.90 / 11.96 ±0.04 / 12.02 ms │ +1.09x faster │
│ QQuery 47 │     228.30 / 232.28 ±3.14 / 237.51 ms │     228.15 / 231.05 ±3.20 / 236.73 ms │     no change │
│ QQuery 48 │        96.82 / 97.86 ±0.64 / 98.67 ms │       96.32 / 98.50 ±1.58 / 100.14 ms │     no change │
│ QQuery 49 │        76.96 / 80.54 ±5.97 / 92.45 ms │        76.06 / 76.48 ±0.34 / 77.10 ms │ +1.05x faster │
│ QQuery 50 │        59.72 / 60.24 ±0.53 / 61.14 ms │        59.35 / 60.77 ±1.45 / 63.51 ms │     no change │
│ QQuery 51 │        92.21 / 93.75 ±0.97 / 95.16 ms │        91.04 / 93.22 ±2.15 / 97.10 ms │     no change │
│ QQuery 52 │        24.32 / 24.60 ±0.22 / 24.99 ms │        24.05 / 24.55 ±0.35 / 25.14 ms │     no change │
│ QQuery 53 │        29.53 / 31.52 ±3.18 / 37.85 ms │        29.71 / 30.25 ±0.50 / 31.07 ms │     no change │
│ QQuery 54 │        56.39 / 57.23 ±1.25 / 59.71 ms │        56.17 / 58.35 ±3.20 / 64.69 ms │     no change │
│ QQuery 55 │        23.71 / 24.30 ±0.60 / 25.45 ms │        23.47 / 23.74 ±0.23 / 24.11 ms │     no change │
│ QQuery 56 │        39.15 / 39.60 ±0.31 / 39.98 ms │        38.77 / 39.69 ±0.51 / 40.26 ms │     no change │
│ QQuery 57 │     177.18 / 179.16 ±2.00 / 182.87 ms │     177.69 / 180.34 ±3.06 / 185.19 ms │     no change │
│ QQuery 58 │     115.39 / 119.38 ±2.70 / 122.78 ms │     114.42 / 117.20 ±2.46 / 121.42 ms │     no change │
│ QQuery 59 │     119.50 / 119.85 ±0.26 / 120.10 ms │     118.52 / 119.25 ±0.58 / 120.14 ms │     no change │
│ QQuery 60 │        39.78 / 40.30 ±0.39 / 40.72 ms │        39.40 / 39.99 ±0.31 / 40.29 ms │     no change │
│ QQuery 61 │        12.30 / 13.62 ±2.28 / 18.18 ms │        12.05 / 12.21 ±0.16 / 12.51 ms │ +1.12x faster │
│ QQuery 62 │        46.98 / 47.68 ±1.14 / 49.96 ms │        46.71 / 48.67 ±3.35 / 55.36 ms │     no change │
│ QQuery 63 │        29.59 / 30.08 ±0.38 / 30.53 ms │        29.59 / 30.54 ±1.27 / 32.99 ms │     no change │
│ QQuery 64 │     409.85 / 412.99 ±3.94 / 420.08 ms │     411.19 / 413.52 ±1.87 / 416.00 ms │     no change │
│ QQuery 65 │     150.92 / 153.67 ±3.71 / 160.85 ms │     150.96 / 153.56 ±2.11 / 156.52 ms │     no change │
│ QQuery 66 │        80.94 / 82.06 ±0.91 / 83.27 ms │        80.91 / 82.55 ±1.66 / 85.65 ms │     no change │
│ QQuery 67 │     241.97 / 244.75 ±1.77 / 246.89 ms │     244.21 / 249.40 ±4.07 / 256.49 ms │     no change │
│ QQuery 68 │        11.82 / 11.97 ±0.17 / 12.29 ms │        11.85 / 12.06 ±0.20 / 12.44 ms │     no change │
│ QQuery 69 │        58.10 / 59.00 ±0.89 / 60.54 ms │        57.62 / 59.52 ±3.41 / 66.34 ms │     no change │
│ QQuery 70 │     107.65 / 108.60 ±1.25 / 111.06 ms │     107.44 / 110.59 ±3.74 / 116.80 ms │     no change │
│ QQuery 71 │        35.24 / 35.65 ±0.25 / 36.03 ms │        34.99 / 35.91 ±0.91 / 37.66 ms │     no change │
│ QQuery 72 │ 2185.29 / 2243.34 ±45.21 / 2292.45 ms │ 2090.36 / 2167.65 ±73.76 / 2281.74 ms │     no change │
│ QQuery 73 │         9.63 / 10.06 ±0.37 / 10.73 ms │         9.82 / 10.26 ±0.45 / 11.04 ms │     no change │
│ QQuery 74 │     170.58 / 174.10 ±2.84 / 179.17 ms │     175.24 / 179.05 ±3.81 / 185.81 ms │     no change │
│ QQuery 75 │     149.46 / 150.47 ±1.36 / 152.99 ms │     148.79 / 151.51 ±3.46 / 158.33 ms │     no change │
│ QQuery 76 │        35.68 / 36.04 ±0.26 / 36.36 ms │        35.23 / 37.15 ±2.33 / 41.69 ms │     no change │
│ QQuery 77 │        60.89 / 61.73 ±0.67 / 62.63 ms │        61.29 / 61.72 ±0.59 / 62.85 ms │     no change │
│ QQuery 78 │     196.34 / 200.64 ±2.42 / 202.97 ms │     198.54 / 201.89 ±3.06 / 206.53 ms │     no change │
│ QQuery 79 │        67.23 / 68.20 ±1.14 / 70.39 ms │        67.76 / 70.00 ±2.40 / 74.24 ms │     no change │
│ QQuery 80 │     100.43 / 104.59 ±3.79 / 110.05 ms │      99.47 / 102.97 ±2.84 / 107.73 ms │     no change │
│ QQuery 81 │        26.41 / 27.00 ±0.38 / 27.56 ms │        25.88 / 26.50 ±0.42 / 27.01 ms │     no change │
│ QQuery 82 │        16.76 / 17.05 ±0.32 / 17.59 ms │        16.28 / 16.79 ±0.33 / 17.19 ms │     no change │
│ QQuery 83 │        40.37 / 41.46 ±1.49 / 44.41 ms │        40.29 / 41.86 ±2.78 / 47.39 ms │     no change │
│ QQuery 84 │        30.77 / 30.96 ±0.24 / 31.42 ms │        30.66 / 31.30 ±0.86 / 32.99 ms │     no change │
│ QQuery 85 │     107.55 / 111.85 ±3.77 / 117.87 ms │     107.40 / 109.34 ±2.51 / 114.20 ms │     no change │
│ QQuery 86 │        25.81 / 26.15 ±0.23 / 26.51 ms │        25.36 / 25.82 ±0.42 / 26.48 ms │     no change │
│ QQuery 87 │        64.64 / 65.45 ±0.68 / 66.68 ms │        64.10 / 66.55 ±3.63 / 73.71 ms │     no change │
│ QQuery 88 │        63.89 / 66.24 ±3.87 / 73.96 ms │        63.56 / 64.03 ±0.51 / 64.97 ms │     no change │
│ QQuery 89 │        36.44 / 37.62 ±0.73 / 38.57 ms │        36.13 / 36.59 ±0.39 / 37.11 ms │     no change │
│ QQuery 90 │        17.58 / 17.71 ±0.16 / 18.02 ms │        16.97 / 17.13 ±0.19 / 17.51 ms │     no change │
│ QQuery 91 │        46.63 / 47.41 ±0.50 / 48.08 ms │        46.22 / 47.75 ±1.61 / 50.61 ms │     no change │
│ QQuery 92 │        30.00 / 30.54 ±0.31 / 30.91 ms │        29.21 / 30.37 ±1.72 / 33.78 ms │     no change │
│ QQuery 93 │        49.81 / 50.84 ±0.80 / 52.20 ms │        50.10 / 51.55 ±0.91 / 52.73 ms │     no change │
│ QQuery 94 │        38.63 / 41.07 ±2.12 / 44.33 ms │        38.57 / 38.85 ±0.29 / 39.30 ms │ +1.06x faster │
│ QQuery 95 │        82.10 / 83.68 ±1.73 / 86.40 ms │        80.19 / 81.97 ±1.36 / 83.91 ms │     no change │
│ QQuery 96 │        24.56 / 24.78 ±0.15 / 24.99 ms │        24.17 / 24.89 ±1.17 / 27.21 ms │     no change │
│ QQuery 97 │        47.29 / 47.40 ±0.11 / 47.56 ms │        47.35 / 47.86 ±0.47 / 48.61 ms │     no change │
│ QQuery 98 │        43.06 / 44.43 ±1.50 / 47.23 ms │        43.67 / 44.20 ±0.51 / 45.11 ms │     no change │
│ QQuery 99 │        71.11 / 71.46 ±0.35 / 72.08 ms │        70.42 / 71.85 ±1.59 / 74.92 ms │     no change │
└───────────┴───────────────────────────────────────┴───────────────────────────────────────┴───────────────┘
┏━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━┳━━━━━━━━━━━━┓
┃ Benchmark Summary                     ┃            ┃
┡━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━╇━━━━━━━━━━━━┩
│ Total Time (HEAD)                     │ 10107.03ms │
│ Total Time (lift-selectivity-stats)   │ 10013.69ms │
│ Average Time (HEAD)                   │   102.09ms │
│ Average Time (lift-selectivity-stats) │   101.15ms │
│ Queries Faster                        │          6 │
│ Queries Slower                        │          1 │
│ Queries with No Change                │         92 │
│ Queries with Failure                  │          0 │
└───────────────────────────────────────┴────────────┘

Resource Usage

tpcds — base (merge-base)

Metric	Value
Wall time	55.0s
Peak memory	2.1 GiB
Avg memory	1.5 GiB
CPU user	223.2s
CPU sys	5.8s
Peak spill	0 B

tpcds — branch

Metric	Value
Wall time	55.0s
Peak memory	2.2 GiB
Avg memory	1.5 GiB
CPU user	230.7s
CPU sys	6.1s
Peak spill	0 B

File an issue against this benchmark runner

adriangbot · 2026-07-13T12:09:29Z

🤖 Benchmark completed (GKE) | trigger

Instance: c4a-highmem-16 (12 vCPU / 65 GiB)

CPU Details (lscpu)

Architecture:                            aarch64
CPU op-mode(s):                          64-bit
Byte Order:                              Little Endian
CPU(s):                                  16
On-line CPU(s) list:                     0-15
Vendor ID:                               ARM
Model name:                              Neoverse-V2
Model:                                   1
Thread(s) per core:                      1
Core(s) per cluster:                     16
Socket(s):                               -
Cluster(s):                              1
Stepping:                                r0p1
BogoMIPS:                                2000.00
Flags:                                   fp asimd evtstrm aes pmull sha1 sha2 crc32 atomics fphp asimdhp cpuid asimdrdm jscvt fcma lrcpc dcpop sha3 sm3 sm4 asimddp sha512 sve asimdfhm dit uscat ilrcpc flagm sb paca pacg dcpodp sve2 sveaes svepmull svebitperm svesha3 svesm4 flagm2 frint svei8mm svebf16 i8mm bf16 dgh rng bti
L1d cache:                               1 MiB (16 instances)
L1i cache:                               1 MiB (16 instances)
L2 cache:                                32 MiB (16 instances)
L3 cache:                                80 MiB (1 instance)
NUMA node(s):                            1
NUMA node0 CPU(s):                       0-15
Vulnerability Gather data sampling:      Not affected
Vulnerability Indirect target selection: Not affected
Vulnerability Itlb multihit:             Not affected
Vulnerability L1tf:                      Not affected
Vulnerability Mds:                       Not affected
Vulnerability Meltdown:                  Not affected
Vulnerability Mmio stale data:           Not affected
Vulnerability Reg file data sampling:    Not affected
Vulnerability Retbleed:                  Not affected
Vulnerability Spec rstack overflow:      Not affected
Vulnerability Spec store bypass:         Mitigation; Speculative Store Bypass disabled via prctl
Vulnerability Spectre v1:                Mitigation; __user pointer sanitization
Vulnerability Spectre v2:                Mitigation; CSV2, BHB
Vulnerability Srbds:                     Not affected
Vulnerability Tsa:                       Not affected
Vulnerability Tsx async abort:           Not affected
Vulnerability Vmscape:                   Not affected

Details

Comparing HEAD and lift-selectivity-stats
--------------------
Benchmark clickbench_partitioned.json
--------------------
┏━━━━━━━━━━━┳━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━┳━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━┳━━━━━━━━━━━━━━━┓
┃ Query     ┃                                  HEAD ┃                lift-selectivity-stats ┃        Change ┃
┡━━━━━━━━━━━╇━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━╇━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━╇━━━━━━━━━━━━━━━┩
│ QQuery 0  │          1.23 / 3.94 ±5.34 / 14.62 ms │          1.21 / 3.89 ±5.31 / 14.50 ms │     no change │
│ QQuery 1  │        12.78 / 13.03 ±0.16 / 13.23 ms │        13.25 / 13.45 ±0.12 / 13.56 ms │     no change │
│ QQuery 2  │        35.97 / 36.29 ±0.29 / 36.65 ms │        36.95 / 37.14 ±0.21 / 37.54 ms │     no change │
│ QQuery 3  │        30.45 / 31.23 ±0.85 / 32.88 ms │        30.84 / 31.23 ±0.44 / 32.08 ms │     no change │
│ QQuery 4  │     220.17 / 223.14 ±1.90 / 225.23 ms │     220.53 / 222.93 ±2.28 / 227.25 ms │     no change │
│ QQuery 5  │     267.26 / 270.07 ±1.85 / 272.78 ms │     267.85 / 272.21 ±2.58 / 275.05 ms │     no change │
│ QQuery 6  │           1.23 / 1.39 ±0.23 / 1.85 ms │           1.26 / 1.41 ±0.22 / 1.85 ms │     no change │
│ QQuery 7  │        14.01 / 14.15 ±0.08 / 14.22 ms │        14.60 / 14.67 ±0.07 / 14.79 ms │     no change │
│ QQuery 8  │     318.93 / 321.97 ±2.10 / 325.06 ms │     319.34 / 323.94 ±3.26 / 328.36 ms │     no change │
│ QQuery 9  │     456.23 / 460.96 ±3.49 / 465.95 ms │     452.61 / 461.68 ±7.57 / 472.43 ms │     no change │
│ QQuery 10 │        69.77 / 70.58 ±0.65 / 71.39 ms │        70.07 / 71.01 ±0.61 / 71.67 ms │     no change │
│ QQuery 11 │        80.77 / 81.60 ±0.60 / 82.51 ms │        81.99 / 82.81 ±0.44 / 83.22 ms │     no change │
│ QQuery 12 │     264.57 / 269.42 ±4.94 / 277.99 ms │     263.72 / 272.29 ±7.73 / 285.52 ms │     no change │
│ QQuery 13 │     362.90 / 369.68 ±5.86 / 379.20 ms │    367.33 / 383.89 ±11.51 / 402.53 ms │     no change │
│ QQuery 14 │     279.83 / 282.86 ±2.55 / 285.89 ms │    280.71 / 293.16 ±12.02 / 313.72 ms │     no change │
│ QQuery 15 │     263.40 / 270.71 ±5.04 / 278.88 ms │    269.35 / 282.77 ±12.32 / 305.53 ms │     no change │
│ QQuery 16 │     599.91 / 608.29 ±5.71 / 615.96 ms │     608.14 / 613.89 ±4.66 / 620.36 ms │     no change │
│ QQuery 17 │     602.97 / 611.82 ±5.14 / 619.01 ms │    611.74 / 629.60 ±12.33 / 649.82 ms │     no change │
│ QQuery 18 │ 1235.16 / 1264.08 ±19.06 / 1292.02 ms │ 1243.95 / 1273.67 ±25.42 / 1310.16 ms │     no change │
│ QQuery 19 │        27.77 / 32.55 ±9.19 / 50.94 ms │       28.52 / 41.98 ±18.63 / 76.95 ms │  1.29x slower │
│ QQuery 20 │     512.62 / 519.75 ±8.34 / 536.05 ms │     514.12 / 523.68 ±7.38 / 531.10 ms │     no change │
│ QQuery 21 │     510.33 / 513.66 ±2.66 / 516.55 ms │     519.22 / 524.57 ±4.15 / 529.21 ms │     no change │
│ QQuery 22 │   972.57 / 984.82 ±12.53 / 1008.57 ms │     986.90 / 992.72 ±4.73 / 997.12 ms │     no change │
│ QQuery 23 │ 3068.08 / 3089.64 ±15.49 / 3113.55 ms │ 3088.62 / 3112.49 ±26.97 / 3162.56 ms │     no change │
│ QQuery 24 │        41.19 / 42.79 ±1.87 / 46.23 ms │        42.79 / 53.68 ±9.12 / 64.48 ms │  1.25x slower │
│ QQuery 25 │     112.25 / 118.43 ±9.87 / 138.04 ms │     111.64 / 114.65 ±4.69 / 124.00 ms │     no change │
│ QQuery 26 │        41.55 / 43.84 ±2.91 / 49.52 ms │        41.84 / 46.04 ±6.52 / 59.00 ms │  1.05x slower │
│ QQuery 27 │     662.44 / 669.09 ±5.77 / 678.78 ms │     667.35 / 672.13 ±2.95 / 675.85 ms │     no change │
│ QQuery 28 │ 3032.45 / 3053.65 ±15.56 / 3077.88 ms │ 2997.86 / 3015.41 ±12.60 / 3034.49 ms │     no change │
│ QQuery 29 │        40.66 / 41.05 ±0.29 / 41.56 ms │        41.39 / 44.59 ±5.56 / 55.68 ms │  1.09x slower │
│ QQuery 30 │     303.43 / 311.01 ±6.84 / 320.80 ms │     299.12 / 310.78 ±7.57 / 317.23 ms │     no change │
│ QQuery 31 │    281.08 / 298.04 ±14.84 / 325.65 ms │     290.40 / 297.64 ±6.11 / 308.34 ms │     no change │
│ QQuery 32 │     929.27 / 937.60 ±6.18 / 945.27 ms │    928.62 / 951.98 ±25.56 / 985.12 ms │     no change │
│ QQuery 33 │ 1436.28 / 1454.23 ±12.49 / 1474.95 ms │ 1436.13 / 1465.17 ±21.55 / 1500.05 ms │     no change │
│ QQuery 34 │ 1466.69 / 1495.68 ±18.72 / 1522.14 ms │ 1426.94 / 1516.36 ±77.49 / 1661.25 ms │     no change │
│ QQuery 35 │    273.56 / 291.85 ±25.74 / 343.00 ms │    278.59 / 304.04 ±46.29 / 396.57 ms │     no change │
│ QQuery 36 │        65.63 / 69.50 ±3.60 / 74.74 ms │        64.99 / 66.74 ±1.56 / 69.34 ms │     no change │
│ QQuery 37 │        35.49 / 37.77 ±2.63 / 41.57 ms │       36.09 / 47.04 ±10.43 / 59.90 ms │  1.25x slower │
│ QQuery 38 │        40.28 / 46.10 ±4.59 / 53.16 ms │        42.49 / 44.09 ±1.36 / 45.83 ms │     no change │
│ QQuery 39 │    141.00 / 149.82 ±12.04 / 172.14 ms │     146.39 / 156.86 ±7.60 / 169.47 ms │     no change │
│ QQuery 40 │        14.55 / 17.65 ±5.43 / 28.47 ms │        14.32 / 14.91 ±0.54 / 15.66 ms │ +1.18x faster │
│ QQuery 41 │        13.60 / 13.78 ±0.15 / 14.05 ms │        13.87 / 14.12 ±0.14 / 14.29 ms │     no change │
│ QQuery 42 │        13.10 / 15.50 ±3.05 / 20.74 ms │        13.15 / 13.42 ±0.24 / 13.82 ms │ +1.16x faster │
└───────────┴───────────────────────────────────────┴───────────────────────────────────────┴───────────────┘
┏━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━┳━━━━━━━━━━━━┓
┃ Benchmark Summary                     ┃            ┃
┡━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━╇━━━━━━━━━━━━┩
│ Total Time (HEAD)                     │ 19453.03ms │
│ Total Time (lift-selectivity-stats)   │ 19630.72ms │
│ Average Time (HEAD)                   │   452.40ms │
│ Average Time (lift-selectivity-stats) │   456.53ms │
│ Queries Faster                        │          2 │
│ Queries Slower                        │          5 │
│ Queries with No Change                │         36 │
│ Queries with Failure                  │          0 │
└───────────────────────────────────────┴────────────┘

Resource Usage

clickbench_partitioned — base (merge-base)

Metric	Value
Wall time	100.0s
Peak memory	11.1 GiB
Avg memory	4.6 GiB
CPU user	1001.9s
CPU sys	66.9s
Peak spill	0 B

clickbench_partitioned — branch

Metric	Value
Wall time	100.0s
Peak memory	11.6 GiB
Avg memory	4.6 GiB
CPU user	1004.9s
CPU sys	69.2s
Peak spill	0 B

File an issue against this benchmark runner

adriangbot · 2026-07-13T12:10:34Z

🤖 Benchmark completed (GKE) | trigger

Instance: c4a-highmem-16 (12 vCPU / 65 GiB)

CPU Details (lscpu)

Architecture:                            aarch64
CPU op-mode(s):                          64-bit
Byte Order:                              Little Endian
CPU(s):                                  16
On-line CPU(s) list:                     0-15
Vendor ID:                               ARM
Model name:                              Neoverse-V2
Model:                                   1
Thread(s) per core:                      1
Core(s) per cluster:                     16
Socket(s):                               -
Cluster(s):                              1
Stepping:                                r0p1
BogoMIPS:                                2000.00
Flags:                                   fp asimd evtstrm aes pmull sha1 sha2 crc32 atomics fphp asimdhp cpuid asimdrdm jscvt fcma lrcpc dcpop sha3 sm3 sm4 asimddp sha512 sve asimdfhm dit uscat ilrcpc flagm sb paca pacg dcpodp sve2 sveaes svepmull svebitperm svesha3 svesm4 flagm2 frint svei8mm svebf16 i8mm bf16 dgh rng bti
L1d cache:                               1 MiB (16 instances)
L1i cache:                               1 MiB (16 instances)
L2 cache:                                32 MiB (16 instances)
L3 cache:                                80 MiB (1 instance)
NUMA node(s):                            1
NUMA node0 CPU(s):                       0-15
Vulnerability Gather data sampling:      Not affected
Vulnerability Indirect target selection: Not affected
Vulnerability Itlb multihit:             Not affected
Vulnerability L1tf:                      Not affected
Vulnerability Mds:                       Not affected
Vulnerability Meltdown:                  Not affected
Vulnerability Mmio stale data:           Not affected
Vulnerability Reg file data sampling:    Not affected
Vulnerability Retbleed:                  Not affected
Vulnerability Spec rstack overflow:      Not affected
Vulnerability Spec store bypass:         Mitigation; Speculative Store Bypass disabled via prctl
Vulnerability Spectre v1:                Mitigation; __user pointer sanitization
Vulnerability Spectre v2:                Mitigation; CSV2, BHB
Vulnerability Srbds:                     Not affected
Vulnerability Tsa:                       Not affected
Vulnerability Tsx async abort:           Not affected
Vulnerability Vmscape:                   Not affected

Details

Comparing HEAD and lift-selectivity-stats
--------------------
Benchmark clickbench_partitioned.json
--------------------
┏━━━━━━━━━━━┳━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━┳━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━┳━━━━━━━━━━━━━━━┓
┃ Query     ┃                                  HEAD ┃                lift-selectivity-stats ┃        Change ┃
┡━━━━━━━━━━━╇━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━╇━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━╇━━━━━━━━━━━━━━━┩
│ QQuery 0  │          1.23 / 4.01 ±5.46 / 14.92 ms │          1.21 / 4.02 ±5.49 / 14.99 ms │     no change │
│ QQuery 1  │        12.43 / 12.85 ±0.25 / 13.15 ms │        13.30 / 13.49 ±0.22 / 13.90 ms │     no change │
│ QQuery 2  │        36.19 / 36.53 ±0.32 / 36.98 ms │        36.84 / 37.17 ±0.27 / 37.65 ms │     no change │
│ QQuery 3  │        30.58 / 31.37 ±0.79 / 32.81 ms │        31.34 / 31.83 ±0.46 / 32.45 ms │     no change │
│ QQuery 4  │     225.92 / 228.06 ±1.40 / 230.06 ms │     227.27 / 233.49 ±3.78 / 237.83 ms │     no change │
│ QQuery 5  │     274.42 / 276.18 ±1.19 / 277.95 ms │     277.28 / 279.12 ±1.78 / 282.06 ms │     no change │
│ QQuery 6  │           1.23 / 1.39 ±0.23 / 1.84 ms │           1.24 / 1.40 ±0.22 / 1.82 ms │     no change │
│ QQuery 7  │        13.93 / 14.02 ±0.11 / 14.24 ms │        14.64 / 14.78 ±0.11 / 14.94 ms │  1.05x slower │
│ QQuery 8  │     327.75 / 332.59 ±2.44 / 334.06 ms │     329.66 / 335.14 ±4.19 / 341.66 ms │     no change │
│ QQuery 9  │     461.21 / 467.36 ±5.03 / 474.08 ms │     475.34 / 484.33 ±8.74 / 499.48 ms │     no change │
│ QQuery 10 │        69.14 / 71.07 ±1.54 / 73.78 ms │        70.37 / 74.76 ±7.21 / 89.15 ms │  1.05x slower │
│ QQuery 11 │        81.07 / 81.91 ±0.85 / 82.95 ms │        83.15 / 84.56 ±1.15 / 85.91 ms │     no change │
│ QQuery 12 │     266.51 / 272.32 ±4.89 / 280.93 ms │     278.98 / 291.14 ±9.21 / 302.52 ms │  1.07x slower │
│ QQuery 13 │    366.54 / 383.58 ±19.09 / 415.29 ms │    388.79 / 406.73 ±12.14 / 424.16 ms │  1.06x slower │
│ QQuery 14 │     339.25 / 344.05 ±4.22 / 351.78 ms │     297.65 / 302.08 ±2.82 / 305.91 ms │ +1.14x faster │
│ QQuery 15 │    290.90 / 342.47 ±27.78 / 368.97 ms │    275.96 / 292.38 ±13.69 / 316.59 ms │ +1.17x faster │
│ QQuery 16 │     618.31 / 627.38 ±6.82 / 635.27 ms │    622.77 / 635.40 ±11.47 / 655.80 ms │     no change │
│ QQuery 17 │     627.02 / 636.40 ±7.85 / 648.90 ms │     627.68 / 641.01 ±9.59 / 654.80 ms │     no change │
│ QQuery 18 │ 1256.95 / 1277.68 ±19.76 / 1314.40 ms │ 1282.76 / 1308.32 ±24.24 / 1352.72 ms │     no change │
│ QQuery 19 │        27.74 / 28.24 ±0.36 / 28.65 ms │        28.41 / 28.65 ±0.28 / 29.18 ms │     no change │
│ QQuery 20 │    511.07 / 526.05 ±14.74 / 547.67 ms │     518.59 / 528.71 ±7.85 / 540.14 ms │     no change │
│ QQuery 21 │     517.58 / 519.33 ±1.73 / 521.74 ms │     520.53 / 523.64 ±2.77 / 528.16 ms │     no change │
│ QQuery 22 │    991.31 / 997.86 ±9.65 / 1016.97 ms │   987.33 / 1000.29 ±7.18 / 1008.26 ms │     no change │
│ QQuery 23 │ 3111.36 / 3133.92 ±25.67 / 3178.18 ms │ 3093.24 / 3138.53 ±27.66 / 3171.95 ms │     no change │
│ QQuery 24 │        41.67 / 42.73 ±1.08 / 44.58 ms │        41.94 / 43.19 ±1.48 / 46.01 ms │     no change │
│ QQuery 25 │     111.11 / 112.11 ±0.78 / 113.52 ms │     112.61 / 115.55 ±4.02 / 123.32 ms │     no change │
│ QQuery 26 │        42.17 / 43.00 ±0.64 / 43.83 ms │        42.49 / 43.99 ±1.84 / 47.57 ms │     no change │
│ QQuery 27 │     671.02 / 676.34 ±8.71 / 693.67 ms │     674.47 / 678.38 ±3.91 / 684.97 ms │     no change │
│ QQuery 28 │ 3019.87 / 3052.76 ±30.43 / 3098.80 ms │ 3026.10 / 3053.39 ±16.03 / 3072.85 ms │     no change │
│ QQuery 29 │        41.32 / 45.53 ±7.95 / 61.43 ms │       41.90 / 57.84 ±19.88 / 95.94 ms │  1.27x slower │
│ QQuery 30 │     303.03 / 310.59 ±5.77 / 317.64 ms │    309.11 / 324.69 ±15.56 / 353.76 ms │     no change │
│ QQuery 31 │     286.06 / 294.69 ±7.31 / 306.80 ms │    281.60 / 300.31 ±12.16 / 319.29 ms │     no change │
│ QQuery 32 │    961.81 / 973.60 ±10.03 / 988.54 ms │   952.88 / 984.12 ±17.52 / 1001.26 ms │     no change │
│ QQuery 33 │ 1463.51 / 1488.38 ±19.00 / 1512.67 ms │ 1480.00 / 1501.47 ±23.53 / 1544.62 ms │     no change │
│ QQuery 34 │ 1515.68 / 1527.82 ±10.23 / 1546.71 ms │ 1531.53 / 1590.15 ±56.49 / 1671.89 ms │     no change │
│ QQuery 35 │    286.11 / 330.33 ±80.66 / 491.51 ms │    283.90 / 329.20 ±60.50 / 447.03 ms │     no change │
│ QQuery 36 │        65.95 / 69.30 ±3.58 / 76.14 ms │        67.20 / 78.39 ±9.33 / 92.45 ms │  1.13x slower │
│ QQuery 37 │        35.68 / 39.33 ±3.85 / 44.17 ms │        35.83 / 36.43 ±0.54 / 37.35 ms │ +1.08x faster │
│ QQuery 38 │        42.24 / 49.15 ±7.02 / 59.51 ms │        40.26 / 41.37 ±0.79 / 42.46 ms │ +1.19x faster │
│ QQuery 39 │     145.11 / 152.46 ±5.55 / 161.06 ms │    152.31 / 162.38 ±10.00 / 181.37 ms │  1.07x slower │
│ QQuery 40 │        14.10 / 19.70 ±7.19 / 31.60 ms │        14.25 / 14.42 ±0.27 / 14.95 ms │ +1.37x faster │
│ QQuery 41 │        13.94 / 14.56 ±0.62 / 15.39 ms │        14.27 / 14.40 ±0.13 / 14.63 ms │     no change │
│ QQuery 42 │        13.55 / 16.60 ±3.60 / 21.11 ms │        13.40 / 14.19 ±1.42 / 17.03 ms │ +1.17x faster │
└───────────┴───────────────────────────────────────┴───────────────────────────────────────┴───────────────┘
┏━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━┳━━━━━━━━━━━━┓
┃ Benchmark Summary                     ┃            ┃
┡━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━╇━━━━━━━━━━━━┩
│ Total Time (HEAD)                     │ 19905.60ms │
│ Total Time (lift-selectivity-stats)   │ 20074.85ms │
│ Average Time (HEAD)                   │   462.92ms │
│ Average Time (lift-selectivity-stats) │   466.86ms │
│ Queries Faster                        │          6 │
│ Queries Slower                        │          7 │
│ Queries with No Change                │         30 │
│ Queries with Failure                  │          0 │
└───────────────────────────────────────┴────────────┘

Resource Usage

clickbench_partitioned — base (merge-base)

Metric	Value
Wall time	100.0s
Peak memory	11.6 GiB
Avg memory	4.8 GiB
CPU user	1025.3s
CPU sys	69.2s
Peak spill	0 B

clickbench_partitioned — branch

Metric	Value
Wall time	105.0s
Peak memory	10.6 GiB
Avg memory	4.1 GiB
CPU user	1025.5s
CPU sys	72.0s
Peak spill	0 B

File an issue against this benchmark runner

alamb

Thank you @adriangb and @xudong963

I mostly focused on the API and description of this PR, rather than the details of the adaptive implementation because as @adriangb says I think we expect the implementation to evolve over time.

I think there are two main points we should resolve:

PhysicalExpr?

Should this adaptive AND evaluator be its own structure or should it be part of the normal AND evaluation in BinaryExpr ?

I think it is easier to keep it separate initially, but longer term I think it would be great if we had only one bit of code to evlaute ANDs -- otherwise I fear trying to understand the behavior over time / integrate it will be quite challenging

State sharing / Distributed execution

As this code relies on in-memory state sharing, I worry that it will cause (more) problems with distributed engines like Ballista. Do we have some way to coordinate / manage such shared state? Will it be a problem

alamb · 2026-07-13T11:56:30Z

+        /// (experimental) When enabled, `FilterExec` adaptively reorders the
+        /// conjuncts of a conjunctive predicate at runtime. It measures each
+        /// conjunct's selectivity and evaluation cost on the rows that reach it
+        /// and runs the conjuncts that discard the most rows per unit of CPU
+        /// time first, so cheap-and-selective predicates gate expensive ones.
+        /// This never changes query results, but it is off by default because
+        /// it can change observable side effects of fallible predicates (even
+        /// when no reorder is adopted): while measuring, and after a reorder,
+        /// conjuncts are evaluated only on the rows that survived the conjuncts
+        /// before them, so an error the fused predicate would have raised on an
+        /// already-filtered row (e.g. `b <> 0 AND 1/b > 2` evaluating `1/b` on
+        /// every row) may not occur. Predicates containing volatile expressions
+        /// are never reordered.


The comment says "off by default because it can change observable side effects of fallible predicates"

Does this mean we can never turn it on by default? What do you think about using it by default but disabling it for fallable predicates -- perhaps we can reuse the 'short_circuits' logic in https://docs.rs/datafusion/latest/datafusion/logical_expr/trait.ScalarUDFImpl.html#method.short_circuits

I think we can make this description significantly more concise by omitting some internal details. Something like

Suggested change

/// (experimental) When enabled, `FilterExec` adaptively reorders the

/// conjuncts of a conjunctive predicate at runtime. It measures each

/// conjunct's selectivity and evaluation cost on the rows that reach it

/// and runs the conjuncts that discard the most rows per unit of CPU

/// time first, so cheap-and-selective predicates gate expensive ones.

/// This never changes query results, but it is off by default because

/// it can change observable side effects of fallible predicates (even

/// when no reorder is adopted): while measuring, and after a reorder,

/// conjuncts are evaluated only on the rows that survived the conjuncts

/// before them, so an error the fused predicate would have raised on an

/// already-filtered row (e.g. `b <> 0 AND 1/b > 2` evaluating `1/b` on

/// every row) may not occur. Predicates containing volatile expressions

/// are never reordered.

/// (experimental) When enabled, `FilterExec` adaptively reorders the

/// conjuncts of a conjunctive predicate at runtime by measuring each

/// conjunct's selectivity and evaluation cost. Off by default because

/// it can change observable side effects of fallible predicates

/// (e.g. reordering `b <> 0 AND 1/b > 2` may change if/when `1/b` results in a divide by zero

/// error. Predicates containing volatile expressions

/// are never reordered.

alamb · 2026-07-13T11:57:15Z

+//!
+//! Predicate evaluation order matters: a selective predicate run first gates
+//! the work of the predicates after it. DataFusion's `BinaryExpr` `AND`
+//! short-circuit only gates on the *leftmost* conjunct, so a conjunction whose


nit: it may help here to add a doc link to the And implementation

alamb · 2026-07-13T12:01:55Z

+//! non-selective conjuncts costs only cheap bitwise `AND`s, while a selective
+//! conjunct shrinks the batch the conjuncts after it must decode. This
+//! compaction is what makes ordering pay off (and is itself a win even without
+//! reordering): a left-deep fused `BinaryExpr` `AND` does *not* compact between


I don't understand what this left-deep fused BinaryExpr is -- what is fused? I kind of understand the compaction comment.

alamb · 2026-07-13T12:05:19Z

+//! warm-up — which is what makes the win materialise when each stream is only a
+//! handful of batches long.
+//!
+//! It is **off by default**


I recommend just mentioning this behavior is controlled by the flag and leave the documentation of the default value on the flag itself (otherwise this can get out of date if we change the deafult value)

alamb · 2026-07-13T12:05:56Z

+//! A `FilterExec` is split across many partition streams, each seeing only a
+//! slice of the data. Measurements are pooled into a shared
+//! [`AdaptiveFilterShared`] so the streams learn as one: the first stream to
+//! accumulate enough samples settles the order and publishes it, and the others


I think @andygrove and @milenkovicm hit issues with ballista when the execution plans are run on multiple nodes (so in memory sharing doesn't work as well). We should probably make sure this mechanism will work with that system

alamb · 2026-07-13T12:08:28Z

+02)--AggregateExec: mode=Final, gby=[], aggr=[count(Int64(1))]
+03)----CoalescePartitionsExec
+04)------AggregateExec: mode=Partial, gby=[], aggr=[count(Int64(1))]
+05)--------FilterExec: b@0 = 3 AND s@1 LIKE 1%, projection=[]


what do you think about adding some sort of metric to the FilterExec that would indicate that the adaptive filter reordering was being applied?

alamb · 2026-07-13T12:11:07Z

-                    let status = self.predicate.as_ref()
-                        .evaluate(&batch)
-                        .and_then(|v| v.into_array(batch.num_rows()))
+                    let array = match self.adaptive.as_mut() {


Would it be simpler if the adaptive conjunction also implemented PhysicalExpr ? That way there would be no change to the runtime path of FilterExec, and we could potentially reuse AdaptiveConjunction anywhere physical exprs were used

alamb · 2026-07-13T12:11:23Z

+//! Predicate evaluation order matters: a selective predicate run first gates
+//! the work of the predicates after it. DataFusion's `BinaryExpr` `AND`
+//! short-circuit only gates on the *leftmost* conjunct, so a conjunction whose
+//! selective member is written last (e.g.


Another suggestion would be to expand this example more explicitly

Something like

For example, given a predicate like

regexp_like(s,'a') AND … AND regexp_like(s,'rare')

where the regexp_like functions are expensive to evaluate and
regexp_like(s,'rare')) filters out many more rows thanregexp_like(s,'a') `
this module will reorder the evaluation so the more selective predicate is first.

regexp_like(s,'rare') AND … ANDregexp_like(s,'a')

alamb · 2026-07-13T12:11:59Z

See comment below -- I recommend we make this a PhysicalExpr (and put it in physical-expr) rather than in physical-plan

alamb · 2026-07-13T12:17:07Z

+/// materialization cost. Unlike a fused `BinaryExpr` chain, survivors stay
+/// compacted across the remaining conjuncts instead of being re-evaluated on
+/// every row.
+fn eval_conjuncts(


This feels fairly similar to the code in evaluation of binary_expr: https://github.com/apache/datafusion/blob/b79d3965b7a2c29853e83def04656408d9d7ee7f/datafusion/physical-expr/src/expressions/binary.rs#L295-L294

(the ShortCircuitStrategy in particular)

xudong963

For me, the core issue is implementing a second conjunction evaluation engine inside physical-plan. That duplicates existing BinaryExpr functionality, including null handling, batch filtering, short-circuiting, and scattering. It also forces a special branch into FilterExecStream.

How about retaining the learner but putting the adaptive conjunction in physical-expr? Once an order is chosen, rebuild a normal left-deep conjunction in that order and let BinaryExpr evaluate it. If the warm-up requires reusable selection logic, extract that logic from BinaryExpr rather than maintaining two implementations.

github-actions Bot added documentation Improvements or additions to documentation physical-expr Changes to the physical-expr crates sqllogictest SQL Logic Tests (.slt) common Related to common crate physical-plan Changes to the physical-plan crate labels Jun 1, 2026

This comment has been minimized.

Sign in to view

github-actions Bot added the auto detected api change Auto detected API change label Jun 1, 2026

This comment has been minimized.

Sign in to view

alamb mentioned this pull request Jun 1, 2026

perf: Reorder predicates in conjuncts via simple heuristic #22343

Merged

This comment has been minimized.

Sign in to view

adriangb force-pushed the lift-selectivity-stats branch 3 times, most recently from a24471d to 4d7b733 Compare June 2, 2026 02:29

adriangb and others added 2 commits July 10, 2026 13:00

adriangb force-pushed the lift-selectivity-stats branch from a456356 to 511ed51 Compare July 10, 2026 18:15

adriangb and others added 4 commits July 10, 2026 16:59

adriangb force-pushed the lift-selectivity-stats branch from 511ed51 to b42f823 Compare July 10, 2026 22:00

adriangb force-pushed the lift-selectivity-stats branch from 49dd3d8 to a991d38 Compare July 10, 2026 22:16

alamb reviewed Jul 13, 2026

View reviewed changes

xudong963 self-requested a review July 14, 2026 03:40

alamb added the performance Make DataFusion faster label Jul 16, 2026

xudong963 reviewed Jul 20, 2026

View reviewed changes

Uh oh!

Conversation

adriangb commented Jun 1, 2026 • edited by alamb Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Which issue does this PR close?

Rationale for this change

What changes are included in this PR?

Are these changes tested?

Are there any user-facing changes?

Uh oh!

This comment has been minimized.

This comment has been minimized.

This comment has been minimized.

This comment has been minimized.

github-actions Bot commented Jun 1, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

This comment has been minimized.

This comment has been minimized.

This comment has been minimized.

This comment has been minimized.

This comment has been minimized.

This comment has been minimized.

This comment has been minimized.

This comment has been minimized.

This comment has been minimized.

This comment has been minimized.

This comment has been minimized.

This comment has been minimized.

This comment has been minimized.

This comment has been minimized.

This comment has been minimized.

adriangb commented Jul 10, 2026

Uh oh!

alamb commented Jul 12, 2026

Uh oh!

alamb commented Jul 13, 2026

Uh oh!

alamb commented Jul 13, 2026

Uh oh!

adriangbot commented Jul 13, 2026

Uh oh!

adriangbot commented Jul 13, 2026

Uh oh!

adriangbot commented Jul 13, 2026

Uh oh!

adriangbot commented Jul 13, 2026

Uh oh!

adriangbot commented Jul 13, 2026

Uh oh!

adriangbot commented Jul 13, 2026

Uh oh!

adriangbot commented Jul 13, 2026

Uh oh!

adriangbot commented Jul 13, 2026

Uh oh!

adriangbot commented Jul 13, 2026

Uh oh!

adriangbot commented Jul 13, 2026

Uh oh!

adriangbot commented Jul 13, 2026

Uh oh!

adriangbot commented Jul 13, 2026

Uh oh!

alamb left a comment

Choose a reason for hiding this comment

PhysicalExpr?

State sharing / Distributed execution

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

adriangb commented Jun 1, 2026 •

edited by alamb

Loading

github-actions Bot commented Jun 1, 2026 •

edited

Loading