perf(exec): cache datasets opened from serialized protos in resolve_dataset by jmhsieh · Pull Request #7145 · lance-format/lance

jmhsieh · 2026-06-08T05:59:45Z

Problem

A distributed FilteredReadExec / ANN plan is serialized to proto and executed on remote workers. The serialized proto cannot carry a live Arc<Dataset>, so when a worker deserializes it, resolve_dataset() is called with dataset = None and falls through to open_dataset_from_table_identifier() → DatasetBuilder::from_uri(..).load() — a full cold open (ObjectStore init + manifest GET + Dataset construction), emitting one dataset_events event="loading" per plan.

There is no caching at this layer, so every distributed plan re-opens the dataset from scratch. On a many-fragment table that is thousands of redundant opens. In one production distributed backfill resume, a ~4,800-fragment table was cold-opened ~4,800 times (4,813 loading events), producing a ~3-hour planning wall in front of ~5 minutes of actual work — planning was ~36× longer than the work it gated, entirely from re-opens that carry no new information (the version is pinned and immutable).

Both the filtered-read path (io/exec/filtered_read_proto.rs) and the ANN paths (io/exec/ann_proto.rs) route through the same resolve_dataset, so both are affected.

Fix

Add a process-global, bounded moka cache in resolve_dataset, keyed by the immutable snapshot identity (uri, version, manifest_etag). Concurrent first-misses are coalesced into a single open via moka's single-flight try_get_with, so a wave of N plans collapses to one open per (uri, version, etag) per worker process.

A single change covers both the filtered-read and ANN paths since they share resolve_dataset.

Why this is safe

A pinned dataset version is an immutable snapshot, so reusing a cached Arc<Dataset> is always correct:

Version pinning / time-travel — the version is part of the key, so a read pinned to version V only ever sees the dataset opened at V.
Post-write invalidation — a write produces a new version (and a new manifest etag), i.e. a new key, rather than mutating an existing entry. The cache never serves stale data.
Storage options are intentionally excluded from the key. Credentials may be re-vended per plan, but the bytes identified by the etag are the same; a cached entry reuses the ObjectStore configured at first open. time_to_idle (5 min) bounds how long those first-open credentials are retained before an idle entry is dropped and re-opened.

Configuration

LANCE_PROTO_DATASET_CACHE_SIZE — max distinct (uri, version, etag) datasets cached per process (default 64). Set to 0 to disable caching entirely and restore the previous cold-open-per-plan behavior.
An info! log is emitted at the open (cache-miss) site so a heavy distributed plan is greppable without enabling lance::events tracing.

Tests

test_resolve_dataset_cached_returns_same_arc — a cache hit returns the same Arc, not a re-open.
test_resolve_dataset_cached_single_flight — 16 concurrent first-misses collapse to exactly one cached entry.
test_resolve_dataset_cached_distinct_versions — different versions never alias (version pinning preserved).
test_resolve_dataset_some_returns_passed_arc — the Some arm still passes the supplied handle through unchanged.
test_resolve_dataset_opens_once_across_plan_runs — deterministic, non-cloud repro: instruments the cache-miss opener (one opener call == one DatasetBuilder::load() == one manifest read) and asserts 50 sequential plans open the dataset once with the cache (and once under concurrency) vs 50 times without it.

Run with: cargo test -p lance --lib --features substrait io::exec::table_identifier

github-actions · 2026-06-08T06:00:04Z

ACTION NEEDED
Lance follows the Conventional Commits specification for release automation.

The PR title and description are used as the merge commit message. Please update your PR title and description to match the specification.

For details on the error please inspect the "PR Title Check" action.

codecov · 2026-06-08T06:48:24Z

Codecov Report

❌ Patch coverage is 89.57055% with 17 lines in your changes missing coverage. Please review.

Files with missing lines	Patch %	Lines
rust/lance/src/io/exec/table_identifier.rs	89.57%	16 Missing and 1 partial ⚠️

📢 Thoughts on this report? Let us know!

…ataset A distributed FilteredReadExec / ANN plan is serialized to proto and run on remote workers. The serialized proto cannot carry a live Arc<Dataset>, so resolve_dataset() falls through to a full cold open (ObjectStore init + manifest GET + Dataset construction) on every plan. With no caching at this layer, every distributed plan re-opens the dataset from scratch — on a many-fragment table that is thousands of redundant opens (one production distributed backfill cold-opened a ~4,800-fragment table ~4,800 times, producing a multi-hour planning wall in front of minutes of real work). Add a process-global, bounded moka cache in resolve_dataset keyed by the immutable snapshot identity (uri, version, manifest_etag). Concurrent first-misses coalesce into a single open via moka's single-flight try_get_with, so a wave of N plans collapses to one open per (uri, version, etag) per worker process. Both the filtered-read and ANN paths go through resolve_dataset, so one change covers both. Reusing a cached Arc<Dataset> for a pinned version is always safe: distinct versions/manifests get distinct keys, and a new write produces a new version (and etag) rather than mutating a cached entry, so the cache never serves stale data, breaks version pinning, or affects time-travel reads. Storage options are intentionally excluded from the key (credentials may be re-vended per plan, but the bytes identified by the etag are the same); a cached entry reuses the ObjectStore from first open, and time_to_idle bounds how long those credentials are retained. Tunable via LANCE_PROTO_DATASET_CACHE_SIZE (default 64; 0 disables and restores cold-open-per-plan). An info! log at the open site makes heavy distributed plans greppable without lance::events tracing. Tests: cache hit returns the same Arc; concurrent first-misses single-flight to one entry; distinct versions never alias (version pinning preserved); the Some-arm passthrough is unchanged; and a deterministic non-cloud repro (test_resolve_dataset_opens_once_across_plan_runs) instruments the cache-miss opener to prove 50 plan_runs open the dataset once with the cache (and once under concurrency) vs 50 times without it — one open == one manifest load. Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>

Drop the internal ticket reference and deployment-specific numbers from the repro test's doc comment; keep the mechanism description generic. Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>

jmhsieh changed the title ~~lance dataset re loaded per distributed plan run rust side~~ bug: lance dataset re loaded per distributed plan run rust side Jun 8, 2026

jmhsieh changed the title ~~bug: lance dataset re loaded per distributed plan run rust side~~ perf(exec): cache datasets opened from serialized protos in resolve_dataset Jun 8, 2026

github-actions Bot added the performance label Jun 8, 2026

jmhsieh force-pushed the jon/gen-571-lance-dataset-re-loaded-per-distributed-plan_run-rust-side branch from 8f5415c to 8a1d3df Compare June 8, 2026 13:28

test(exec): genericize repro test doc comment

ba884fa

Drop the internal ticket reference and deployment-specific numbers from the repro test's doc comment; keep the mechanism description generic. Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

perf(exec): cache datasets opened from serialized protos in resolve_dataset#7145

perf(exec): cache datasets opened from serialized protos in resolve_dataset#7145
jmhsieh wants to merge 2 commits into
lance-format:mainfrom
jmhsieh:jon/gen-571-lance-dataset-re-loaded-per-distributed-plan_run-rust-side

jmhsieh commented Jun 8, 2026 •

edited

Loading

Uh oh!

github-actions Bot commented Jun 8, 2026

Uh oh!

codecov Bot commented Jun 8, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

jmhsieh commented Jun 8, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Problem

Fix

Why this is safe

Configuration

Tests

Uh oh!

github-actions Bot commented Jun 8, 2026

Uh oh!

codecov Bot commented Jun 8, 2026

Codecov Report

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

jmhsieh commented Jun 8, 2026 •

edited

Loading