Correctly purge the organization profile when updating an org's projects by woodruffw · Pull Request #19917 · pypi/warehouse

woodruffw · 2026-04-21T19:38:02Z

The bug here was actually kind of subtle: we use key_factory to build the purge keys, which can silently fail if the underlying attribute access fails (e.g. Project.organization returns None or raises AttributeError).

This in turn wouldn't happen in most normal cases, but can happen if we build the purge keys during after_flush, since at that point relationships like Project.organization haven't been materialized yet (if the Project was initialized with organization_id instead).

My original theory was wrong here.

The bug here stems from an interaction between flag_dirty and #19898. The basic problem is that add_organization_project calls organization.record_event, which adds events to the organization's committed_state. This in turn trips the optimization in #19898, since committed_state == {'events'}, causing us to skip the purges.

(committed_state is otherwise empty, since flag_dirty itself doesn't add anything else.)

Fixes #19911.

Signed-off-by: William Woodruff <william@astral.sh>

woodruffw · 2026-04-22T14:22:10Z

+            organization=self.db.get(Organization, organization_id),
+            project=self.db.get(Project, project_id),


Flagging: this is the core of the fix, but I'm also kind of unhappy with it (since now we're fetching an entire row from the DB). Maybe not a huge deal in context though, given that adding a project to an org isn't exactly a hot path operation.

If we don't watch to fetch the record, we can add a flag_dirty for the organization_project.project instead, but that also feels clunky.

Considering that the upstream caller already has both the Organization and Project objects, I think it's a better choice to pass the objects that we can then mutate, instead of working with IDs, and then we avoid some of the manual db.add,flush,flag_dirty, and let the ORM do its thing.

Signed-off-by: William Woodruff <william@astral.sh>

woodruffw · 2026-04-22T14:28:21Z

NB: This effectively "fixes" the purge behavior by bypassing the optimization in #19898 entirely. I'm not sure if that's the best approach 😅

miketheman

Thanks for digging in, and there's a few things inline - let me know if anything doesn't make sense.

miketheman · 2026-04-24T19:13:23Z

+    # Adding a project to an organization via `add_organization_project` must
+    # purge both the project profile (`project/{name}`) and the organization
+    # profile (`org/{name}`), otherwise the cached `/org/{orgname}` project
+    # list goes stale.
+    #
+    # Exercising `add_organization_project is load-bearing: the
+    # purge-key factory uses `if_attr_exists`, which resolves to `None` during
+    # `after_flush` if the row was built with only FK ids instead of
+    # objects, causing us to silently drop the purges.


Claude insists on using comment blocks over docstrings, I don't know why.

miketheman · 2026-04-24T19:28:40Z

-                    organization_id=organization_id, project_id=project.id
+                    organization=self.db.get(Organization, organization_id),
+                    project=project,


I reverted this change locally, and tests pass. So either we don't need this change, or the tests aren't expressing the condition well enough.

miketheman · 2026-04-24T19:29:08Z

        """
        Adds an association between the specified organization and project
        """
+        from warehouse.packaging.models import Project


I'm still not sure why our isort config doesn't flag inline imports without a disable comment - but this should be placed at top-of module. (Hey Claude, remember that preference).

miketheman · 2026-04-24T19:35:55Z

+            organization=self.db.get(Organization, organization_id),
+            project=self.db.get(Project, project_id),


If we don't watch to fetch the record, we can add a flag_dirty for the organization_project.project instead, but that also feels clunky.

Considering that the upstream caller already has both the Organization and Project objects, I think it's a better choice to pass the objects that we can then mutate, instead of working with IDs, and then we avoid some of the manual db.add,flush,flag_dirty, and let the ORM do its thing.

miketheman · 2026-04-24T19:37:30Z

        OrganizationProject,
        purge_keys=[
            key_factory("project/{attr.normalized_name}", if_attr_exists="project"),
+            key_factory("org/{attr.normalized_name}", if_attr_exists="organization"),


I think this is the main fix - since it was previously unregistered.

miketheman · 2026-04-24T19:59:35Z

+    assert f"org/{organization.normalized_name}" in purges
+    assert f"project/{project.normalized_name}" in purges


This test passes for the wrong reasons, and also believe it shouldn't work in its current behavior.

Adding an Event shouldn't dirty an object for cache purge, which is why it's excluded. Dirties that include events and other attributes continue to be purged correctly. One logged example from a few minutes ago:

{ "obj_class": "File", "state": "dirty", "keys": [ "project/ctboost" ], "changed_attrs": [ "events", "provenance" ], "event": "cache_purge_keys_generated", ... }

The session purge shows that the object had events + , and thus got a cache purge. see

warehouse/warehouse/cache/origin/__init__.py

Lines 69 to 70 in e2dd240

# Skip if only non-cache-relevant attributes changed

if changed and changed <= _NON_CACHE_RELEVANT_ATTRS:

maybe the word "only" needs to be bolded 😉

The reason this test is currently passing is that there's no "session purge pop" after organization_service.add_organization_project(...) call - so those changes will trigger the purge.

Moving db_request.db.flush() / db_request.db.info.pop("warehouse.cache.origin.purges", None) to be after the service call ensures the session state is "clean" before attempting the Event addition, and shows the failure.

Generally I think removing this test case is best, as the general idea is covered by this test:

warehouse/tests/unit/cache/origin/test_init.py

Lines 100 to 118 in 25bc404

@pytest.mark.parametrize(

"trigger",

[_trigger_event, _trigger_observation, _trigger_invitation],

ids=["events", "observations", "invitations"],

)

def test_store_purge_keys_skips_audit_only_collection_changes(

trigger, app_config, db_request

):

# Audit/admin-only collection mutations (events, observations, invitations)

# never affect publicly-cached content and should not trigger a Project purge.

project = ProjectFactory.create()

db_request.db.flush()

db_request.db.info.pop("warehouse.cache.origin.purges", None)

trigger(project, db_request)

db_request.db.flush()

purges = db_request.db.info.get("warehouse.cache.origin.purges", set())

assert f"project/{project.normalized_name}" not in purges

Maybe adding some more parameterizations there could be helpful, but might be more complexity than worth it? Your choice.

woodruffw added 3 commits April 21, 2026 15:28

Add an OrganizationProject purge key for the org profile

0d52a82

Signed-off-by: William Woodruff <william@astral.sh>

Add a backstop test

32f7aeb

Signed-off-by: William Woodruff <william@astral.sh>

Reformat

ef94b77

Signed-off-by: William Woodruff <william@astral.sh>

woodruffw commented Apr 22, 2026

View reviewed changes

Trim comment

25bc404

Signed-off-by: William Woodruff <william@astral.sh>

woodruffw marked this pull request as ready for review April 22, 2026 14:28

woodruffw requested a review from a team as a code owner April 22, 2026 14:28

miketheman self-assigned this Apr 24, 2026

miketheman added CDN/network Issues related to our CDN, users having problems connecting to PyPI organizations labels Apr 24, 2026

miketheman requested changes Apr 24, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Correctly purge the organization profile when updating an org's projects#19917

Correctly purge the organization profile when updating an org's projects#19917
woodruffw wants to merge 4 commits into
pypi:mainfrom
woodruffw-forks:ww/fix

woodruffw commented Apr 21, 2026 •

edited

Loading

Uh oh!

woodruffw Apr 22, 2026

Uh oh!

miketheman Apr 24, 2026

Uh oh!

woodruffw commented Apr 22, 2026

Uh oh!

miketheman left a comment

Uh oh!

miketheman Apr 24, 2026

Uh oh!

miketheman Apr 24, 2026

Uh oh!

miketheman Apr 24, 2026

Uh oh!

miketheman Apr 24, 2026

Uh oh!

miketheman Apr 24, 2026

Uh oh!

miketheman Apr 24, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

		organization=self.db.get(Organization, organization_id),
		project=self.db.get(Project, project_id),

		assert f"org/{organization.normalized_name}" in purges
		assert f"project/{project.normalized_name}" in purges

	# Skip if only non-cache-relevant attributes changed
	if changed and changed <= _NON_CACHE_RELEVANT_ATTRS:

	@pytest.mark.parametrize(
	"trigger",
	[_trigger_event, _trigger_observation, _trigger_invitation],
	ids=["events", "observations", "invitations"],
	)
	def test_store_purge_keys_skips_audit_only_collection_changes(
	trigger, app_config, db_request
	):
	# Audit/admin-only collection mutations (events, observations, invitations)
	# never affect publicly-cached content and should not trigger a Project purge.
	project = ProjectFactory.create()
	db_request.db.flush()
	db_request.db.info.pop("warehouse.cache.origin.purges", None)

	trigger(project, db_request)
	db_request.db.flush()

	purges = db_request.db.info.get("warehouse.cache.origin.purges", set())
	assert f"project/{project.normalized_name}" not in purges

Conversation

woodruffw commented Apr 21, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

woodruffw commented Apr 22, 2026

Uh oh!

miketheman left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

woodruffw commented Apr 21, 2026 •

edited

Loading