Skip to content

naming scheme for generating ResourceSync documents #8

Description

@markmatney

I've observed the following about the way this implementation names generated sitemaps (resourcelist, changelist, etc.) when ResourceSync.execute() is invoked (for a set of 50000 resources or less):

  • With strategy 0 (resourcelist):
    • On first invocation, a file resourcelist_0000.xml is created.
    • Subsequent invocations modify that file and keep the same name.
  • With strategy 1 (new_changelist):
    • On first invocation, a file changelist_0000.xml is created.
    • On subsequent invocations, a file changelist-index.xml is created, and points to a series of changelist_xxxx.xml files.
  • With strategy 2 (inc_changelist):
    • On first invocation, a file changelist_0000.xml is created.
    • Subsequent invocations modify that file and keep the same name.

I've not tested on collections with more than 50000 resources, but per the spec, I expect there'd be a resourcelist-index.xml generated that points to a number of resourcelists.

I am thinking that whenever there is only one resourcelist or changelist for a set of resources, then it should be named simply resourcelist.xml or changelist.xml, without a _0000 suffix, on the basis that a suffix like that is misleading/unnecessary when it is the only such sitemap for a set of resources.

Thoughts?

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type
    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions