feat(agents): optimize RPI agent context management with discipline rules by katriendg · Pull Request #1492 · microsoft/hve-core

katriendg · 2026-04-30T14:34:41Z

Description

Tightened context discipline for the RPI (Research, Plan, Implement, Review) agent family by adding a shared Context Discipline section directly inside each parent agent and capping subagent chat responses to executive summaries. Disk-first tracking files remain the source of truth; chat responses now act as indices.

The discipline is built around three protocols, all inlined verbatim into every RPI parent agent:

Lean Post-Work Turn. After a subagent returns, the parent emits one compact line per subagent, makes at most one tracking-file edit, then stops. No re-reading of large planning, research, or details files in the closing turn, no re-quoting subagent payloads, no narrating the next phase.
Response Mode Selection. A four-tier ladder (Direct, Lightweight, Standard, Full) instructs parents to pick the lightest mode that satisfies the request rather than defaulting to full subagent dispatch.
Subagent Result Handling. Treat the subagent chat response as an index. Re-read the subagent file only when the next decision needs evidence the summary does not contain.

Why inline the rules into each agent

Two reasons drove keeping these rules inside each *.agent.md file rather than extracting them to a separate .instructions.md:

Self-contained agent packaging. RPI parent agents are distributed through extension and plugin packaging where each agent is expected to be runnable on its own. An external #file: reference adds a resolution dependency that complicates packaging, mirroring, and any future relocation of the artifact tree. Inlining keeps the agent self-contained and aligned with the existing hve-core convention for RPI agents.
Prompt-caching cache hits. GitHub Copilot uses prompt caching. Because the exact same Context Discipline block now appears as a stable, identically-worded region inside every parent agent, repeated invocations during a multi-phase RPI session are more likely to land on cache hits, reducing latency and token cost. A single shared file referenced via #file: does not deliver the same byte-identical-in-context advantage at the agent prompt level.

Agent updates

The Context Discipline section was inlined into all 5 RPI parent agents:

Each parent agent now contains the same Lean Post-Work Turn steps, the Response Mode Selection table, and the Subagent Result Handling bullets.

Subagent response format updates

Response Format sections were tightened in 4 RPI subagents to enforce executive-summary chat output with full fidelity persisted to the corresponding tracking file. Each updated section caps the chat response (≤7 prioritized bullets, ≤240 chars each, up to 3 clarifying questions only when blocking) and explicitly forbids pasting full file contents, code blocks, validation output, or long quotes:

researcher-subagent.agent.md : points at the subagent research file, retains the recommended next research checklist.
plan-validator.agent.md : points at the Planning Log, adds an explicit DR- / DD- delta line.
implementation-validator.agent.md : points at the review log, prioritizes blocking issues and regressions.
rpi-validator.agent.md : points at the review log, prioritizes schema violations.

Related Issue(s)

Closes #1536

Type of Change

Select all that apply:

Code & Documentation:

Bug fix (non-breaking change fixing an issue)
New feature (non-breaking change adding functionality)
Breaking change (fix or feature causing existing functionality to change)
Documentation update

Infrastructure & Configuration:

AI Artifacts:

Reviewed contribution with prompt-builder agent and addressed all feedback
Copilot instructions (.github/instructions/*.instructions.md)
Copilot prompt (.github/prompts/*.prompt.md)
Copilot agent (.github/agents/*.agent.md)
Copilot skill (.github/skills/*/SKILL.md)

Note for AI Artifact Contributors:

Agents: Research, indexing/referencing other project (using standard VS Code GitHub Copilot/MCP tools), planning, and general implementation agents likely already exist. Review .github/agents/ before creating new ones.

Skills: Must include both bash and PowerShell scripts. See Skills.

Model Versions: Only contributions targeting the latest Anthropic and OpenAI models will be accepted. Older model versions (e.g., GPT-3.5, Claude 3) will be rejected.

See Agents Not Accepted and Model Version Requirements.

Other:

Script/automation (.ps1, .sh, .py)
Other (please describe):

Sample Prompts (for AI Artifact Contributions)

User Request:
Existing RPI workflow invocations (@rpi-agent, @task-researcher, @task-planner, @task-implementor, @task-reviewer) remain unchanged. Context optimization operates transparently.

Execution Flow:

Parent agent receives user request.
Parent agent applies Response Mode Selection to choose invocation strategy (Direct, Lightweight, Standard, or Full).
Parent agent invokes a subagent when needed.
Subagent writes complete findings to the appropriate .copilot-tracking/ file.
Subagent returns an executive summary (1-line file path, 1-line status, ≤7 finding bullets ≤240 chars each, up to 3 clarifying questions, 1 "Full Detail" pointer).
Parent agent applies Lean Post-Work Turn: emit one compact line per subagent, update the tracking file once, stop.
Parent agent re-reads the subagent file only when the next decision requires detail beyond the summary bullets.

Output Artifacts:
RPI tracking files in .copilot-tracking/ directories remain the source of truth. Chat responses now contain executive summaries instead of full payloads.

Success Indicators:

Each RPI parent agent contains an inlined Context Discipline section with Lean Post-Work Turn, Response Mode Selection table, and Subagent Result Handling bullets.
Subagent chat responses are limited to ≤7 bullets with a "Re-read for complete..." pointer line.
Tracking files contain complete findings with evidence and citations.
Parent agents stop after updating the tracking file (no re-reading or re-narrating).

Testing

Automated Validation:

Markdown linting: ✅ Passed (npm run lint:md)
Frontmatter validation: ✅ Passed (npm run lint:frontmatter, 536 files)
Collection metadata: ✅ Passed (npm run lint:collections-metadata, 13 collections)
Plugin generation: ✅ Passed (npm run plugin:generate)

Security Analysis:

No sensitive data introduced.
No dependency changes.
No security-related scripts modified.
Conventional commit format verified across branch commits.

Checklist

Required Checks

Documentation is updated (if applicable) : (N/A : AI artifact body changes only)
Files follow existing naming conventions
Changes are backwards compatible (if applicable)
Tests added for new functionality (if applicable) : (N/A : no test harness for agent response format)

AI Artifact Contributions

Used /prompt-analyze to review contribution
Addressed all feedback from prompt-builder review
Verified contribution follows common standards and type-specific requirements

Required Automated Checks

Markdown linting: npm run lint:md
Spell checking: npm run spell-check
Frontmatter validation: npm run lint:frontmatter
Skill structure validation: npm run validate:skills
Link validation: npm run lint:md-links
PowerShell analysis: npm run lint:ps
Plugin freshness: npm run plugin:generate
Docusaurus tests: npm run docs:test

Security Considerations

This PR does not contain any sensitive or NDA information
Any new dependencies have been reviewed for security issues : (N/A : no dependency changes)
Security-related scripts follow the principle of least privilege : (N/A : no security script changes)

Additional Notes

Self-contained agents over shared instructions. An earlier iteration of this branch extracted the Context Discipline rules into a dedicated rpi-context-discipline.instructions.md file referenced from each parent agent via #file:. That approach was reverted in favor of inlining: it keeps RPI parent agents fully self-contained for extension and plugin packaging, and it lets prompt caching (Anthropic prompt caching) recognize the identical Context Discipline block as a stable, repeated region across RPI invocations within a session.

Disk-first architecture. Chat responses are now indices, not full results. Subagents always write complete findings to their tracking files before returning the summary. Parents re-read only when the next decision requires evidence beyond the summary bullets, which keeps long RPI sessions from accumulating large per-turn payloads.

… response formats - introduce hve-core/rpi-context-discipline.instructions.md for parent RPI agents - reference context discipline from rpi-agent, task-researcher, planner, implementor, reviewer - tighten subagent response formats to executive summaries with full detail on disk - update hve-core, hve-core-all, and project-planning collections plus generated plugins 🧹 - Generated by Copilot

…ditional output details - add count of DR- and DD- items in plan-validator - include relative path to research document in researcher subagent - add checklist of recommended next steps for both subagents 🔍 - Generated by Copilot

…t agents 🔒 - Generated by Copilot

codecov-commenter · 2026-04-30T14:44:03Z

Codecov Report

✅ All modified and coverable lines are covered by tests.
✅ Project coverage is 86.49%. Comparing base (66b524a) to head (7cc78d3).

Additional details and impacted files

@@            Coverage Diff             @@
##             main    #1492      +/-   ##
==========================================
+ Coverage   85.46%   86.49%   +1.02%     
==========================================
  Files          82       76       -6     
  Lines       11802    10743    -1059     
==========================================
- Hits        10087     9292     -795     
+ Misses       1715     1451     -264

Flag	Coverage Δ
pester	`83.59% <ø> (-0.02%)`	⬇️

Flags with carried forward coverage won't be shown. Click here to find out more.
see 8 files with indirect coverage changes

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

…n subagent response formats - shorten rpi-context-discipline description to fit 120-char target - drop duplicated path bullet from researcher and rpi-validator response formats - reorder plan-validator response to put findings before DR-/DD- delta line - regenerate hve-core, hve-core-all, and project-planning plugin READMEs ✏️ - Generated by Copilot

… description and update applyTo path 🔧 - Generated by Copilot Co-authored-by: Copilot <copilot@github.com>

github-actions

Advisory review, this PR is from a maintainer. Findings are informational only.

Overview

This PR introduces a well-designed context-discipline mechanism for the RPI workflow. The single-source-of-truth approach (one shared instructions file referenced via #file: from all five parent agents) is the correct pattern for this type of cross-cutting concern, and the subagent response-format updates are clear and consistently applied. Overall the implementation is sound. A few items to consider:

Issue Alignment

⚠️ No linked issue. The PR explicitly states "Related Issue(s): None." While not every internal improvement requires a tracking issue, it's worth confirming this is intentional rather than an omission. Linking a discussion or issue helps future contributors understand the motivation and any considered alternatives.

PR Template Compliance

⚠️ AI Artifact Contributions checklist items are unchecked. The PR checks both Copilot instructions and Copilot agent under Type of Change — both correct — but leaves all three items under the "AI Artifact Contributions" sub-checklist unchecked:

[ ] Used /prompt-analyze to review contribution
[ ] Addressed all feedback from prompt-builder review
[ ] Verified contribution follows common standards and type-specific requirements

If /prompt-analyze review was performed, these should be checked. If skipped intentionally, mark them (N/A — internal workflow change) consistent with the pattern used elsewhere in this PR's checklist.

Coding Standards

💡 PR description contains an inaccurate applyTo pattern. The description reads:

The applyTo pattern targets .copilot-tracking/{research,plans,details,changes,reviews,challenges}/**

But the actual file uses {research,plans,details,changes,reviews} — correctly omitting challenges, which is consistent with the in-file note that Task Challenger is intentionally out of scope. The PR description should be corrected to match the real pattern to avoid confusing future readers.

💡 Writing style violations in the new instructions file (inline comment on lines 24–27). The Response Mode Selection section uses **Term** — Description bolded-prefix list items with em dashes, which conflicts with both the em-dash and bolded-prefix-list conventions in writing-style.instructions.md. See the inline comment for a suggested restructure.

💡 Missing attribution suffix. The description field reads "Context-discipline rules for RPI parent agents: lean post-work turns, response modes, subagent results". Per prompt-builder.instructions.md, artifacts should carry a - Brought to you by microsoft/hve-core suffix in the description frontmatter. Several peer files (pull-request.instructions.md, commit-message.instructions.md) follow this convention. Worth aligning for consistency.

Code Quality

✅ No bugs, logic errors, security concerns, or missing error handling identified. The #file: path resolution for agents at .github/agents/hve-core/ resolving to ../../instructions/hve-core/rpi-context-discipline.instructions.md is correct. Plugin symlinks and collection YAML entries are consistent across all three collections.

Generated by PR Review for issue #1492 · ● 1.5M

Co-authored-by: Copilot <copilot@github.com>

github-actions

PR Review: `feat(agents): optimize RPI agent context management with shared discipline rules`

Overall this is a well-structured, clearly documented PR with a coherent design. The single-source pattern via #file: references is idiomatic for this codebase, all five parent agents are updated consistently, and the four subagent response-format rewrites follow the new lean protocol faithfully. The collection and plugin outputs are regenerated correctly. Below are the findings from each review dimension.

Issue Alignment

Issue #1536 was inaccessible via the review toolchain due to an integrity policy filter, so full issue-to-implementation traceability cannot be confirmed automatically. Based on the PR description alone, the stated goals (disk-first architecture, lean post-work turns, executive-summary responses, response-mode selection) are well-addressed by the changes. No scope creep or obviously missing requirements are visible.

Action required: Please verify that issue #1536's acceptance criteria are fully satisfied and nothing was left out.

PR Template Compliance

Two items need attention:

1. AI Artifact Contributions checklist is entirely unchecked

Under Checklist → AI Artifact Contributions, all three items remain unchecked:

[ ] Used /prompt-analyze to review contribution
[ ] Addressed all feedback from prompt-builder review
[ ] Verified contribution follows common standards and type-specific requirements

This is inconsistent with the Type of Change → AI Artifacts section, where [x] Reviewed contribution with prompt-builder agent and addressed all feedback is checked. If the prompt-builder review was completed and feedback was addressed, the matching checklist items should be checked as well.

2. Spell-check checkbox marked as passing but was blocked

The Testing section marks npm run spell-check as ✅ passed while also annotating it "(Blocked: requires Node.js ≥22.18.0)". A check that could not run should not be marked as passed — it should remain unchecked with the blocking reason noted inline.

Coding Standards Review

rpi-context-discipline.instructions.md ✅

Frontmatter contains the required description and applyTo fields per instruction-file conventions.
Attribution suffix (- Brought to you by microsoft/hve-core) is present in the description value.
H1 heading is correct — no title: frontmatter field, so H1 is the right document title per the markdown rules.
applyTo pattern '**/.copilot-tracking/{research,plans,details,changes,reviews}/**' correctly targets the tracking files that agents write to during RPI sessions; agents additionally pull the file in explicitly via #file:, giving two activation paths as intended.
Bullet-point punctuation follows complete-sentence rules throughout. Table descriptions in the Response Mode Selection table are acceptable as fragments within table cells.

Agent #file: path resolution ✅

The path #file:../../instructions/hve-core/rpi-context-discipline.instructions.md resolves correctly from .github/agents/hve-core/ → .github/ → instructions/hve-core/... in both repo and plugin distribution contexts.

All five parent agents updated consistently ✅ (rpi-agent, task-implementor, task-planner, task-researcher, task-reviewer — all confirmed).

All four subagent response formats updated consistently ✅ (implementation-validator, plan-validator, researcher-subagent, rpi-validator — pattern is uniform: log-path line → status line → ≤7 bullets → ≤3 questions → "Re-read <path>..." pointer).

Collection and plugin outputs ✅ — three collections updated, three plugin README/symlink outputs regenerated.

Code Quality and Security

No bugs, logic errors, security vulnerabilities, or breaking changes identified. The change is purely additive for parent agents (new ## Context Discipline section) and a targeted rewrite of Response Format sections for subagents. No dependency changes, no secrets, no infrastructure modifications.

Action Items

#	Item	Severity
1	Check all three AI Artifact Contributions checklist boxes if the `prompt-builder` review was completed and feedback addressed	Minor — template compliance
2	Uncheck the spell-check item (or annotate it as N/A) rather than marking a blocked run as passed	Minor — template accuracy
3	Manually confirm issue #1536 acceptance criteria are fully addressed (automated check was blocked by integrity policy)	Advisory — cannot auto-verify

Note

🔒 Integrity filter blocked 1 item

The following item were blocked because they don't meet the GitHub integrity level.

#1536 issue_read: has lower integrity than agent requires. The agent cannot read data with integrity below "approved".

To allow these resources, lower min-integrity in your GitHub frontmatter:

tools:
  github:
    min-integrity: approved  # merged | approved | unapproved | none

Generated by PR Review for issue #1492 · ● 2.5M

rezatnoMsirhC

Thank you for this contribution. The single-source approach for context-discipline rules is clean and the #file: propagation pattern is well suited to keeping these rules in sync across the RPI parent agents. Two minor comments left inline.

…remove separate instructions file - streamline response mode selection and subagent result handling - clarify lean post-work turn requirements - remove deprecated context-discipline instructions from various collections 🔧 - Generated by Copilot

github-actions

Advisory Review — `feat(agents): optimize RPI agent context management with shared discipline rules`

Advisory review: this PR is from a maintainer. Findings are informational only.

Issue Alignment

The PR links Closes #1536. A linked issue is present ✅. The stated intent — reducing context bloat in long-running RPI sessions by capping subagent chat payloads and enforcing lean post-work turns — is coherent and well-motivated.

PR Template Compliance

Item	Status
Description section filled in	✅
Related issue(s) present	✅ (`Closes #1536`)
Type of Change checked	✅ (New feature)
"Copilot instructions" checkbox checked	⚠️ See below
AI Artifact `prompt-builder` review attestation	✅
Testing section completed	✅
Required checklist items checked	✅

⚠️ "Copilot instructions" checkbox is checked, but no instructions file exists in the diff. The PR description prominently describes adding .github/instructions/hve-core/rpi-context-discipline.instructions.md, updating three collections, and regenerating three plugin outputs — yet none of these appear in the diff. The actual PR contains changes to exactly 9 agent files. The checkbox and substantial portions of the description describe scope that is not present.

⚠️ Sample Prompts — "User Request" field describes backward compatibility ("Context optimization operates transparently") rather than a natural language invocation. The template asks "What natural language request would trigger this agent/prompt/instruction?" For transparency instructions this is admittedly harder to answer concretely, but the intent field should reflect that.

Coding Standards

prompt-builder.instructions.md applies to all *.agent.md files.

Duplication vs. #file: convention (see inline comment on rpi-agent.agent.md): All five parent agents have an identical 23-line ## Context Discipline block pasted inline. The PR description claims this is backed by a shared instructions file with #file: references. The instructions file does not exist in this PR, and none of the agents contain #file: references. Per the prompt-builder single-source guidance, shared protocol blocks should use #file: to avoid divergence. As-is, any future change to the discipline rules requires editing five files.

Subagent response format changes are well-structured and align with the prompt-builder Subagent Structural Template conventions. The shift to prescriptive caps (≤7 bullets, ≤240 chars, ≤3 questions) is a clear improvement over the previous open-ended bullet lists. The plan-validator addition of DR-/DD- item counts is a useful signal for parent agents.

Writing style: No em-dash patterns, no bolded-prefix list items, appropriate use of tables. ✅

Code Quality

The functional intent of the changes is sound — disk-first architecture, executive summary chat responses, and explicit re-read-only-when-needed rules are a reasonable approach to managing context in long sessions.

Potential ambiguity in parent agents: The new ## Context Discipline section instructs the agent to "emit one compact line per subagent... stop." The existing ### Response Format sections in each parent agent describe full bottom-up presentations (e.g., task-researcher's summary table). The two sections can coexist if Context Discipline governs intermediate turns and Response Format governs final-turn summaries — but this boundary is not explicitly stated. A clarifying sentence (e.g., "This discipline applies to intermediate turns after subagent returns, not to final-turn user summaries.") would prevent an agent from short-circuiting its completion presentation.

Unbounded checklist in researcher-subagent and rpi-validator response formats (see inline comment): The "recommended next research" checklist lacks a cap, inconsistent with the bounded constraints added for all other response elements.

Action Items

Resolve the description/implementation gap: Either create the missing rpi-context-discipline.instructions.md and replace inline duplicates with #file: references, or update the PR description to accurately reflect the inline-copy approach and uncheck the "Copilot instructions" checkbox.
Add a cap to the "recommended next research" checklist in researcher-subagent.agent.md and rpi-validator.agent.md (e.g., "up to 5 items") for consistency with the bounded format.
Consider a clarifying sentence in each parent agent's Context Discipline section to make the scope boundary with Response Format explicit.

Note

🔒 Integrity filter blocked 11 items

The following items were blocked because they don't meet the GitHub integrity level.

#1536 issue_read: has lower integrity than agent requires. The agent cannot read data with integrity below "approved".
47ab056 list_commits: has lower integrity than agent requires. The agent cannot read data with integrity below "approved".
b3d55bd list_commits: has lower integrity than agent requires. The agent cannot read data with integrity below "approved".
023b41c list_commits: has lower integrity than agent requires. The agent cannot read data with integrity below "approved".
34f4898 list_commits: has lower integrity than agent requires. The agent cannot read data with integrity below "approved".
7ce258f list_commits: has lower integrity than agent requires. The agent cannot read data with integrity below "approved".
57ea279 list_commits: has lower integrity than agent requires. The agent cannot read data with integrity below "approved".
5bedf80 list_commits: has lower integrity than agent requires. The agent cannot read data with integrity below "approved".
b63642e list_commits: has lower integrity than agent requires. The agent cannot read data with integrity below "approved".
6469168 list_commits: has lower integrity than agent requires. The agent cannot read data with integrity below "approved".
a1e8878 list_commits: has lower integrity than agent requires. The agent cannot read data with integrity below "approved".

To allow these resources, lower min-integrity in your GitHub frontmatter:

tools:
  github:
    min-integrity: approved  # merged | approved | unapproved | none

Generated by PR Review for issue #1492 · ● 1.8M

…alidator subagents 🔧 - Generated by Copilot

github-actions

Advisory review, this PR is from a maintainer. Findings are informational only.

Overview

This PR introduces a Context Discipline block across all five RPI parent agents and tightens subagent Response Format sections to cap chat output to executive summaries, with full fidelity written to .copilot-tracking/ files. The design rationale — self-contained agent packaging and prompt-caching cache hits — is clearly articulated in the description and Additional Notes. The changes are coherent, well-scoped, and the disk-first architecture decision is sound.

Two advisory observations are noted below.

Issue Alignment

Linked issue #1536 could not be read during this review (integrity policy restriction). Based on the PR description and the changes themselves, the work clearly targets RPI agent context verbosity — the stated purpose is consistent with what was implemented.

PR Template Compliance

Three Required Automated Checks are neither checked nor annotated as N/A:

npm run spell-check — unchecked, no annotation
npm run lint:md-links — unchecked, no annotation
npm run docs:test — unchecked, no annotation

The template notes these "must pass before merging." If they are not applicable for agent-only .agent.md changes, please annotate them consistently with the other N/A items in the checklist (e.g., (N/A — agent body changes only, no new links or documentation pages)). This keeps the audit trail clear for reviewers.

Coding Standards

ALL CAPS directives — prompt-builder.instructions.md (Patterns to Avoid section) explicitly lists ALL CAPS directives and emphasis markers as patterns to avoid in prompt authoring:

"The following patterns provide limited value as prompt instructions: ALL CAPS directives and emphasis markers."

Affected lines across all nine changed files:

File	Lines	Terms
`rpi-agent.agent.md`	96, 100	`MUST`, `Do NOT`
`task-implementor.agent.md`	61, 65	`MUST`, `Do NOT`
`task-planner.agent.md`	63, 67	`MUST`, `Do NOT`
`task-researcher.agent.md`	55, 59	`MUST`, `Do NOT`
`task-reviewer.agent.md`	42, 46	`MUST`, `Do NOT`
`implementation-validator.agent.md`	202, 211	`ALWAYS`, `Do NOT`
`researcher-subagent.agent.md`	61, 71	`ALWAYS`, `Do NOT`
`plan-validator.agent.md`	97, 107	`ALWAYS`, `Do NOT`
`rpi-validator.agent.md`	67, 77	`ALWAYS`, `Do NOT`

Inline comments are on representative examples (one parent agent, one subagent). A global find-and-replace of MUST → must, ALWAYS → always, and Do NOT → Do not across the nine files would resolve all instances without changing the instructional intent.

Code Quality

No bugs, security issues, or architectural concerns found. The inlining decision (vs. #file: shared instructions) is well-reasoned, and the response-format caps are practical and consistent across all four subagents.

Note

🔒 Integrity filter blocked 1 item

The following item were blocked because they don't meet the GitHub integrity level.

#1536 issue_read: has lower integrity than agent requires. The agent cannot read data with integrity below "approved".

To allow these resources, lower min-integrity in your GitHub frontmatter:

tools:
  github:
    min-integrity: approved  # merged | approved | unapproved | none

Generated by PR Review for issue #1492 · ● 1.3M

…or consistency 🔧 - Generated by Copilot

github-actions

Advisory review — this PR is from a maintainer. Findings are informational only.

Review Summary

This PR is well-structured and clearly motivated. The Context Discipline section is verbatim-identical across all five RPI parent agents (by design, as documented in the PR description), and the subagent response format updates are consistent and purposeful. No critical issues found.

Issue Alignment

Linked issue: Closes #1536 ✅
The PR description thoroughly describes the intent (lean post-work turns, response mode selection, subagent result handling), and the diff directly reflects those goals.
No scope creep identified — all 9 changed files are the exact agents named in the description.

PR Template Compliance

Description, Related Issue, Type of Change, Sample Prompts, Testing, and Security Considerations are all filled in correctly.
Minor: Two Required Automated Checks are left unchecked without an N/A annotation:
- [ ] Spell checking: npm run spell-check — this script is active in the repo and covers *.md files; consider running it and checking the box (or annotating N/A if it was run and confirmed clean).
- [ ] Link validation: npm run lint:md-links — same note; agent files may contain #file: or relative references that benefit from link-checking.

Coding Standards Review

Applicable instruction file: prompt-builder.instructions.md (applyTo: **/*.agent.md)

✅ All five parent agents correctly use * bullet lists (no + or - mixing).
✅ No bolded-prefix list-item pattern (**Term:** description) introduced — the * 1 line: ... pattern in response format sections is not a violation.
✅ Ordered list steps in the Context Discipline section use imperative voice, consistent with the instructions' guidance for autonomous/action steps.
✅ Table columns align and use consistent Mode naming.
✅ No temporal markers, thought-process commentary, or narrative comments introduced.

Code Quality

The inlining rationale (self-contained packaging + prompt caching locality) is sound and well-documented.
The ≤ 240 chars and ≤ 7 bullets limits are consistent across all four updated subagents.
researcher-subagent and rpi-validator both cap the next-steps checklist at 5 items for symmetry — this is a good tightening.
The placement of the ## Context Discipline section varies slightly across agents (after the parallelism note in most, after the subagent behavior bullets in task-reviewer) — this is acceptable given each file has a different section structure.

Action Items

(Advisory) Run npm run spell-check and npm run lint:md-links before merge, and check or annotate the corresponding checklist boxes.

Note

🔒 Integrity filter blocked 1 item

The following item were blocked because they don't meet the GitHub integrity level.

#1536 issue_read: has lower integrity than agent requires. The agent cannot read data with integrity below "approved".

To allow these resources, lower min-integrity in your GitHub frontmatter:

tools:
  github:
    min-integrity: approved  # merged | approved | unapproved | none

Generated by PR Review for issue #1492 · ● 1.4M

…ompact loop fix (#1541) ## Description This PR introduces **per-agent model selection** via frontmatter, backed by a validated model catalog that tracks GitHub Copilot's evolving model lineup. Simple tasks (git operations, issue triage, research) now route to fast-tier models at a fraction of the cost, while complex agents inherit the session model for full capability. Additionally, this PR removes the self-referential `/compact` handoff from 12 agents, eliminating the root cause of **Autopilot infinite loops** reported in #1420. The disk-first `.copilot-tracking/` architecture and Memory Agent already provide equivalent persistence without the loop risk. ### Model Selection Infrastructure > Cost-first principle: use fast models for read-only research and validation; inherit session model for code generation and complex reasoning. - Added **model catalog** (`scripts/linting/model-catalog.json`) tracking 25 models across 5 tiers (free, fast, standard, premium, ultra) with multiplier values, vendor attribution, and GA/preview/retiring status - Added **catalog refresh script** (`scripts/linting/Update-ModelCatalog.ps1`) that fetches authoritative YAML from `github/docs` for model release status and multiplier data; marks removed models as *retiring* with 60-day grace period rather than deleting - Added **validation script** (`scripts/linting/Test-ModelReferences.ps1`) that scans all `.agent.md` and `.prompt.md` files for model frontmatter and validates references against the catalog; reports invalid models as errors, retiring models as warnings - Added **JSON schema** (`scripts/linting/schemas/model-catalog.schema.json`) for structural validation of the catalog file - Added **weekly CI workflow** (`.github/workflows/model-validation.yml`) running every Wednesday plus PR-triggered validation on agent/prompt/catalog changes; includes catalog freshness check and artifact upload - Integrated `lint:models` and `lint:models:refresh` into `package.json`; model validation runs as part of the `lint:all` chain ### Per-Agent and Per-Prompt Model Assignment Assigned fast-tier models to 7 **subagents** performing read-heavy validation tasks: *researcher-subagent*, *plan-validator*, *implementation-validator*, *prompt-evaluator*, *rpi-validator*, *codebase-profiler*, and *report-generator*. Each declares a prioritized fallback array: `Claude Haiku 4.5 → GPT-5.4 mini`. Assigned `Claude Haiku 4.5 (copilot)` to 7 **prompts** handling mechanical operations: *git-commit-message*, *git-commit*, *git-setup*, *github-add-issue*, *github-discover-issues*, *github-triage-issues*, and *checkpoint*. Added "Model Selection for Subagents" guidance to 6 **parent agents** (*task-researcher*, *task-planner*, *task-implementor*, *task-reviewer*, *prompt-builder*, *security-reviewer*) documenting cost-first dispatch decisions and VS Code tier constraint behavior. ### /compact Handoff Removal (Fixes #1420) Removed the `Compact` handoff entry from all 12 agents where it appeared. Eleven had it as their **first handoff**, causing Autopilot to auto-execute it on every turn completion, creating an infinite self-referential loop. - Updated `rai-identity.instructions.md` to remove the "Compact handoff" exit point reference from disclaimer display logic - Updated `docs/rpi/context-engineering.md` to recommend `/checkpoint` (Memory Agent) for cross-phase persistence and clarify that `/compact` remains available as a typed command PR #1492 (`feat/context-working`) adds Context Discipline to 5 RPI parent agents, enforcing disk-first lean responses. The `/compact` handoff is now architecturally redundant because: 1. **Disk-first `.copilot-tracking/` files** — all state already lives on disk 2. **Memory Agent** — provides structured session persistence with handoff to a different agent (non-looping) 3. **PR #1492 Context Discipline** — caps subagent responses to executive summaries, reducing context bloat at the source ### Test Coverage - Added 41 Pester tests (`Test-ModelReferences.Tests.ps1`) covering validation logic, frontmatter parsing, and error handling - Added 29 Pester tests (`Test-UpdateModelCatalog.Tests.ps1`) covering catalog merge, comparison, and refresh logic ## Related Issue(s) Fixes #1420 Closes #1540 ## Type of Change Select all that apply: **Code & Documentation:** * [ ] Bug fix (non-breaking change fixing an issue) * [x] New feature (non-breaking change adding functionality) * [x] Breaking change (fix or feature causing existing functionality to change) * [x] Documentation update **Infrastructure & Configuration:** * [x] GitHub Actions workflow * [ ] Linting configuration (markdown, PowerShell, etc.) * [ ] Security configuration * [ ] DevContainer configuration * [ ] Dependency update **AI Artifacts:** * [x] Reviewed contribution with `prompt-builder` agent and addressed all feedback * [x] Copilot instructions (`.github/instructions/*.instructions.md`) * [x] Copilot prompt (`.github/prompts/*.prompt.md`) * [x] Copilot agent (`.github/agents/*.agent.md`) * [ ] Copilot skill (`.github/skills/*/SKILL.md`) > Note for AI Artifact Contributors: > > * Agents: Research, indexing/referencing other project (using standard VS Code GitHub Copilot/MCP tools), planning, and general implementation agents likely already exist. Review `.github/agents/` before creating new ones. > * Skills: Must include both bash and PowerShell scripts. See [Skills](../docs/contributing/skills.md). > * Model Versions: Only contributions targeting the **latest Anthropic and OpenAI models** will be accepted. Older model versions (e.g., GPT-3.5, Claude 3) will be rejected. > * See [Agents Not Accepted](../docs/contributing/custom-agents.md#agents-not-accepted) and [Model Version Requirements](../docs/contributing/ai-artifacts-common.md#model-version-requirements). **Other:** * [x] Script/automation (`.ps1`, `.sh`, `.py`) * [ ] Other (please describe): ## Sample Prompts (for AI Artifact Contributions) **User Request:** Invoke any RPI agent (e.g., `task researcher`) with a research task. The agent dispatches its Researcher Subagent at fast-tier cost automatically. Run `npm run lint:models` to validate all model references. **Execution Flow:** 1. Parent agent evaluates task type (read-only vs code-generation) 2. For research/validation tasks, parent specifies `model: "Claude Haiku 4.5 (copilot)"` on `runSubagent` call 3. VS Code resolves model against cost tier constraint (cannot exceed parent model tier) 4. Subagent executes at fast-tier cost; results written to `.copilot-tracking/` disk files 5. If tier constraint blocks downgrade, platform falls back to session model gracefully **Output Artifacts:** - `logs/model-validation-results.json` — structured validation results with per-file status - `scripts/linting/model-catalog.json` — refreshed catalog after `lint:models:refresh` **Success Indicators:** - `npm run lint:models` exits 0 with no invalid model references - Subagent invocations show model name in VS Code chat header when explicitly set - No Autopilot infinite loops when agents complete their work ## Testing - `npm run lint:models` — model reference validation (validates all 14 model-annotated files) - Security analysis: no sensitive data exposure, no privilege escalation, workflow uses read-only permissions - Diff-based assessment: all changes are configuration-level (frontmatter, handoff entries, guidance sections); no business logic modified - Manual testing performed > [!NOTE] > Add manual testing descriptions when applicable. ## Checklist ### Required Checks * [x] Documentation is updated (if applicable) * [x] Files follow existing naming conventions * [ ] Changes are backwards compatible (if applicable) * [x] Tests added for new functionality (if applicable) ### AI Artifact Contributions  * [x] Used `/prompt-analyze` to review contribution * [x] Addressed all feedback from `prompt-builder` review * [x] Verified contribution follows common standards and type-specific requirements ### Required Automated Checks The following validation commands must pass before merging: * [x] Markdown linting: `npm run lint:md` * [x] Spell checking: `npm run spell-check` * [x] Frontmatter validation: `npm run lint:frontmatter` * [x] Skill structure validation: `npm run validate:skills` * [x] Link validation: `npm run lint:md-links` * [x] PowerShell analysis: `npm run lint:ps` * [x] Plugin freshness: `npm run plugin:generate` * [x] Docusaurus tests: `npm run docs:test` ## Security Considerations  * [x] This PR does not contain any sensitive or NDA information * [ ] Any new dependencies have been reviewed for security issues (N/A — no new runtime dependencies added) * [x] Security-related scripts follow the principle of least privilege > [!WARNING] > This PR includes **experimental** GHCP artifacts that may have breaking changes. > - `.github/agents/hve-core/task-challenger.agent.md` > - `.github/agents/experimental/experiment-designer.agent.md` > - `.github/agents/experimental/pptx.agent.md` > - `.github/agents/security/security-planner.agent.md` > - `.github/agents/security/sssc-planner.agent.md` > - `.github/agents/security/security-reviewer.agent.md` > - `.github/agents/security/subagents/codebase-profiler.agent.md` > - `.github/agents/security/subagents/report-generator.agent.md` > - `.github/agents/rai-planning/rai-planner.agent.md` ## GHCP Artifact Maturity | File | Type | Maturity | Notes | |------|------|----------|-------| | `.github/agents/hve-core/rpi-agent.agent.md` | Agent | ✅ stable | All builds | | `.github/agents/hve-core/task-researcher.agent.md` | Agent | ✅ stable | All builds | | `.github/agents/hve-core/task-planner.agent.md` | Agent | ✅ stable | All builds | | `.github/agents/hve-core/task-implementor.agent.md` | Agent | ✅ stable | All builds | | `.github/agents/hve-core/task-reviewer.agent.md` | Agent | ✅ stable | All builds | | `.github/agents/hve-core/prompt-builder.agent.md` | Agent | ✅ stable | All builds | | `.github/agents/hve-core/task-challenger.agent.md` | Agent | ⚠️ experimental | Pre-release only | | `.github/agents/experimental/experiment-designer.agent.md` | Agent | ⚠️ experimental | Pre-release only | | `.github/agents/experimental/pptx.agent.md` | Agent | ⚠️ experimental | Pre-release only | | `.github/agents/security/security-planner.agent.md` | Agent | ⚠️ experimental | Pre-release only | | `.github/agents/security/sssc-planner.agent.md` | Agent | ⚠️ experimental | Pre-release only | | `.github/agents/security/security-reviewer.agent.md` | Agent | ⚠️ experimental | Pre-release only | | `.github/agents/security/subagents/codebase-profiler.agent.md` | Agent | ⚠️ experimental | Pre-release only | | `.github/agents/security/subagents/report-generator.agent.md` | Agent | ⚠️ experimental | Pre-release only | | `.github/agents/rai-planning/rai-planner.agent.md` | Agent | ⚠️ experimental | Pre-release only | | `.github/prompts/hve-core/checkpoint.prompt.md` | Prompt | ✅ stable | All builds | | `.github/prompts/hve-core/git-commit-message.prompt.md` | Prompt | ✅ stable | All builds | | `.github/prompts/hve-core/git-commit.prompt.md` | Prompt | ✅ stable | All builds | | `.github/prompts/hve-core/git-setup.prompt.md` | Prompt | ✅ stable | All builds | | `.github/prompts/github/github-add-issue.prompt.md` | Prompt | ✅ stable | All builds | | `.github/prompts/github/github-discover-issues.prompt.md` | Prompt | ✅ stable | All builds | | `.github/prompts/github/github-triage-issues.prompt.md` | Prompt | ✅ stable | All builds | | `.github/instructions/rai-planning/rai-identity.instructions.md` | Instructions | ⚠️ experimental | Pre-release only | ### GHCP Maturity Acknowledgment - [ ] I acknowledge this PR includes non-stable GHCP artifacts - [ ] Non-stable artifacts are intentional for this change ## Additional Notes - The `/compact` removal is a **breaking change** for users who relied on the handoff button. The `/compact` typed command remains available; only the agent-surfaced handoff is removed. - Model catalog currently tracks 25 models; the automated refresh runs weekly to catch additions, removals, and multiplier changes from GitHub's upstream YAML sources. - The VS Code cost tier constraint means subagents can only use models at the same or lower tier than the parent. All guidance sections document this limitation and the graceful fallback behavior. ## Follow-up Tasks - Monitor weekly CI workflow for first catalog drift detection to confirm automation works end-to-end - Consider extending model selection to remaining prompts (pull-request, doc-ops) once cost savings are validated --------- Co-authored-by: Marcel Bindseil <marcelbindseil@gmail.com>

# Conflicts: # .github/agents/hve-core/task-implementor.agent.md # .github/agents/hve-core/task-planner.agent.md # .github/agents/hve-core/task-researcher.agent.md

katriendg added 3 commits April 30, 2026 13:56

feat(instructions): add shared context-discipline rules for RPI paren…

b63642e

…t agents 🔒 - Generated by Copilot

katriendg and others added 3 commits May 4, 2026 09:22

Merge remote-tracking branch 'origin/main' into feat/context-working

7ce258f

refactor(instructions): remove Task Challenger from RPI parent agents…

023b41c

… description and update applyTo path 🔧 - Generated by Copilot Co-authored-by: Copilot <copilot@github.com>

katriendg marked this pull request as ready for review May 4, 2026 09:34

katriendg requested a review from a team as a code owner May 4, 2026 09:34

github-actions Bot reviewed May 4, 2026

View reviewed changes

Comment thread .github/instructions/hve-core/rpi-context-discipline.instructions.md Outdated

chore: address PR comments

b3d55bd

Co-authored-by: Copilot <copilot@github.com>

github-actions Bot reviewed May 4, 2026

View reviewed changes

rezatnoMsirhC reviewed May 4, 2026

View reviewed changes

Comment thread .github/instructions/hve-core/rpi-context-discipline.instructions.md Outdated

Comment thread .github/agents/hve-core/subagents/implementation-validator.agent.md Outdated

katriendg changed the title ~~feat(agents): optimize RPI agent context management with shared discipline rules~~ feat(agents): optimize RPI agent context management with discipline rules May 5, 2026

github-actions Bot reviewed May 5, 2026

View reviewed changes

Comment thread .github/agents/hve-core/rpi-agent.agent.md

Comment thread .github/agents/hve-core/subagents/researcher-subagent.agent.md Outdated

katriendg requested a review from rezatnoMsirhC May 5, 2026 07:03

refactor(agents): limit checklist items to 5 for researcher and RPI v…

5367149

…alidator subagents 🔧 - Generated by Copilot

github-actions Bot reviewed May 5, 2026

View reviewed changes

Comment thread .github/agents/hve-core/rpi-agent.agent.md Outdated

Comment thread .github/agents/hve-core/subagents/implementation-validator.agent.md Outdated

refactor(agents): standardize response format language to lowercase f…

3a56ec0

…or consistency 🔧 - Generated by Copilot

github-actions Bot added the review-passed label May 5, 2026

github-actions Bot reviewed May 5, 2026

View reviewed changes

katriendg mentioned this pull request May 6, 2026

feat(agents)!: per-agent model selection for cost optimization and /compact loop fix #1541

Merged

36 tasks

katriendg and others added 2 commits May 11, 2026 09:43

Merge remote-tracking branch 'origin/main' into feat/context-working

2989324

# Conflicts: # .github/agents/hve-core/task-implementor.agent.md # .github/agents/hve-core/task-planner.agent.md # .github/agents/hve-core/task-researcher.agent.md

Merge branch 'main' into feat/context-working

0f6c087

WilliamBerryiii approved these changes May 11, 2026

View reviewed changes

Merge branch 'main' into feat/context-working

7cc78d3

mayurpatel312 approved these changes May 11, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(agents): optimize RPI agent context management with discipline rules#1492

feat(agents): optimize RPI agent context management with discipline rules#1492
katriendg wants to merge 13 commits into
mainfrom
feat/context-working

katriendg commented Apr 30, 2026 •

edited

Loading

Uh oh!

codecov-commenter commented Apr 30, 2026 •

edited

Loading

Uh oh!

github-actions Bot left a comment

Uh oh!

Uh oh!

github-actions Bot left a comment

Uh oh!

rezatnoMsirhC left a comment

Uh oh!

Uh oh!

Uh oh!

github-actions Bot left a comment

Uh oh!

Uh oh!

Uh oh!

github-actions Bot left a comment

Uh oh!

Uh oh!

Uh oh!

github-actions Bot left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

Conversation

katriendg commented Apr 30, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Why inline the rules into each agent

Agent updates

Subagent response format updates

Related Issue(s)

Type of Change

Sample Prompts (for AI Artifact Contributions)

Testing

Checklist

Required Checks

AI Artifact Contributions

Required Automated Checks

Security Considerations

Additional Notes

Uh oh!

codecov-commenter commented Apr 30, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

github-actions Bot left a comment

Choose a reason for hiding this comment

Overview

Issue Alignment

PR Template Compliance

Coding Standards

Code Quality

Uh oh!

Uh oh!

github-actions Bot left a comment

Choose a reason for hiding this comment

PR Review: feat(agents): optimize RPI agent context management with shared discipline rules

Issue Alignment

PR Template Compliance

Coding Standards Review

Code Quality and Security

Action Items

Uh oh!

rezatnoMsirhC left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

github-actions Bot left a comment

Choose a reason for hiding this comment

Advisory Review — feat(agents): optimize RPI agent context management with shared discipline rules

Issue Alignment

PR Template Compliance

Coding Standards

Code Quality

Action Items

Uh oh!

Uh oh!

Uh oh!

github-actions Bot left a comment

Choose a reason for hiding this comment

Overview

Issue Alignment

PR Template Compliance

Coding Standards

Code Quality

Uh oh!

Uh oh!

Uh oh!

github-actions Bot left a comment

Choose a reason for hiding this comment

Review Summary

Issue Alignment

PR Template Compliance

Coding Standards Review

Code Quality

Action Items

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

katriendg commented Apr 30, 2026 •

edited

Loading

codecov-commenter commented Apr 30, 2026 •

edited

Loading

PR Review: `feat(agents): optimize RPI agent context management with shared discipline rules`

Advisory Review — `feat(agents): optimize RPI agent context management with shared discipline rules`