Merge Stage 6 repeatability refresh

This commit is contained in:
Svrnty 2026-06-02 06:48:37 -04:00
commit 00d13f6b6f
5 changed files with 416 additions and 0 deletions

View File

@ -0,0 +1,84 @@
---
name: CTO Case Stage6 Repeatability Refresh Evidence
status: validated
lifecycle_classification: sot
owner: jp
created: 2026-06-02
last_reviewed: 2026-06-02
core_promotion_status: not-promoted
source: .sot/03-PROTOCOLS/CTO-CASE-STAGE6-REPEATABILITY-REFRESH-ISSUES.md
---
# CTO Case Stage6 Repeatability Refresh Evidence
Local planning SOT only. Not a Core Protocol. Not active Core authority.
Stage 6 repeatability refresh.
## Workboard
- `CTO-WORK-081`
- `CTO-WORK-082`
- `CTO-WORK-083`
- `CTO-WORK-084`
## Result
- status: validated
- Stage 5 repeatability proof
- second governed Stage 5 Case run
- repeatability_count: 2
- repeatability_status: pass
- runtime_default_activation: false
- Runtime default activation remains false.
- core_promotion_status: not-promoted
- No Core promotion occurs.
- Do not activate Case as default backend.
- This closeout does not authorize another Case run.
- no target repository mutation attempted
## Route Evidence
- Hermes implementation commit: `05ab2ff`
- Hermes merge commit: `40b0d9a`
- repeat case: `r1-slugify-punctuation`
- repeat target commit: `2fa1601`
- first target commit: `7706f99`
- focused validator: `python3 harness/runner/validate-case-stage6-real-refresh.py --json`
- focused validator result: `ok: true`
- focused WebUI summary validator: `python3 harness/runner/validate-webui-summary.py --json`
- focused WebUI summary validator result: `ok: true`
- post-merge aggregate Harness health: `pass`
## Artifact Evidence
- first report: `/home/svrnty/.hermes/profiles/cto-planb/harness-runs/20260601T105222Z-r1-src-string-slugify-180161/report.json`
- first Stage 5 proof: `/home/svrnty/.hermes/profiles/cto-planb/harness-runs/20260601T105222Z-r1-src-string-slugify-180161/stage5-owned-repo-proof.json`
- repeat report: `/home/svrnty/.hermes/profiles/cto-planb/harness-runs/20260602T101450Z-r1-slugify-punctuation-2663598/report.json`
- repeat Stage 5 proof: `/home/svrnty/.hermes/profiles/cto-planb/harness-runs/20260602T101450Z-r1-slugify-punctuation-2663598/stage5-owned-repo-proof.json`
- post-merge comparison: `/home/svrnty/.hermes/profiles/cto-planb/harness-runs/20260602T104507Z-stage6-real-governed-refresh/stage6-real-governed-refresh-comparison.json`
- post-merge WebUI summary: `/home/svrnty/.hermes/profiles/cto-planb/harness-runs/20260602T104552Z-run-all-fake-2738738/webui-summary.json`
## Validation Facts
- report shape: pass
- event validity: pass
- allowed-path compliance: pass
- failure closure: pass
- failure matrix coverage carry-forward: pass
- artifact completeness: pass
- forbidden-action closure: pass
- operator acceptance: pass
- approval provenance: pass
- source admission freshness: current
- target validation: pass
- Target Repository read-only status: pass
- fake lane evidence: represented by prior Stage 6 comparison
- Codex lane: blocked with rationale
- Pi lane: blocked with rationale
- runtime default activation: false
- core promotion status: not-promoted
## Approval Provenance Note
Both Stage 5 proof artifacts record approval_source as `JP chat approval on 2026-06-01`. The second repeatability prose records JP chat instruction on 2026-06-02. The validator records proof approval provenance exactly and does not rewrite proof artifacts.

View File

@ -0,0 +1,102 @@
---
name: CTO Case Stage6 Repeatability Refresh Issues
status: validated
lifecycle_classification: sot
owner: jp
created: 2026-06-02
last_reviewed: 2026-06-02
core_promotion_status: not-promoted
source: .sot/03-PROTOCOLS/CTO-CASE-STAGE6-REPEATABILITY-REFRESH-PRD.md
---
# CTO Case Stage6 Repeatability Refresh Issues
Local planning SOT only. Not a Core Protocol. Not active Core authority.
Stage 6 repeatability refresh.
## Issue: CTO-WORK-081 - Stage 6 Repeatability Refresh PRD
Type: AFK.
Status: validated.
Acceptance:
- Register `CTO-WORK-081`, `CTO-WORK-082`, `CTO-WORK-083`, and `CTO-WORK-084`.
- Define pinned inputs for `r1-src-string-slugify` and `r1-slugify-punctuation`.
- Require report shape, event validity, allowed-path compliance, failure closure, failure matrix coverage carry-forward, artifact completeness, forbidden-action closure, operator acceptance, approval provenance, source admission freshness, target validation, and Target Repository read-only status.
- Require fake lane evidence, Codex blocked rationale, and Pi blocked rationale.
- Require computed `repeatability_status` as `pass`, `blocked`, or `fail`.
- Require `repeatability_count: 2` for success.
- Require `runtime default activation: false`.
- Require `core_promotion_status: not-promoted`.
- Forbid new Case mutation, default activation, Core promotion, target mutation, push, merge, deploy, close, PR open, issue close, publication, vendor-source mutation, external developer repository mutation, unowned repository mutation, endpoint exposure, secret exposure, credential exposure, and raw Target Repository content exposure.
## Issue: CTO-WORK-082 - Stage 6 Repeatability Harness Route
Type: AFK.
Status: validated.
Blocked by: `CTO-WORK-081`.
Acceptance:
- Extend `validate-case-stage6-real-refresh.py`.
- Keep compatibility fields `real_stage5_pass_report` and `real_stage5_proof`.
- Add `repeat_stage5_pass_report` and `repeat_stage5_proof`.
- Add `real_stage5_passes`.
- Add `repeatability_required: 2`.
- Add `repeatability_count: 2`.
- Add `repeatability_status: pass`.
- Add `repeatability_passed: true`.
- Add `runtime_default_activation: false`.
- Add `core_promotion_status: not-promoted`.
- Focused validator passes.
- Aggregate Harness health passes before commit and after merge.
## Issue: CTO-WORK-083 - Hermes Repeatability Control Summary
Type: AFK.
Status: validated.
Blocked by: `CTO-WORK-082`.
Acceptance:
- Hermes Control Surface summary reads the extended real-governed refresh artifact.
- Summary exposes `repeatability_count: 2`.
- Summary exposes `repeatability_status: pass`.
- Summary exposes `candidate_default_repeatability_eligible: true`.
- Summary exposes `runtime_default_activation: false`.
- Summary exposes `core_promotion_status: not-promoted`.
- Summary keeps Codex lane and Pi lane blocked with rationale.
- Summary validator checks repeatability fields.
## Issue: CTO-WORK-084 - Stage 6 Repeatability Evidence Closeout
Type: AFK.
Status: validated.
Blocked by: `CTO-WORK-083`.
Acceptance:
- Evidence references Hermes implementation commit `05ab2ff`.
- Evidence references Hermes merge commit `40b0d9a`.
- Evidence references comparison artifact `/home/svrnty/.hermes/profiles/cto-planb/harness-runs/20260602T104507Z-stage6-real-governed-refresh/stage6-real-governed-refresh-comparison.json`.
- Evidence references WebUI summary artifact `/home/svrnty/.hermes/profiles/cto-planb/harness-runs/20260602T104552Z-run-all-fake-2738738/webui-summary.json`.
- Evidence records `repeatability_count: 2`.
- Evidence records `repeatability_status: pass`.
- Evidence records `runtime_default_activation: false`.
- Evidence records `core_promotion_status: not-promoted`.
- Evidence records no target repository mutation attempted.
- Evidence records no forbidden action.
- CTO validator checks PRD, issues, evidence, and Workboard statuses.
## Granularity Check
This sequence separates planning, Harness evidence refresh, Control Surface exposure, and closeout evidence. The Hermes code shipped the Harness and summary changes together because the summary is a read-only projection over the same artifact.

View File

@ -0,0 +1,95 @@
---
name: CTO Case Stage6 Repeatability Refresh PRD
status: validated
lifecycle_classification: sot
owner: jp
created: 2026-06-02
last_reviewed: 2026-06-02
core_promotion_status: not-promoted
source: .sot/03-PROTOCOLS/CTO-CASE-STAGE5-REPEATABILITY-PROOF-EVIDENCE.md
---
# CTO Case Stage6 Repeatability Refresh PRD
Local planning SOT only. Not a Core Protocol. Not active Core authority.
Stage 6 repeatability refresh.
## Problem
CTO has two governed Stage 5 Case passes in the admitted Target Repository. Existing Stage 6 real-governed refresh evidence consumed only the first pass. CTO needs a read-only refresh that proves repeatability across both passes before any later candidate-default decision discussion.
## Solution
Extend the existing Stage 6 real-governed refresh seam. The seam reads two pinned Harness Evidence Interface report and proof pairs, verifies both governed passes against Stage 6 dimensions, records computed repeatability status, and exposes repeatability through the Hermes Control Surface summary.
## Pinned Inputs
- first case_id: `r1-src-string-slugify`
- first run_id: `20260601T105222Z-r1-src-string-slugify-180161`
- first report: `/home/svrnty/.hermes/profiles/cto-planb/harness-runs/20260601T105222Z-r1-src-string-slugify-180161/report.json`
- first Stage 5 proof: `/home/svrnty/.hermes/profiles/cto-planb/harness-runs/20260601T105222Z-r1-src-string-slugify-180161/stage5-owned-repo-proof.json`
- first target commit: `7706f99`
- first expected changed files: `src/strings.py`, `test_strings.py`
- repeat case_id: `r1-slugify-punctuation`
- repeat run_id: `20260602T101450Z-r1-slugify-punctuation-2663598`
- repeat report: `/home/svrnty/.hermes/profiles/cto-planb/harness-runs/20260602T101450Z-r1-slugify-punctuation-2663598/report.json`
- repeat Stage 5 proof: `/home/svrnty/.hermes/profiles/cto-planb/harness-runs/20260602T101450Z-r1-slugify-punctuation-2663598/stage5-owned-repo-proof.json`
- repeat target commit: `2fa1601`
- repeat expected changed files: `src/strings.py`, `strings.py`, `test_strings.py`
## Scope
- Validate both report/proof pairs.
- Verify report shape, event validity, allowed-path compliance, failure closure, failure matrix coverage carry-forward, artifact completeness, forbidden-action closure, operator acceptance, approval provenance, source admission freshness, target validation, and Target Repository read-only status.
- Require fake lane evidence from prior Stage 6.
- Keep Codex lane blocked with rationale.
- Keep Pi lane blocked with rationale.
- Record computed `repeatability_status` as `pass`, `blocked`, or `fail`.
- Record `repeatability_count: 2` when both pinned passes validate.
- Record `runtime default activation: false`.
- Record `core_promotion_status: not-promoted`.
- Expose repeatability count, status, comparison path, and default activation false through Hermes Control Surface summary.
## Non-Goals
- Do not run a new Case mutation.
- Do not activate Case as default backend.
- Do not promote CTO artifacts into Core.
- Do not mutate a Target Repository.
- Do not push, merge, deploy, close, PR open, issue close, publish, mutate vendor source, mutate external developer repositories, mutate unowned repositories, expose endpoint values, expose secret values, expose credential values, or expose raw Target Repository content.
- Do not replace Core authority with CTO, Hermes, Harness, Case, Pi, Codex, or any execution backend.
## Acceptance Criteria
- `CTO-WORK-081`, `CTO-WORK-082`, `CTO-WORK-083`, and `CTO-WORK-084` are validated.
- `validate-case-stage6-real-refresh.py` consumes both pinned report/proof pairs.
- `validate-case-stage6-real-refresh.py` records `repeatability_count: 2`.
- `validate-case-stage6-real-refresh.py` records `repeatability_status: pass`.
- `validate-case-stage6-real-refresh.py` records `runtime_default_activation: false`.
- `validate-case-stage6-real-refresh.py` records `core_promotion_status: not-promoted`.
- Hermes Control Surface summary records `repeatability_count: 2`.
- Hermes Control Surface summary records `repeatability_status: pass`.
- Hermes Control Surface summary records `candidate_default_repeatability_eligible: true`.
- Hermes Control Surface summary records `runtime_default_activation: false`.
- Validators reject missing artifacts, failed pass facts, target mutation, default activation, Core promotion, forbidden actions, missing approval provenance, stale source admission, and secret-shaped keys.
## Validation
- `python3 harness/runner/validate-case-stage6-real-refresh.py --json`
- `python3 harness/runner/validate-webui-summary.py --json`
- `harness/evals/health.sh --json`
- `python3 tools/validate_cto_child.py`
- `python3 /home/svrnty/workspaces/cortex-os/core/tools/check_s69_caveman_prose_discipline.py`
## Risks
- Repeatability evidence can be misread as default authority. Mitigation: record runtime default activation false in every artifact.
- A second pass can overstate readiness if failure closure is ignored. Mitigation: require prior Stage 6 failure closure.
- Artifact paths can drift. Mitigation: validators load files from pinned artifact paths.
- Approval provenance can drift between prose and proof. Mitigation: validator records proof `approval_source` exactly and evidence records the mismatch.
- Control Surface wording can imply governance. Mitigation: Core promotion remains not-promoted.
## Success Definition
CTO has validated child-local evidence that Case passed two distinct governed Stage 5 tasks through the CTO Harness seam, Hermes exposes that repeatability evidence read-only, runtime default activation remains false, and Core promotion remains not-promoted.

View File

@ -401,3 +401,23 @@ items:
status: validated
source: .sot/03-PROTOCOLS/CTO-CASE-STAGE5-REPEATABILITY-PROOF-EVIDENCE.md
owner: ""
- id: CTO-WORK-081
title: Stage 6 Repeatability Refresh PRD
status: validated
source: .sot/03-PROTOCOLS/CTO-CASE-STAGE6-REPEATABILITY-REFRESH-PRD.md
owner: ""
- id: CTO-WORK-082
title: Stage 6 Repeatability Harness Route
status: validated
source: .sot/03-PROTOCOLS/CTO-CASE-STAGE6-REPEATABILITY-REFRESH-ISSUES.md
owner: ""
- id: CTO-WORK-083
title: Hermes Repeatability Control Summary
status: validated
source: .sot/03-PROTOCOLS/CTO-CASE-STAGE6-REPEATABILITY-REFRESH-ISSUES.md
owner: ""
- id: CTO-WORK-084
title: Stage 6 Repeatability Evidence Closeout
status: validated
source: .sot/03-PROTOCOLS/CTO-CASE-STAGE6-REPEATABILITY-REFRESH-EVIDENCE.md
owner: ""

View File

@ -81,6 +81,9 @@ REQUIRED_FILES = [
".sot/03-PROTOCOLS/CTO-CASE-STAGE5-REPEATABILITY-PROOF-PRD.md",
".sot/03-PROTOCOLS/CTO-CASE-STAGE5-REPEATABILITY-PROOF-ISSUES.md",
".sot/03-PROTOCOLS/CTO-CASE-STAGE5-REPEATABILITY-PROOF-EVIDENCE.md",
".sot/03-PROTOCOLS/CTO-CASE-STAGE6-REPEATABILITY-REFRESH-PRD.md",
".sot/03-PROTOCOLS/CTO-CASE-STAGE6-REPEATABILITY-REFRESH-ISSUES.md",
".sot/03-PROTOCOLS/CTO-CASE-STAGE6-REPEATABILITY-REFRESH-EVIDENCE.md",
".sot/03-PROTOCOLS/CTO-FIRST-REAL-GOVERNED-WORKFLOW-PRD.md",
".sot/03-PROTOCOLS/CTO-FIRST-REAL-GOVERNED-WORKFLOW-ISSUES.md",
".sot/03-PROTOCOLS/CTO-FIRST-REAL-GOVERNED-WORKFLOW-APPROVAL-PACKET.md",
@ -1327,6 +1330,84 @@ REQUIRED_STAGE5_REPEATABILITY_PHRASES = [
"No Core promotion occurs.",
]
REQUIRED_STAGE6_REPEATABILITY_REFRESH_PRD_PHRASES = [
"Local planning SOT only. Not a Core Protocol. Not active Core authority.",
"Stage 6 repeatability refresh",
"CTO-WORK-081",
"CTO-WORK-082",
"CTO-WORK-083",
"CTO-WORK-084",
"r1-src-string-slugify",
"r1-slugify-punctuation",
"/home/svrnty/.hermes/profiles/cto-planb/harness-runs/20260601T105222Z-r1-src-string-slugify-180161/report.json",
"/home/svrnty/.hermes/profiles/cto-planb/harness-runs/20260602T101450Z-r1-slugify-punctuation-2663598/report.json",
"report shape",
"event validity",
"allowed-path compliance",
"failure closure",
"failure matrix coverage carry-forward",
"artifact completeness",
"forbidden-action closure",
"operator acceptance",
"approval provenance",
"source admission freshness",
"repeatability_count: 2",
"repeatability_status: pass",
"runtime_default_activation: false",
"core_promotion_status: not-promoted",
]
REQUIRED_STAGE6_REPEATABILITY_REFRESH_ISSUE_PHRASES = [
"Local planning SOT only. Not a Core Protocol. Not active Core authority.",
"Stage 6 repeatability refresh",
"CTO-WORK-081",
"CTO-WORK-082",
"CTO-WORK-083",
"CTO-WORK-084",
"validate-case-stage6-real-refresh.py",
"real_stage5_passes",
"repeat_stage5_pass_report",
"repeat_stage5_proof",
"repeatability_required: 2",
"repeatability_count: 2",
"repeatability_status: pass",
"repeatability_passed: true",
"runtime_default_activation: false",
"core_promotion_status: not-promoted",
"05ab2ff",
"40b0d9a",
"/home/svrnty/.hermes/profiles/cto-planb/harness-runs/20260602T104507Z-stage6-real-governed-refresh/stage6-real-governed-refresh-comparison.json",
"/home/svrnty/.hermes/profiles/cto-planb/harness-runs/20260602T104552Z-run-all-fake-2738738/webui-summary.json",
]
REQUIRED_STAGE6_REPEATABILITY_REFRESH_EVIDENCE_PHRASES = [
"Local planning SOT only. Not a Core Protocol. Not active Core authority.",
"Stage 6 repeatability refresh",
"CTO-WORK-081",
"CTO-WORK-082",
"CTO-WORK-083",
"CTO-WORK-084",
"05ab2ff",
"40b0d9a",
"r1-slugify-punctuation",
"2fa1601",
"7706f99",
"/home/svrnty/.hermes/profiles/cto-planb/harness-runs/20260602T104507Z-stage6-real-governed-refresh/stage6-real-governed-refresh-comparison.json",
"/home/svrnty/.hermes/profiles/cto-planb/harness-runs/20260602T104552Z-run-all-fake-2738738/webui-summary.json",
"repeatability_count: 2",
"repeatability_status: pass",
"runtime_default_activation: false",
"core_promotion_status: not-promoted",
"no target repository mutation attempted",
"report shape: pass",
"event validity: pass",
"approval provenance: pass",
"source admission freshness: current",
"Codex lane: blocked with rationale",
"Pi lane: blocked with rationale",
"JP chat approval on 2026-06-01",
]
def workboard_status(text: str, issue_id: str) -> str | None:
pattern = rf"- id: {re.escape(issue_id)}\n(?: .+\n)*? status: ([^\n]+)"
@ -1845,6 +1926,36 @@ def main() -> int:
if phrase not in text:
errors.append(f"missing_stage5_repeatability_evidence_phrase:{phrase}")
stage6_repeatability_prd = ROOT / ".sot/03-PROTOCOLS/CTO-CASE-STAGE6-REPEATABILITY-REFRESH-PRD.md"
if stage6_repeatability_prd.is_file():
text = stage6_repeatability_prd.read_text(encoding="utf-8")
if "core_promotion_status: not-promoted" not in text:
errors.append("stage6_repeatability_prd_missing_not_promoted_frontmatter")
for phrase in REQUIRED_STAGE6_REPEATABILITY_REFRESH_PRD_PHRASES:
checked.append(f"stage6_repeatability_prd_phrase:{phrase}")
if phrase not in text:
errors.append(f"missing_stage6_repeatability_prd_phrase:{phrase}")
stage6_repeatability_issues = ROOT / ".sot/03-PROTOCOLS/CTO-CASE-STAGE6-REPEATABILITY-REFRESH-ISSUES.md"
if stage6_repeatability_issues.is_file():
text = stage6_repeatability_issues.read_text(encoding="utf-8")
if "core_promotion_status: not-promoted" not in text:
errors.append("stage6_repeatability_issues_missing_not_promoted_frontmatter")
for phrase in REQUIRED_STAGE6_REPEATABILITY_REFRESH_ISSUE_PHRASES:
checked.append(f"stage6_repeatability_issue_phrase:{phrase}")
if phrase not in text:
errors.append(f"missing_stage6_repeatability_issue_phrase:{phrase}")
stage6_repeatability_evidence = ROOT / ".sot/03-PROTOCOLS/CTO-CASE-STAGE6-REPEATABILITY-REFRESH-EVIDENCE.md"
if stage6_repeatability_evidence.is_file():
text = stage6_repeatability_evidence.read_text(encoding="utf-8")
if "core_promotion_status: not-promoted" not in text:
errors.append("stage6_repeatability_evidence_missing_not_promoted_frontmatter")
for phrase in REQUIRED_STAGE6_REPEATABILITY_REFRESH_EVIDENCE_PHRASES:
checked.append(f"stage6_repeatability_evidence_phrase:{phrase}")
if phrase not in text:
errors.append(f"missing_stage6_repeatability_evidence_phrase:{phrase}")
prd = ROOT / ".sot/03-PROTOCOLS/CTO-CASE-CANDIDATE-BACKEND-PRD.md"
if prd.is_file():
text = prd.read_text(encoding="utf-8")
@ -2456,6 +2567,10 @@ def main() -> int:
"CTO-WORK-078": "validated",
"CTO-WORK-079": "validated",
"CTO-WORK-080": "validated",
"CTO-WORK-081": "validated",
"CTO-WORK-082": "validated",
"CTO-WORK-083": "validated",
"CTO-WORK-084": "validated",
}
for issue_id, expected in expected_statuses.items():
checked.append(f"workboard_status:{issue_id}:{expected}")