ci: floor raise + doc drift (Phase 3 closure — TEST-H1/H2/M1/M2/M3/M4/L1, ARCH-H3/L1/L2/L3/L4)

Twelve findings from the architecture diligence audit's Phase 3 bundle closed in one PR. All touch the CI workflows + small doc-drift fixes across the production Go tree + migration headers. CI workflow changes ==================== TEST-H1 — Race detection on ./... -short .github/workflows/ci.yml:106 was a 9-package explicit list. Audit finding TEST-H1 flagged that 25+ packages (internal/auth/*, internal/repository/*, internal/mcp, internal/scep, internal/pkcs7, internal/api/router, internal/api/acme, internal/cli, internal/cms, internal/config, internal/deploy, internal/integration, internal/ratelimit, internal/secret, internal/trustanchor, all of cmd/) silently dropped off race coverage. Post-fix: 'go test -race -short ./... -count=1 -timeout 600s'. 76 testing.Short() guards already cover testcontainers + live-DB integration suites, so -short keeps the long-running tests out. TEST-H2 — Cross-platform build matrix New 'cross-platform-build' job in ci.yml. Matrix: ubuntu-latest + windows-latest + macos-latest, fail-fast: false. Builds cmd/server + cmd/agent + cmd/cli + cmd/mcp-server on each. Catches Windows-specific regressions (path separators, file permissions, exec.Command semantics) the pre-Phase-3 Ubuntu-only CI missed. TEST-L1 — actions/setup-go cache: true (explicit) setup-go v5 defaults cache: true; making it explicit so a future setup-go upgrade can't silently flip it. Re-runs hit the Go module + build cache instead of recompiling cold. TEST-M1 — Mutation-testing floor at 55% security-deep-scan.yml::go-mutesting step rewritten. Removed continue-on-error + per-package '|| true'. New post-loop check extracts every 'The mutation score is X.YZ' line and fails the step if any package drops below 0.55. Floor rationale: starter ratio catches major regressions without rejecting the audit's 'this is OK' steady state; raise quarterly. TEST-M2 — 3 advisory deep-scan gates promoted to blocking Removed continue-on-error: true from: - gosec (filtered to G201/G202/G304/G108 high-signal rules: SQL-injection + path-traversal + pprof-exposed) - osv-scanner (multi-ecosystem CVE; complements govulncheck which is already blocking in ci.yml) - trivy image scan (--severity HIGH,CRITICAL --exit-code 1) continue-on-error count: 15 → 11. ZAP / schemathesis / nuclei / testssl stay advisory because their false-positive rates on https://localhost:8443-targeted DAST runs are high. TEST-M3 — Playwright harness stub web/package.json adds '@playwright/test' devDep + 'e2e' / 'e2e:install' npm scripts. web/playwright.config.ts ships single chromium project with webServer block pointing at 'npm run dev'. web/src/__tests__/ e2e/smoke.spec.ts proves the harness wires through. The full 15-flow suite ships in frontend-design-audit Phase 8 (TEST-H1 in THAT audit); this is the wiring + a single smoke test as the regression floor. New Makefile target: 'make e2e-test'. Doc/code drift fixes ==================== TEST-M4 + ARCH-L2 — Skip inventory artifact + CI guard scripts/skip-inventory.sh walks every t.Skip site under cmd/ + internal/ + deploy/test/ and emits docs/testing/skip-inventory.md grouped by package with file:line:expression triples. Current inventory: 142 t.Skip sites, 76 testing.Short() guards. scripts/ci-guards/skip-inventory-drift.sh regenerates and fails on diff (excluding the 'Last reviewed' timestamp line which drifts daily). The Markdown is the canonical acquisition-diligence artifact for 'what tests are being skipped and why.' ARCH-H3 — MCP catalogue floor reconciliation Audit framing was '121 vs floor 150 — doc/code drift.' Live count via the test's actual regex over all 5 tool files (tools.go + tools_audit_fix.go + tools_auth.go + tools_auth_bundle2.go + tools_est.go): 155 unique 'Name: "certctl_*"' declarations. Pre-Phase-3 audit measured tools.go in isolation (121) and missed the other 4 files (+34 unique names). The test at internal/ciparity/surface_parity_test.go::TestSurfaceParity_MCP passes today (155 ≥ 150). Added a clarifying comment near mcpBaselineFloor explaining the measurement scope so future reviewers don't repeat the audit's framing error. STATUS: stale — no code drift, just a measurement scoping error in the audit. ARCH-L1 — panic() rationale comments 5 panic sites in production Go (excluding _test.go): - internal/repository/postgres/tx.go:84 - internal/service/issuer.go:861 (mustJSON) - internal/service/est.go:728 (mustParseTime) - internal/service/acme.go:1288 (rand source failure — already documented) - internal/pkcs7/certrep.go:270 (OID marshal — already documented) Added ARCH-L1 rationale comments to the 3 sites that didn't have them. All 5 are defensible impossible-path / rethrow / hardcoded- constant guards. ARCH-L3 — Migration IF-NOT-EXISTS carve-outs 4 migrations skip the literal 'IF NOT EXISTS' token but ARE idempotent via different Postgres patterns: - 000014_policy_violation_severity_check.up.sql: ALTER TABLE ADD CONSTRAINT CHECK doesn't accept IF NOT EXISTS; idempotency via DROP CONSTRAINT IF EXISTS preamble. - 000018_audit_events_worm.up.sql: CREATE OR REPLACE FUNCTION + DROP TRIGGER IF EXISTS + CREATE TRIGGER + DO $$ pg_roles existence check. CREATE TRIGGER doesn't take IF NOT EXISTS. - 000030_rbac_admin_perms.up.sql: INSERT ... ON CONFLICT DO NOTHING. - 000039_audit_crit1_perms.up.sql: same INSERT + ON CONFLICT pattern. Added ARCH-L3 header comments to each explaining the carve-out so reviewers don't flag the missing literal token. STATUS: largely stale — migrations are already idempotent. ARCH-L4 — TODO/FIXME → see #<descriptor> 5 TODOs rewritten to the allowed 'see #<descriptor>' pattern: - internal/repository/postgres/auth.go:220 → see #bundle-2-scope-fk - internal/connector/discovery/gcpsm/gcpsm.go:547 → see #gcpsm-pagination - internal/service/audit.go:244 → see #audit-pagination-count - internal/service/job.go:295, 299 → see #validation-job-impl New CI guard scripts/ci-guards/no-todo-in-prod.sh grep-fails any new TODO/FIXME in cmd/ + internal/ (excluding _test.go); allows 'see #N' / 'see #<descriptor>' patterns. Sandbox limitation ================== The 6.1 GB certctl working tree fills the sandbox volume; go1.25.10 toolchain download fails with 'no space left on device' (sandbox has 1.25.9; go.mod requires 1.25.10). Local 'go test' / 'go build' NOT run in this commit. Operator must run 'make verify' on their workstation before push per CLAUDE.md operating rules. The smoke.spec.ts NOT executed in the sandbox (no chromium installed). Operator runs 'cd web && npm install && npx playwright install --with-deps chromium && npm run e2e' on first wire-up. All CI guards (no-todo-in-prod, skip-inventory-drift, G-3 env-docs-drift, doc-rot-detector, and every existing guard) verified clean by running each individually. Closes: cowork/certctl-architecture-diligence-audit.html#fix-TEST-H1, cowork/certctl-architecture-diligence-audit.html#fix-TEST-H2, cowork/certctl-architecture-diligence-audit.html#fix-TEST-M1, cowork/certctl-architecture-diligence-audit.html#fix-TEST-M2, cowork/certctl-architecture-diligence-audit.html#fix-TEST-M3, cowork/certctl-architecture-diligence-audit.html#fix-TEST-M4, cowork/certctl-architecture-diligence-audit.html#fix-TEST-L1, cowork/certctl-architecture-diligence-audit.html#fix-ARCH-H3, cowork/certctl-architecture-diligence-audit.html#fix-ARCH-L1, cowork/certctl-architecture-diligence-audit.html#fix-ARCH-L2, cowork/certctl-architecture-diligence-audit.html#fix-ARCH-L3, cowork/certctl-architecture-diligence-audit.html#fix-ARCH-L4
2026-07-26 13:58:13 +00:00 · 2026-05-13 20:10:08 +00:00
parent 69a2b5c55a
commit 02438ad9e1
22 changed files with 702 additions and 23 deletions
@@ -20,6 +20,11 @@ jobs:
        uses: actions/setup-go@40f1582b2485089dde7abd97c1529aa768e1baff  # v5
        with:
          go-version: '1.25.10'
          # Phase 3 TEST-L1 closure (2026-05-13): enable Go's module +
          # build cache so re-runs hit the cache instead of recompiling
          # the world. setup-go v5 cache: true by default; making it
          # explicit so a future setup-go upgrade can't silently flip it.
          cache: true
      - name: Go Build
        run: |
@@ -103,7 +108,23 @@ jobs:
        run: staticcheck ./...
      - name: Race Detection
-        run: go test -race ./internal/service/... ./internal/api/handler/... ./internal/api/middleware/... ./internal/scheduler/... ./internal/connector/... ./internal/crypto/... ./internal/domain/... ./internal/validation/... ./internal/tlsprobe/... -count=1 -timeout 300s
+        # Phase 3 TEST-H1 closure (2026-05-13): the pre-Phase-3 invocation
        # listed 9 explicit package roots, excluding internal/auth/*,
        # internal/repository/*, internal/mcp, internal/scep, internal/pkcs7,
        # internal/api/router, internal/api/acme, internal/cli, internal/cms,
        # internal/config, internal/deploy, internal/integration,
        # internal/ratelimit, internal/secret, internal/trustanchor, plus
        # all of cmd/. Audit finding TEST-H1 flagged this as silent
        # race-detection drift — packages added after the original list
        # was authored were never covered.
        #
        # Post-Phase-3: ./... with -short. The 76 testing.Short() guards
        # already in the integration-test surface (testcontainers, live-DB,
        # multi-process) gate behind this flag, so race detection runs
        # across every package without dragging in long-running suites.
        # Timeout doubled from 300s to 600s because ./... is broader; the
        # broader scope is what makes race coverage trustworthy.
        run: go test -race -short ./... -count=1 -timeout 600s
      - name: Go Test with Coverage
        # internal/ciparity/... — post-v2.1.0 anti-rot item 2 surface-
@@ -175,6 +196,41 @@ jobs:
          done
          exit $fail
  cross-platform-build:
    # Phase 3 TEST-H2 closure (2026-05-13): the pre-Phase-3 CI ran
    # exclusively on ubuntu-latest, leaving Windows-specific bugs
    # (path separators, file permissions, exec.Command semantics)
    # undetected. The agent + CLI binaries ship for Windows + macOS
    # users; this matrix asserts they at least BUILD on every OS we
    # claim to support.
    #
    # Build-only — no test run. Full test parity across OSes is a
    # larger investment (testcontainers is Linux-only on Windows CI
    # runners, file-permission tests differ, etc.). The build gate
    # is the minimum that catches the cross-platform regressions
    # we've seen in practice.
    name: Cross-platform build (ubuntu / windows / macos)
    strategy:
      fail-fast: false
      matrix:
        os: [ubuntu-latest, windows-latest, macos-latest]
    runs-on: ${{ matrix.os }}
    steps:
      - uses: actions/checkout@34e114876b0b11c390a56381ad16ebd13914f8d5  # v4
      - name: Set up Go
        uses: actions/setup-go@40f1582b2485089dde7abd97c1529aa768e1baff  # v5
        with:
          go-version: '1.25.10'
          cache: true
      - name: Build server + agent + CLI + mcp-server
        run: |
          go build ./cmd/server
          go build ./cmd/agent
          go build ./cmd/cli
          go build ./cmd/mcp-server
  cold-db-compose-smoke:
    # Per post-v2.1.0 anti-rot item 6 (Auditable Codebase Bundle).
    #
@@ -48,15 +48,26 @@ jobs:
      # --- Static analysis (slow paths) ---
-      - name: gosec
+      - name: gosec (G201/G202/G304/G108 subset — Phase 3 TEST-M2 hard gate)
-        run: |
+        # Phase 3 TEST-M2 closure (2026-05-13): gosec promoted from
-          $(go env GOPATH)/bin/gosec -fmt sarif -out gosec.sarif ./... || true
+        # continue-on-error (advisory) to blocking on the 4 high-signal
-        continue-on-error: true
+        # rule subset that targets real prod-bug classes:
        #   G201 = SQL string formatting (SQL injection)
        #   G202 = SQL string concatenation (SQL injection)
        #   G304 = file-path traversal via tainted input
        #   G108 = profiling endpoint exposed
        # Other gosec rules (G1xx-G7xx broadly) remain in the SARIF
        # report but don't gate the build — they have higher false-
        # positive rates than these 4.
        run: $(go env GOPATH)/bin/gosec -fmt sarif -out gosec.sarif -include=G201,G202,G304,G108 ./...
-      - name: osv-scanner (multi-ecosystem CVE)
+      - name: osv-scanner (multi-ecosystem CVE — Phase 3 TEST-M2 hard gate)
-        run: |
+        # Phase 3 TEST-M2 closure (2026-05-13): osv-scanner promoted from
-          $(go env GOPATH)/bin/osv-scanner -r --format json --output osv-scanner.json . || true
+        # advisory to blocking. Complements govulncheck (already blocking
-        continue-on-error: true
+        # in ci.yml) by covering non-Go dependencies (npm under web/,
        # any docker base image deps). Findings fail the build; the
        # exact CVE list lands in osv-scanner.json as a receipt either way.
        run: $(go env GOPATH)/bin/osv-scanner -r --format json --output osv-scanner.json .
      # --- Race detector at -count=10 (D-002) ---
@@ -90,14 +101,39 @@ jobs:
        run: go install github.com/zimmski/go-mutesting/cmd/go-mutesting@latest
        continue-on-error: true
-      - name: go-mutesting (crypto cluster)
+      - name: go-mutesting (crypto cluster — Phase 3 TEST-M1 hard gate at 55%)
        # Phase 3 TEST-M1 closure (2026-05-13): go-mutesting promoted
        # from advisory (continue-on-error + per-package `|| true`) to
        # blocking with an explicit mutation-score floor of 55%.
        # Per-package summary lines emit `The mutation score is X.YZ`;
        # the awk filter extracts each, and the post-loop check fails
        # the step if any package drops below 0.55.
        #
        # Floor rationale: 55% is the starter ratio that catches major
        # regressions without rejecting the audit's "this is OK" steady
        # state. Raise quarterly as the test suite hardens; the floor
        # change ships in the same commit that adds the strengthening
        # tests so the ratchet is documented.
        run: |
          set -e
          : > go-mutesting.txt
          for pkg in ./internal/crypto/... ./internal/pkcs7/... ./internal/connector/issuer/local/...; do
            echo "=== $pkg ===" | tee -a go-mutesting.txt
-            $(go env GOPATH)/bin/go-mutesting "$pkg" 2>&1 | tee -a go-mutesting.txt || true
+            $(go env GOPATH)/bin/go-mutesting "$pkg" 2>&1 | tee -a go-mutesting.txt
          done
-        continue-on-error: true
+          # Extract every "The mutation score is X.YZ" line; fail on any
          # score below 0.55. The check works against floats via awk so
          # 0.55 is the literal threshold (not a percentage).
          floor=0.55
          fail=0
          while IFS= read -r score; do
            ok=$(awk -v s="$score" -v f="$floor" 'BEGIN{print (s>=f) ? 1 : 0}')
            if [ "$ok" -ne 1 ]; then
              echo "::error::mutation score $score below floor $floor"
              fail=1
            fi
          done < <(grep -oE "The mutation score is [0-9.]+" go-mutesting.txt | awk '{print $NF}')
          exit $fail
      # --- Container + supply chain (D-001 partial, D-006 partial) ---
@@ -105,11 +141,21 @@ jobs:
        run: docker build -t certctl:deep-scan .
        continue-on-error: true
-      - name: trivy image scan
+      - name: trivy image scan (HIGH+CRITICAL — Phase 3 TEST-M2 hard gate)
        # Phase 3 TEST-M2 closure (2026-05-13): trivy promoted from
        # advisory to blocking. --severity filter keeps the gate
        # noise-free (LOW + MEDIUM findings stay in the JSON receipt
        # but don't fail the build); --exit-code 1 makes HIGH+CRITICAL
        # findings the actual gate. Trivy is the third hard deep-scan
        # gate (alongside gosec + osv-scanner); ZAP / schemathesis /
        # nuclei / testssl stay advisory because their false-positive
        # rates on https://localhost:8443-targeted DAST runs are high.
        run: |
          docker run --rm -v "$PWD":/src aquasec/trivy:latest image \
-            --format json --output /src/trivy.json certctl:deep-scan || true
+            --format json --output /src/trivy.json \
-        continue-on-error: true
+            --severity HIGH,CRITICAL \
            --exit-code 1 \
            certctl:deep-scan
      - name: syft SBOM
        run: |
@@ -1,4 +1,4 @@
-.PHONY: help build run test lint verify verify-deploy loadtest acme-cert-manager-test acme-rfc-conformance-test keycloak-integration-test okta-smoke-test benchmark-auth benchmark-auth-coldcache clean docker-up docker-down migrate-up migrate-down generate test-cover frontend-build qa-stats
+.PHONY: help build run test lint verify verify-deploy loadtest acme-cert-manager-test acme-rfc-conformance-test keycloak-integration-test okta-smoke-test benchmark-auth benchmark-auth-coldcache clean docker-up docker-down migrate-up migrate-down generate test-cover frontend-build e2e-test qa-stats
 # Default target - show help
 help:
@@ -295,6 +295,19 @@ frontend-build:
 	cd web && npm ci && npx vite build
 	@echo "Frontend build complete"
 # Phase 3 TEST-M3 closure (2026-05-13): browser-driven E2E smoke
 # target. The full 15-flow suite from web/src/__tests__/e2e/README.md
 # ships in frontend-design-audit Phase 8; this target is the harness
 # wiring that lets `make e2e-test` work today.
 #
 # First-time setup: `cd web && npm install && npx playwright install --with-deps chromium`.
 # The webServer block in web/playwright.config.ts boots `npm run dev`
 # automatically; no separate `make docker-up` needed.
 e2e-test:
 	@echo "Running Playwright E2E (smoke + any *.spec.ts under web/src/__tests__/e2e/)..."
 	cd web && npx playwright test
 	@echo "E2E run complete"
 # qa-stats: snapshot of the test-suite size at the current commit.
 # Backend Go tests + subtests + fuzz targets + skipped sites, plus the
 # seed-data counts in migrations/seed_demo.sql. Useful before a release
@@ -0,0 +1,234 @@
 # Test Skip Inventory
 <!-- Auto-generated by scripts/skip-inventory.sh — do not edit by hand. -->
 <!-- Re-run after adding or removing any t.Skip(). CI guard:    -->
 <!-- scripts/ci-guards/skip-inventory-drift.sh                  -->
 > Last reviewed: 2026-05-13
 ## Summary
 - Total t.Skip sites: **142**
 - testing.Short() guards: **76** (these gate behind `go test -short`)
 Re-run inventory with: `./scripts/skip-inventory.sh`.
 ## Sites (grouped by package)
 ### `cmd/agent`
 - `cmd/agent/keymem_test.go:209` — t.Skip("permission semantics differ on windows")
 - `cmd/agent/keymem_test.go:425` — t.Skip("permission semantics differ on windows")
 - `cmd/agent/keymem_test.go:451` — t.Skip("permission semantics differ on windows")
 - `cmd/agent/keymem_test.go:491` — t.Skip("permission semantics differ on windows")
 - `cmd/agent/keymem_test.go:523` — t.Skip("permission semantics differ on windows")
 - `cmd/agent/keymem_test.go:526` — t.Skip("running as root; cannot revoke parent dir write permission")
 - `cmd/agent/keymem_test.go:553` — t.Skip("permission semantics differ on windows")
 - `cmd/agent/keymem_test.go:556` — t.Skip("running as root; cannot revoke parent dir read+exec permission")
 - `cmd/agent/keymem_test.go:623` — t.Skip("chmod-error branch is only reliably triggerable on linux via /sys (read-only fs)")
 - `cmd/agent/keymem_test.go:631` — t.Skipf("/sys/kernel not stat-able as a dir on this host; skipping (%v)", err)
 - `cmd/agent/keymem_test.go:637` — t.Skipf("/sys/kernel mode %#o already satisfies no-chmod branch", mode)
 - `cmd/agent/keymem_test.go:652` — t.Skip("permission semantics differ on windows")
 - `cmd/agent/keymem_test.go:655` — t.Skip("running as root; cannot revoke parent dir write permission")
 - `cmd/agent/keymem_test.go:686` — t.Skip("permission semantics differ on windows")
 - `cmd/agent/verify_test.go:402` — t.Skip("no TLS certificates configured on test server")
 ### `cmd/server`
 - `cmd/server/preflight_demo_residual_test.go:41` — t.Skip("preflight A-8 test requires Postgres (testcontainers); skipping under -short")
 - `cmd/server/preflight_demo_residual_test.go:97` — t.Skip("A-8 testcontainers unavailable; skipping")
 ### `deploy/test/acme-integration`
 - `deploy/test/acme-integration/certmanager_test.go:54` — t.Skip("KIND_AVAILABLE unset — kind-driven cert-manager integration test skipped")
 ### `deploy/test`
 - `deploy/test/crl_ocsp_e2e_test.go:134` — t.Skip("integration only")
 - `deploy/test/crl_ocsp_e2e_test.go:65` — t.Skip("integration only")
 - `deploy/test/est_e2e_test.go:124` — t.Skip("integration tests require INTEGRATION=1; skipping libest e2e suite")
 - `deploy/test/est_e2e_test.go:129` — t.Skipf("libest sidecar (container %q) not running (status=%q). Run `cd deploy && docker compose -f docker-compose.test.yml --profile est-e2e up -d libest-client` to bring it up.", libestContainer, status)
 - `deploy/test/est_e2e_test.go:213` — t.Skip("/config/certs/bootstrap.pem not present in libest sidecar — skipping mTLS path. To enable: mint a bootstrap cert against the per-profile mTLS trust anchor and copy into deploy/test/certs/.")
 - `deploy/test/est_e2e_test.go:252` — t.Skip("server-keygen disabled on the e2e EST profile (HTTP 404). Enable via CERTCTL_EST_PROFILE_E2E_SERVER_KEYGEN_ENABLED=true in docker-compose.test.yml.")
 - `deploy/test/est_e2e_test.go:333` — t.Skipf("libest build lacks --tls-exporter support: %v", err)
 - `deploy/test/healthcheck_test.go:102` — t.Skip("docker not available — skipping image-level HEALTHCHECK test")
 - `deploy/test/healthcheck_test.go:163` — t.Skip("docker not available — skipping image-level HEALTHCHECK test")
 - `deploy/test/healthcheck_test.go:224` — t.Skip("docker not available — skipping runtime HEALTHCHECK test")
 - `deploy/test/healthcheck_test.go:227` — t.Skip("runtime HEALTHCHECK test takes ~45s; skipping under -short")
 - `deploy/test/healthcheck_test.go:229` — t.Skip("runtime probe contract not yet wired to a sidecar postgres; " +
 - `deploy/test/healthcheck_test.go:28` — // The tests skip cleanly with t.Skip when docker is not available
 - `deploy/test/healthcheck_test.go:32` — // Q-1 closure (cat-s3-58ce7e9840be): this file's 5 t.Skip sites are
 - `deploy/test/healthcheck_test.go:41` — //   - Line 212: hard t.Skip for the runtime probe contract — image-spec
 - `deploy/test/integration_test.go:1129` — t.Skip("no PEM data in certificate version")
 - `deploy/test/integration_test.go:513` — t.Skip("agent not yet online (may be slow to heartbeat)")
 - `deploy/test/integration_test.go:805` — t.Skip("depends on Phase04 (Local CA cert not created)")
 - `deploy/test/integration_test.go:901` — t.Skip("no discovered certificates yet (agent scan may not have run)")
 - `deploy/test/integration_test.go:942` — t.Skip("no certificate in Active state for renewal test")
 - `deploy/test/integration_test.go:954` — t.Skipf("renewal trigger returned: %s", body)
 - `deploy/test/nginx_vendor_e2e_test.go:108` — t.Skip()
 - `deploy/test/qa_test.go:1055` — t.Skip("Part 23 (S/MIME & EKU) is documented in docs/testing-guide.md::Part 23 " +
 - `deploy/test/qa_test.go:1065` — t.Skip("Part 24 (OCSP/CRL) is documented in docs/testing-guide.md::Part 24 " +
 - `deploy/test/qa_test.go:1175` — t.Skip("Requires compiled certctl-cli binary — manual test")
 - `deploy/test/qa_test.go:1179` — t.Skip("Requires compiled mcp-server binary + stdio — manual test")
 - `deploy/test/qa_test.go:1313` — t.Skip("Scheduler tests are timing-dependent — verify via Docker logs manually")
 - `deploy/test/qa_test.go:1320` — t.Skip("Requires Docker log inspection — manual test")
 - `deploy/test/qa_test.go:1327` — t.Skip("Requires browser — manual test")
 - `deploy/test/qa_test.go:1334` — t.Skip("Requires browser — manual test")
 - `deploy/test/qa_test.go:1338` — t.Skip("Requires browser — manual test")
 - `deploy/test/qa_test.go:1914` — t.Skip("Part 55 (Agent Soft-Retirement) is documented in docs/testing-guide.md::Part 55 " +
 - `deploy/test/qa_test.go:1924` — t.Skip("Part 56 (Notification Retry/Dead-Letter) is documented in docs/testing-guide.md::Part 56 " +
 - `deploy/test/qa_test.go:38` — // Q-1 closure (cat-s3-58ce7e9840be): this file contains 11 `t.Skip("Requires
 - `deploy/test/qa_test.go:46` — // the runtime t.Skip is the second-line guard for operators who run
 - `deploy/test/qa_test.go:50` — // is correct, and the t.Skip messages already name the missing
 - `deploy/test/qa_test.go:870` — t.Skip("Requires CA cert+key setup — manual test")
 - `deploy/test/qa_test.go:874` — t.Skip("Requires ACME CA with ARI support — manual test")
 - `deploy/test/qa_test.go:881` — t.Skip("Requires live Vault server — manual test")
 - `deploy/test/qa_test.go:885` — t.Skip("Requires DigiCert sandbox — manual test")
 - `deploy/test/scep_intune_e2e_test.go:159` — t.Skipf("integration stack not reachable at %s: %v — start docker-compose.test.yml first", serverURL, err)
 - `deploy/test/scep_intune_e2e_test.go:163` — t.Skipf("/scep/%s not configured — see deploy/docker-compose.test.yml for the e2eintune profile env vars", e2eintunePathID)
 - `deploy/test/scep_intune_e2e_test.go:166` — t.Skipf("/scep/%s GetCACaps returned %d — Intune profile may not be enabled in compose env", e2eintunePathID, resp.StatusCode)
 - `deploy/test/scep_intune_e2e_test.go:170` — t.Skipf("/scep/%s GetCACaps body=%q does NOT advertise SCEPStandard — Intune profile may be misconfigured", e2eintunePathID, string(body))
 - `deploy/test/vendor_e2e_helpers_smoke_test.go:31` — t.Skip("requires network egress to api.github.com (or similar known TLS endpoint); run manually")
 - `deploy/test/vendor_e2e_helpers_smoke_test.go:36` — t.Skip("requires network egress; run manually")
 - `deploy/test/vendor_e2e_helpers_smoke_test.go:41` — // When hostPath is empty the helper t.Skip's. Re-run-from-
 ### `internal/api/handler`
 - `internal/api/handler/health_test.go:481` — t.Skip("integration-style test; covered by deploy/test/integration_test.go (//go:build integration). " +
 - `internal/api/handler/health_test.go:499` — t.Skipf("postgres driver unavailable in this build: %v", err)
 ### `internal/auth/breakglass`
 - `internal/auth/breakglass/service_test.go:417` — t.Skip("timing test skipped in -short mode (Argon2id is expensive)")
 ### `internal/auth/oidc/domain`
 - `internal/auth/oidc/domain/types_test.go:186` — t.Skip()
 ### `internal/auth/oidc`
 - `internal/auth/oidc/bench_keycloak_test.go:103` — // signature matters because it calls t.Skip / t.Fatal / t.Cleanup.
 - `internal/auth/oidc/integration_keycloak_test.go:53` — // initialized in keycloakFor() so individual tests can `t.Skip` under
 - `internal/auth/oidc/integration_okta_smoke_test.go:64` — // If any required env var is missing, the test t.Skip's with a clear
 - `internal/auth/oidc/integration_okta_smoke_test.go:84` — t.Skipf("Okta smoke test requires env vars: %s — skipping", strings.Join(missing, ", "))
 ### `internal/ciparity`
 - `internal/ciparity/surface_parity_test.go:97` — // readFileOrSkip reads a file; on ENOENT, calls t.Skipf rather than
 ### `internal/connector/issuer/acme`
 - `internal/connector/issuer/acme/acme_failure_test.go:687` — t.Skipf("could not bind challenge server (env may not allow): %v", err)
 ### `internal/connector/issuer/local`
 - `internal/connector/issuer/local/bundle9_coverage_test.go:467` — t.Skip("unexpectedly short DER")
 - `internal/connector/issuer/local/bundle9_coverage_test.go:592` — t.Skip("permission semantics differ on windows")
 - `internal/connector/issuer/local/bundle9_coverage_test.go:609` — t.Skip("permission semantics differ on windows")
 - `internal/connector/issuer/local/bundle9_coverage_test.go:621` — t.Skip("permission semantics differ on windows")
 - `internal/connector/issuer/local/bundle9_coverage_test.go:653` — t.Skip("permission semantics differ on windows")
 ### `internal/connector/issuer/openssl`
 - `internal/connector/issuer/openssl/openssl_failure_test.go:124` — t.Skip("running as root; chmod 0o600 doesn't gate execution for uid 0")
 - `internal/connector/issuer/openssl/openssl_failure_test.go:71` — t.Skip("openssl adapter shell-out tests assume POSIX bash; skipping on Windows")
 ### `internal/connector/notifier/email`
 - `internal/connector/notifier/email/email_test.go:425` — t.Skip("test requires no service on smtp.example.com:587")
 - `internal/connector/notifier/email/email_test.go:503` — t.Skip("test assumes no service on 127.0.0.1:54321")
 ### `internal/connector/target/iis`
 - `internal/connector/target/iis/iis_test.go:225` — t.Skip("Skipping: powershell.exe not available (non-Windows)")
 - `internal/connector/target/iis/iis_test.go:92` — t.Skip("Skipping: powershell.exe not available (non-Windows)")
 ### `internal/crypto`
 - `internal/crypto/encryption_property_test.go:35` — t.Skip("skipping property-based test in -short mode (PBKDF2 600k rounds × 50 iters > short budget)")
 - `internal/crypto/encryption_property_test.go:75` — t.Skip("skipping property-based test in -short mode (PBKDF2 cost)")
 ### `internal/deploy`
 - `internal/deploy/coverage_test.go:403` — t.Skip("read-only chmod doesn't restrict root")
 - `internal/deploy/coverage_test.go:467` — t.Skip("non-unix")
 - `internal/deploy/deploy_test.go:611` — t.Skip("non-unix platform")
 ### `internal/ratelimit`
 - `internal/ratelimit/sliding_window_test.go:146` — t.Skip("race-style test under -short")
 ### `internal/repository/postgres`
 - `internal/repository/postgres/audit_worm_test.go:29` — t.Skip("skipping integration test in short mode")
 - `internal/repository/postgres/auth_revoke_scope_test.go:118` — t.Skip("integration test in short mode")
 - `internal/repository/postgres/auth_revoke_scope_test.go:149` — t.Skip("integration test in short mode")
 - `internal/repository/postgres/auth_revoke_scope_test.go:179` — t.Skip("integration test in short mode")
 - `internal/repository/postgres/auth_revoke_scope_test.go:208` — t.Skip("integration test in short mode")
 - `internal/repository/postgres/auth_revoke_scope_test.go:56` — t.Skip("integration test in short mode")
 - `internal/repository/postgres/auth_revoke_scope_test.go:87` — t.Skip("integration test in short mode")
 - `internal/repository/postgres/auth_scope_test.go:123` — t.Skip("integration test in short mode")
 - `internal/repository/postgres/auth_scope_test.go:153` — t.Skip("integration test in short mode")
 - `internal/repository/postgres/auth_scope_test.go:181` — t.Skip("integration test in short mode")
 - `internal/repository/postgres/auth_scope_test.go:207` — t.Skip("integration test in short mode")
 - `internal/repository/postgres/auth_scope_test.go:229` — t.Skip("integration test in short mode")
 - `internal/repository/postgres/auth_scope_test.go:252` — t.Skip("integration test in short mode")
 - `internal/repository/postgres/auth_scope_test.go:281` — t.Skip("integration test in short mode")
 - `internal/repository/postgres/auth_scope_test.go:95` — t.Skip("integration test in short mode")
 - `internal/repository/postgres/oidc_encryption_invariant_test.go:160` — t.Skip("Phase 13 encryption invariant: integration test in short mode")
 - `internal/repository/postgres/oidc_encryption_invariant_test.go:225` — t.Skip("Phase 13 encryption invariant: integration test in short mode")
 - `internal/repository/postgres/oidc_encryption_invariant_test.go:62` — t.Skip("Phase 13 encryption invariant: integration test in short mode")
 - `internal/repository/postgres/oidc_prelogin_encryption_test.go:163` — t.Skip("HIGH-5 legacy fallback: integration test in short mode")
 - `internal/repository/postgres/oidc_prelogin_encryption_test.go:42` — t.Skip("HIGH-5 encryption invariant: integration test in short mode")
 - `internal/repository/postgres/oidc_test.go:117` — t.Skip("integration test in short mode")
 - `internal/repository/postgres/oidc_test.go:140` — t.Skip("integration test in short mode")
 - `internal/repository/postgres/oidc_test.go:171` — t.Skip("integration test in short mode")
 - `internal/repository/postgres/oidc_test.go:185` — t.Skip("integration test in short mode")
 - `internal/repository/postgres/oidc_test.go:209` — t.Skip("integration test in short mode")
 - `internal/repository/postgres/oidc_test.go:239` — t.Skip("integration test in short mode")
 - `internal/repository/postgres/oidc_test.go:301` — t.Skip("integration test in short mode")
 - `internal/repository/postgres/oidc_test.go:331` — t.Skip("integration test in short mode")
 - `internal/repository/postgres/oidc_test.go:45` — t.Skip("integration test in short mode")
 - `internal/repository/postgres/oidc_test.go:82` — t.Skip("integration test in short mode")
 - `internal/repository/postgres/oidc_test.go:96` — t.Skip("integration test in short mode")
 - `internal/repository/postgres/repo_test.go:1944` — t.Skip("integration test requires PostgreSQL")
 - `internal/repository/postgres/repo_test.go:2003` — t.Skip("integration test requires PostgreSQL")
 - `internal/repository/postgres/repo_test.go:2114` — t.Skip("integration test requires PostgreSQL")
 - `internal/repository/postgres/seed_test.go:91` — t.Skip("skipping integration test in short mode")
 - `internal/repository/postgres/session_test.go:100` — t.Skip("integration test in short mode")
 - `internal/repository/postgres/session_test.go:120` — t.Skip("integration test in short mode")
 - `internal/repository/postgres/session_test.go:167` — t.Skip("integration test in short mode")
 - `internal/repository/postgres/session_test.go:197` — t.Skip("integration test in short mode")
 - `internal/repository/postgres/session_test.go:211` — t.Skip("integration test in short mode")
 - `internal/repository/postgres/session_test.go:246` — t.Skip("integration test in short mode")
 - `internal/repository/postgres/session_test.go:259` — t.Skip("integration test in short mode")
 - `internal/repository/postgres/session_test.go:29` — t.Skip("integration test in short mode")
 - `internal/repository/postgres/session_test.go:307` — t.Skip("integration test in short mode")
 - `internal/repository/postgres/session_test.go:340` — t.Skip("integration test in short mode")
 - `internal/repository/postgres/session_test.go:407` — t.Skip("integration test in short mode")
 - `internal/repository/postgres/session_test.go:54` — t.Skip("integration test in short mode")
 - `internal/repository/postgres/session_test.go:86` — t.Skip("integration test in short mode")
 - `internal/repository/postgres/testutil_test.go:39` — t.Skip("skipping integration test in short mode")
 - `internal/repository/postgres/user_test.go:106` — t.Skip("integration test in short mode")
 - `internal/repository/postgres/user_test.go:131` — t.Skip("integration test in short mode")
 - `internal/repository/postgres/user_test.go:170` — t.Skip("integration test in short mode")
 - `internal/repository/postgres/user_test.go:210` — t.Skip("integration test in short mode")
 - `internal/repository/postgres/user_test.go:29` — t.Skip("integration test in short mode")
 - `internal/repository/postgres/user_test.go:302` — t.Skip("integration test in short mode")
 - `internal/repository/postgres/user_test.go:339` — t.Skip("integration test in short mode")
 - `internal/repository/postgres/user_test.go:374` — t.Skip("integration test in short mode")
 - `internal/repository/postgres/user_test.go:59` — t.Skip("integration test in short mode")
 - `internal/repository/postgres/user_test.go:73` — t.Skip("integration test in short mode")
 ### `internal/scep/intune`
 - `internal/scep/intune/challenge_golden_test.go:47` — t.Skip("regenerate fixtures only when -update-golden is passed")
 - `internal/scep/intune/challenge_test.go:213` — t.Skip("encoder didn't produce padding for this fixture; skipping")
 - `internal/scep/intune/rate_limit_test.go:139` — t.Skip("race-style test under -short")
 - `internal/scep/intune/replay_test.go:131` — t.Skip("race-style test under -short; run full suite for coverage")
 ### `internal/service`
 - `internal/service/coverage_extras_test.go:374` — t.Skipf("RSA keygen unavailable: %v", err)
 - `internal/service/coverage_extras_test.go:394` — t.Skipf("ECDSA keygen unavailable: %v", err)
@@ -38,6 +38,20 @@ import (
 // mcpBaselineFloor — see header doc. Bump when a deletion is
 // deliberate; the diff captures the change.
 //
 // Phase 3 ARCH-H3 reconciliation (2026-05-13): the audit framing
 // "121 vs floor 150 — doc/code drift" was a measurement scoping error
 // — the audit counted only internal/mcp/tools.go (which has 121
 // AddTool calls) and missed the four sibling files listed in
 // mcpToolFiles() below (tools_audit_fix.go + tools_auth.go +
 // tools_auth_bundle2.go + tools_est.go) that add another 34 unique
 // names. Live total: 155 unique `Name: "certctl_*"` declarations
 // across the 5 files, ≥ 150. This test therefore passes today.
 //
 // Bumping the floor: when the catalogue legitimately grows, raise
 // this constant in the same commit that adds the new tools so the
 // floor tracks the ratchet. Lower only when a deletion is intentional
 // and documented in surface-parity-mcp-exemptions.yaml.
 const mcpBaselineFloor = 150
 var (
@@ -544,8 +544,11 @@ func (c *httpSMClient) ListSecrets(ctx context.Context, project string) ([]Secre
 		})
 	}
-	// TODO: handle pagination with nextPageToken if needed for large secret managers
+	// see #gcpsm-pagination — handle nextPageToken if needed for large
-	// For now, just return the first page results
+	// secret managers. For now, just return the first page results;
 	// typical GCP Secret Manager pages contain up to 500 secrets, which
 	// covers every operator we've heard from so far. Add pagination when
 	// the first 500-secret-tenant surfaces.
 	return secrets, nil
 }
@@ -217,7 +217,7 @@ func (r *RoleRepository) ListPermissions(ctx context.Context, roleID string) ([]
 }
 func (r *RoleRepository) AddPermission(ctx context.Context, g *authdomain.RolePermission) error {
-	// TODO(bundle-2): Bundle 1 Phase 12 deferral — scope_id is NOT
+	// see #bundle-2-scope-fk — Bundle 1 Phase 12 deferral. scope_id is NOT
 	// currently FK-constrained against the resource tables
 	// (certificate_profiles, issuers). This means an operator can
 	// grant a permission at scope_type=profile / scope_id=p-bogus
@@ -81,6 +81,10 @@ func WithinTx(ctx context.Context, db *sql.DB, fn func(tx *sql.Tx) error) (err e
 	defer func() {
 		if p := recover(); p != nil {
 			_ = tx.Rollback()
 			// ARCH-L1: re-throw the recovered panic after rolling back
 			// the transaction. The Tx layer's contract is "preserve
 			// panics across the rollback boundary"; swallowing here
 			// would hide the original bug from the caller.
 			panic(p)
 		}
 		if err != nil {
@@ -241,7 +241,13 @@ func (s *AuditService) ListAuditEventsByCategory(ctx context.Context, eventCateg
 		}
 	}
-	// TODO: Get total count from repository
+	// see #audit-pagination-count — the repository currently returns
 	// the full filtered slice and we surface len(result) as total. This
 	// works for the audit page's current shape (server-side filter +
 	// client-side pagination over a bounded window) but is wrong when the
 	// frontend ports to server-side cursoring (Phase 9 P-H2). At that
 	// point the repository must add a CountAuditEvents(filter) method and
 	// this line becomes total, _ := s.repo.CountAuditEvents(ctx, filter).
 	total := int64(len(result))
 	return result, total, nil
@@ -722,6 +722,14 @@ var (
 	serverKeygenSyntheticNotAfter  = mustParseTime("2099-12-31T23:59:59Z")
 )
 // mustParseTime parses a hard-coded RFC 3339 string into a time.Time.
 //
 // ARCH-L1: panic is correct because the callers pass only compile-time
 // constants — a parse failure here means the developer typo'd a
 // constant in code, which would never reach a CI build. Surfacing as
 // a panic catches the typo at first-run rather than letting a zero-
 // valued time.Time silently propagate to certificate not-before/after
 // fields.
 func mustParseTime(s string) time.Time {
 	t, err := time.Parse(time.RFC3339, s)
 	if err != nil {
@@ -855,6 +855,13 @@ func getEnvForSeed(key string) string {
 }
 // mustJSON marshals a value to json.RawMessage, panicking on error (for seed data only).
 //
 // ARCH-L1: panic is correct because mustJSON is invoked only on
 // compile-time-known seed structs — json.Marshal never returns an
 // error for plain struct shapes. A failure here means an upstream
 // type-system mismatch the caller couldn't have caught at build time,
 // which is a programmer bug worth surfacing immediately rather than
 // silently producing malformed seed JSON.
 func mustJSON(v interface{}) json.RawMessage {
 	b, err := json.Marshal(v)
 	if err != nil {
@@ -292,11 +292,16 @@ func (s *JobService) processIssuanceJob(ctx context.Context, job *domain.Job) er
 // processValidationJob handles a certificate validation job.
 // This is a placeholder that documents the flow.
-// TODO: Implement actual validation job processing if needed.
+// see #validation-job-impl — implement actual validation job processing
 // when a customer ask materializes. Today's renewal-loop already calls
 // target connector ValidateDeployment after every deploy; this code
 // path is reserved for an out-of-band "verify what's deployed matches
 // the issued cert" scheduler tick. Not currently wired into the job
 // dispatcher — the job type is reserved.
 func (s *JobService) processValidationJob(ctx context.Context, job *domain.Job) error {
 	s.logger.Debug("processing validation job", "job_id", job.ID)
-	// TODO: Implement validation job processing
+	// see #validation-job-impl — when implemented:
 	// In production:
 	//   1. Fetch the certificate
 	//   2. For each target, call target connector ValidateDeployment
@@ -1,5 +1,13 @@
 -- Migration 000014: CHECK constraint on policy_violations.severity
 --
 -- ARCH-L3 (2026-05-13): this migration's `ALTER TABLE ... ADD CONSTRAINT
 -- ... CHECK` statement does not carry the literal `IF NOT EXISTS`
 -- token because Postgres' ALTER TABLE ADD CONSTRAINT syntax does not
 -- accept it. Idempotency comes from the DROP CONSTRAINT IF EXISTS
 -- preamble: re-running this migration on a tree with the constraint
 -- already in place drops + re-adds, which is a no-op in observable
 -- behavior.
 --
 -- Sibling to migration 000013, which added severity + CHECK to policy_rules.
 -- policy_violations has carried a severity column since the initial schema
 -- (000001, line 183) but without any CHECK. The engine used to hardcode
@@ -1,5 +1,11 @@
 -- Bundle-6 / Audit M-017 / HIPAA §164.312(b) (audit controls):
 --
 -- ARCH-L3 (2026-05-13): this migration uses CREATE OR REPLACE FUNCTION
 -- + DROP TRIGGER IF EXISTS + CREATE TRIGGER + DO $$ ... pg_roles
 -- existence check. These patterns produce equivalent idempotency to
 -- IF NOT EXISTS but in shapes Postgres' CREATE TRIGGER + REVOKE
 -- syntax do not literally accept. Re-running this migration is safe.
 --
 -- audit_events is append-only at the database layer. The application
 -- role cannot UPDATE or DELETE rows. Compliance superusers (legal hold,
 -- retention purges) use a separate role provisioned out-of-band that
@@ -1,4 +1,12 @@
 -- 000030_rbac_admin_perms.up.sql
 --
 -- ARCH-L3 (2026-05-13): this migration's INSERT statements don't carry
 -- the literal `IF NOT EXISTS` token because Postgres' INSERT syntax
 -- doesn't accept it on the INSERT keyword itself. Idempotency comes
 -- from the `ON CONFLICT (...) DO NOTHING` clauses on every INSERT —
 -- re-running the migration on a tree where the rows already exist is
 -- a no-op.
 --
 -- Bundle 1 / Phase 3.5: admin-only fine-grained permissions for the
 -- legacy admin handlers (bulk_revocation, admin_crl_cache,
 -- admin_scep_intune, admin_est, intermediate_ca). Phase 3.5 wraps the
@@ -1,4 +1,12 @@
 -- 000039_audit_crit1_perms.up.sql
 --
 -- ARCH-L3 (2026-05-13): this migration's INSERT statements don't carry
 -- the literal `IF NOT EXISTS` token because Postgres' INSERT syntax
 -- doesn't accept it on the INSERT keyword itself. Idempotency comes
 -- from the `ON CONFLICT (...) DO NOTHING` clauses on every INSERT —
 -- re-running the migration on a tree where the rows already exist is
 -- a no-op.
 --
 -- Audit 2026-05-10 CRIT-1 closure: legacy-CRUD permission set.
 --
 -- The Bundle 1 + Bundle 2 audit surfaced that the RBAC permission
@@ -0,0 +1,44 @@
 #!/usr/bin/env bash
 # scripts/ci-guards/no-todo-in-prod.sh
 #
 # Phase 3 ARCH-L4 closure (2026-05-13): production Go files
 # (cmd/ + internal/, excluding *_test.go) MUST NOT carry bare
 # TODO / FIXME comments. The pre-Phase-3 working tree had 5 such
 # comments; this guard catches the regression mode where a future PR
 # reintroduces them.
 #
 # Allowed patterns:
 #   - `// see #<descriptor>` — track-via-descriptor for deferred work.
 #     Descriptor may be a GitHub issue number (`see #123`) or a
 #     greppable kebab-case identifier (`see #gcpsm-pagination`,
 #     `see #bundle-2-scope-fk`). The point is that future code-search
 #     for the descriptor finds the comment + every related call site.
 #   - Test files (*_test.go) are exempt because TODO in tests is
 #     usually documenting an unimplemented test case, not deferred
 #     production code.
 #
 # The Phase 3 closure rewrote the 5 pre-existing TODOs:
 #   - internal/repository/postgres/auth.go:220 → see #bundle-2-scope-fk
 #   - internal/connector/discovery/gcpsm/gcpsm.go:547 → see #gcpsm-pagination
 #   - internal/service/audit.go:244 → see #audit-pagination-count
 #   - internal/service/job.go:295, 299 → see #validation-job-impl
 set -e
 VIOLATIONS=$(grep -rnE "TODO|FIXME" --include='*.go' cmd/ internal/ 2>/dev/null \
    | grep -v '_test\.go' \
    || true)
 if [ -n "$VIOLATIONS" ]; then
  echo "::error::no-todo-in-prod regression: TODO / FIXME in production Go."
  echo ""
  echo "Production code MUST NOT carry bare TODO / FIXME comments. Use"
  echo "'// see #<descriptor>' instead — either a GitHub issue number"
  echo "(see #123) or a greppable descriptor (see #gcpsm-pagination)."
  echo ""
  echo "Violations:"
  echo "$VIOLATIONS"
  exit 1
 fi
 echo "no-todo-in-prod guard OK: no TODO / FIXME in production Go"
@@ -0,0 +1,45 @@
 #!/usr/bin/env bash
 # scripts/ci-guards/skip-inventory-drift.sh
 #
 # Phase 3 TEST-M4 + ARCH-L2 closure (2026-05-13): regenerate the
 # skip inventory at docs/testing/skip-inventory.md and fail the build
 # if the regenerated file differs from the tracked copy. The
 # inventory is the canonical acquisition-diligence artefact for "what
 # tests are being skipped and why" — keeping it accurate is a CI
 # contract, not a manual checklist.
 #
 # To fix a drift error: re-run ./scripts/skip-inventory.sh locally
 # and commit the regenerated file alongside the PR that added or
 # removed t.Skip sites.
 set -e
 EXPECTED="docs/testing/skip-inventory.md"
 TMPFILE="$(mktemp -t skip-inventory.XXXXXX.md)"
 trap 'rm -f "$TMPFILE"' EXIT
 ./scripts/skip-inventory.sh "$TMPFILE" > /dev/null
 # Compare excluding the timestamp line (which legitimately drifts per-day).
 if diff -q \
   <(grep -vE "^> Last reviewed:" "$EXPECTED") \
   <(grep -vE "^> Last reviewed:" "$TMPFILE") > /dev/null; then
  echo "skip-inventory-drift guard OK: docs/testing/skip-inventory.md matches the live tree"
  exit 0
 fi
 echo "::error::skip-inventory-drift regression: docs/testing/skip-inventory.md is stale."
 echo ""
 echo "The skip inventory at $EXPECTED no longer matches the live"
 echo "t.Skip surface. Regenerate with:"
 echo ""
 echo "    ./scripts/skip-inventory.sh"
 echo ""
 echo "Then commit the updated docs/testing/skip-inventory.md alongside"
 echo "your t.Skip changes."
 echo ""
 echo "Diff (- expected, + actual):"
 diff <(grep -vE "^> Last reviewed:" "$EXPECTED") \
     <(grep -vE "^> Last reviewed:" "$TMPFILE") | head -50
 exit 1
@@ -0,0 +1,73 @@
 #!/usr/bin/env bash
 # scripts/skip-inventory.sh
 #
 # Phase 3 TEST-M4 + ARCH-L2 closure (2026-05-13): generate
 # docs/testing/skip-inventory.md from every t.Skip() site under
 # cmd/, internal/, and deploy/test/. Re-run after adding or removing
 # any t.Skip; CI guard at scripts/ci-guards/skip-inventory-drift.sh
 # fails the build on undetected drift.
 #
 # Each entry is grouped by package and shows file:line:expression.
 # The Phase 3 audit found ~142 skip sites; the inventory makes the
 # pattern transparent (testcontainers-gated under -short, platform-
 # gated under Linux/Darwin/Windows, demo-Compose-gated, etc.) so
 # acquisition-diligence reviewers can audit the skip surface.
 set -e
 OUTPUT="${1:-docs/testing/skip-inventory.md}"
 mkdir -p "$(dirname "$OUTPUT")"
 {
    echo "# Test Skip Inventory"
    echo
    echo "<!-- Auto-generated by scripts/skip-inventory.sh — do not edit by hand. -->"
    echo "<!-- Re-run after adding or removing any t.Skip(). CI guard:    -->"
    echo "<!-- scripts/ci-guards/skip-inventory-drift.sh                  -->"
    echo
    echo "> Last reviewed: $(date +%Y-%m-%d)"
    echo
    echo "## Summary"
    echo
    total=$(grep -rcE "t\.Skip\b" --include='*_test.go' cmd/ internal/ deploy/test/ 2>/dev/null \
            | awk -F: '$2>0 {s+=$2} END {print s}')
    short=$(grep -rE "testing\.Short\(\)" --include='*_test.go' cmd/ internal/ deploy/test/ 2>/dev/null \
            | wc -l | tr -d ' ')
    echo "- Total t.Skip sites: **$total**"
    echo "- testing.Short() guards: **$short** (these gate behind \`go test -short\`)"
    echo
    echo "Re-run inventory with: \`./scripts/skip-inventory.sh\`."
    echo
    echo "## Sites (grouped by package)"
    echo
    grep -rnE "t\.Skip" --include='*_test.go' cmd/ internal/ deploy/test/ 2>/dev/null \
        | awk -F: '
            {
                # Extract directory (package)
                n = split($1, parts, "/")
                pkg = parts[1]
                for (i = 2; i < n; i++) pkg = pkg "/" parts[i]
                # Print pkg | file:line | expression
                rest = ""
                for (i = 3; i <= NF; i++) rest = (rest == "" ? $i : rest ":" $i)
                gsub(/^[ \t]+|[ \t]+$/, "", rest)
                print pkg "|" $1 ":" $2 "|" rest
            }
        ' \
        | sort \
        | awk -F'|' '
            {
                if ($1 != cur) {
                    if (cur != "") print ""
                    print "### `" $1 "`"
                    print ""
                    cur = $1
                }
                print "- `" $2 "` — " $3
            }
            END { print "" }
        '
 } > "$OUTPUT"
 echo "Wrote skip inventory to: $OUTPUT"
@@ -8,7 +8,9 @@
    "build": "tsc && vite build",
    "preview": "vite preview",
    "test": "vitest run",
-    "test:watch": "vitest"
+    "test:watch": "vitest",
    "e2e": "playwright test",
    "e2e:install": "playwright install --with-deps chromium"
  },
  "dependencies": {
    "@tanstack/react-query": "^5.90.21",
@@ -18,6 +20,7 @@
    "recharts": "^3.8.0"
  },
  "devDependencies": {
    "@playwright/test": "^1.49.0",
    "@testing-library/jest-dom": "^6.9.1",
    "@testing-library/react": "^16.3.2",
    "@types/react": "^19.2.14",
@@ -0,0 +1,54 @@
 /**
 * Phase 3 TEST-M3 closure (2026-05-13): Playwright harness stub for
 * the browser-driven E2E surface documented in
 * web/src/__tests__/e2e/README.md.
 *
 * This config is the minimum-viable scaffold: one chromium project,
 * webServer pointing at `npm run dev` for fast feedback, no firefox /
 * webkit projects yet. The full 15-flow suite (operator boot, OIDC
 * login, cert issuance, revocation, rotation, JWKS rotate, etc.)
 * lands in frontend-design-audit Phase 8 (TEST-H1 in that audit's
 * tracker). The smoke test that ships alongside this config proves
 * the harness wiring works.
 */
 import { defineConfig, devices } from '@playwright/test';
 export default defineConfig({
  testDir: './src/__tests__/e2e',
  // Match Playwright's *.spec.ts convention; README.md and any
  // *.test.ts vitest files are intentionally excluded.
  testMatch: /.*\.spec\.ts$/,
  // Single worker keeps the smoke test deterministic on shared CI
  // runners. Raise to fullyParallel: true once the harness covers
  // independent flows.
  fullyParallel: false,
  forbidOnly: !!process.env.CI,
  retries: process.env.CI ? 1 : 0,
  workers: 1,
  reporter: process.env.CI ? [['github'], ['list']] : 'list',
  use: {
    baseURL: process.env.E2E_BASE_URL || 'http://localhost:5173',
    trace: 'on-first-retry',
    screenshot: 'only-on-failure',
    // Phase 3 SEC-M3 carve-out: the dev server uses a self-signed cert
    // via the Vite HTTPS proxy. Tell Playwright to accept it for the
    // smoke test only; production E2E against a properly-trusted cert
    // never needs this flag.
    ignoreHTTPSErrors: true,
  },
  projects: [
    {
      name: 'chromium',
      use: { ...devices['Desktop Chrome'] },
    },
  ],
  webServer: process.env.E2E_BASE_URL
    ? undefined
    : {
        command: 'npm run dev',
        url: 'http://localhost:5173',
        reuseExistingServer: !process.env.CI,
        timeout: 120_000,
      },
 });
@@ -0,0 +1,34 @@
 /**
 * Phase 3 TEST-M3 smoke test (2026-05-13).
 *
 * Proves the Playwright harness works end-to-end:
 *   1. webServer block in playwright.config.ts boots `npm run dev`
 *   2. Playwright chromium connects to http://localhost:5173/
 *   3. The page renders a known element from the certctl shell
 *
 * Full coverage of the 15 flows documented in
 * web/src/__tests__/e2e/README.md ships in frontend-design-audit
 * Phase 8 (TEST-H1 / TEST-H2 / TEST-H3). This file exists to keep
 * the harness wiring tested so that adding new specs is mechanical.
 */
 import { test, expect } from '@playwright/test';
 test.describe('certctl dashboard — smoke', () => {
  test('login page renders the certctl brand and a login affordance', async ({ page }) => {
    await page.goto('/');
    // The Layout sidebar always renders the "certctl" brand text in
    // the header — verified live at web/src/components/Layout.tsx
    // (Phase 0 frontend remediation will keep this stable when the
    // logo migrates from PNG to inline SVG).
    //
    // For the smoke we just assert the document loaded and the
    // <title> resolves; deeper page-content assertions belong in
    // per-flow specs that the frontend-design-audit Phase 8 ships.
    await expect(page).toHaveTitle(/certctl/i);
    // Body should be visible (negative test: blank page would fail).
    await expect(page.locator('body')).toBeVisible();
  });
 });