mirror of https://github.com/shankar0123/certctl.git synced 2026-06-07 20:41:30 +00:00

Files

T

shankar0123 3c81531398 ci: OpenAPI parity reconciliation + codegen scaffolding (Phase 5 — ARCH-H1 / ARCH-M6)

Phase 5 reconciliation: the audit's headline framing 'ARCH-H1 = 62-route
OpenAPI gap' was a measurement scoping error. Every one of the 209
unique router routes is already accounted for — 154 in api/openapi.yaml,
55 in api/openapi-handler-exceptions.yaml. The existing
openapi-handler-parity.sh CI guard already enforces this and passes
clean today. The audit subtracted operation-count from route-count
without accounting for the documented exceptions YAML.

Where real work remains (and what this PR does about it)
=========================================================

Of the 64 documented exceptions, 35 are legitimate wire-protocol
carve-outs that MUST stay (SCEP RFC 8894 × 8 entries, ACME RFC 8555
default + per-profile × 27 entries — they're protocol contracts, not
REST resources). The remaining 29 are REST-shaped routes whose
OpenAPI ops were deferred during their original Bundle 2 /
audit-2026-05-10 / 2026-05-11 work:

  - auth/sessions (3)
  - auth/oidc admin (9)
  - auth/breakglass admin (4)
  - auth/users mgmt (3)
  - auth/runtime-config (1)
  - auth/demo-residual/cleanup (1)
  - audit/export (1)
  - auth/logout (1)
  - auth/breakglass/login (1)
  - auth/oidc {login,callback,bcl} (3)
  - oidc/providers/{id}/jwks-status (1)
  - + 2 other auth-flow routes

Burn-down plan in 3 sprints (documented in
api/openapi-handler-exceptions.yaml header):
  Sprint A: Cluster 1 — sessions + oidc admin (12 ops)
  Sprint B: Cluster 2 — breakglass + users + runtime-config (8 ops)
  Sprint C: Cluster 3 — audit/export + auth flows (9 ops)

This PR does NOT author the 29 OpenAPI ops; each needs request/
response schemas, not placeholders, and the design work is too
large for one PR. The reconciliation here is documentation + a CI
guard that will fail any future schema-drift, plus the scaffolding
needed for sub-phase 5b.

Sub-phase 5b: codegen scaffolding
==================================

Adds the orval scaffolding without running npm install (sandbox
disk-full; first 'npm install' + 'npm run generate' happens on the
operator's workstation):

  - web/orval.config.ts — codegen config emits react-query hooks
    from api/openapi.yaml into web/src/api/generated/
  - web/package.json — adds orval@^7.0.0 devDep + 'generate' npm script
  - web/CODEGEN.md — operator-facing migration doc:
    first-time setup, per-consumer migration pattern, burn-down plan,
    CI-guard rules
  - scripts/ci-guards/openapi-codegen-drift.sh — blocks the build
    when api/openapi.yaml changes but web/src/api/generated/ wasn't
    regenerated alongside. Currently no-op (the directory doesn't
    exist yet); activates from the first 'npm run generate' run.

The legacy web/src/api/client.ts stays in tree per the phase prompt's
'do not delete in same PR as codegen' rule. Consumers migrate one
page at a time as their OpenAPI ops land; client.ts deletion is a
SEPARATE follow-up PR after the last consumer migrates.

Updates to existing guard + exceptions YAML
============================================

  - scripts/ci-guards/openapi-handler-parity.sh header rewritten
    with the Phase 5 reconciliation numbers (220/158/64/0) and the
    wire-protocol vs REST-deferred classification.
  - api/openapi-handler-exceptions.yaml header rewritten with the
    35/29 split + the 3-sprint burn-down plan. Each exception entry
    is unchanged; the header now documents which entries are
    permanent (wire-protocol) vs temporary (REST-deferred).

Sandbox limitations + operator follow-up
=========================================

  - 'npm install' was NOT run from the sandbox (sessions volume
    99%-full, 142 MB free). The operator runs 'cd web && npm install'
    on their workstation; this lands orval@^7.0.0 in node_modules,
    then 'cd web && npm run generate' produces the initial
    web/src/api/generated/ tree.
  - First per-consumer migration (suggested: web/src/pages/AuthSettings
    or one of the operator-decision pages) lands in a follow-up PR
    after npm install completes.
  - The 29-op OpenAPI burn-down is a 2-sprint effort tracked under
    ARCH-H1 in cowork/certctl-architecture-diligence-audit.html.

All CI guards (openapi-handler-parity, openapi-codegen-drift, plus
every existing guard) verified clean by running each individually.

Closes:
  - cowork/certctl-architecture-diligence-audit.html#fix-ARCH-H1
    (reconciliation: gap is 0 with exceptions accounted for; burn-down
    plan documented for follow-up sprints)
  - cowork/certctl-architecture-diligence-audit.html#fix-ARCH-M6
    (codegen scaffolding shipped; client.ts deletion follows in a
    subsequent PR after consumers migrate)

2026-05-13 20:24:20 +00:00

B2-compose-base-no-demo-env.sh

fix(security): close BUNDLE 2 — safe first run, demo mode, agent bootstrap

2026-05-13 00:14:59 +00:00

B3-helm-chart-coherence.sh

fix(helm): close BUNDLE 3 — Helm chart hardening + enterprise deploy

2026-05-13 00:40:42 +00:00

B6-no-private-keys-in-tree.sh

docs(b6): secret-custody reference + config-encryption upgrade runbook + private-key CI guard

2026-05-13 01:48:40 +00:00

B-1-orphan-crud.sh

ci-pipeline-cleanup Phase 1: extract 20 regression guards to scripts/ci-guards/

2026-04-30 20:36:26 +00:00

bundle-1-compat-regression.sh

auth-bundle-2 Phase 6: session middleware + CSRF token plumbing +

2026-05-10 06:22:25 +00:00

bundle-1-to-2-upgrade-regression.sh

auth-bundle-2 Phase 6: session middleware + CSRF token plumbing +

2026-05-10 06:22:25 +00:00

bundle-8-L-015-target-blank-rel-noopener.sh

ci-pipeline-cleanup Phase 1: extract 20 regression guards to scripts/ci-guards/

2026-04-30 20:36:26 +00:00

bundle-8-L-019-dangerously-set-inner-html.sh

ci-pipeline-cleanup Phase 1: extract 20 regression guards to scripts/ci-guards/

2026-04-30 20:36:26 +00:00

bundle-8-M-009-bare-usemutation.sh

ci-pipeline-cleanup Phase 1: extract 20 regression guards to scripts/ci-guards/

2026-04-30 20:36:26 +00:00

complete-path-config-coverage-exceptions.yaml

feat(ci): item-1 complete-path config-coverage guard (PARTIAL — sandbox could not verify Go test)

2026-05-12 14:02:04 +00:00

complete-path-config-coverage.sh

feat(ci): item-1 complete-path config-coverage guard (PARTIAL — sandbox could not verify Go test)

2026-05-12 14:02:04 +00:00

cors-wildcard-allowlist.sh

fix(api/cors): narrow Bundle-2 routes from wildcard to NewCORS(corsCfg)

2026-05-10 20:12:19 +00:00

D-1-D-2-statusbadge-phantom.sh

ci-pipeline-cleanup Phase 1: extract 20 regression guards to scripts/ci-guards/

2026-04-30 20:36:26 +00:00

digest-validity.sh

ci: add exponential-backoff retry to digest-validity guard

2026-05-13 20:17:08 +00:00

doc-rot-detector-exceptions.yaml

feat(ci): item-5 doc rot detector (90d warn / 120d fail)

2026-05-12 14:10:27 +00:00

doc-rot-detector.sh

feat(ci): item-5 doc rot detector (90d warn / 120d fail)

2026-05-12 14:10:27 +00:00

G-1-jwt-auth-literal.sh

ci-pipeline-cleanup Phase 1: extract 20 regression guards to scripts/ci-guards/

2026-04-30 20:36:26 +00:00

G-2-api-key-hash-json.sh

ci-pipeline-cleanup Phase 1: extract 20 regression guards to scripts/ci-guards/

2026-04-30 20:36:26 +00:00

G-3-env-docs-drift.sh

docs: remove internal engineering docs; docs must be tool- or story-relevant

2026-05-13 02:44:27 +00:00

H-1-encryption-key-min-length.sh

fix(deploy/test) + ci(guard): unblock deploy-vendor-e2e — encryption-key length

2026-05-01 00:57:43 +00:00

H-001-bare-from.sh

ci-pipeline-cleanup Phase 1: extract 20 regression guards to scripts/ci-guards/

2026-04-30 20:36:26 +00:00

H-009-readme-jwt.sh

ci: restore +x bit on scripts/ci-guards/*.sh (sandbox stripped exec bit)

2026-05-05 04:56:43 +00:00

L-1-bulk-action-loop.sh

ci-pipeline-cleanup Phase 1: extract 20 regression guards to scripts/ci-guards/

2026-04-30 20:36:26 +00:00

L-001-insecure-skip-verify.sh

ci: restore +x bit on scripts/ci-guards/*.sh (sandbox stripped exec bit)

2026-05-05 04:56:43 +00:00

M-012-no-root-user.sh

ci-pipeline-cleanup Phase 1: extract 20 regression guards to scripts/ci-guards/

2026-04-30 20:36:26 +00:00

multi-tenant-query-coverage.sh

chore(ci-guards): close 4 CI-guard regressions surfaced by v2.1.0 release-gate Phase 5

2026-05-11 14:19:35 +00:00

N-bundle-2-security-empty-preserved.sh

auth-bundle-2 Phase 5: OIDC + session HTTP surface (13 endpoints),

2026-05-10 06:08:27 +00:00

no-change-me-in-prod-compose.sh

config: default hardening + operator docs (Phase 2 closure — SEC-H1, SEC-H3, SEC-M4, DEPL-H1, DEPL-M2 + doc-only carve-outs)

2026-05-13 19:50:00 +00:00

no-new-synthetic-admin.sh

harden(auth): demo-mode residual-grants detector + cleanup endpoint + CI guard (A-8)

2026-05-11 11:45:54 +00:00

no-precompiled-binary.sh

ci: supply-chain hardening (Phase 1 closure — RED-1, RED-2, TEST-L2)

2026-05-13 19:30:53 +00:00

no-tag-pinned-actions.sh

ci: supply-chain hardening (Phase 1 closure — RED-1, RED-2, TEST-L2)

2026-05-13 19:30:53 +00:00

no-todo-in-prod.sh

ci: floor raise + doc drift (Phase 3 closure — TEST-H1/H2/M1/M2/M3/M4/L1, ARCH-H3/L1/L2/L3/L4)

2026-05-13 20:10:08 +00:00

openapi-codegen-drift.sh

ci: OpenAPI parity reconciliation + codegen scaffolding (Phase 5 — ARCH-H1 / ARCH-M6)

2026-05-13 20:24:20 +00:00

openapi-handler-parity.sh

ci: OpenAPI parity reconciliation + codegen scaffolding (Phase 5 — ARCH-H1 / ARCH-M6)

2026-05-13 20:24:20 +00:00

P-1-documented-orphan-fns.sh

ci-pipeline-cleanup Phase 1: extract 20 regression guards to scripts/ci-guards/

2026-04-30 20:36:26 +00:00

README.md

docs: remove internal engineering docs; docs must be tool- or story-relevant

2026-05-13 02:44:27 +00:00

S-1-hardcoded-source-counts.sh

docs: remove internal engineering docs; docs must be tool- or story-relevant

2026-05-13 02:44:27 +00:00

S-2-strings-contains-err.sh

ci-pipeline-cleanup Phase 1: extract 20 regression guards to scripts/ci-guards/

2026-04-30 20:36:26 +00:00

skip-inventory-drift.sh

ci: floor raise + doc drift (Phase 3 closure — TEST-H1/H2/M1/M2/M3/M4/L1, ARCH-H3/L1/L2/L3/L4)

2026-05-13 20:10:08 +00:00

surface-parity-mcp-exemptions.yaml

feat(ci): item-2 cross-surface contract parity (stdlib-only package)

2026-05-12 14:09:32 +00:00

T-1-frontend-page-coverage.sh

web, docs: IssuerHierarchyPage + sysadmin runbook + connectors row (Rank 8 commit 5)

2026-05-04 02:33:48 +00:00

test-compose-scep-coherence.sh

fix(deploy/test) + ci(guard): drop dead SCEP profile from test compose

2026-05-01 01:39:18 +00:00

test-naming-convention.sh

ci-pipeline-cleanup Phase 1: extract 20 regression guards to scripts/ci-guards/

2026-04-30 20:36:26 +00:00

U-2-plaintext-healthcheck.sh

ci: restore +x bit on scripts/ci-guards/*.sh (sandbox stripped exec bit)

2026-05-05 04:56:43 +00:00

U-3-migration-mount.sh

ci-pipeline-cleanup Phase 1: extract 20 regression guards to scripts/ci-guards/

2026-04-30 20:36:26 +00:00

README.md

`scripts/ci-guards/` — Regression-guard scripts

Each <id>.sh script in this directory pins one closed audit finding from regressing. CI runs the full set on every push via the Regression guards step in .github/workflows/ci.yml. Operators can run any script locally:

bash scripts/ci-guards/G-3-env-docs-drift.sh

Contract

Every script in this directory MUST:

Be exit-code 0 on a clean repo (no regression present).
Be exit-code non-zero on regression, with a ::error:: annotation prefix so PR reviewers see the failing line in the GitHub Actions UI.
Be runnable from repo root via bash scripts/ci-guards/<id>.sh with NO arguments and NO env-var requirements. The CI loop step (for g in scripts/ci-guards/*.sh; do bash "$g"; done) iterates every .sh here without args; any script that requires an arg or env var WILL fail in that loop.
Carry a head-comment block matching the in-source justification from the original ci.yml entry: the audit-finding reference, the closure rationale, the exempt-surface list (if any).
Use set -e early to fail-fast on internal command errors.
Produce no output on the happy path beyond a final echo "<id>: clean." confirmation line.

Helpers vs guards

Scripts that consume input artifacts (a test-output log, a coverage.out file) or env vars (PR_NUMBER, GH_TOKEN) are HELPERS, not guards. They live in scripts/, NOT scripts/ci-guards/.

Current helpers:

scripts/vendor-e2e-skip-check.sh — consumes test-output.log arg from the deploy-vendor-e2e job
scripts/coverage-pr-comment.sh — consumes coverage.out + PR_NUMBER + GH_TOKEN env from the go-build-and-test job
scripts/check-coverage-thresholds.sh — consumes coverage.out
- .github/coverage-thresholds.yml

Adding a new guard

Drop a new <id>.sh in this directory with the head-comment block describing the audit finding it closes.
Make it executable: chmod +x scripts/ci-guards/<id>.sh.
Verify it fails on a deliberate regression and passes on clean repo.
CI auto-picks up new scripts via the for g in scripts/ci-guards/*.sh loop in the Regression guards step — no ci.yml change required.

Guards in this directory

Count: re-derive on demand via ls scripts/ci-guards/*.sh | wc -l. The table below names each one — keep it in sync as guards are added.

Per-finding regression guards

ID	Finding	Catches
`G-1-jwt-auth-literal`	G-1 JWT silent auth downgrade	`"jwt"` literal in additive auth-type surfaces
`L-001-insecure-skip-verify`	L-001 unjustified InsecureSkipVerify	`InsecureSkipVerify: true` without `//nolint:gosec`
`H-001-bare-from`	H-001 (CWE-829) tag-swap attack	Bare `FROM` line without `@sha256` digest pin
`M-012-no-root-user`	M-012 (CWE-250) container-as-root	Dockerfile missing terminal `USER <non-root>`
`H-009-readme-jwt`	H-009 README JWT advertising	README.md re-introducing JWT-as-supported claim
`G-2-api-key-hash-json`	G-2 cat-s5-apikey_leak	`api_key_hash` in JSON-emitting surface
`U-2-plaintext-healthcheck`	U-2 healthcheck protocol mismatch	Plaintext `http://` in HEALTHCHECK directive
`U-3-migration-mount`	U-3 seed initdb schema drift	Migration file mounted into postgres initdb
`D-1-D-2-statusbadge-phantom`	D-1 + D-2 dead keys + TS phantoms	StatusBadge dead keys + 5 Certificate / 5 Agent / 1 Issuer / 1 Notification phantom fields
`L-1-bulk-action-loop`	L-1 client-side bulk loops	`for ... await triggerRenewal/updateCertificate` in CertificatesPage
`B-1-orphan-crud`	B-1 orphan-CRUD client fns	8 update/create/delete fns lose their page consumer
`S-2-strings-contains-err`	S-2 brittle error-dispatch	`strings.Contains(err.Error(), "not found"\|"violates foreign key")` in handlers
`G-3-env-docs-drift`	G-3 env-var docs drift	`CERTCTL_*` env var defined OR documented but not both
`test-naming-convention`	I-001-extended	`func TestXxx` (lowercase first letter) — Go silently skips
`S-1-hardcoded-source-counts`	S-1 stale numeric prose	Hardcoded "N issuer connectors" / "N MCP tools" in README + docs
`P-1-documented-orphan-fns`	P-1 documented orphans	16 read-fn names removed from client.ts exports
`T-1-frontend-page-coverage`	T-1 untested frontend pages	New page in `web/src/pages/` without sibling `.test.tsx` and not on the deferred allowlist
`bundle-8-L-015-target-blank-rel-noopener`	L-015 (CWE-1022) reverse-tabnabbing	`target="_blank"` without `rel="noopener noreferrer"`
`bundle-8-L-019-dangerously-set-inner-html`	L-019 (CWE-79) XSS	`dangerouslySetInnerHTML` outside `safeHtml.ts`
`bundle-8-M-009-bare-usemutation`	M-009 + M-029 mutation contract	Bare `useMutation()` outside `useTrackedMutation` wrapper
`H-1-encryption-key-min-length`	H-1 closure follow-up (post-Phase-5 surfacing)	`CERTCTL_CONFIG_ENCRYPTION_KEY` literal in any `deploy/docker-compose*.yml` shorter than the 32-byte floor enforced by `internal/config/config.go::Validate()`
`test-compose-scep-coherence`	post-Phase-5 surfacing of dead SCEP test config	`CERTCTL_SCEP_ENABLED=true` in test compose without (a) a CI job that runs the SCEP integration test, (b) the `ra.crt` + `ra.key` + `intune_trust_anchor.pem` fixtures committed to `deploy/test/fixtures/`, AND (c) the matching volume mount

Forward-looking guards (Auditable Codebase Bundle, post-v2.1.0 anti-rot)

These guards catch defect classes BEFORE they get audit findings — they pin invariants on the codebase that the v2.0 audit history showed are easy to lose.

ID	Item	Catches
`complete-path-config-coverage`	post-v2.1.0 / item-1	"Lying field" — `CERTCTL_*` env var defined in `internal/config/config.go` that no consumer outside `internal/config/` actually reads. Operator-facing config that the docs claim works but the code never honors. Companion Go test at `internal/config/coverage_test.go`.
`doc-rot-detector`	post-v2.1.0 / item-5	Docs older than 90 days warn (yellow), older than 120 days fail (red). Uses HEAD commit timestamp for reproducibility. `docs/archive/` allowlisted in bulk.

The cold-DB compose smoke (post-v2.1.0 / item-6) is NOT a script in this directory — it is inlined directly into .github/workflows/ci.yml::cold-db-compose-smoke because there is no value in a developer running it locally (the whole point of the gate is that CI owns the cold-DB state). To inspect or modify the smoke logic, read that workflow job; there is intentionally no scripts/ci-guards/cold-db-compose-smoke.sh.

The fourth Bundle artifact (internal/ciparity/) is Go tests, not shell guards — runs under the standard Go test step. Pins the MCP tool catalogue floor + naming convention; reports CLI/MCP/OpenAPI surface counts as a trend metric.

Running the full set locally

for g in scripts/ci-guards/*.sh; do
  echo "=== $(basename "$g") ==="
  bash "$g" || echo "  FAILED"
done

README.md

scripts/ci-guards/ — Regression-guard scripts

Contract

Helpers vs guards

Adding a new guard

Guards in this directory

Per-finding regression guards

Forward-looking guards (Auditable Codebase Bundle, post-v2.1.0 anti-rot)

Running the full set locally

`scripts/ci-guards/` — Regression-guard scripts