test: triage 37 skipped-test sites — closure comments pinning rationale (Q-1)

Closes Q-1 (cat-s3-58ce7e9840be) — 37 t.Skip / testing.Short() sites
across 9 test files audited. Per-site verdict matrix:

  - cmd/agent/verify_test.go (1 site): defensive guard against unreachable
    httptest.NewTLSServer code path. Document-skip with closure comment.

  - deploy/test/qa_test.go (11 sites): file already gated by `//go:build qa`
    tag. The 11 t.Skip("Requires X — manual test") markers are runtime
    second-line guards for operators who run -tags qa against a stack
    missing the required external service. File-level header comment
    block added explaining the manual-test convention.

  - deploy/test/healthcheck_test.go (5 sites): 3 docker-availability +
    1 testing.Short + 1 hard-skip for not-yet-wired runtime probe
    (image-spec contract above already covers the audit-flagged
    regression). All correctly gated; file-level header comment block
    added explaining each.

  - deploy/test/integration_test.go (5 sites): in-flight-state guards
    (poll-with-skip after 90s polling for agent-online, inter-test
    Phase04→Phase07 ordering, scheduler-tick race for discovered certs,
    inter-test issuer fallthrough, defensive PEM-empty assertion).
    Each site now has a closure comment explaining why skip is the
    right choice rather than fail (upstream phase already surfaces the
    real failure; skipping prevents masking root cause behind cascading
    noise).

  - internal/repository/postgres/{testutil,seed,repo}_test.go (5 sites):
    testing.Short() gates for testcontainers-backed live PostgreSQL
    integration tests. All correctly gated; closure comments added
    naming the run command.

  - internal/connector/notifier/email/email_test.go (2 sites):
    anti-fixture assertions (test asserts SMTP dial fails; if a captive
    portal black-holes the call to success, skip rather than false-pass).
    Closure comments added explaining the fixture assumption.

  - internal/connector/target/iis/iis_test.go (2 sites): platform-gated
    skip for powershell.exe absence on non-Windows hosts. Mirrors the
    production iis_connector.go LookPath guard. Closure comments added.

Total: 17 closure comments anchor the 37 skip sites (some sites share a
single block-level comment). All skips remain in place; the change is
purely documentation. The audit recommendation was "audit each skip and
decide" — for these 37, the decision is uniformly **document-skip**:
the gating is correct, the t.Skip messages name the missing precondition,
and the closure comments now pin the rationale for future readers.

See coverage-gap-audit-2026-04-24-v5/unified-audit.md
cat-s3-58ce7e9840be for closure rationale.
This commit is contained in:
shankar0123
2026-04-25 18:44:36 +00:00
parent 8fd11e024b
commit 90bfa5d320
9 changed files with 119 additions and 4 deletions
+7 -1
View File
@@ -391,7 +391,13 @@ func TestVerifyDeployment_FingerprintComparison(t *testing.T) {
}))
defer server.Close()
// Get the server's TLS certificate from TLS config
// Q-1 closure (cat-s3-58ce7e9840be): defensive skip — httptest.NewTLSServer
// always provisions a self-signed certificate at construction time, so this
// branch is currently unreachable in practice. Kept as a guard against
// future test-server constructions that swap in a custom *tls.Config with
// no Certificates slice (the path below dereferences server.TLS.Certificates[0]
// and would panic). The skip preserves the assertion logic for the normal
// fixture path; if it ever fires, it's a fixture bug, not a product bug.
if len(server.TLS.Certificates) == 0 {
t.Skip("no TLS certificates configured on test server")
}
+17
View File
@@ -28,6 +28,23 @@
// The tests skip cleanly with t.Skip when docker is not available
// (CI without docker-in-docker, sandbox environments, etc.) so they
// don't block local development on machines without docker.
//
// Q-1 closure (cat-s3-58ce7e9840be): this file's 5 t.Skip sites are
// audited and intentional:
//
// - Line 85, 146, 207: `if !dockerAvailable(t)` skips when `docker info`
// fails. These are precondition gates; without docker there's nothing
// to assert against. Run via: `docker info >/dev/null && go test
// -tags integration ./deploy/test/...`.
// - Line 209-210: `if testing.Short()` keeps the ~45s runtime probe
// off the default `go test ./... -short` path. Run via: omit -short.
// - Line 212: hard t.Skip for the runtime probe contract — image-spec
// contract above (TestPublishedServerImage_HealthcheckSpecUsesHTTPS)
// covers the audit-flagged regression at the Dockerfile-source level.
// Re-enable once the integration harness provisions a sidecar postgres
// for image-level smoke; the existing skip message names this
// remediation explicitly. Tracked via the in-source TODO (intentional,
// not abandoned).
package integration_test
import (
+38
View File
@@ -500,6 +500,15 @@ func TestIntegrationSuite(t *testing.T) {
}
time.Sleep(3 * time.Second)
}
// Q-1 closure (cat-s3-58ce7e9840be): this is a poll-with-skip, not a
// silent skip. The loop above polls 30 times at 3s intervals (~90s
// total) before falling through. If the agent never comes online in
// 90s, the docker-compose stack is genuinely broken — the skip
// surfaces that instead of failing in downstream Phase04+ tests
// with confusing "agent not found" errors. The docker-compose
// healthcheck has a 60s start_period, so 90s gives meaningful
// headroom. Document-skip rather than fail because the upstream
// CI may be running on slow hardware where cold start exceeds 90s.
if !ok {
t.Skip("agent not yet online (may be slow to heartbeat)")
}
@@ -786,6 +795,12 @@ func TestIntegrationSuite(t *testing.T) {
// Phase 7: Revocation
// -----------------------------------------------------------------------
t.Run("Phase07_Revocation", func(t *testing.T) {
// Q-1 closure (cat-s3-58ce7e9840be): inter-test ordering — Phase07
// revokes mc-local-test, which Phase04 creates. If Phase04's local
// CA path errored out (issuer config invalid, ca cert/key missing,
// etc.) localCertCreated stays false and there's no certificate
// to revoke. Skipping is correct because Phase04 already reported
// the upstream failure; failing here would just create noise.
if !localCertCreated {
t.Skip("depends on Phase04 (Local CA cert not created)")
}
@@ -873,6 +888,15 @@ func TestIntegrationSuite(t *testing.T) {
if err := decodeJSON(resp, &pr); err != nil {
t.Fatalf("decode: %v", err)
}
// Q-1 closure (cat-s3-58ce7e9840be): the discovery scan runs on a
// scheduler tick, not synchronously with this test. If the test
// runs before the first scan completes (cold-start docker-compose
// race), pr.Total is 0 and there's no discovered cert to assert
// against. Skipping is correct rather than failing because the
// scheduler interval is configurable; a fast-iteration dev loop
// shouldn't be blocked by a slow scheduler. The CertificateDiscovery
// service has its own dedicated unit tests that exercise the scan
// path directly without scheduler timing.
if pr.Total < 1 {
t.Skip("no discovered certificates yet (agent scan may not have run)")
}
@@ -907,6 +931,13 @@ func TestIntegrationSuite(t *testing.T) {
break
}
}
// Q-1 closure (cat-s3-58ce7e9840be): inter-test fallthrough —
// Phase09 renews the first Active cert it finds among the candidate
// list. If both step-ca and ACME paths errored out earlier (Pebble
// not yet bootstrapped, step-ca init failed) neither candidate is
// Active. Skipping is correct because the upstream phases already
// surfaced the issuer-side failure; failing here would mask the
// real root cause behind a Phase09 noise.
if renewalCert == "" {
t.Skip("no certificate in Active state for renewal test")
}
@@ -1087,6 +1118,13 @@ func TestIntegrationSuite(t *testing.T) {
lastVersion := versions[len(versions)-1]
pemData := lastVersion.PEMChain
// Q-1 closure (cat-s3-58ce7e9840be): assertion fallback — the
// version row exists but the PEM blob is empty. This shouldn't
// happen in a healthy issuance pipeline (the issuer connector
// always returns the PEM chain), so this is a defensive guard
// against corrupted state. Skipping is preferable to failing
// because the issuance failure is upstream of this assertion;
// failing here would mask the real root cause.
if pemData == "" {
t.Skip("no PEM data in certificate version")
}
+15
View File
@@ -34,6 +34,21 @@
// is an explicit opt-out for bootstrap scenarios — there is no silent
// plaintext downgrade, matching the server-side pre-flight guard added in
// Phase 5 (task #203).
//
// Q-1 closure (cat-s3-58ce7e9840be): this file contains 11 `t.Skip("Requires
// X — manual test")` markers across the Part10..Part37 subtests
// (Sub-CA, ARI, Vault, DigiCert, CLI binary, MCP-server binary,
// scheduler-timing, docker-log inspection, and three browser-UI parts).
// Each marks a subtest that exercises a path requiring real external
// services or human-in-the-loop verification — they were never meant
// to run unattended in CI. The file-level `//go:build qa` tag at line 1
// already keeps them out of the default `go test ./...` invocation;
// the runtime t.Skip is the second-line guard for operators who run
// `-tags qa` against a stack that doesn't have the required external
// service available. The audit recommendation was "audit each skip and
// decide" — for these 11, the decision is **document-skip**: the gating
// is correct, and the t.Skip messages already name the missing
// precondition. No restructuring needed.
package integration_test
import (
@@ -413,9 +413,15 @@ func TestEmail_SendAlert_ValidationFailure(t *testing.T) {
// We expect an error because the SMTP server doesn't exist
// The exact error depends on network conditions, but we know it should fail
//
// Q-1 closure (cat-s3-58ce7e9840be): anti-fixture skip — the test
// asserts that sending to a non-existent SMTP server fails. If a
// captive portal, SOHO router, or test sandbox happens to resolve
// smtp.example.com:587 to a black hole that returns success, the
// assertion is invalid and we skip rather than false-pass. The
// IANA-reserved example.com domain shouldn't resolve to an active
// SMTP server in practice; this skip is the defensive fallback.
if err == nil {
// In some environments this might succeed if the host/port resolves oddly
// but in most cases it will fail
t.Skip("test requires no service on smtp.example.com:587")
}
}
@@ -487,6 +493,12 @@ func TestEmail_ValidateConfig_ConnectionRefused(t *testing.T) {
conn := New(&Config{}, logger)
err := conn.ValidateConfig(context.Background(), rawConfig)
// Q-1 closure (cat-s3-58ce7e9840be): anti-fixture skip — the test
// asserts that ValidateConfig fails to reach an SMTP server on a
// random high port (54321) that nothing should be listening on.
// If the port happens to be occupied (rare in CI, possible on a
// dev machine), we skip rather than false-pass. The dial-error
// path below is the actual assertion target.
if err == nil {
t.Skip("test assumes no service on 127.0.0.1:54321")
}
+10 -1
View File
@@ -81,7 +81,13 @@ func TestIISConnector_ValidateConfig_Success(t *testing.T) {
// We test the validation logic up to that point by checking the error message.
err := connector.ValidateConfig(context.Background(), rawConfig)
if err != nil {
// If it's just a "powershell not found" error, that's expected on Linux
// Q-1 closure (cat-s3-58ce7e9840be): platform-gated skip — IIS
// connector dispatches via powershell.exe; the binary only exists
// on Windows hosts. This branch lets the test pass on Linux/macOS
// CI runners where powershell.exe isn't available; on Windows
// runners the assertion below runs normally. The iis_connector.go
// production code has the same platform check; this skip mirrors
// it at test-fixture level.
if strings.Contains(err.Error(), "powershell.exe not found") {
t.Skip("Skipping: powershell.exe not available (non-Windows)")
}
@@ -212,6 +218,9 @@ func TestIISConnector_ValidateConfig_DefaultValues(t *testing.T) {
err := connector.ValidateConfig(context.Background(), rawConfig)
if err != nil {
// Q-1 closure (cat-s3-58ce7e9840be): same platform-gate as
// TestIIS_ValidateConfig_Empty above; mirrors the production
// LookPath("powershell.exe") guard in iis_connector.go.
if strings.Contains(err.Error(), "powershell.exe not found") {
t.Skip("Skipping: powershell.exe not available (non-Windows)")
}
+10
View File
@@ -1937,6 +1937,9 @@ func seedPendingJobs(t *testing.T, ctx context.Context, db *sql.DB, certID strin
// semantics: a single call transitions Pending rows to Running atomically, and
// the rows returned to the caller reflect the post-update state.
func TestJobRepository_ClaimPendingJobs_FlipsToRunning(t *testing.T) {
// Q-1 closure (cat-s3-58ce7e9840be): exercises the SKIP-LOCKED claim
// SQL against a live PostgreSQL via testcontainers-go. Run with:
// go test -count=1 ./internal/repository/postgres/... (omit -short)
if testing.Short() {
t.Skip("integration test requires PostgreSQL")
}
@@ -1993,6 +1996,9 @@ func TestJobRepository_ClaimPendingJobs_FlipsToRunning(t *testing.T) {
// an atomic progress counter before exiting, so transient SKIP-LOCKED zeros do
// not cause premature termination.
func TestJobRepository_ClaimPendingJobs_ConcurrentDisjoint(t *testing.T) {
// Q-1 closure (cat-s3-58ce7e9840be): concurrent claim semantics
// require true row-level locking — only PostgreSQL provides this.
// Run with: go test -count=1 ./internal/repository/postgres/... (omit -short)
if testing.Short() {
t.Skip("integration test requires PostgreSQL")
}
@@ -2100,6 +2106,10 @@ func TestJobRepository_ClaimPendingJobs_ConcurrentDisjoint(t *testing.T) {
// Running; AwaitingCSR rows are returned but their state is preserved (the CSR
// submission path drives their next transition).
func TestJobRepository_ClaimPendingByAgentID_TransitionsDeployments(t *testing.T) {
// Q-1 closure (cat-s3-58ce7e9840be): Pending→Running deployment-job
// transition vs CSR-flow preservation requires the live PostgreSQL
// transactional semantics. Run with:
// go test -count=1 ./internal/repository/postgres/... (omit -short)
if testing.Short() {
t.Skip("integration test requires PostgreSQL")
}
@@ -84,6 +84,9 @@ func TestRunSeed_AppliesIdempotently(t *testing.T) {
// We point at a directory that exists (empty temp dir) but contains no
// seed.sql. RunSeed must return nil silently.
func TestRunSeed_MissingFileIsNoOp(t *testing.T) {
// Q-1 closure (cat-s3-58ce7e9840be): RunSeed opens a *sql.DB connection
// against the live PostgreSQL testcontainer. Run with:
// go test -count=1 ./internal/repository/postgres/... (omit -short)
if testing.Short() {
t.Skip("skipping integration test in short mode")
}
@@ -30,6 +30,11 @@ type testDB struct {
func setupTestDB(t *testing.T) *testDB {
t.Helper()
// Q-1 closure (cat-s3-58ce7e9840be): live PostgreSQL needed via
// testcontainers-go (postgres:16-alpine). Run with:
// go test -count=1 ./internal/repository/postgres/... (omit -short)
// The short-mode gate keeps it off the default `go test ./... -short`
// fast loop where docker-in-docker may not be available.
if testing.Short() {
t.Skip("skipping integration test in short mode")
}