7.5 KiB
Fortress Roadmap
Roadmap for closing the remaining assurance-system gap in UltrafastSecp256k1.
The objective is not to wait for outside validation before becoming robust. The objective is to make the repository itself increasingly difficult to break, increasingly easy to audit, and increasingly strict about public claims.
Goal
Move the assurance model from strong and transparent to fortress-grade.
That means:
- Claims are machine-linked to evidence.
- Review blind spots shrink over time.
- External-style review loops are absorbed into deterministic checks.
- Documentation drift becomes harder to introduce.
- Performance and capability governance match public positioning.
- Known failure classes are covered by explicit self-audit surfaces rather than implied confidence.
Practical interpretation:
The target is not to claim literal proof that "all possible bugs" are gone. The target is to drive the self-audit system toward near-total coverage of all known failure classes and to leave any residual risk explicit, scoped, and hard to hide.
Operating posture:
This roadmap assumes the audit class should be strong enough for owner-grade responsibility, even if the team is not currently using the engine to manage its own high-value assets. Because of that, the assurance target is internal safety and operational trustworthiness first, not external optics or marketing trust.
Workstream 1: Canonical Assurance Ledger
Status: in progress
Deliverables:
docs/ASSURANCE_LEDGER.md- Machine-readable companion format
- Claim ID references in public docs
- Stale-claim review checklist in preflight
Outcome:
Public claims become versioned, scoped, and auditable.
Workstream 2: AI-Assisted Audit Formalization
Status: started
Deliverables:
docs/AI_AUDIT_PROTOCOL.md- Review mode taxonomy
- Accepted-finding conversion rules
- Review event logging format
Outcome:
AI review becomes a governed input into the assurance system rather than an informal side channel.
Workstream 3: Graph-Driven Assurance
Status: in progress
Deliverables:
- Graph coverage checks for new source/doc/test surfaces
- Stale-claim detection linked to the graph
- Impact reports for public-surface changes
- Graph-first audit bundles for reviewers
Outcome:
The source graph becomes part of the assurance engine, not just review tooling.
Current progress:
- Claim-surface graph coverage is validated through
scripts/validate_assurance.py. scripts/preflight.py --claimsexposes graph-driven stale-claim checks locally.- Public-surface drift now fails closed when claim evidence resolves on disk but not in the indexed graph.
Self-audit completeness criteria for this workstream:
- Every public claim surface should resolve to concrete repository evidence.
- Every new subsystem should appear in the graph, in docs, and in at least one deterministic validation path.
- Missing graph coverage should be treated as an assurance gap, not a documentation detail.
Workstream 4: GPU Governance Hardening
Status: in progress
Deliverables:
- Stronger GPU-specific benchmark governance
- Clear distinction between stable host ABI and experimental/internal kernels
- Optional ROCm/HIP promotion criteria tied to hardware-backed validation
- Better workflow coverage for GPU public differentiators
Outcome:
The strongest public performance claims become guarded by comparably strong enforcement.
Scope note:
ROCm/HIP hardware-backed validation is a future expansion path for backend coverage and publishability. It is not a prerequisite for the validity of the current audit surfaces for CPU, CUDA, OpenCL, or Metal, and it does not weaken already validated backends while AMD hardware is absent.
Current progress:
docs/GPU_BACKEND_EVIDENCE.jsonnow defines backend status, publishability, and required artifact classes.scripts/validate_assurance.pyvalidates fail-closed ROCm/HIP promotion rules.scripts/preflight.py --gpu-evidenceexposes the local gate for backend publishability checks.- Public narrative surfaces now cross-reference the assurance ledger claim IDs for top-level trust statements.
- GPU parity documentation has been reconciled around the real 13/13 CUDA/OpenCL/Metal stable C ABI surface.
- ROCm/HIP promotion is now documented as a hardware-backed checklist rather than a vague future task.
Workstream 5: Drift-Resistant Documentation
Status: in progress
Deliverables:
- Canonical link graph for README, WHY, SECURITY, AUDIT_GUIDE, and backend docs
- Reduced metric drift across files
- Clearer distinction between active, partial, planned, and experimental states
- Faster stale-count detection
Outcome:
Top-level narrative stays aligned with repository truth as the code evolves.
Workstream 6: Self-Audit Completeness
Status: in progress
Deliverables:
- Failure-class inventory for correctness, parser boundaries, CT, protocol misuse, ABI misuse, GPU host misuse, docs drift, and benchmark overclaim risk
- Deterministic mapping from each failure class to one or more audit surfaces
- Residual-risk accounting for anything not yet covered by a deterministic check
- Fail-closed policy for new features that land without an audit-path mapping
Outcome:
The self-audit system becomes harder to bypass because coverage is measured by problem class, not by raw test counts.
Current progress:
docs/SELF_AUDIT_FAILURE_MATRIX.mdnow maps major failure classes to deterministic audit surfaces and named residual-risk notes.docs/OWNER_GRADE_AUDIT_TODO.mdnow translates the remaining assurance gap into concrete code-and-tooling work for an owner-grade audit class.
Coverage target:
The project should aim for near-total coverage of known failure classes. Practically, that means pushing the residual risk surface toward the smallest possible set of explicitly named gaps instead of treating "lots of tests" as a proxy for completeness.
Immediate Tasks
- Expand claim ID references from top-level docs into more subsystem-specific docs.
- Keep adding review-event logging for newly accepted AI-assisted passes.
- Expand graph-driven stale-claim checks from core public claims into more subsystem-specific docs.
- Expand GPU-facing performance governance beyond backend publishability into reproducibility and artifact-retention checks.
- Push the optional ROCm/HIP promotion checklist into more GPU-facing docs and release procedures without treating it as an audit blocker.
- Expand the failure-class matrix as new subsystems and ABI surfaces land.
- Keep shrinking the named residual-risk set with deterministic gates where possible.
- Implement the owner-grade audit tooling backlog, starting with executable failure-class gates and CT evidence aggregation.
Promotion Criteria For "Fortress" Status
The project should only claim fortress-grade self-audit when all of the following are true:
- Every top-level trust claim maps to ledger evidence.
- AI-assisted review is governed by a documented protocol.
- Graph tooling participates directly in assurance checks.
- GPU public claims have enforcement close to CPU public claims.
- Public docs have a low stale-drift rate over time.
- Known failure classes have named coverage paths or explicit residual-risk entries.
What This Roadmap Does Not Assume
This roadmap does not assume:
- A paid third-party audit is required first
- Trust should depend on a one-time report
- External review is optional
External audit remains welcome. The point is to ensure the repository is already strong, transparent, and rerunnable before anyone arrives.