sparrowwallet/UltrafastSecp256k1

Fork 0

shrec 8af7320c60

Harden audit and fix Windows CUDA build

2026-03-25 14:36:36 +00:00

4.8 KiB

Raw Permalink Blame History

Local CI (GitHub Parity)

This guide is the practical local gate for Linux developers.

Goal: if local parity passes, GitHub Linux CI should not fail for the same reasons.

1. One-command entrypoint

Use the wrapper:

./scripts/ci-local.sh --list
./scripts/ci-local.sh quick
./scripts/ci-local.sh pre-push
./scripts/ci-local.sh gh-parity

ci-local.sh auto-detects docker compose or docker-compose.

2. Recommended flow

Fast iteration:

./scripts/ci-local.sh quick

Before push:

./scripts/ci-local.sh pre-push

Before PR/merge (max Linux parity):

./scripts/ci-local.sh gh-parity

3. What `gh-parity` runs

gh-parity runs Linux jobs mapped to GitHub blockers/advisory checks:

warnings (-Werror)
linux-gcc (Release + Debug)
linux-clang (Release + Debug)
asan
tsan
msan (advisory, non-blocking to match workflow behavior)
valgrind
clang-tidy
cppcheck
ct-verif (deterministic CT LLVM/IR checks)
bench-regression (local baseline compare)
audit (unified_audit_runner on GCC + Clang)
arm64 (cross-compile)
wasm (build + KAT)

4. Benchmark regression baseline

bench-regression stores local baseline in:

.ci-baseline/bench_quick_baseline.json

Behavior:

first run creates baseline and passes
next runs compare current quick bench against baseline
default threshold is 120 percent (20 percent slower fails)

Override threshold:

BENCH_ALERT_THRESHOLD=130 ./scripts/ci-local.sh bench-regression

5. GPU Audit (Local Only)

The GPU audit cannot run on GitHub CI because hosted runners do not expose the required GPU devices. It is still reproducible through the repository's local infrastructure:

run the standard Linux parity stack in Docker via docker-compose.ci.yml or Dockerfile.local-ci
then execute the GPU-specific slices on a host that exposes the required CUDA/OpenCL device stack

In other words, GPU validation is not "developer laptop only"; it is locally reproducible for anyone who brings compatible hardware and maps that hardware into the same repo-shipped environment.

Prerequisites

NVIDIA GPU (any compute capability >= 5.0)
CUDA Toolkit >= 12.0
CMake >= 3.24, Ninja

Build

# From library root:
cmake --preset cuda-audit-5060ti
cmake --build --preset cuda-audit-5060ti -j

Run

# Confirm the GPU audit slice is visible in the active build tree
ctest --test-dir build/cuda-release-5060ti -N

# Run the GPU C ABI validation slice
ctest --preset cuda-audit-5060ti

# Run the standalone CUDA audit runner as well
./build/cuda-release-5060ti/cuda/gpu_audit_runner

Expected Output

The runner executes 43 modules across 10 sections and produces:

Section	Modules	Coverage
Mathematical Invariants	12	Field, scalar, point arithmetic, group order
Signature Operations	3	ECDSA + Schnorr roundtrip, wrong-key rejection
Batch Operations	4	Batch inversion, bloom filter, batch ECDSA verify, MSM
CPU-GPU Differential	1	Generator mul cross-check
Device Memory	2	Alloc/free stress, CUDA error state
Constant-Time Layer	6	CT field/scalar/point, CT ECDSA/Schnorr, CT-FAST parity
Standard Test Vectors	3	BIP-340, RFC-6979, BIP-32
Protocol Security	6	Multi-key ECDSA/Schnorr, ECDH, recovery, BIP-32 chain, Hash160
Fuzzing	4	Edge scalars, zero-key rejection, serialization roundtrip
Performance Smoke	2	ECDSA 100-iter stress, Schnorr 50-iter stress

Verdict: AUDIT-READY when all 43/43 pass.

The canonical GPU-enabled ctest -N output should also include these extra GPU C ABI tests:

gpu_abi_gate
gpu_ops_equivalence
gpu_host_api_negative
gpu_backend_matrix

Reports are written to the build directory:

gpu_audit_report.json -- machine-readable
gpu_audit_report.txt -- human-readable summary

Notes

First run may take ~5 minutes due to PTX JIT compilation (subsequent runs are faster)
The selftest_core module runs 41+ GPU kernel tests and dominates total runtime
CMAKE_CUDA_ARCHITECTURES="native" auto-detects your GPU; explicit SM values avoid JIT overhead
If you prefer a containerized Linux toolchain, use the repo's Docker local CI environment first and then run the GPU-enabled build on a host/container with GPU passthrough.

6. What is still not reproducible on Linux local Docker

These GitHub jobs need non-Linux or hosted integrations:

windows (MSVC)
macOS/iOS (Apple toolchain and Metal runtime)
CodeQL / Scorecard / dependency-review / other GitHub-native services

GPU-backed Linux validation is reproducible locally, but it still depends on host hardware exposure and cannot be reduced to a pure CPU-only container image.

Use local parity for fast prevention, and keep GitHub CI as final cross-platform confirmation.

4.8 KiB Raw Permalink Blame History