Skip to content

Benchmark Overview

This suite validates a single systems claim:

  • Public-path recomputation should be bounded by dependency complexity (k), not dataset size (n).
  • Secret-path overhead must be measurable, bounded, and continuously improved without changing DSL semantics.

Benchmark Matrix

BenchmarkFileWhat It Proves
#5 Sustained Mutationtests/Benchmarks/benchmark.5.sustained-mutation.test.tsThroughput stability over long mutation streams; p95 drift over time windows.
#6 Fan-Out Sensitivitytests/Benchmarks/benchmark.6.fanout-sensitivity.test.tsLatency behavior as fan-out grows, with constant derivation complexity k.
#7 Cold vs Warmtests/Benchmarks/benchmark.7.cold-warm-profiles.test.tsSeparation of cold setup cost vs warm and steady-state runtime.
#8 Explain Overheadtests/Benchmarks/benchmark.8.explain-overhead.test.tsObservability overhead of explain(path) vs baseline mutation/read loops.
#9 Secret-Scope Impacttests/Benchmarks/benchmark.9.secret-scope-impact.test.tsPublic vs secret latency envelope under equivalent workloads.
#10 Push vs Pulltests/Benchmarks/benchmark.10.push-vs-pull.test.tsIsolation of write-only (push) and first-read-after-write (pull) in eager vs lazy modes.
#11 Secret Push vs Pulltests/Benchmarks/benchmark.11.secret-push-vs-pull.test.tsSecret/public split of push vs pull; confirms secret-path cost structure after chunking/cache refactors.
Regression Gatetests/Benchmarks/benchmark.regression-gate.test.tsCI pass/fail checks for p95 latency, k complexity bound, and stealth masking correctness.

Latest Results (Local Baseline)

Machine: Suis-MacBook-Air
Run context: local, March 2026

#5 Throughput Under Sustained Mutation

MetricValue (ms)
p500.0041
p950.0077
p990.0132
max0.2680

Windowed p95 drift: -65.79% (end window vs start window).

Interpretation:

  • No upward drift under sustained updates.
  • Throughput remains stable as history grows.

#6 Fan-Out Sensitivity Curves

Fanoutkp50 (ms)p95 (ms)p99 (ms)
1020.00980.01850.0919
10020.00590.01050.0158
50020.00570.00910.0144
100020.00430.00680.0181
250020.00400.00560.0114
500020.00340.00380.0072

Interpretation:

  • k stays constant at 2.
  • Latency stays in micro-to-low-millisecond range across fanout values.

#7 Cold vs Warm Runtime Profiles

NodesCold (ms)Warm (ms)Steady Avg (ms)Steady Min (ms)Steady Max (ms)
1000.18990.09480.01610.00770.3110
10000.00820.01370.00440.00410.0067
50000.00950.00960.00470.00360.0109

Interpretation:

  • Cold penalty is isolated.
  • Warm/steady paths are consistently fast.

#8 Explain Overhead Budget

Modep50 (ms)p95 (ms)p99 (ms)
baseline0.00550.01010.0173
with_explain0.00920.01380.0244

p95 overhead: 36.63%.

Interpretation:

  • explain(path) adds bounded overhead while preserving traceability.
  • Absolute overhead remains sub-millisecond, making it production-safe for real-time auditing.

#9 Secret-Scope Performance Impact

Scopep50 (ms)p95 (ms)p99 (ms)
public0.00710.01500.1111
secret0.24920.29680.5470

Secret p95 slowdown vs public: 1878.33%.

Interpretation:

  • Secret path remains slower than public by design cost (crypto + boundary logic), but now sits in sub-millisecond p95 absolute range for this scenario.

#10 Push vs Pull (Eager vs Lazy)

Selected rows (fanout = 5000):

ModeFanoutkMutation p95 (ms)Read p95 (ms)
eager500020.00170.0022
lazy500020.00280.0027

Interpretation:

  • Both modes are now low-latency.
  • Lazy/eager semantics are selectable without destabilizing performance envelopes.

#11 Secret Push vs Pull (Chunked Secret Storage)

PlaneNodesMutation p95 (ms)Read p95 (ms)
public1000.00560.0121
secret1000.02160.3438
public3000.00270.0045
secret3000.01220.1566
public6000.00270.0041
secret6000.01170.2653

Slowdown ratios (secret/public p95):

NodesMutation slowdown xRead slowdown x
1003.86x28.41x
3004.52x34.80x
6004.33x64.71x

Interpretation:

  • Chunking and cache reduced secret mutation slowdown to single-digit multiples.
  • Read slowdown remains higher than write slowdown, but absolute read p95 remains sub-millisecond.

Regression Gate Status

Latest gate output:

  • latency_p95: ✅ 0.0125ms (threshold 20ms)
  • complexity_k: ✅ k=2 (threshold <=4)
  • stealth_masking: ✅ origin=stealth, masked=true, value=●●●●

What Is Proven Now

  • Public-path performance is stable and effectively bounded by small k.
  • Lazy/eager recompute modes are operational and benchmarked.
  • Explainability overhead is measurable and bounded.
  • Secret-path cost is no longer monolithic; chunked secret storage materially improved write-side scaling.
  • Privacy and semantic invariants remain intact (test:prebuild green).

Next Optimization Frontier

  • Further reduce secret read p95 under high node counts by:
  • increasing chunk locality for hot-read paths,
  • optional multi-read amortization benchmarks (reads_per_mutation),
  • and tuning chunk size/hash-bucket parameters with empirical thresholds.