PhiLab Workbench · Decision Logic · Interactive Prototype

Research only becomes credible once kill criteria are actually executable.

This subpage is not a mood board for the PRD. It is an interactive V1 of the core logic: constraints, hypotheses, evidence, gate decisions, and pivots are running together as one research workbench.

The page follows the PRD on purpose: compiler logic and research state first, benchmark sweeps second, gate consequences third. That keeps the project close to DG-1 instead of drifting into elegant dashboards without scientific consequences.

Open experiment builder Open decision gates

Build State

What is already running here

Constraint compiler, hypothesis scoreboard, gate dashboard, pivot logic, and two executable browser benchmarks. In addition, a real QuTiP trajectory run is now linked below as an exportable backend bundle.

System Principle

No quiet excuses for weak ideas

Self-dual AA specs are blocked explicitly. Monitoring without a physical window is not softened into vague optimism. If kill criteria trigger, a pivot path appears immediately instead of narrative drift.

Abstract · Summary for AI

PhiLab turns research logic into an explicit executable state

This page translates hypotheses, theoretical constraints, evidence records, and decision gates into an interactive interface. For students it is a practical introduction to kill criteria and pivots. For researchers it is a prototype research-OS layer: specs are compiled, risk structure is made explicit, and benchmark sweeps are tied directly to gate states instead of remaining presentation slides.

What the page actually does

Compiler: blocks invalid self-dual AA specs.
Registry: exposes hypotheses, evidence, and status changes.
Gates: convert results into operational research branches.

III - Experiment Builder

From spec to an explicit compiler verdict

Choose a preset, modify the regime, or intentionally provoke a failure case. The workbench checks the allowed search space first and only then executes the benchmark. That makes the PRD logic tangible instead of decorative.

Compiler output

Each run shows transparently why a spec was accepted or blocked.

Phase diagram

No benchmark active yet.

Selected point

The focus point shows how a local metric immediately turns into a gate-relevant decision.

Observable sweep

Depending on the mode, the page renders the core observables against lambda or gamma.

Entanglement / trajectories

The gapped mode renders cut profiles, while the monitoring mode switches to explicit trajectory fans.

V - Exported Results

Backend artifacts are now embedded for QuTiP, Quimb, and bootstrap runs

The browser workbench intentionally stays lightweight. At the same time, the current Python reference runs from the separate philab-workbench project are embedded here as complete bundles. That keeps it explicit which results belong to the browser simulation and which exist as external artifacts with manifest, summary, raw data, and report. Newly added are a small Quimb ED run and a compact Quimb scaling scan for the same gapped sensor path, both with a currently clear negative result.

`exp_aa_gapped_quimb_001` · small ED check for H1-gapped

Exact reference run with quimb_exactdiag on an 8-site proxy chain. The current result is unambiguous: KC-1 is triggered because the minimum edge-support metric, 0.345, sits well below the support threshold.

quimb_exactdiag size: 8 status: falsified

bundle.json summary.json raw_results.json manifest.json report.md

`exp_aa_gapped_quimb_scaling_001` · small scaling scan for H1-gapped

Multi-size run with quimb_exactdiag across 8 to 12 sites. The result stays hard negative here as well: KC-1 remains active, and edge support stays at only 0.115 even for the largest tested size.

quimb_exactdiag sizes: 8-12 status: falsified

bundle.json summary.json raw_results.json manifest.json report.md

`exp_monitoring_qutip_001` · reduced Z3 monitoring window

2-site toy model, 12 QuTiP trajectories, backend qutip_mcsolve. The current export should be read as weakened: the reduced model does not yet open a robust pre-Zeno window, but it is deliberately not treated as a hard falsification of the full monitoring path.

qutip_trajectories 12 trajectories 2 sites status: weakened

bundle.json summary.json raw_results.json manifest.json report.md

Bootstrap reference runs

The two earlier Python bundles remain visible on purpose. They mark the first reproducible baseline for the gapped sensor path and for the monitoring-window search before heavier backends are switched on. Together with the two Quimb runs, they now form a clearer progression of reference paths.

bootstrap gapped regime monitoring

exp_aa_gapped_001 bundle exp_monitoring_window_001 bundle Workbench Repo

The same bundle logic now appears directly inside the interactive session report whenever one of these presets is active.

How these bundles should be read

Adapter: qutip_mcsolve from a real Python run, not a browser mockup.
Purpose: stress-test the monitoring / MIPT branch early and reproducibly in a reduced model.
Interpretation: gamma_c_target = 0.0 weakens H3 in the toy model, but it does not replace a larger interacting study.
Context: manifest and report share the same config hash, so the run remains citable and auditable.

Experiment spec Workbench repo

VI - Comparative Reading

Three reference paths, three different statements

ED single-size

The small Quimb ED run is the hardest local reality check for H1-gapped. Already at 8 sites, edge support falls clearly below the support threshold. This is not a soft warning but an early no-go in the small proxy model.

quimb single size hard negative

ED scaling

The scaling run answers the next fair question: was the 8-site result just an artifact? The current 8-to-12-site scan says no. Edge support remains weak even at the largest tested size.

quimb size scaling negative persists

QuTiP monitoring

The QuTiP path answers a different question. It does not test H1-gapped but the monitoring branch H3. The language remains intentionally more careful there: weakened rather than hard kill, because the reduced toy model does not replace the larger interacting study.

qutip monitoring weakened

FAQ · PhiLab Workbench

Three questions about executable research logic

What is a theoretical constraint?

A theoretical constraint is not an intuition marker but a formal limit on the search space. In the workbench it blocks specifications that contradict the known state of the research program.

Why are decision gates so central here?

Because the project should not get trapped in elegant intermediate stories. The gates force numerical and analytical results to become explicit go / no-go decisions.

Why is this useful for students?

The workbench shows how research logic becomes operational: hypothesis, kill criterion, benchmark, evidence, and pivot appear here as concrete interface states rather than vague project language.

Research only becomes credible once kill criteria are actually executable.

What is already running here

No quiet excuses for weak ideas

PhiLab turns research logic into an explicit executable state

Theoretical priors, evidence, and pivots inside one working space

The current research state is a data structure, not intuition

From spec to an explicit compiler verdict

When results collapse, the roadmap must collapse too

Backend artifacts are now embedded for QuTiP, Quimb, and bootstrap runs

`exp_aa_gapped_quimb_001` · small ED check for H1-gapped

`exp_aa_gapped_quimb_scaling_001` · small scaling scan for H1-gapped

`exp_monitoring_qutip_001` · reduced Z3 monitoring window

Bootstrap reference runs

How these bundles should be read

Three reference paths, three different statements

ED single-size

ED scaling

QuTiP monitoring

Three questions about executable research logic

What is a theoretical constraint?

Why are decision gates so central here?

Why is this useful for students?

Cite PhiLab as interactive research software

Research only becomes credible once kill criteria are actually executable.

What is already running here

No quiet excuses for weak ideas

PhiLab turns research logic into an explicit executable state

Theoretical priors, evidence, and pivots inside one working space

The current research state is a data structure, not intuition

From spec to an explicit compiler verdict

When results collapse, the roadmap must collapse too

Backend artifacts are now embedded for QuTiP, Quimb, and bootstrap runs

exp_aa_gapped_quimb_001 · small ED check for H1-gapped

exp_aa_gapped_quimb_scaling_001 · small scaling scan for H1-gapped

exp_monitoring_qutip_001 · reduced Z3 monitoring window

Bootstrap reference runs

How these bundles should be read

Three reference paths, three different statements

ED single-size

ED scaling

QuTiP monitoring

Three questions about executable research logic

What is a theoretical constraint?

Why are decision gates so central here?

Why is this useful for students?

Cite PhiLab as interactive research software

`exp_aa_gapped_quimb_001` · small ED check for H1-gapped

`exp_aa_gapped_quimb_scaling_001` · small scaling scan for H1-gapped

`exp_monitoring_qutip_001` · reduced Z3 monitoring window