dual-research · spec dashboard
read-only view of
specs/, handoffs/ and dashboard/events/ at mainpause_circle
Queue · idle
Nothing in flight. Run
/dev-next in your queue session to start the next spec.0 in flight0 queuedlast shipped 48s ago
48s ago
last deploy · 0216
Drafts
0
ideation
Queued
0
pending · queue empty
In flight
0
idle
Shipped
76
all-time · 72 with full timings
Avg cycle (last 10)
16m 21s
↓ 4m 54s vs prior 10
Queue
/dev-next picks position 1 first
#
Spec
Title
Type
Waiting
Queue is empty. Promote a draft with
/spec-promote <id> or queue a new one with /spec-queue.Recent activity
last 24 hours
13:27:22 UTC
bookmark
handoff written0216 · handoffs/2026-05-25-spec-0216-raiser-self-address-observability.md
—
13:25:49 UTC
check_circle
deployed0216 · Make raiser self-ADDRESS visible via ProtocolViolation · v1.44.21
—
13:02:37 UTC
bookmark
handoff written0215 · handoffs/2026-05-25-spec-0215-fix-in-round-partner-blindness.md
—
12:41:01 UTC
bookmark
handoff written0214 · handoffs/2026-05-25-spec-0214-drop-phase4-agent-emitted-hash-gate.md
—
12:40:08 UTC
check_circle
deployed0214 · Drop Phase 4 agent-emitted SHA-256 gate; converge on version equality · v1.44.19
—
12:22:58 UTC
flag_circle
in progress0214 · Drop Phase 4 agent-emitted SHA-256 gate; converge on version equality
—
12:04:25 UTC
add_task
queued0214 · Drop Phase 4 agent-emitted SHA-256 gate; converge on version equality
—
Showing 1–10 of 40
Drafts
backlog · promote with
/spec-promote <id>—
No drafts parked.
—
Total time spent
across 72 timed cycles
Total elapsed
1d 17h
across 72 timed cycles
Mean cycle
16m 46s
excluding 4 outliers > 1h
Median cycle
16m 7s
p50 of timed runs
Fastest / Slowest
4m 20s / 10h 50m
0159 / 0152
All specs
76 shipped · 0 queued · 0 in flight
Spec
Title
Type
Status
Lifetime
Cycle
Refactor: collapse 11-stage timeline to 7 honest spans + decimal sub-spec treatment
refactoring
deployed
11h 37m
23m 18s
Tests: lock the spec 0212 explicit-cleanup ordering invariant (re-detach must precede `git branch -D`)
test
deployed
10h 53m
10m 12s
Refactor: /dev-next — eliminate the post-merge race window (split `gh pr merge` from branch-cleanup, buffer all post-merge `--push-to-main` writes until step 23)
refactoring
deployed
10h 29m
16m 15s
Refactor: /dev-next step 20 — survive deploy-main concurrency-group cancellation of the merge-commit run
refactoring
deployed
9h 57m
9m 24s
Refactor: /dev-next step 20 — capture MERGE_SHA at merge time, not after subsequent --push-to-main commits advance origin/main
refactoring
deployed
1d 9h
16m 15s
Refactor: scripts/sweep_stale_blues.sh — prefer flyctl over fly so the CI sweep step in deploy.yml actually runs
refactoring
deployed
1d 9h
24m 9s
Refactor: /dev-next step 21 — delegate fly deploy to .github/workflows/deploy.yml (eliminate local-vs-GHA deploy race)
refactoring
deployed
13h 33m
15m 56s
Refactor: /dev-next step 19/20/24 — eliminate worktree cross-conflict on main checkout (push-via-plumbing for handoff + archive, queue worktree never holds main)
refactoring
deployed
13h 14m
22m 34s
Fix: CollapsibleSection bodies stay hidden when open due to legacy how-it-works CSS bleed
bug
deployed
12h 34m
22m 54s
Fix: provider chip dim in critique — promote tone-claude/gpt override into the base Chip primitive
bug
deployed
12h 8m
20m 22s
Fix: link-variant icon missing from registry — evidence/sources badges render as blank blue square
bug
deployed
11h 44m
7m 21s
Tests: canonicalize the JSX-loaded-via-babel test doctrine (Playwright harness or source-pattern as the documented project pattern)
test
deployed
11h 30m
15m 18s
Refactor: remove dead .kind-tab / .kind-tabs CSS + shared.jsx variant="kind" / variant="kind-tabs" branches (post-spec-0205 cleanup)
refactoring
deployed
11h 7m
18m 20s
Tests: ItemCard 8-capture parity grid — refresh references against post-0205 anatomy and retroactively attach to spec 0205
test
deployed
10h 44m
39m 43s
Fix: P4 critique card — five visual regressions (sources layout/icon, lifecycle-first ordering, coloured kind filter chips, raiser badge defaulting to System)
bug
deployed
3h 59m
29m 53s
Timeline pane V2 — promote workshop lock to live (T1 card height, T2.a click/modal split, T2.b cost precision parity)
new-feature
deployed
3h 23m
23m 43s
Fix: cycle-mutable status readers — overlay queue-state.json in current_queue AND queue_drain_supervisor
bug
deployed
2h 54m
12m 32s
Fix: dashboard live API + L-spec event vocab parity with queue-state.json
bug
deployed
2h 17m
21m 45s
Main-commit noise reduction — queue-state file + handoff archival + event consolidation
new-feature
deployed
4m 21s
31m 7s
Branch & safety hygiene — verified deletes, dirty-tree refusal, pre-push branch assertion
refactoring
deployed
0s
10m 57s
Fly deploy simplification — drop bluegreen, no-retry-on-failure rule, codify recovery path
refactoring
deployed
0s
12m 56s
Queue mechanics — decimal sub-numbering, promote-as-next, drop queue_position
new-feature
deployed
22h 47m
22m 41s
Dev-spec quality guardrails — no questions, source traceability, BDD acceptance criteria
new-feature
deployed
22h 4m
20m 14s
Refactor: rewrite dashboard mockup to use canonical DS token names so the parity allowlist shrinks to empty
refactoring
deployed
17h 17m
8m 18s
Fix: authoring funnel `promo_pct` numerator counts all queued specs, blows past 100% when most specs skip the draft step
bug
deployed
17h 3m
8m 21s
Fix: All-Runs renders `0s ago` for abandoned rows that have no transcript
bug
deployed
16h 52m
10m 1s
Fix: build_headless_command passes --cwd which the installed claude CLI rejects
bug
deployed
15m 28s
9m 30s
Stale-blue sweep filter — catch machines on an out-of-release image
new-feature
deployed
16h 39m
11m 13s
L-spec checkpoint budget heuristic — wall-clock session age trigger
new-feature
deployed
16h 26m
8m 30s
Refactor: extract queue-drain supervisor to Python with unit tests
refactoring
deployed
16h 15m
9m 3s
Refactor: TimelineTabs `Tab variant=solid` branch from `.is-active` to `data-active=true`, drop the dual CSS selector
refactoring
deployed
16h 2m
10m 1s
Fix: statusCounts.open misses items in `addressed` state — All ≠ Open + Resolved + Drift
bug
deployed
13h 31m
30m 28s
Route ItemCardIssueBody and ItemCardCommentBody through ItemCardThreadView
new-feature
deployed
12h 58m
9m 11s
Refactor: DS reference §13 ItemCard examples — match post-0173 head, evidence-inline, collapse, and QuestionThread bubbles
refactoring
deployed
12h 47m
15m 17s
Refactor: cycle-time line chart Y-axis cap from fixed 60m to a percentile-based / auto-ranged cap
refactoring
deployed
1d 12h
16m 41s
Tests: pixel + structural fidelity check of the live dashboard against `dashboard-redesign-v3-horizontal.html`
test
deployed
1d 12h
9m 24s
Fix: authoring funnel DRAFTS bucket only counts promoted drafts, omits current backlog under `specs/drafts/`
bug
deployed
1d 11h
21m
Fix: bootstrap timeline shows `—` for completed-stage durations after `/api/data` refresh
bug
deployed
1d 11h
8m 4s
Fix: All-Runs reports `running` for runs that died days ago — add time-based liveness check + new `abandoned` lifecycle status
bug
deployed
1d 11h
18m 52s
Fix: Consumption card V2 anatomy — split combined Total tokens bar into total-input + total-output, add output totals block, move cache-savings line
bug
deployed
1d 10h
23m 44s
Fix: critique-card body redundancies (verdict row, bottom anchor, seen-row) + mandate side-by-side parity grid in PR
bug
deployed
1d 10h
9m 23s
Fix: three-section Input panel reveals empty body — drop inner per-piece default-collapse so one click on outer chevron shows content
bug
deployed
1d 10h
7m 57s
Dashboard redesign v3 — horizontal hero + timeline, full-width counter row, populated Metrics tab, pagination, pastel chart palette, light default
new-feature
deployed
21h 40m
38m
Login screen v2 — animated hero, theme toggle continuity, looping chatter
new-feature
deployed
1d 9h
22m 26s
Summary tab v2 — celebratory close-out with verdict, stats, critique outcomes, and markdown download
new-feature
deployed
1d 9h
21m 27s
Fix: /api/data subrequest-limit blowup (502) + drop dashboard poll from 15s to 5s
bug
deployed
20h 28m
11m 52s
Drain deferrals — pulse-info dot, .tab-group-solid rename, segment counts, item-card head/lifecycle/sources/collapse, upstream [object Object] fix
new-feature
deployed
1d
0s
Fix: critique cards render literal ** markdown and re-show cryptic compound IDs
bug
deployed
1d 8h
11m 25s
Fix: Agent Input sub-tab renders two narrow cards causing horizontal scroll in split-pane modals
bug
deployed
1d 8h
8h 14m
/canvas workshop skill — registry-driven pane iteration sandbox with verbatim live + DS snapshots
new-feature
deployed
23h 58m
28m 20s
Dashboard redesign v2 — condensed callouts, tabs, light/dark themes, History total-elapsed banner
new-feature
deployed
22h
50m
Critique pane — item-card frame + head rebuild + expanded lifecycle view + source attribution + affordances
new-feature
deployed
20h 56m
8m
Critique pane — bar2 segmented controls + bar1 drift chip + phase-tab + kind-cluster DS catch-up
new-feature
deployed
20h 45m
10m
Timeline pane — System + Error chip primitives + brief-card refactor + live-state agent-strip wiring + turn-render data-layer fix
new-feature
deployed
20h 31m
16m
Timeline pane — chip polish (identity / activity / category bubble / cost) + light-mode token drift fix
new-feature
deployed
20h 13m
13m 30s
Timeline pane — M3 card chrome + phase header simplification + narrow-view strip equalisation
new-feature
deployed
19h 57m
2h 42m
Deferred-spec subagent in /dev-next — auto-capture in-flight deferrals as queued specs or drafts
new-feature
deployed
14h 7m
6m 8s
/spec-queue auto-decomposition — conservative bundle-by-default, auto-chained sub-specs
new-feature
deployed
13h 55m
5m 19s
Dashboard live-ness — cycle-started anchor, browser auto-refresh, live elapsed ticker, deploy-pages cleanup
new-feature
deployed
13h 45m
18m 23s
Spec workflow hardening — design-system steering at spec time, run-queue-until-empty skill, project CLAUDE.md
new-feature
deployed
13h 6m
9m
Dashboard redesign — design-system primitives, expandable in-flight hero with stage timeline, activity feed
new-feature
deployed
12h 13m
20m 43s
Spec lifecycle system v1 — typed drafts, dev queue, dashboard, session naming
new-feature
deployed
10h 50m
10h 50m
Design-system parity for critique surface + canonical Agent Input grouping + run-ID copy affordance
bug
deployed
7h 16m
—
Canonical prompt-pieces registry + per-attachment token tracking — protocol, persistence, and consumption surfaces
—
ready
—
—
Sources & provenance — investigation outcome and per-critique-card surface (incl. Phase 4 Issue/Comment patches)
—
ready
—
—
Phase 4 deadlock — draft extractor body retention + escape-valve breadth for terminal-ledger AGREED
—
ready
—
—
Active-agent gradient pulse · uniform critique card heights · run-id chip in header row 2
—
ready
—
—
Substantive-convergence escape valve — canonical-promote when both AGREED with terminal ledger but artifact hashes drift
—
ready
—
—
Unify run-status derivation — single source of truth across the All-Runs list, the run-detail page, and the orchestrator's exit-code emission
—
ready
—
—
Phase 0 round cards + side-by-side critique modal — close the round-by-round visibility gap for the brief negotiation
—
ready
—
—
Run-detail surface rework — agent chips into Timeline pane, narrow critique compaction, M3 segmented phase progress, timeline card chip slim-down
—
in-flight
—
—
CSS finalization + v1 token block removal + IBM Plex removal (close the migration arc)
—
proposed
—
—
Remaining JSX (app / errors / compare / auth / search / shared) v1 → v2 token migration
—
proposed
—
—
Onboarding tour visual rewrite + unified Admin/Settings + server-persisted onboarding state
—
proposed
—
—
Critique filter header height parity + responsive compaction + timeline card right-alignment
—
in-flight
—
—
Deep Research consumption & cost tracking — collapsed/unfolded redesign, canonical piece aggregation
—
implemented
—
—
Critique cards — bucket correctness, expanded-card scroll, badge cleanup, height parity with Timeline
—
merged
—
—
M3 chrome + run-detail header JSX wiring (plus restore live agent strip dropped by spec 0099)
—
proposed
—
—
Loading + States + A11y + Light + Responsive verification sweep (cross-cutting polish + verification of all earlier specs in both themes and three breakpoints)
—
proposed
—
—
Onboarding tour overlay + ProgressSegs admin (8-step tour over the live app via data-tour-anchor attributes; no redraw of underlying surfaces; ProgressSegs 8-segment per-user track)
—
proposed
—
—
How It Works overlay + right-side menu + Changelog (full-screen M3 dialog · sticky right menu · How It Works ↔ Changelog toggle · 9 collapsible sub-sections with inline diagrams)
—
proposed
—
—
Consumption pane full rework — collapsed + unfolded with sub-rows + uniform width across phases with round chip above card + sticky bottom legend
—
proposed
—
—
Timeline pane M3 rework — header chrome + vertical phase rail outside column anchored to header centers + tl-turn variants + single dashed top border on unfold + REPAIR row variant with explainer
—
proposed
—
—
Critique pane M3 rework — Bar 1 (title · phase tabs · totals · drift chip) + Bar 2 (kind tabs · agent · status filters) + collapsible status-grouped sections + Σ Summary state + phase-header sizing taller than card headers
—
proposed
—
—
QuestionThread + unified item-card family (Q · D · I · C cards with one anatomy — who → when → what → quote, six-word verdict vocabulary, tonal bubble + quote inside bubble, dashed footer)
—
proposed
—
—
M3 modal primitive (basic 560 dp + rich 1080 dp, shape-xl, elevation-3, agent-tinted left border)
—
proposed
—
—
M3 tabs + top app bar + chrome compaction (run-list 3→2 headers, top-bar layout when viewing a run)
—
proposed
—
—
M3 cards + AgentStrip + badge inventory + hover elevation-2 rule + AgentStrip badge sizing/symmetry
—
proposed
—
—
M3 atoms — buttons (5 variants) + FAB + icon button + chips (4 kinds) + status pills (canonical OK) + switches + segmented buttons
—
proposed
—
—
Material 3 token & foundation layer (palette, type, shape, elevation, state, motion, density, fonts, icons, base)
—
proposed
—
—
Parser robustness for cross-round Q/A/issue linkage + code-fence awareness (data-integrity overhaul)
—
proposed
—
—
Convergence escape hatches for stuck-AGREED loops (canonical-FSD synthesis, stuck-AGREED escape valve, hard ledger feedback)
—
proposed
—
—
Consumption tab rework — phase headers above rows, single card per agent, no duplicate compact bar
—
in-review
—
—
Agent Input panel — system-prompt fallback, structural hierarchy, modal vertical space
—
in-review
—
—
Hosted UI modal-load perf — warm machine, gzip, immutable cache headers, server-side LRU
—
merged
—
—
Consumption tab agent-card restructure — equal-height, data-top-bars-bottom, wider bars
—
merged
—
—
Run-detail header — agent strip equalization, phase-tab info hierarchy, remove blocking banner
—
merged
—
—
Modal primitive (single + split) + sub-tabs + RoundScrubber + provider-symmetric SourceCard
—
merged
—
—
Timeline + critique restructure (PhaseRail + ChipCluster + 3-axis filter + DriftCluster + Summary panel + CardHeadline migration)
—
merged
—
—
Primitive vocabulary — Button, StatusBadge, Chip, RunIDChip, ThemeToggle, Card, AgentStrip + today's-component migration
—
merged
—
—
Critique data integrity — parser coverage, badge wiring, modal load paths, count reconciliation
—
merged
—
—
Critique rework — Phase 4 answer linkage fix, compact cards, summary tab, timeline-pane re-alignment
—
merged
—
—
Cost-pipeline integrity — preserve metrics on resume, price the cache tier that was actually used, fold tool spend into the headline
—
merged
—
—
Web search audit foundation + protocol parser fixes + resume hardening + --notion repeatable
—
merged
—
—
Critique navigation — first-class Q+D, side-by-side rework, sentiment cards, click-to-highlight
—
merged
—
—
Consumption-tab follow-ups — tier-lookup window, click-to-expand bars, per-phase web-search count
—
merged
—
—
Timeline UX pass — inline unfold, per-input segments, real context windows, parser repairs
—
merged
—
—
Visualisation foundations — modal pattern, summary cards, preflight tabs, attachment ingest
—
merged
—
—
Live data fidelity — round counts, disagreement parsing, terminal-status pills, chip math
—
merged
—
—
⏱
Cycle time WoW
—
Needs ≥ 1 deployed cycle in each of the last two ISO weeks.
⏱
Where time goes
45% in deploy
Mean deploy = 6m 28s. Next: Implement 5m 49s, Handoff 1m 4s.
!
Reconcile drift
—
No reconcile events on the last 10 cycles.
Cycle time over last 22 cycles
Each point is one deployed spec. Dashed = rolling-10 mean. Outliers > 45m (1) clipped to the top.
Cycle timeRolling-10 mean
Where time goes (mean stage durations)
Stacked, last 10 deployed cycles · total mean = 14m 27s
ImplementTestShipDeployHandoff
Throughput · specs deployed per week
last 8 ISO weeks · current week in purple
By type · last 30 days
76 specs total · mean cycle by category
new-feature
29 · 49m 25s
bug
26 · 34m 17s
refactoring
16 · 15m 17s
test
5 · 17m 41s
Success rate · last 30 days
deployed vs failed cycles
Deployed · 76Failed · 0
Spec authoring funnel
how ideas reach deploy
Last 30 days · 0 drafts in backlog + 9 promoted · 100% reached queue · 123% of queued shipped