dual-research · spec dashboard
read-only view of
specs/, handoffs/ and dashboard/events/ at mainpause_circle
Queue · idle
Nothing in flight. Run
/dev-next in your queue session to start the next spec.0 in flight0 queuedlast shipped 8m 12s ago
8m 12s ago
last deploy · 0197
Drafts
0
ideation
Queued
0
pending · queue empty
In flight
0
idle
Shipped
49
all-time · 45 with full timings
Avg cycle (last 10)
12m 2s
↓ 3m 51s vs prior 10
Queue
/dev-next picks position 1 first
#
Spec
Title
Type
Waiting
Queue is empty. Promote a draft with
/spec-promote <id> or queue a new one with /spec-queue.Recent activity
last 24 hours
17:18:27 UTC
bookmark
handoff written0197 · handoffs/2026-05-23-spec-0197-mockup-canonical-token-vocabulary-cleanup.md
—
17:17:36 UTC
check_circle
deployed0197 · Refactor: rewrite dashboard mockup to use canonical DS token names so the parity allowlist shrinks to empty · v1.39.3
—
17:10:09 UTC
flag_circle
in progress0197 · Refactor: rewrite dashboard mockup to use canonical DS token names so the parity allowlist shrinks to empty
—
17:04:02 UTC
bookmark
handoff written0196 · handoffs/2026-05-23-spec-0196-authoring-funnel-promo-pct-numerator-includes-direct-queue-specs.md
—
17:03:05 UTC
check_circle
deployed0196 · Fix: authoring funnel `promo_pct` numerator counts all queued specs, blows past 100% when most specs skip the draft step · v1.39.2
—
16:55:09 UTC
flag_circle
—
16:55:07 UTC
flag_circle
in progress0196 · Fix: authoring funnel `promo_pct` numerator counts all queued specs, blows past 100% when most specs skip the draft step
—
16:53:11 UTC
bookmark
handoff written0195 · handoffs/2026-05-23-spec-0195-all-runs-zero-seconds-ago-for-no-transcript-rows.md
—
16:52:03 UTC
check_circle
deployed0195 · Fix: All-Runs renders `0s ago` for abandoned rows that have no transcript · v1.39.1
—
Showing 1–10 of 40
Drafts
backlog · promote with
/spec-promote <id>—
No drafts parked.
—
Total time spent
across 45 timed cycles
Total elapsed
1d 8h
across 45 timed cycles
Mean cycle
15m 21s
excluding 3 outliers > 1h
Median cycle
13m 30s
p50 of timed runs
Fastest / Slowest
4m 20s / 10h 50m
0159 / 0152
All specs
49 shipped · 0 queued · 0 in flight
Spec
Title
Type
Status
Lifetime
Cycle
Refactor: rewrite dashboard mockup to use canonical DS token names so the parity allowlist shrinks to empty
refactoring
deployed
17h 17m
8m 18s
Fix: authoring funnel `promo_pct` numerator counts all queued specs, blows past 100% when most specs skip the draft step
bug
deployed
17h 3m
8m 21s
Fix: All-Runs renders `0s ago` for abandoned rows that have no transcript
bug
deployed
16h 52m
10m 1s
Fix: build_headless_command passes --cwd which the installed claude CLI rejects
bug
deployed
15m 28s
9m 30s
Stale-blue sweep filter — catch machines on an out-of-release image
new-feature
deployed
16h 39m
11m 13s
L-spec checkpoint budget heuristic — wall-clock session age trigger
new-feature
deployed
16h 26m
8m 30s
Refactor: extract queue-drain supervisor to Python with unit tests
refactoring
deployed
16h 15m
9m 3s
Refactor: TimelineTabs `Tab variant=solid` branch from `.is-active` to `data-active=true`, drop the dual CSS selector
refactoring
deployed
16h 2m
10m 1s
Fix: statusCounts.open misses items in `addressed` state — All ≠ Open + Resolved + Drift
bug
deployed
13h 31m
30m 28s
Route ItemCardIssueBody and ItemCardCommentBody through ItemCardThreadView
new-feature
deployed
12h 58m
9m 11s
Refactor: DS reference §13 ItemCard examples — match post-0173 head, evidence-inline, collapse, and QuestionThread bubbles
refactoring
deployed
12h 47m
15m 17s
Refactor: cycle-time line chart Y-axis cap from fixed 60m to a percentile-based / auto-ranged cap
refactoring
deployed
1d 12h
16m 41s
Tests: pixel + structural fidelity check of the live dashboard against `dashboard-redesign-v3-horizontal.html`
test
deployed
1d 12h
9m 24s
Fix: authoring funnel DRAFTS bucket only counts promoted drafts, omits current backlog under `specs/drafts/`
bug
deployed
1d 11h
21m
Fix: bootstrap timeline shows `—` for completed-stage durations after `/api/data` refresh
bug
deployed
1d 11h
8m 4s
Fix: All-Runs reports `running` for runs that died days ago — add time-based liveness check + new `abandoned` lifecycle status
bug
deployed
1d 11h
18m 52s
Fix: Consumption card V2 anatomy — split combined Total tokens bar into total-input + total-output, add output totals block, move cache-savings line
bug
deployed
1d 10h
23m 44s
Fix: critique-card body redundancies (verdict row, bottom anchor, seen-row) + mandate side-by-side parity grid in PR
bug
deployed
1d 10h
9m 23s
Fix: three-section Input panel reveals empty body — drop inner per-piece default-collapse so one click on outer chevron shows content
bug
deployed
1d 10h
7m 57s
Dashboard redesign v3 — horizontal hero + timeline, full-width counter row, populated Metrics tab, pagination, pastel chart palette, light default
new-feature
deployed
21h 40m
38m
Login screen v2 — animated hero, theme toggle continuity, looping chatter
new-feature
deployed
1d 9h
22m 26s
Summary tab v2 — celebratory close-out with verdict, stats, critique outcomes, and markdown download
new-feature
deployed
1d 9h
21m 27s
Fix: /api/data subrequest-limit blowup (502) + drop dashboard poll from 15s to 5s
bug
deployed
20h 28m
11m 52s
Drain deferrals — pulse-info dot, .tab-group-solid rename, segment counts, item-card head/lifecycle/sources/collapse, upstream [object Object] fix
new-feature
deployed
1d
0s
Fix: critique cards render literal ** markdown and re-show cryptic compound IDs
bug
deployed
1d 8h
11m 25s
Fix: Agent Input sub-tab renders two narrow cards causing horizontal scroll in split-pane modals
bug
deployed
1d 8h
8h 14m
/canvas workshop skill — registry-driven pane iteration sandbox with verbatim live + DS snapshots
new-feature
deployed
23h 58m
28m 20s
Dashboard redesign v2 — condensed callouts, tabs, light/dark themes, History total-elapsed banner
new-feature
deployed
22h
50m
Critique pane — item-card frame + head rebuild + expanded lifecycle view + source attribution + affordances
new-feature
deployed
20h 56m
8m
Critique pane — bar2 segmented controls + bar1 drift chip + phase-tab + kind-cluster DS catch-up
new-feature
deployed
20h 45m
10m
Timeline pane — System + Error chip primitives + brief-card refactor + live-state agent-strip wiring + turn-render data-layer fix
new-feature
deployed
20h 31m
16m
Timeline pane — chip polish (identity / activity / category bubble / cost) + light-mode token drift fix
new-feature
deployed
20h 13m
13m 30s
Timeline pane — M3 card chrome + phase header simplification + narrow-view strip equalisation
new-feature
deployed
19h 57m
2h 42m
Deferred-spec subagent in /dev-next — auto-capture in-flight deferrals as queued specs or drafts
new-feature
deployed
14h 7m
6m 8s
/spec-queue auto-decomposition — conservative bundle-by-default, auto-chained sub-specs
new-feature
deployed
13h 55m
5m 19s
Dashboard live-ness — cycle-started anchor, browser auto-refresh, live elapsed ticker, deploy-pages cleanup
new-feature
deployed
13h 45m
18m 23s
Spec workflow hardening — design-system steering at spec time, run-queue-until-empty skill, project CLAUDE.md
new-feature
deployed
13h 6m
9m
Dashboard redesign — design-system primitives, expandable in-flight hero with stage timeline, activity feed
new-feature
deployed
12h 13m
20m 43s
Spec lifecycle system v1 — typed drafts, dev queue, dashboard, session naming
new-feature
deployed
10h 50m
10h 50m
Design-system parity for critique surface + canonical Agent Input grouping + run-ID copy affordance
bug
deployed
7h 16m
—
Canonical prompt-pieces registry + per-attachment token tracking — protocol, persistence, and consumption surfaces
—
ready
—
—
Sources & provenance — investigation outcome and per-critique-card surface (incl. Phase 4 Issue/Comment patches)
—
ready
—
—
Phase 4 deadlock — draft extractor body retention + escape-valve breadth for terminal-ledger AGREED
—
ready
—
—
Active-agent gradient pulse · uniform critique card heights · run-id chip in header row 2
—
ready
—
—
Substantive-convergence escape valve — canonical-promote when both AGREED with terminal ledger but artifact hashes drift
—
ready
—
—
Unify run-status derivation — single source of truth across the All-Runs list, the run-detail page, and the orchestrator's exit-code emission
—
ready
—
—
Phase 0 round cards + side-by-side critique modal — close the round-by-round visibility gap for the brief negotiation
—
ready
—
—
Run-detail surface rework — agent chips into Timeline pane, narrow critique compaction, M3 segmented phase progress, timeline card chip slim-down
—
in-flight
—
—
CSS finalization + v1 token block removal + IBM Plex removal (close the migration arc)
—
proposed
—
—
Remaining JSX (app / errors / compare / auth / search / shared) v1 → v2 token migration
—
proposed
—
—
Onboarding tour visual rewrite + unified Admin/Settings + server-persisted onboarding state
—
proposed
—
—
Critique filter header height parity + responsive compaction + timeline card right-alignment
—
in-flight
—
—
Deep Research consumption & cost tracking — collapsed/unfolded redesign, canonical piece aggregation
—
implemented
—
—
Critique cards — bucket correctness, expanded-card scroll, badge cleanup, height parity with Timeline
—
merged
—
—
M3 chrome + run-detail header JSX wiring (plus restore live agent strip dropped by spec 0099)
—
proposed
—
—
Loading + States + A11y + Light + Responsive verification sweep (cross-cutting polish + verification of all earlier specs in both themes and three breakpoints)
—
proposed
—
—
Onboarding tour overlay + ProgressSegs admin (8-step tour over the live app via data-tour-anchor attributes; no redraw of underlying surfaces; ProgressSegs 8-segment per-user track)
—
proposed
—
—
How It Works overlay + right-side menu + Changelog (full-screen M3 dialog · sticky right menu · How It Works ↔ Changelog toggle · 9 collapsible sub-sections with inline diagrams)
—
proposed
—
—
Consumption pane full rework — collapsed + unfolded with sub-rows + uniform width across phases with round chip above card + sticky bottom legend
—
proposed
—
—
Timeline pane M3 rework — header chrome + vertical phase rail outside column anchored to header centers + tl-turn variants + single dashed top border on unfold + REPAIR row variant with explainer
—
proposed
—
—
Critique pane M3 rework — Bar 1 (title · phase tabs · totals · drift chip) + Bar 2 (kind tabs · agent · status filters) + collapsible status-grouped sections + Σ Summary state + phase-header sizing taller than card headers
—
proposed
—
—
QuestionThread + unified item-card family (Q · D · I · C cards with one anatomy — who → when → what → quote, six-word verdict vocabulary, tonal bubble + quote inside bubble, dashed footer)
—
proposed
—
—
M3 modal primitive (basic 560 dp + rich 1080 dp, shape-xl, elevation-3, agent-tinted left border)
—
proposed
—
—
M3 tabs + top app bar + chrome compaction (run-list 3→2 headers, top-bar layout when viewing a run)
—
proposed
—
—
M3 cards + AgentStrip + badge inventory + hover elevation-2 rule + AgentStrip badge sizing/symmetry
—
proposed
—
—
M3 atoms — buttons (5 variants) + FAB + icon button + chips (4 kinds) + status pills (canonical OK) + switches + segmented buttons
—
proposed
—
—
Material 3 token & foundation layer (palette, type, shape, elevation, state, motion, density, fonts, icons, base)
—
proposed
—
—
Parser robustness for cross-round Q/A/issue linkage + code-fence awareness (data-integrity overhaul)
—
proposed
—
—
Convergence escape hatches for stuck-AGREED loops (canonical-FSD synthesis, stuck-AGREED escape valve, hard ledger feedback)
—
proposed
—
—
Consumption tab rework — phase headers above rows, single card per agent, no duplicate compact bar
—
in-review
—
—
Agent Input panel — system-prompt fallback, structural hierarchy, modal vertical space
—
in-review
—
—
Hosted UI modal-load perf — warm machine, gzip, immutable cache headers, server-side LRU
—
merged
—
—
Consumption tab agent-card restructure — equal-height, data-top-bars-bottom, wider bars
—
merged
—
—
Run-detail header — agent strip equalization, phase-tab info hierarchy, remove blocking banner
—
merged
—
—
Modal primitive (single + split) + sub-tabs + RoundScrubber + provider-symmetric SourceCard
—
merged
—
—
Timeline + critique restructure (PhaseRail + ChipCluster + 3-axis filter + DriftCluster + Summary panel + CardHeadline migration)
—
merged
—
—
Primitive vocabulary — Button, StatusBadge, Chip, RunIDChip, ThemeToggle, Card, AgentStrip + today's-component migration
—
merged
—
—
Critique data integrity — parser coverage, badge wiring, modal load paths, count reconciliation
—
merged
—
—
Critique rework — Phase 4 answer linkage fix, compact cards, summary tab, timeline-pane re-alignment
—
merged
—
—
Cost-pipeline integrity — preserve metrics on resume, price the cache tier that was actually used, fold tool spend into the headline
—
merged
—
—
Web search audit foundation + protocol parser fixes + resume hardening + --notion repeatable
—
merged
—
—
Critique navigation — first-class Q+D, side-by-side rework, sentiment cards, click-to-highlight
—
merged
—
—
Consumption-tab follow-ups — tier-lookup window, click-to-expand bars, per-phase web-search count
—
merged
—
—
Timeline UX pass — inline unfold, per-input segments, real context windows, parser repairs
—
merged
—
—
Visualisation foundations — modal pattern, summary cards, preflight tabs, attachment ingest
—
merged
—
—
Live data fidelity — round counts, disagreement parsing, terminal-status pills, chip math
—
merged
—
—
⏱
Cycle time WoW
—
Needs ≥ 1 deployed cycle in each of the last two ISO weeks.
⏱
Where time goes
44% in deploy
Mean deploy = 5m 16s. Next: Implement 4m 48s, Tests 41s.
!
Reconcile drift
1 of 10
10% needed mechanical patches before code landed.
Cycle time over last 22 cycles
Each point is one deployed spec. Dashed = rolling-10 mean. Outliers > 45m (1) clipped to the top.
Cycle timeRolling-10 mean
Where time goes (mean stage durations)
Stacked, last 10 deployed cycles · total mean = 11m 52s
Pre-flightRead & planReconcileImplementTestsPR + mergeDeploy
Throughput · specs deployed per week
last 8 ISO weeks · current week in purple
By type · last 30 days
49 specs total · mean cycle by category
new-feature
24 · 53m 23s
bug
17 · 43m 24s
refactoring
6 · 11m 52s
test
2 · 11m 36s
Success rate · last 30 days
deployed vs failed cycles
Deployed · 49Failed · 0
Spec authoring funnel
how ideas reach deploy
Last 30 days · 0 drafts in backlog + 8 promoted · 100% reached queue · 109% of queued shipped