dual-research · spec dashboard
read-only view of specs/, handoffs/ and dashboard/events/ at main
v1.18.0 liveupdated repo ↗
pause_circle
Queue · idle
Nothing in flight. Run /dev-next in your queue session to start the next spec.
0 in flight0 queuedlast shipped 4m 42s ago
4m 42s ago
last deploy · 0153
Drafts
4
Queued
0
In progress
0
Merged today
5
Deployed (all)
5
Throughput & cycle time
Avg cycle time
5h 35m
rolling 2
Throughput
5 / day
last 24h · 5 / week trend
Reconcile patches
0 of 1
0% needed handoff drift fix
Failed cycles
0
last 30 days
Queue
/dev-next picks position 1 first
#
Spec
Title
Type
Waiting
Queue is empty. Promote a draft with /spec-promote <id> or queue a new one with /spec-queue.
Recent activity
last 24 hours
12:14:13 UTC
bookmark
handoff written0153 · handoffs/2026-05-22-spec-0153-dashboard-redesign-staged-hero-and-activity-feed.md
12:13:38 UTC
check_circle
deployed0153 · Dashboard redesign — design-system primitives, expandable in-flight hero with stage timeline, activity feed · v1.18.0
12:04:34 UTC
merge
merged0153 · PR #176 · admin squash
12:03:23 UTC
rule
implement0153 · 1 commits
12:03:23 UTC
task_alt
tests green0153 · 1474 passed
11:53:17 UTC
flag_circle
branched0153 · branch spec/0153-dashboard-redesign-staged-hero-and-activity-feed
11:53:02 UTC
flag_circle
pre-flight0153 · pre-flight clean
11:53:02 UTC
flag_circle
read handoff0153 · prior handoff read
11:53:02 UTC
flag_circle
read spec0153 · spec parsed
11:53:02 UTC
rule
reconcile0153 · clean · 0 patches
11:53:02 UTC
flag_circle
in progress0153 · Dashboard redesign — design-system primitives, expandable in-flight hero with stage timeline, activity feed
11:44:43 UTC
add_task
queued0153 · Dashboard redesign — design-system primitives, expandable in-flight hero with stage timeline, activity feed · position 1
10:50:17 UTC
merge
merged0152 · PR #175 · admin squash
10:50:17 UTC
check_circle
deployed0152 · Spec lifecycle system v1 — typed drafts, dev queue, dashboard, session naming · v1.17.0
10:50:17 UTC
bookmark
handoff written0152 · handoffs/2026-05-22-spec-0152-spec-lifecycle-system-v1.md
00:00:00 UTC
add_task
queued0152 · Spec lifecycle system v1 — typed drafts, dev queue, dashboard, session naming
00:00:00 UTC
flag_circle
in progress0152 · Spec lifecycle system v1 — typed drafts, dev queue, dashboard, session naming
Drafts
backlog · promote with /spec-promote <id>
All specs
5 shipped · 0 queued · 0 in flight
Spec
Title
Type
Status
Version
Dashboard redesign — design-system primitives, expandable in-flight hero with stage timeline, activity feed
new-feature
deployed
1.18.0
Spec lifecycle system v1 — typed drafts, dev queue, dashboard, session naming
new-feature
deployed
1.17.0
Design-system parity for critique surface + canonical Agent Input grouping + run-ID copy affordance
bug
deployed
1.16.0
Legacy-shim sunset + historical input-bundle backfill
refactoring
deployed
1.15.0
Post-batch cleanup + Anthropic cache engagement + protocol follow-ups
new-feature
deployed
1.14.0
Consumption-card backend follow-ups + protocol-violation UI surface
ready
Phase 0 critique section grouping + live timeline rendering determinism
ready
Consumption card visual rework — CcxCard M3 polish + spec-preview rendering
ready
Canonical prompt-pieces registry + per-attachment token tracking — protocol, persistence, and consumption surfaces
ready
Sources & provenance — investigation outcome and per-critique-card surface (incl. Phase 4 Issue/Comment patches)
ready
Cost & token attribution correction + run-detail header total-cost copy affordance
ready
Prompt capture for full-view modals — Initial Brief persistence and modal hydration
ready
Critique aggregation invariants and resolved-view integrity
ready
Phase 4 deadlock — draft extractor body retention + escape-valve breadth for terminal-ledger AGREED
ready
Active-agent gradient pulse · uniform critique card heights · run-id chip in header row 2
ready
Substantive-convergence escape valve — canonical-promote when both AGREED with terminal ledger but artifact hashes drift
ready
Unify run-status derivation — single source of truth across the All-Runs list, the run-detail page, and the orchestrator's exit-code emission
ready
Phase 0 round cards + side-by-side critique modal — close the round-by-round visibility gap for the brief negotiation
ready
Migrate hardcoded JSX fontWeight numbers to --md-w-* tokens (close the bypass gap)
proposed
Diagram skill v2.0.0 — mode-aware split + Material mode + how-it-works regen
proposed
Run-detail surface rework — agent chips into Timeline pane, narrow critique compaction, M3 segmented phase progress, timeline card chip slim-down
in-flight
Design-system pollution scrub (post-arc cleanup)
proposed
CSS finalization + v1 token block removal + IBM Plex removal (close the migration arc)
proposed
Remaining JSX (app / errors / compare / auth / search / shared) v1 → v2 token migration
proposed
design-language.jsx v1 → v2 token migration
accepted
run-detail.jsx v1 → v2 token migration
proposed
Design system v2 canonicalization — promote v2 to single source of truth, archive v1
proposed
Changelog — spec rendered in a full-view modal + working screenshots (or none)
proposed
Onboarding tour visual rewrite + unified Admin/Settings + server-persisted onboarding state
proposed
Critique filter header height parity + responsive compaction + timeline card right-alignment
in-flight
How-It-Works as a full-page route + click-to-enlarge SVG viewer
proposed
Persist item lifecycle events to the transcript + backfill historical runs
proposed
How-It-Works overlay + Changelog tab — full content & component rewrite
proposed
Turn-modal items panel — provider badge, segment labelling, Claims panel removal
implemented
Badge governance — unified chip primitive, vocabulary, and surface-level rollout
proposed
Deep Research consumption & cost tracking — collapsed/unfolded redesign, canonical piece aggregation
implemented
Deep Research artifact naming, how-it-works diagrams, and timeline full-view
proposed
Turn / Cross-review modal cleanup + Timeline-Critique pane visual harmonisation
merged
Deep Research UI — timeline, critique cards with sources, appendix, validate-run CLI
proposed
Deep Research protocol — canonical methodology, lifecycle, and prompts
proposed
Full-view modals — fixed 92vh height + remove the 360px accordion-body cap
merged
Agent strip — stop wrapping the model id and activity label when the box is tight
merged
Critique cards — bucket correctness, expanded-card scroll, badge cleanup, height parity with Timeline
merged
Modal sizing + phase rail anchoring + button contrast + critique card layout
proposed
Modal chrome + overlay JSX M3 token migration
proposed
Run-list page M3 token migration
proposed
Timeline phase/turn card treatment + RunDetailHeader children M3 token migration
proposed
M3 chrome + run-detail header JSX wiring (plus restore live agent strip dropped by spec 0099)
proposed
Loading + States + A11y + Light + Responsive verification sweep (cross-cutting polish + verification of all earlier specs in both themes and three breakpoints)
proposed
Onboarding tour overlay + ProgressSegs admin (8-step tour over the live app via data-tour-anchor attributes; no redraw of underlying surfaces; ProgressSegs 8-segment per-user track)
proposed
How It Works overlay + right-side menu + Changelog (full-screen M3 dialog · sticky right menu · How It Works ↔ Changelog toggle · 9 collapsible sub-sections with inline diagrams)
proposed
Agent input panel + PhaseRail + RoundScrubber M3 anatomy sweep
proposed
Consumption pane full rework — collapsed + unfolded with sub-rows + uniform width across phases with round chip above card + sticky bottom legend
proposed
Timeline pane M3 rework — header chrome + vertical phase rail outside column anchored to header centers + tl-turn variants + single dashed top border on unfold + REPAIR row variant with explainer
proposed
Critique pane M3 rework — Bar 1 (title · phase tabs · totals · drift chip) + Bar 2 (kind tabs · agent · status filters) + collapsible status-grouped sections + Σ Summary state + phase-header sizing taller than card headers
proposed
QuestionThread + unified item-card family (Q · D · I · C cards with one anatomy — who → when → what → quote, six-word verdict vocabulary, tonal bubble + quote inside bubble, dashed footer)
proposed
M3 modal primitive (basic 560 dp + rich 1080 dp, shape-xl, elevation-3, agent-tinted left border)
proposed
M3 tabs + top app bar + chrome compaction (run-list 3→2 headers, top-bar layout when viewing a run)
proposed
M3 cards + AgentStrip + badge inventory + hover elevation-2 rule + AgentStrip badge sizing/symmetry
proposed
M3 atoms — buttons (5 variants) + FAB + icon button + chips (4 kinds) + status pills (canonical OK) + switches + segmented buttons
proposed
Material 3 token & foundation layer (palette, type, shape, elevation, state, motion, density, fonts, icons, base)
proposed
Phase 4 drafter-engagement gate (close the round-1 sycophantic-APPROVED loophole)
proposed
Parser robustness for cross-round Q/A/issue linkage + code-fence awareness (data-integrity overhaul)
proposed
Convergence escape hatches for stuck-AGREED loops (canonical-FSD synthesis, stuck-AGREED escape valve, hard ledger feedback)
proposed
Stop hiding Phase 3 / Phase 4 timeline rows on errored & deadlocked runs
proposed
Cross-cutting polish — every remaining gap from the 2026-05-18 tweak-cycle audit
in-review
Consumption tab rework — phase headers above rows, single card per agent, no duplicate compact bar
in-review
Agent Input panel — system-prompt fallback, structural hierarchy, modal vertical space
in-review
Unified LoadingState primitive — one delightful loading visual everywhere
merged
Robust run-list loading state + spinner visual + hide 5455
merged
Run-list loading state + server-side hidden-runs filter
merged
Cache /api/runs/{id} snapshot by (run_id, latest_event_seq)
merged
Hotfix: disable Fly auto_stop_machines to stop the proxy flap
merged
Hosted UI modal-load perf — warm machine, gzip, immutable cache headers, server-side LRU
merged
Fly VM memory bump to fix UI-server OOM-on-boot
merged
Hotfix: run-detail.jsx parse error (white-screen regression)
merged
Stable-worktree helper — isolate CLI runs from active orchestrator/feature-branch work
merged
Consumption tab agent-card restructure — equal-height, data-top-bars-bottom, wider bars
merged
Agent Input tab rework — rename, reorder, structural restructure
merged
Question/Disagreement render unification + markdown rendering fix
merged
Critique pane structural — filter strip, Phase 4 split, summary copy
merged
Timeline structural pass — PhaseRail, phase headers, card size, collapsibility
merged
Run-detail header — agent strip equalization, phase-tab info hierarchy, remove blocking banner
merged
Run-list & chrome polish (status pills, top bar, top tabs, right cluster)
merged
Brand-icon system + Design-page DNA reskin
merged
Chip vocabulary + code-cluster expansion
merged
Onboarding (3-screen first-time flow) + landing demo capsule
merged
Cross-run dashboards — /compare (two-run side-by-side) + /search (cross-run query)
merged
Keyboard contract + shortcuts overlay + search palette
merged
Modal primitive (single + split) + sub-tabs + RoundScrubber + provider-symmetric SourceCard
merged
Timeline + critique restructure (PhaseRail + ChipCluster + 3-axis filter + DriftCluster + Summary panel + CardHeadline migration)
merged
Run detail header + chrome restructure + ActiveRunChip
merged
Run list -- sort + attention promotion + filter Tabs + URL state + /-bound search
merged
QuestionThread + QuestionRef + AP-01 enforcement
merged
Tab system (3 variants) + table header
merged
Primitive vocabulary — Button, StatusBadge, Chip, RunIDChip, ThemeToggle, Card, AgentStrip + today's-component migration
merged
Consumption tab — content-vs-billing split + output bar + cross-turn lineage
proposed
Design-system foundation — tokens, base, a11y, MDI icons, emoji removal
merged
Reconcile-costs reads run-cost data from Supabase (re-enable daily cron)
merged
Always-on cost verification against provider invoices + pricing-version snapshot
merged
Run-detail resilience + Phase 4 sibling-key separation for repair turns
merged
Critique panel + Summary tab + Consumption tab rework + design unification
merged
Full-view shell standardisation + model pill layout
merged
Turn-input semantics + per-turn badges + side-by-side framing
merged
Cross-round ledger + standing-items input + conservative convergence
merged
Critique data integrity — parser coverage, badge wiring, modal load paths, count reconciliation
merged
Critique classification + load-time resilience + sentiment paragraph + tighter cards
merged
Critique rework — Phase 4 answer linkage fix, compact cards, summary tab, timeline-pane re-alignment
merged
Cost-pipeline integrity — preserve metrics on resume, price the cache tier that was actually used, fold tool spend into the headline
merged
Web search audit UI + agent-pill alignment fix
merged
Web search audit foundation + protocol parser fixes + resume hardening + --notion repeatable
merged
Consumption rework + header-placement fix + app-version chip
merged
Critique navigation — first-class Q+D, side-by-side rework, sentiment cards, click-to-highlight
merged
Inputs foundation — universal Input view, Phase 0 split, two-row live header
merged
Phase-2 hash-drift escape, P2 summaries, live-push flag, dual-research-run skill
merged
Consumption-tab follow-ups — tier-lookup window, click-to-expand bars, per-phase web-search count
merged
Timeline UX pass — inline unfold, per-input segments, real context windows, parser repairs
merged
Token-consumption tab — per-turn context-window visualisation
merged
Cross-review inline comments — Phase 4 side-by-side modal
merged
Negotiate inline comments — side-by-side modal with anchored critique cards
merged
How-it-works restructure — chat-lifecycle diagram, phase accordions, v3.5 process map
merged
Visualisation foundations — modal pattern, summary cards, preflight tabs, attachment ingest
merged
Run-detail header pass 2 + compact theme toggle
merged
Compact run-detail header, "How it works" page, and release notes
merged
Admin allowlist UI, profile menu, and landing-page redesign
merged
Google OAuth + email allowlist via Supabase Auth
merged
Fly.io deployment of the UI server with Supabase-backed aggregator
merged
Supabase schema + `--push` CLI for hosted-deployment track
merged
Hosted deployment kickoff handoff package
merged
Render the last round in deadlocked / errored phase-2 timelines
merged
Live data fidelity — round counts, disagreement parsing, terminal-status pills, chip math
merged
Integration kickoff handoff package
merged
Clearer card stats — Phase 1 badges and explicit labels
merged
Run-id pill and timeline card stats
merged
UI polish and navigation
merged
UI bundle integration
merged
UI HTTP server with SSE
merged
UI run aggregator
merged
Frontend handoff package
merged
Rate-limit-aware retry + resume from prior session
merged
Prompt caching to unblock prod-tier rate limit and cut multi-round cost
merged
Web search wiring + prod-tier full-convergence E2E
merged
Phases 3 + 4 — drafting, review, and final document emission
merged
Phase 2 — plan negotiation with caps, repair, and drafter tiebreak
merged
Orchestrator scaffold + Phase 0/1 end-to-end
merged
Engineering workflow — specs, branches, PRs, semver
merged