Login screen v2 — animated hero, theme toggle continuity, looping chatter
new-feature
queued
TBD
Summary tab v2 — celebratory close-out with verdict, stats, critique outcomes, and markdown download
new-feature
queued
TBD
Fix: /api/data subrequest-limit blowup (502) + drop dashboard poll from 15s to 5s
bug
deployed
1.30.1
Drain deferrals — pulse-info dot, .tab-group-solid rename, segment counts, item-card head/lifecycle/sources/collapse, upstream [object Object] fix
new-feature
queued
TBD
Fix: critique cards render literal ** markdown and re-show cryptic compound IDs
bug
queued
TBD
Fix: Agent Input sub-tab renders two narrow cards causing horizontal scroll in split-pane modals
bug
queued
TBD
/canvas workshop skill — registry-driven pane iteration sandbox with verbatim live + DS snapshots
new-feature
queued
TBD
Dashboard redesign v2 — condensed callouts, tabs, light/dark themes, History total-elapsed banner
new-feature
deployed
1.30.0
Critique pane — item-card frame + head rebuild + expanded lifecycle view + source attribution + affordances
new-feature
deployed
1.29.0
Critique pane — bar2 segmented controls + bar1 drift chip + phase-tab + kind-cluster DS catch-up
new-feature
deployed
1.28.0
Timeline pane — System + Error chip primitives + brief-card refactor + live-state agent-strip wiring + turn-render data-layer fix
new-feature
deployed
1.27.0
Timeline pane — chip polish (identity / activity / category bubble / cost) + light-mode token drift fix
new-feature
deployed
1.26.0
Timeline pane — M3 card chrome + phase header simplification + narrow-view strip equalisation
new-feature
deployed
1.25.0
Push /dev-next events to main during feature-branch phase
new-feature
deployed
1.24.0
Fix: post-deploy sweep for safe_to_destroy blue machines
bug
deployed
1.23.2
Tests: JS test stack for Pages Function and dashboard-bootstrap.js
test
deployed
1.23.1
Dashboard live data via Cloudflare Pages Function
new-feature
deployed
1.23.0
Fix: Fly machines-API mid-rolling-deploy timeout (9-in-a-row)
bug
deployed
1.22.1
Deferred-spec subagent in /dev-next — auto-capture in-flight deferrals as queued specs or drafts
new-feature
deployed
1.22.0
/spec-queue auto-decomposition — conservative bundle-by-default, auto-chained sub-specs
new-feature
deployed
1.21.0
Dashboard live-ness — cycle-started anchor, browser auto-refresh, live elapsed ticker, deploy-pages cleanup
new-feature
deployed
1.20.0
Fix: lifecycle skills' session-title stamping is unimplemented
bug
deployed
1.19.1
Spec workflow hardening — design-system steering at spec time, run-queue-until-empty skill, project CLAUDE.md
new-feature
deployed
1.19.0
Dashboard redesign — design-system primitives, expandable in-flight hero with stage timeline, activity feed
new-feature
deployed
1.18.0
Spec lifecycle system v1 — typed drafts, dev queue, dashboard, session naming
new-feature
deployed
1.17.0
Design-system parity for critique surface + canonical Agent Input grouping + run-ID copy affordance
bug
deployed
1.16.0
Legacy-shim sunset + historical input-bundle backfill
refactoring
deployed
1.15.0
Post-batch cleanup + Anthropic cache engagement + protocol follow-ups
new-feature
deployed
1.14.0
Consumption-card backend follow-ups + protocol-violation UI surface
—
ready
—
Phase 0 critique section grouping + live timeline rendering determinism
—
ready
—
Consumption card visual rework — CcxCard M3 polish + spec-preview rendering
—
ready
—
Canonical prompt-pieces registry + per-attachment token tracking — protocol, persistence, and consumption surfaces
—
ready
—
Sources & provenance — investigation outcome and per-critique-card surface (incl. Phase 4 Issue/Comment patches)
—
ready
—
Cost & token attribution correction + run-detail header total-cost copy affordance
—
ready
—
Prompt capture for full-view modals — Initial Brief persistence and modal hydration
—
ready
—
Critique aggregation invariants and resolved-view integrity
—
ready
—
Phase 4 deadlock — draft extractor body retention + escape-valve breadth for terminal-ledger AGREED
—
ready
—
Active-agent gradient pulse · uniform critique card heights · run-id chip in header row 2
—
ready
—
Substantive-convergence escape valve — canonical-promote when both AGREED with terminal ledger but artifact hashes drift
—
ready
—
Unify run-status derivation — single source of truth across the All-Runs list, the run-detail page, and the orchestrator's exit-code emission
—
ready
—
Phase 0 round cards + side-by-side critique modal — close the round-by-round visibility gap for the brief negotiation
—
ready
—
Migrate hardcoded JSX fontWeight numbers to --md-w-* tokens (close the bypass gap)
—
proposed
—
Diagram skill v2.0.0 — mode-aware split + Material mode + how-it-works regen
—
proposed
—
Run-detail surface rework — agent chips into Timeline pane, narrow critique compaction, M3 segmented phase progress, timeline card chip slim-down
—
in-flight
—
Design-system pollution scrub (post-arc cleanup)
—
proposed
—
CSS finalization + v1 token block removal + IBM Plex removal (close the migration arc)
—
proposed
—
Remaining JSX (app / errors / compare / auth / search / shared) v1 → v2 token migration
—
proposed
—
design-language.jsx v1 → v2 token migration
—
accepted
—
run-detail.jsx v1 → v2 token migration
—
proposed
—
Design system v2 canonicalization — promote v2 to single source of truth, archive v1
—
proposed
—
Changelog — spec rendered in a full-view modal + working screenshots (or none)
—
proposed
—
Onboarding tour visual rewrite + unified Admin/Settings + server-persisted onboarding state
—
proposed
—
Critique filter header height parity + responsive compaction + timeline card right-alignment
—
in-flight
—
How-It-Works as a full-page route + click-to-enlarge SVG viewer
—
proposed
—
Persist item lifecycle events to the transcript + backfill historical runs
—
proposed
—
How-It-Works overlay + Changelog tab — full content & component rewrite
—
proposed
—
Turn-modal items panel — provider badge, segment labelling, Claims panel removal
—
implemented
—
Badge governance — unified chip primitive, vocabulary, and surface-level rollout
—
proposed
—
Deep Research consumption & cost tracking — collapsed/unfolded redesign, canonical piece aggregation
—
implemented
—
Deep Research artifact naming, how-it-works diagrams, and timeline full-view
—
proposed
—
Turn / Cross-review modal cleanup + Timeline-Critique pane visual harmonisation
—
merged
—
Deep Research UI — timeline, critique cards with sources, appendix, validate-run CLI
—
proposed
—
Deep Research protocol — canonical methodology, lifecycle, and prompts
—
proposed
—
Full-view modals — fixed 92vh height + remove the 360px accordion-body cap
—
merged
—
Agent strip — stop wrapping the model id and activity label when the box is tight
—
merged
—
Critique cards — bucket correctness, expanded-card scroll, badge cleanup, height parity with Timeline
—
merged
—
Modal sizing + phase rail anchoring + button contrast + critique card layout
—
proposed
—
Modal chrome + overlay JSX M3 token migration
—
proposed
—
Run-list page M3 token migration
—
proposed
—
Timeline phase/turn card treatment + RunDetailHeader children M3 token migration
—
proposed
—
M3 chrome + run-detail header JSX wiring (plus restore live agent strip dropped by spec 0099)
—
proposed
—
Loading + States + A11y + Light + Responsive verification sweep (cross-cutting polish + verification of all earlier specs in both themes and three breakpoints)
—
proposed
—
Onboarding tour overlay + ProgressSegs admin (8-step tour over the live app via data-tour-anchor attributes; no redraw of underlying surfaces; ProgressSegs 8-segment per-user track)
—
proposed
—
How It Works overlay + right-side menu + Changelog (full-screen M3 dialog · sticky right menu · How It Works ↔ Changelog toggle · 9 collapsible sub-sections with inline diagrams)
—
proposed
—
Agent input panel + PhaseRail + RoundScrubber M3 anatomy sweep
—
proposed
—
Consumption pane full rework — collapsed + unfolded with sub-rows + uniform width across phases with round chip above card + sticky bottom legend
—
proposed
—
Timeline pane M3 rework — header chrome + vertical phase rail outside column anchored to header centers + tl-turn variants + single dashed top border on unfold + REPAIR row variant with explainer
—
proposed
—
Critique pane M3 rework — Bar 1 (title · phase tabs · totals · drift chip) + Bar 2 (kind tabs · agent · status filters) + collapsible status-grouped sections + Σ Summary state + phase-header sizing taller than card headers
—
proposed
—
QuestionThread + unified item-card family (Q · D · I · C cards with one anatomy — who → when → what → quote, six-word verdict vocabulary, tonal bubble + quote inside bubble, dashed footer)
—
proposed
—
M3 modal primitive (basic 560 dp + rich 1080 dp, shape-xl, elevation-3, agent-tinted left border)
—
proposed
—
M3 tabs + top app bar + chrome compaction (run-list 3→2 headers, top-bar layout when viewing a run)
—
proposed
—
M3 cards + AgentStrip + badge inventory + hover elevation-2 rule + AgentStrip badge sizing/symmetry
—
proposed
—
M3 atoms — buttons (5 variants) + FAB + icon button + chips (4 kinds) + status pills (canonical OK) + switches + segmented buttons
—
proposed
—
Material 3 token & foundation layer (palette, type, shape, elevation, state, motion, density, fonts, icons, base)
—
proposed
—
Phase 4 drafter-engagement gate (close the round-1 sycophantic-APPROVED loophole)
—
proposed
—
Parser robustness for cross-round Q/A/issue linkage + code-fence awareness (data-integrity overhaul)
—
proposed
—
Convergence escape hatches for stuck-AGREED loops (canonical-FSD synthesis, stuck-AGREED escape valve, hard ledger feedback)
—
proposed
—
Stop hiding Phase 3 / Phase 4 timeline rows on errored & deadlocked runs
—
proposed
—
Cross-cutting polish — every remaining gap from the 2026-05-18 tweak-cycle audit
—
in-review
—
Consumption tab rework — phase headers above rows, single card per agent, no duplicate compact bar
—
in-review
—
Agent Input panel — system-prompt fallback, structural hierarchy, modal vertical space
—
in-review
—
Unified LoadingState primitive — one delightful loading visual everywhere
—
merged
—
Robust run-list loading state + spinner visual + hide 5455
—
merged
—
Run-list loading state + server-side hidden-runs filter
—
merged
—
Cache /api/runs/{id} snapshot by (run_id, latest_event_seq)
—
merged
—
Hotfix: disable Fly auto_stop_machines to stop the proxy flap
—
merged
—
Hosted UI modal-load perf — warm machine, gzip, immutable cache headers, server-side LRU
—
merged
—
Fly VM memory bump to fix UI-server OOM-on-boot
—
merged
—
Hotfix: run-detail.jsx parse error (white-screen regression)
—
merged
—
Stable-worktree helper — isolate CLI runs from active orchestrator/feature-branch work
—
merged
—
Consumption tab agent-card restructure — equal-height, data-top-bars-bottom, wider bars
—
merged
—
Agent Input tab rework — rename, reorder, structural restructure
—
merged
—
Question/Disagreement render unification + markdown rendering fix
—
merged
—
Critique pane structural — filter strip, Phase 4 split, summary copy
—
merged
—
Timeline structural pass — PhaseRail, phase headers, card size, collapsibility
—
merged
—
Run-detail header — agent strip equalization, phase-tab info hierarchy, remove blocking banner
—
merged
—
Run-list & chrome polish (status pills, top bar, top tabs, right cluster)
—
merged
—
Brand-icon system + Design-page DNA reskin
—
merged
—
Chip vocabulary + code-cluster expansion
—
merged
—
Onboarding (3-screen first-time flow) + landing demo capsule
—
merged
—
Cross-run dashboards — /compare (two-run side-by-side) + /search (cross-run query)
—
merged
—
Keyboard contract + shortcuts overlay + search palette
—
merged
—
Modal primitive (single + split) + sub-tabs + RoundScrubber + provider-symmetric SourceCard
—
merged
—
Timeline + critique restructure (PhaseRail + ChipCluster + 3-axis filter + DriftCluster + Summary panel + CardHeadline migration)
—
merged
—
Run detail header + chrome restructure + ActiveRunChip
—
merged
—
Run list -- sort + attention promotion + filter Tabs + URL state + /-bound search
—
merged
—
QuestionThread + QuestionRef + AP-01 enforcement
—
merged
—
Tab system (3 variants) + table header
—
merged
—
Primitive vocabulary — Button, StatusBadge, Chip, RunIDChip, ThemeToggle, Card, AgentStrip + today's-component migration
—
merged
—
Consumption tab — content-vs-billing split + output bar + cross-turn lineage
—
proposed
—
Design-system foundation — tokens, base, a11y, MDI icons, emoji removal
—
merged
—
Reconcile-costs reads run-cost data from Supabase (re-enable daily cron)
—
merged
—
Always-on cost verification against provider invoices + pricing-version snapshot
—
merged
—
Run-detail resilience + Phase 4 sibling-key separation for repair turns
—
merged
—
Critique panel + Summary tab + Consumption tab rework + design unification
—
merged
—
Full-view shell standardisation + model pill layout
—
merged
—
Turn-input semantics + per-turn badges + side-by-side framing
—
merged
—
Cross-round ledger + standing-items input + conservative convergence
—
merged
—
Critique data integrity — parser coverage, badge wiring, modal load paths, count reconciliation
—
merged
—
Critique classification + load-time resilience + sentiment paragraph + tighter cards
—
merged
—
Critique rework — Phase 4 answer linkage fix, compact cards, summary tab, timeline-pane re-alignment
—
merged
—
Cost-pipeline integrity — preserve metrics on resume, price the cache tier that was actually used, fold tool spend into the headline
—
merged
—
Web search audit UI + agent-pill alignment fix
—
merged
—
Web search audit foundation + protocol parser fixes + resume hardening + --notion repeatable
—
merged
—
Consumption rework + header-placement fix + app-version chip
—
merged
—
Critique navigation — first-class Q+D, side-by-side rework, sentiment cards, click-to-highlight
—
merged
—
Inputs foundation — universal Input view, Phase 0 split, two-row live header
—
merged
—
Phase-2 hash-drift escape, P2 summaries, live-push flag, dual-research-run skill
—
merged
—
Consumption-tab follow-ups — tier-lookup window, click-to-expand bars, per-phase web-search count
—
merged
—
Timeline UX pass — inline unfold, per-input segments, real context windows, parser repairs
—
merged
—
Token-consumption tab — per-turn context-window visualisation
—
merged
—
Cross-review inline comments — Phase 4 side-by-side modal
—
merged
—
Negotiate inline comments — side-by-side modal with anchored critique cards
—
merged
—
How-it-works restructure — chat-lifecycle diagram, phase accordions, v3.5 process map
—
merged
—
Visualisation foundations — modal pattern, summary cards, preflight tabs, attachment ingest
—
merged
—
Run-detail header pass 2 + compact theme toggle
—
merged
—
Compact run-detail header, "How it works" page, and release notes
—
merged
—
Admin allowlist UI, profile menu, and landing-page redesign
—
merged
—
Google OAuth + email allowlist via Supabase Auth
—
merged
—
Fly.io deployment of the UI server with Supabase-backed aggregator
—
merged
—
Supabase schema + `--push` CLI for hosted-deployment track
—
merged
—
Hosted deployment kickoff handoff package
—
merged
—
Render the last round in deadlocked / errored phase-2 timelines
—
merged
—
Live data fidelity — round counts, disagreement parsing, terminal-status pills, chip math
—
merged
—
Integration kickoff handoff package
—
merged
—
Clearer card stats — Phase 1 badges and explicit labels
—
merged
—
Run-id pill and timeline card stats
—
merged
—
UI polish and navigation
—
merged
—
UI bundle integration
—
merged
—
UI HTTP server with SSE
—
merged
—
UI run aggregator
—
merged
—
Frontend handoff package
—
merged
—
Rate-limit-aware retry + resume from prior session
—
merged
—
Prompt caching to unblock prod-tier rate limit and cut multi-round cost
—
merged
—
Web search wiring + prod-tier full-convergence E2E
—
merged
—
Phases 3 + 4 — drafting, review, and final document emission
—
merged
—
Phase 2 — plan negotiation with caps, repair, and drafter tiebreak
—
merged
—
Orchestrator scaffold + Phase 0/1 end-to-end
—
merged
—
Engineering workflow — specs, branches, PRs, semver
—
merged
—