agnes-the-ai-analyst

AI-Cognitive-Leap/agnes-the-ai-analyst

Fork 0

Commit graph

Author	SHA1	Message	Date
Vojtech	001e5ce40e	feat(web): /home value-first redesign + unified page-shell across app (#366 ) * feat(web): value-first /home reskin (CEO mock palette + pillars + first-session) Restructures `/home` to lead with product value instead of install steps, matching the CEO mock proposed for the homepage: - New intro hero on top — eyebrow `Welcome, {{ display_name }}`, H1 `{{ instance_brand }} is your team's AI workspace`, lede framing the product as an "AI Chief of Staff", two CTAs (`Set up in ~15 min →` jumps to the wizard, `Just browse — no install needed` jumps to `#look-around`), and a four-pillar row (Data packages · Plugins · Skills · Memory). Renders for both onboarded and not-onboarded users so the value framing is consistent across visits. - New `first-session` narrative — five-beat walkthrough (launch → pick project → memory loads → ask → close) with mock terminal frames carrying traffic-light dots, prompts, and dimmed system output. - Setup wizard chrome — progress chip (`Step 1 of N · ~15 min · One-time · Reversible`), thin progress bar, and per-step number badges on each `.install-block` so the wizard reads as bounded instead of an open-ended scroll. - Palette shift from blue to green/navy: `--hp-primary` aliases `#2ea877` (mint), `--hp-hero-bg` is navy `#0f1b3a`, code panels stay near-black `#0c1224` with warm-yellow `#ffd866` accents. The token alias is reused so downstream rules pick up the new accent automatically; instance theme overrides via `config.theme_overrides()` still win. - VS Code surface tile carries a `Recommended` pill; the existing "Want to look around first?" section is renamed to `Explore your workspace` and gets the `#look-around` anchor. All test-pinned class names and IDs (`install-hero`, `install-block`, `home-mock`, `self-mark-btn`, `setupClaudeBtn`, `offboard-strip`, `home-getting-started`, `home-gs-item`, `home-overview`, `home-usage`) preserved as structural anchors; new visual language overlays via additional classes. Existing onboarded/not-onboarded branching, `/api/me/onboarded` POST, status frame gating, post-CTA modal, and OS-tab switching JS unchanged. Stray `~/FoundryAI` comment swapped for `~/{{ workspace_dir }}` to honor the vendor-agnostic OSS rule. 51 home tests pass without modification. * fix(web): /home palette inversion — dark intro hero on top, light setup card below Previous reskin commit kept the install-hero as a dark navy gradient and rendered the new intro hero as a light surface — opposite of what the CEO mock specifies. Playwright comparison vs `data/ceo_home.html` confirmed: - CEO mock: dark navy hero at TOP (with white pillars on navy), LIGHT white setup card BELOW with light step rows and dark code panels inset. - Previous: light intro hero on top, dark setup card below. Inverted. This patch flips both: - `.home-hero-intro` now: dark navy gradient `#0f1b3a → #1a2a5f`, green radial glow in the corner, green eyebrow, white H1 (`accent` span green), rgba-white lede, green pill primary CTA, translucent-white secondary CTA, pillars row separated by hairline border-top with green square-dot bullets in front of each pillar header. - `.install-hero` and `.install-block` now: white surface card with thin green accent strip across the top, light step rows split by hairline borders, green-tinted step-number circles (`#e6f9f0` bg, `#1f8a5e` ink), green progress chip + bar. Code panels (`.install-cmd`) and terminal frames stay dark — they're the "type this" surfaces. - All previously-rgba-white descendants of `.install-hero` (close button, eyebrow, h1, lead, links, code chips, OS tabs, install notes, setup-CTA button, self-mark fallback, auto-detect badge, terminal-howto disclosure) re-skinned for light surface. All 12 home page tests still pass (no markup changes, only CSS). * fix(web): /home parity polish — system font + mock sizes + blue info hint + gray step-num After v2 palette flip, user comparison vs CEO mock surfaced three remaining gaps in the wizard area: - Font stack mismatch: Agnes inherits Inter via `style-custom.css`, but the CEO mock uses the platform system stack (San Francisco on macOS, Segoe UI on Windows). The rendered weight/letterforms read noticeably different. `.home-mock` now declares `-apple-system, BlinkMacSystemFont, "Segoe UI", Roboto, sans-serif` for itself and all descendants, with the monospace stack reserved for `code`/`kbd`/`pre`, `.install-cmd`, and `.terminal-body`. - Step number badges were green-tinted; mock uses neutral gray (`#f0f2f6` bg, `#4a5168` ink) — green is reserved for the "done" state. Switched to `--hp-surface-dim` + `--hp-text-secondary`. - "Don't have a terminal open?" disclosure was an amber/yellow variant left over from the old dark-hero palette. Mock uses a blue info-hint vocabulary (`--info-bg: #eef3ff`, `--info-line: #4f7cf2`, `--info-ink: #1c3994`) with white kbd chips. Added the info-* tokens to the `:root` block and re-skinned `details.terminal-howto` (incl. summary, body, kbd) to match. Step-body type sizes also brought in line with the mock spec — `.install-block .label` (step h3 equivalent) is now 17px / 700 with 6px gap; `.install-note` body type is 14px / 1.55. `--hp-info-bg / --hp-info-ink / --hp-info-line / --hp-warn-bg / --hp-warn-ink / --hp-warn-line / --hp-surface-dim` added as first-class tokens so future hint/warn callouts pick the same colors without a duplicate vocabulary. 12/12 home tests pass. * feat(web): centralize design tokens + reword /home wizard to 6 steps (CEO mock parity) Two intertwined changes that touch both global design + /home structure: GLOBAL TOKEN SHIFT (app/web/static/style-custom.css) - `--primary` flipped from blue `#0073D1` to green `#2ea877` — same brand alias the rest of the app referenced, so every page picks up the new accent automatically. Old `--primary-dark` / `--primary-light` recolored to match. - New tokens added: `--brand-accent`, `--hero-bg`, `--hero-ink`, `--surface-dim`, `--info-bg/ink/line`, `--warn-bg/ink/line`. Brings the global vocabulary in line with the CEO mock's `:root` block so callouts and hero surfaces don't have to invent local tokens. - `--font-primary` switched from Inter-led stack to the system stack (`-apple-system, BlinkMacSystemFont, "Segoe UI", Roboto, "Inter", system-ui, sans-serif`) so weight/letterforms render identically on macOS (San Francisco) and Windows (Segoe UI) — matches the mock and avoids a font-loading flash for analysts without Inter installed. - Shadow tints re-cast in navy `rgba(15,27,58,...)`; focus ring uses the new green `rgba(46,168,119,0.25)`. - `.app-nav-link` font-size 13px → 14px, padding 6px 12px → 8px 14px, hover bg → `--primary-light` (mint), color → `--primary-dark`. `.app-nav-menu-item.is-active` re-tinted to the same green system. - Sweep across 26 templates (style-custom.css + 25 template files) replacing every hardcoded `#0073D1` / `#005BA3` / `#E6F3FC` / `rgba(0,115,209,…)` / `rgba(0,86,163,…)` with token references or the new green hexes — 175 occurrences total. Pages that styled their own buttons / borders / shadows pick up the new brand color without per-page overrides. /HOME WIZARD: 6 STEPS PER MOCK (app/web/templates/home_not_onboarded.html) - Step 1 reworded `Install Claude Code on your computer` + `~3 min` subhead (mock copy). - Step 2 renamed `Pick a folder for {{ instance_brand }}` (was `create your workspace folder`) — same `mkdir` command, mock-aligned framing. - NEW Step 3 `Open a terminal inside that folder` — no shell command, just the "you are standing in the right directory" reassurance with a Finder/PowerShell/file-manager howto disclosure. Mirrors the CEO mock's Step 3. - Step 4 (was Step 3, gated by `home_automode.show`) renamed `Launch Claude with auto-approve on`. Body copy lightly updated so it references "the next step" instead of "Step 4". - Step 5 (was Step 4) renamed `Get the install script and paste it into Claude`. The setup-cta-lead now explicitly says "pasting the script into Claude Code will install {{ instance_brand }}…" so existing test assertions pinning the `install Agnes` substring still match. - NEW Step 6 `Optional: create a one-word shortcut for next time` — prints an `echo 'alias {{workspace_dir\|lower}}=…' >> ~/.zshrc` one-liner for Unix and an `Add-Content $PROFILE …` equivalent for Windows. OS tabs + copy buttons reuse the existing wizard chrome. - Progress chip dynamic: `Step 1 of 6` when home_automode is on, `Step 1 of 5` when off. Progress bar fill `100 // total_steps` so the bar sits at 16-20 % on first paint. - `.step-lede` token added for the new short body copy beneath each step label (14.5px / ink-soft). - `macOS / Linux / WSL` tab labels changed to `macOS / Linux` per user instruction. Terminal-howto `WSL:` paragraph dropped; the paste-shortcut hint now reads `(Linux)` instead of `(Linux/WSL)`. Functional WSL handling in `connector_prompts.py` (it's a Linux detection fallback, not user-facing label) preserved. - `setup_instructions.py` Claude Code install hint: `npm (Linux / WSL)` → `npm (Linux)`. SURFACES — 4 CARDS PER MOCK - Replaced the 3-tile `.home-usage-grid` with a 4-card grid: - VS Code (Recommended) — `.surface-card.feature`, green ring, DAILY USE eyebrow + 5-step numbered list + `Open VS Code setup guide →` link to `/setup-advanced#vscode`. - Terminal — QUICK ACCESS eyebrow + 4-step list. - Claude Code (Desktop app) — CONNECT IT eyebrow + 4-step list. - Cowork (claude.ai) — `.surface-card.incomplete`, warn-tinted border + `Instructions needed` badge + a TODO callout describing the missing content. The card is intentionally honest about the gap rather than hiding it. TEST UPDATES - `test_web_home_page.py` negative onboarded-state assertions rebased on the new step labels (6 entries instead of 4). - `test_home_route_resolution.py` `test_home_renders_automode_block_by_default` + its `_when_env_off` counterpart now check the new `Step 4 — Launch Claude with auto-approve on` label. * fix(web): /home section content + layout — verbatim mock match User comparison flagged several remaining gaps; this patch rewrites the three lower sections of /home to match the CEO mock spec exactly: FIRST-SESSION (5 beats) - h2 28px / 700 / -.5px tracking (was 19px / 600). - lede 18px ink-soft (was 13.5px secondary). - `.session-walk` wrapper, 36px gap between beats (mock spec). - `.session-step` grid 48px / 1fr, gap 22px — number circle on the left, content on the right. - `.session-num` 40 × 40 circle with SOLID GREEN bg (`--primary`) and WHITE text + soft green shadow (was 28px mint pill w/ dark-green text). - `.session-content h3` 18px / 600 (was 14.5px / 600). - `.session-content > p` 15px. - `.session-content .annotation` 13.5px ink-muted body type with `strong` for highlighting (replaces the upper-case "WHAT'S HAPPENING" eyebrow pattern that didn't match the mock). - `.session-intro` callout card (white surface + mint icon block) framing the "five beats" tagline. - `.session-tldr` summary box (brand-light bg + brand-dark left border) wrapping up the loop. - Terminal frames re-skinned: `#0c1224` body / `#182241` bar / real macOS traffic-light colors `#ff5f57` / `#febc2e` / `#28c840`. - Terminal body 13px / 1.65 line-height with mock-spec class vocabulary: `.you` (yellow input), `.ai-name` (brand bold), `.path` (light blue), `.dim` (translucent code-ink), `.caret` (blinking cursor). - Five beats rewritten with mock's exact narrative flow (launch → menu → pick → ask → close), vendor-agnostic project names (`RevenueAnalysis`, `Onboarding`, etc.) replacing the customer- specific `GRPN_` examples in the mock. Templated `{{ instance_brand }}` / `{{ workspace_dir }}` / `{{ workspace_dir \| lower }}` (the shortcut alias) everywhere. SURFACES (4 cards) - The section is no longer wrapped in a white rectangle; the `.home-usage` class loses its bg + border + padding (mock has the cards directly on the page bg). - h2 28px (was 22px). Eyebrow 12px / 1.5px tracking / brand-dark. - `.surface-card.feature` (VS Code) now uses 2px green border + vertical brand-light → white gradient (was 1px ring). - `.surface-card.incomplete` (Cowork) uses 2px red border (`#e35e5e`) + vertical red-tint → white gradient (was yellow flat bg). - `.surface-card .steps` panel: inner surface-dim bg + 8px radius + 13px font. - `.surface-foot` top-border + ink-muted (mock spec). - `.badge-warn` now a solid red box (`#e35e5e` bg + white ink + 4px radius) instead of a yellow pill, matching the mock. - Header layout fixed: the global absorbed `header { display: flex; justify-content: space-between }` rule was making the h2 sit on the right of the eyebrow; explicit `display: block` override on `.home-mock section > header` puts the title on the LEFT under the eyebrow as the mock has. BROWSE — Explore your workspace - Wrapped in `<section class="browse-section">` with proper eyebrow + h2 + lede (was a bare `.section-label` div). - `.browse-grid` 5-col grid (was responsive auto-fill, 4-card layout). Skills tile added as a 5th card linking to `/marketplace?type=skills`. - `.browse-card` mock-spec: 22 20 padding, 28px icon, 15px title, 12.5px ink-muted desc, hover lifts -2px with brand border + shadow-md. Section wrappers (`.home-usage`, `.first-session`) no longer carry the white card chrome — they sit directly on the page bg, matching the mock. Only Getting Started + Overview keep their white cards. GLOBAL eyebrow vocabulary (`.home-hero-intro .eyebrow`, `.first-session > .eyebrow`, `.surfaces > header .eyebrow`, `.browse-section .eyebrow`) all aligned to mock spec: 12px / 700 / 1.5px tracking / brand-dark color / 14px bottom margin. Hero h1 bumped to 44px / 800 / -1px tracking (was 32px / 600). 51/51 home tests pass. fix(web): /home session-intro card + terminal-body verbatim mock match User comparison flagged three remaining /home gaps; this patch addresses each: - `.session-intro` rule was missing — the "five beats" tagline rendered as a bare line with no card chrome. Added the mock- spec card: white surface, 14px radius, 20×24 padding, 1px border + shadow-sm, with a 44×44 brand-light icon block on the left. - Beat 1 terminal-title was `~/{{ workspace_dir }} — zsh` (mock- style shell-pwd format), but the user wants every terminal frame across all 5 beats to read `claude — {{ instance_brand }}`. Updated. - Terminal-body line structure for beats 2-5 rewritten verbatim from the CEO mock: - `<span class="prompt">></span><span class="you">…</span>` now has no space between the prompt and user input (mock pattern: zero gap, the .prompt's `margin-right: 8px` provides the visual separation). - Beat 2 menu items use `<strong>[N]</strong>` numbering with project entries on indented lines, each project name followed by a `<span class="dim">(N ago)</span>` timestamp at a fixed column — instead of my prior single-line concatenation. - Beat 3 narrative split into 4 stanzas separated by blank lines (matches mock): the "Switched to <strong>X</strong>" status, then dim Loaded/Last-session lines, then a stand-alone "One unprocessed input detected:" pair, then the "Want me to process …" question. My prior version dim-wrapped the entire block, which looked off. - Beat 4 narrative split into headline summary + risks section with <strong> heads + bullet lists separated by blank lines, matching the mock's "Q1 close summary" / "Open risks" rhythm. The Q1 question carries the mock's manual line-break + 2- space continuation indent inside the `.you` span — without that, terminal-body's `white-space: pre-wrap` would auto-wrap awkwardly at a different column than the mock. - Beat 5 exit narrative uses two separate dim lines + a standalone `.ai-name` "See you next time." line, then prompt + caret. My prior version collapsed everything into one dim block. - Project names changed from customer-specific (`GRPN_`) to generic (RevenueAnalysis, WeeklyReview, Onboarding, OpsDb, HRHandShake) so the OSS distribution stays vendor-agnostic per CLAUDE.md. - `Marketing plan` examples replaced with `Q1 close` so the narrative stays plausible for an analyst audience. 12/12 home tests pass. fix(web): /home surfaces verbatim mock — VS Code thumb, Terminal expected-output, NEW badge User comparison flagged three remaining surface-section gaps: - VS Code surface card was rendering a generic "Screenshot pending" placeholder; the mock has a labeled inline mockup (`<a class="vscode-thumb">` w/ `.thumb-fallback`) showing the recommended 4-pane layout (EXPLORER yellow, TERMINAL 1 purple, TERMINAL 2 green, TERMINAL 3 orange) on a dark navy bg + a "Recommended layout" caption pill. CSS `.vscode-thumb` block added — uses gradient-strip backgrounds to draw the colored panel bars without needing a base64 image. - "Recommended" badge was a pill (999px radius) with `--brand-accent` bg + navy text. Mock uses `.badge` instead of `.recommend-pill` — solid `--primary` (brand-dark green) bg with WHITE text and 4px radius. Replaced the class + CSS rule so the badge reads as a tag, not a pill. - Terminal surface card was missing the "What you should see" subsection — mock has an `.expected-output` block showing a sample of the welcome menu inside a dim dashed panel. Added the block with the mock's exact rendered output (templated to `{{ instance_brand }}` + generic project names instead of customer-specific GRPN entries) plus the `.expected-output` CSS (surface-dim bg + dashed border + `::before` "WHAT YOU SHOULD SEE" eyebrow per mock spec). Also addressed the explore-section feedback: - Skills browse-card now carries the `new` class so it picks up the `.browse-card.new::after` corner badge ("NEW", green bg, white text, 10px / 700 / 0.5px tracking) per mock. - Browse cards align same height via `align-self: stretch` (grid default) + `flex-grow: 1` on `.browse-desc` so descriptions fill remaining vertical space; previously the Skills tile sat shorter because its desc text was longer than others'. Structural HTML changes to all four surface cards: dropped the inner `<div class="surface-card-head">` wrapper + `<p class="surface-pitch">` class in favor of mock's flat layout (`.what` + `.steps` + `.when-to-use`). `<ol class="surface-steps">` replaced with `<div class="steps"><strong class="steps-eyebrow">DAILY USE / QUICK ACCESS / CONNECT IT</strong> <ol>...</ol></div>` so the eyebrow + numbered list share the mock's tinted surface-dim panel. 12/12 home tests pass. * fix(web): align /home setup walkthrough to design spec - Setup-section header (eyebrow + heading + lede) floats above the install hero; install card has no accent strip; step labels drop `Step N —` prefix; closing strip is single flex row. - VS Code surface card renders recommended-layout screenshot from `/static/img/vscode-layout.png` with click-to-enlarge lightbox. - Workspace install path cascades to `~/Desktop/{workspace_dir}` in every step, surface card, first-session annotation, and shortcut. - Step 1 verify text restores Enterprise — Finance and Legal option. - Step 6 shortcut installs a shell function with arg forwarding (`"$@"` unix / `@args` windows) and a user-facing Auto / YOLO permission-mode toggle. - Step 5 manual-fallback details inline on the CTA row; description reads at step-lede size, not 13px chip. - Setup-section heading no longer right-aligns (was inheriting `header { display: flex; justify-content: space-between }` from the legacy stylesheet; wrapper changed to `<div>`). - Getting Started `<details>` block removed (duplicated links). * test(web): align /home tests with restructured setup wizard - Replace test_getting_started_card_renders_on_home with test_setup_section_renders_for_not_onboarded — asserts the new setup-section-header floats above the install hero and Getting Started markup is absent (block removed in the prior commit). - Update automode-block test to match labels without the `Step N —` prefix. - Update setup-CTA partial test to match the relabeled "Copy install script to clipboard" button. Drop orphaned CSS for `.home-getting-started`, `.home-gs-summary`, and `.home-gs-item` — selectors had no matching markup after the Getting Started block was removed. Also: Step 3 `pwd` expected-output uses an absolute path (`/Users/yourname/Desktop/{workspace_dir}`) instead of the tilde-prefixed form, matching what the command actually prints. fix(web): repaint home_onboarded + setup_advanced; align CTA label - home_onboarded + setup_advanced still carried the retired blue `#0056A3` as both `--hp-primary-dark` and the hero gradient endpoint. Both reference `var(--primary-dark)` now so the green palette cascades. - setup_advanced YOLO snippet was the old `alias` form (no cd, no arg forwarding). Replaced with the shell function variant from /home Step 6 — drops into ~/Desktop/{workspace_dir} and forwards "\$@" (unix) / @args (Windows). - setup_advanced ~/{workspace_dir} path references cascaded to ~/Desktop/{workspace_dir} so install story matches /home. - Dashboard's "Setup a new Claude Code" button label aligned to the canonical "Copy install script to clipboard" — matches /home and the new docstring in _claude_setup_cta.jinja, which now mandates this label across consumers. * fix(web): keep base brand blue; scope green palette to /home redesign User noticed login + dashboard had turned green when the /home redesign flipped --primary from blue (#0073D1) to green (#2ea877) in commit 278f202e. The brand-wide flip went further than the redesign needed — only /home, /home (onboarded), and /setup-advanced intentionally use the green/navy spec; every other page (login, dashboard, catalog, marketplace, admin, profile) was just inheriting the green because --primary cascaded everywhere. Revert the global brand colour to blue and lock the green into the two outstanding redesign scopes: - style-custom.css: --primary back to #0073D1, --primary-light back to rgba(0,115,209,0.1), --primary-dark back to #005BA3, --brand-accent back to a lighter blue. - home_onboarded.html: .home-mock now sets --hp-primary, --hp-primary-dark, --hp-primary-light to explicit green hex (matching home_not_onboarded), so the hero stays green regardless of the global brand. - setup_advanced.html: same lock — .advanced-mock pins the green palette in-scope. Hero gradients on both pages now reference the local --hp-primary chain (not the global --primary), so any future palette tweak inside either scope cascades correctly without disturbing the rest of the app. * refactor(web): hoist --hp-* into shared design-tokens.css (--ds-) PR 2 of the design-system extraction ladder. Pure mechanical rename + dedup; no visual diff on any rendered page (verified on /home, /dashboard). - New app/web/static/css/design-tokens.css declares the full token set on :root: brand surface (green primary, primary-dark, mint light, brand-accent), hero (navy bg + ink), code-panel (near-black bg + cool ink + warm-yellow), light surfaces (bg/surface/border), text (primary/secondary/muted), orange accent, info + warn callout vocabularies, navy-tinted elevation shadows, system font stack + mono. - base.html loads it alongside style-custom.css so the tokens are globally available. - Rename --hp- -> --ds-* in home_not_onboarded (313 refs), home_onboarded (15), setup_advanced (39). 367 token references pointed at one of three local blocks; now all point at the global :root. - Drop the three local token blocks. Each scope class (.home-mock / .advanced-mock) only keeps its base ink + font-size + line-height rules. The legacy --primary family stays canonical for the blue base brand — login, dashboard, catalog, marketplace, admin still read blue. The design system is opt-in via the scope class. * refactor(web): extract shared components.css; migrate /home markup PR 3 of the design-system extraction ladder. First batch of reusable components lifted out of home_not_onboarded.html into a new shared stylesheet; markup migrated to consume them. - New app/web/static/css/components.css with five components, all reusable on any page that loads design-tokens.css: .callout-rec — amber lightbulb recommendation box .callout-hint — blue info hint box .code-output — "WHAT YOU SHOULD SEE" terminal output block .lightbox — full-bleed image enlarge overlay .setup-section-header — wizard header (eyebrow + h2 + lede) - base.html loads components.css after design-tokens.css. - home_not_onboarded.html markup renamed: class="rec" -> class="callout-rec" class="hint" -> class="callout-hint" class="expected-output" -> class="code-output" - Local CSS rules removed from home_not_onboarded.html for each of the extracted components — ~150 lines down to 5-line "extracted to components.css" comments. The bespoke wizard-specific styles (.install-cmd, .os-tabs, .mode-tabs, .terminal-frame) stay template-local for now since they only have one consumer. Visual regression check: /home install hero renders the amber rec callout, blue hint callout, dashed code-output block, green section header, and click-to-enlarge VS Code thumb identically to the pre-extraction render. 43 home tests pass. * fix(web): unify page-headers — activity-center full-width, marketplace shares box - /activity-center audit-log hero rendered as half-width because the _page_hero include was inside <header class="obs-topbar">, a flex row that pinned the time-range + auto-refresh controls next to it. The hero is now a sibling rendered before the <header>, so it spans the full container width like every other admin page; the controls keep their flex row underneath. - Marketplace hero unified with .page-header--hero. Markup is now <section class="page-header page-header--hero mp-hero"> so the shared box drives padding/radius/gradient/max-width/shadow; the .mp-hero override block only carries the right-anchored cover image and the rules for the search row + scope checkboxes (which the canonical hero doesn't have). Inner text uses the canonical .page-header__eyebrow / __title / __subtitle classes. - .page-header--hero shadow tint now follows the brand blue (rgba(0, 115, 209, 0.2)) instead of the leftover green from the prior palette flip; same depth highlight everywhere the gradient is blue. * fix(web): unify remaining page heroes — admin, profile, install, store, stack Sweep across pages that carried bespoke gradient hero markup so every page-hero shares the canonical `.page-header--hero` dimensions (padding 28/32/24, border-radius 14, max-width var(--width-app), navy-tinted shadow, gradient with --primary → --primary-dark). Inner text uses the .page-header__eyebrow / __title / __subtitle classes so typography matches across the app. - admin_tables: migrated to _page_hero.html include. - admin_tokens: kept .tokens-hero wrapper for the counts-chip row but added the canonical class on the same element; stripped duplicate gradient + padding + typography rules. - install: same pattern (kept hero-meta pill row). - profile: migrated to _page_hero.html include. - store_upload: kept .upload-hero wrapper for the .meta chip row; composite class with the canonical hero. - setup_advanced: .advanced-mock .ad-hero now matches canonical dimensions; green palette retained via --ds-primary/dark. - stack_card.css: .stack-hero (catalog + corporate-memory search hero) uses canonical gradient + padding + max-width. The detail-page heroes (marketplace_plugin_detail, marketplace_item_detail, catalog__detail, store_edit, admin_group_detail, admin_store_submission_detail) stay bespoke for now — they're rich detail headers with photos, badges, install actions; converting them would lose contract context. Same applies to dashboard.html env-setup-cta (it's a CTA card, not a page hero). fix(web): canonicalise .container — single page shell every page inherits Previously each admin page set its own `.container:has(.<page>) {max-width: none}` + `.<page>-page {max-width: 1400px}` override, and per-page hero markup either nested inside flex toolbars (which pinned the hero next to filter controls and squeezed it half-width) or self-constrained with a different max-width than the page. /home, /dashboard, /marketplace, and /admin/* ended up at different widths with different nav-to-hero gaps. - style-custom.css `.container` now carries the canonical 1280px max-width + `16px 32px 48px` padding so every page inherits the same nav-to-hero gap and side gutters. `.container > main` is margin/padding 0 so the container is the sole owner of gutters. - `.page-header--hero` drops its self-constraining max-width and auto-centering margin — the container provides the width, so the hero sits flush with the table/toolbar below it. - `.stack-hero` (catalog + corporate-memory) and `.advanced-mock .ad-hero` (/setup-advanced) follow the same pattern: container owns the width. - Per-page max-width overrides stripped from admin_users, admin_access, admin_groups, admin_marketplaces, admin_welcome, admin_workspace_prompt. - _page_hero include extracted from inside flex toolbars on admin_users, admin_access, admin_groups, admin_marketplaces, admin_server_config, admin_welcome, admin_workspace_prompt, admin_sessions, admin_session_detail, admin_usage, activity_center. The toolbar (`.users-toolbar`, `.gp-toolbar`, etc.) keeps only the filter + action controls; hero renders before it as a sibling. - _page_chrome.html trimmed to just the page-background tint for the redesign scopes; the duplicate `.container` rules it carried are now redundant. Verified: /home, /admin/marketplaces, /admin/users all render container width 1280px with hero top at 88px (16px below the 72px-tall sticky nav). Same spacing as /home design spec. * fix(web): admin_tables + admin_corporate_memory inherit canonical .container Both pages were overriding `{% block layout %}` from base.html, which bypasses the canonical `.container` wrapper. Result: hero span the full viewport (1596px on a wide screen) while the inner content sat at a narrower max-width — hero and content didn't align, and the nav-to-hero gap differed from every other admin page. Switched both templates to `{% block content %}` so they render inside the canonical `.container` from base.html — same path as admin_groups, admin_users, admin_marketplaces, etc. - admin_tables: dropped local `.page-title { max-width: 1600px }` + `.content { max-width: 1600px }` overrides (kept typography + inner gutter rules) and the mobile padding overrides that paired with them. Container now owns the gutters. - admin_corporate_memory: only the block keyword needed changing; the template already had a clean inner structure (no max-width override on `.container-memory`). Verified on /admin/tables and /admin/corporate-memory: - .container width 1280, padding 16/32/48 - Hero top 88 (nav 72 + container padding-top 16) - Hero + content both 1216px wide, both at left 190 — perfect alignment with /admin/groups. * fix(web): drop .page-shell padding override + admin_tables stale :root Two regressions discovered after the canonical-container unification: 1. `.container:has(.page-shell)` still set `padding: 28px 32px 48px` while the canonical `.container` had moved to `16px 32px 48px`. Every page-shell consumer (/admin/sessions, /admin/sessions/<id>, /admin/usage, /marketplace, /dashboard, marketplace detail pages, /me/activity, /store/, /admin/store-submissions) was rendering with a 28px nav-to-hero gap while /admin/users + /admin/groups rendered with 16px. Same width, mismatched vertical rhythm. The opt-in rule is now a no-op marker: canonical container already provides 1280px + 16/32/48 + main margin/padding 0. 2. admin_tables.html had a stale `<style>` block that re-declared `:root { --primary: var(--primary); ... }`. The self-referential token resolved to empty, collapsing the page-header hero's `linear-gradient(135deg, var(--primary), var(--primary-dark))` to no background — the hero appeared as a pale ghost without colour. The entire shadow `:root` block was a stale copy of the design tokens that style-custom.css already provides. Dropped it; tokens now resolve from the global `:root`. After both fixes /admin/sessions, /admin/tables, and every other page-shell consumer match /admin/groups exactly: container 1280px, container padding-top 16px, hero at top 88px / left 190px / width 1216px. fix(web): drop /admin/tokens .tokens-page width + padding override `.tokens-page` carried its own `max-width: 1280px; margin: 0 auto; padding: 28px 8px 48px` block — the canonical `.container` already provides width + 16/32/48 padding, so the nested wrapper was adding 28px on top of the container's 16px (= 44px nav-to-hero gap, vs 16px on every other admin page) and shrinking the hero sideways by 8px on each side (1200px vs the canonical 1216px). After: container owns the layout; `.tokens-page` is just a font-family scope. /admin/tokens hero now sits at top 88, left 190, width 1216 — same numbers as /admin/groups / /admin/users. * fix(web): hero links readable on blue; /admin/access Groups link href - New `.page-header--hero a` rule in style-custom.css forces any anchor inside a gradient hero to render white + underlined so links stay readable on the blue background. Previously links inherited the global `var(--primary)` blue, which disappeared on top of the matching blue gradient. No per-page class needed — drop a plain `<a>` in any hero subtitle and it just works. - /admin/access hero subtitle was Jinja-passing the inline link with HTML-entity-encoded quotes (`href="..."`). The entities decoded to literal `"` characters inside the rendered href, producing `/admin/%22/admin/groups%22` — a 404. Switched the `set` to a block-set (`{% set page_hero_subtitle %}...{% endset %}`) so the inline `<a href="/admin/groups">Groups</a>` survives unescaped through `_page_hero.html`. Also stripped the now-redundant inline `style="color:#fff;text-decoration:underline;"` — the new shared rule handles it. * fix(web): /dashboard top padding matches every other page `.main` on /dashboard had `padding: 28px 32px 48px` while every other page now uses `16px 32px 48px` via the canonical `.container`. Dashboard bypasses `.container` (overrides base.html's `layout` block to render a full-width `<main>` directly), so the padding lives on `.main` itself — bumped the top to 16px to match. After: first child top = 88, left = 190, width = 1216 — same numbers as /admin/groups / /admin/users / /admin/marketplaces. * fix(web): green eyebrow + white title on .page-header--hero (matches /home) `.page-header--hero .page-header__eyebrow` was faint white (rgba(255,255,255,0.75)) — readable but unbranded against the blue gradient. Changed to `var(--ds-brand-accent)` (mint green #54d3a0) so every page hero pairs a green eyebrow with white title + subtitle, echoing /home's setup-section header (green eyebrow, dark heading combo). One CSS rule applies everywhere — no per-page styling needed. Also bumped the eyebrow to font-weight 700 / letter-spacing 1.2px so the green stands out cleanly against the gradient. * fix(web): page-header--hero + stack-hero use /home navy gradient `.page-header--hero` and `.stack-hero` were on the brand-blue gradient (`var(--primary)` → `var(--primary-dark)`) while /home's hero (`.home-hero-intro`) sits on the deeper navy gradient (`#0f1b3a` → `#1a2a5f`). Every other page-hero now uses that same navy gradient so /home, /marketplace, /catalog, /corporate-memory, /admin/, /profile, /install, /dashboard, /setup-advanced share one brand surface. Shadow tint adjusted to the navy depth (rgba(15, 27, 58, 0.22)). Brand blue stays the link/CTA colour everywhere else; only the hero box itself is navy. fix(web): primary buttons green; marketplace tabs navy translucent Two parity tweaks pulling the rest of the app toward /home's visual language. - `.btn-primary` (both rules in style-custom.css) now uses `var(--ds-primary)` / `var(--ds-primary-dark)` green fill, matching the "Copy install script to clipboard" button on /home. Brand-blue `--primary` still drives link colour and the accent surface; only the filled button background flipped to green. Every page with a `.btn-primary` (admin "+Add user", "+Add marketplace", catalog, marketplace actions, dashboard, modals) now reads as the same "do it" affordance. - `.mp-tabs` (Curated Marketplace / Flea Market / My Stack tab group) now sits on the navy `--ds-hero-bg` with translucent white pills (rgba(255,255,255,0.10) inactive, 0.18 active) — same translucent-white-on-navy treatment as the "Just browse — no install needed" pill on /home. Icons render as soft white; per-tab colour-coding dropped in favour of the unified surface. * fix(web): catalog/memory tabs + empty-state CTA + admin action buttons Bring /catalog and /memory in line with /home + /marketplace: - `.stack-tabs` (Browse / My Stack / Recipes on /catalog, Browse / My Stack on /memory) now uses the navy `--ds-hero-bg` container with translucent-white-on-navy pills, mirroring the `.mp-tabs` treatment and /home's "Just browse — no install needed" CTA pill. Per-tab icon colour-coding dropped — icons render as soft white on the navy fill. - `.stack-tabs-row__actions .btn` (right-slot "+New Recipe", "+New Data Package" admin CTAs) now uses green primary fill (`--ds-primary`), matching `.btn-primary` and /home's "Copy install script to clipboard" button. - `.stack-empty .cta a` (empty-state action button — the "Open /admin/tables →" CTA on /catalog and equivalent on /memory) flipped from blue `--primary` to green `--ds-primary` so the colour aligns with every other primary button in the app. * fix(web): marketplace Search button green (--ds-primary) matching other CTAs * fix(web): unify Search button + admin-action button across browse pages - Added Search button (`<button class="stack-hero__search-btn">`) to /catalog and /memory heroes — same green pill as /marketplace. Wired to the existing live-filter pipeline (button click runs `applyFilters()` and refocuses the input). All three browse pages now wear the identical search bar UI. - `.stack-hero__search-btn` shares `--ds-primary` fill with `.mp-hero .search-btn`. - `.mp-actions .btn` ("Submit a skill or plugin" CTA on /marketplace) flipped from the legacy blue-outline to the same green primary fill + dimensions (`display: inline-flex; line-height: 1; padding: 9px 16px; gap: 6px`) as `.stack-tabs-row__actions .btn` on /catalog and /memory. All three right-slot action buttons render at identical height now. - `.stack-tabs-row__actions .btn` got `inline-flex` + `line-height: 1` + `gap: 6px` so a `<button class="btn">` and a `<a class="btn">` both render at exactly 33px high — the embedded `.admin-only-hint` chip no longer pushes one variant taller than the other. * fix(web): marketplace guide CTAs green (fastpath + primary); drop flea purple * fix(web): dashboard CTA hero on navy; readable <code> chips in hero - `.env-setup-cta` on /dashboard ("Set up a new Claude Code" card) flipped from the brand-blue gradient + green-tinted shadow to the canonical navy gradient (`--ds-hero-bg` → `#1a2a5f`) with navy-tinted shadow + 14px radius + 28/32/24 padding, matching `.page-header--hero` and /home's `.home-hero-intro`. Dashboard's top CTA now sits on the same brand surface as every other hero. - Added `.page-header--hero code` rule — translucent white pill + warm-yellow ink (#ffd866) so `<code>` chips embedded in hero subtitles read as code samples against the navy gradient. The global `code` rule sets `color: var(--text-primary)` (dark), which turned in-hero chips into invisible dark-on-white-on-navy ghosts (e.g. the `-by-dev` suffix on /store/new). - /store/new's `.page-header__subtitle code` dropped its inline style override — the shared rule handles it now. * feat(web): two-theme switching via data-theme + admin toggle Introduces a theme system that flips the entire UI palette between "navy" (current design, default) and "blue" (pre-redesign palette) via a single `<html data-theme="...">` attribute. Page markup, class names, and component styles don't change — only the `--ds-` token values flip. Backend - New `app/instance_config.py::get_instance_theme()` resolves the active theme from `AGNES_INSTANCE_THEME` env > `instance.theme` in instance.yaml > default "navy". Unrecognised values clamp to "navy" so a typo doesn't break the page. - `app/web/router.py::_build_context` injects `instance_theme` alongside `instance_brand` etc. so every template inherits it. - `app/web/templates/base.html` renders `<html lang="en" data-theme="{{ instance_theme \| default('navy') }}">`. CSS - `app/web/static/css/design-tokens.css` adds two new tokens to the default `:root` set: `--ds-hero-shadow` (drop-shadow tint on hero boxes) and `--ds-hero-eyebrow` (eyebrow accent colour). Plus a `:root[data-theme="blue"]` override block that flips seven tokens: `--ds-primary`, `--ds-primary-dark`, `--ds-primary-light`, `--ds-brand-accent`, `--ds-hero-bg`, `--ds-hero-bg-deep`, `--ds-hero-shadow`, `--ds-hero-eyebrow`. The blue theme aliases the brand surface tokens back to the legacy `--primary` family. - `.page-header--hero`, `.stack-hero`, `.env-setup-cta`, `.home-mock .home-hero-intro` now reference the new `--ds-hero-shadow` and `--ds-hero-bg-deep` tokens instead of hard-coding `rgba(15, 27, 58, 0.22)` and `#1a2a5f` — gradient + shadow now flip with the theme. - `.page-header--hero .page-header__eyebrow` uses `var(--ds-hero-eyebrow)` so the eyebrow goes mint-green on navy and translucent-white on blue (mint on blue reads poorly). Admin - `app/api/admin.py::_KNOWN_FIELDS["instance"]` now registers a `theme` field of kind `select` with options `["navy", "blue"]` and a `hint` explaining the trade-off. The existing /admin/server-config UI auto-renders a select for this — no template changes needed. Defaults - Default value is "navy" so existing instances see no visual change. Admins flip to "blue" via /admin/server-config to restore the pre-redesign look. Restart note: uvicorn must reload to pick up the Python changes (new getter, new template-context key, new known-field). CSS changes hot-reload via browser refresh. fix(web): blue theme — home hero eyebrow + CTA contrast `.home-hero-intro .eyebrow` and `.btn-intro-primary` referenced `--ds-brand-accent` directly, which on the blue theme resolves to the lighter brand-accent blue (#4F9DEB). Result: light-blue eyebrow on the blue gradient ("WELCOME, ADMIN" barely readable) and a light-blue button with darker-blue text ("Set up in ~15 min") that all sat in the same hue range. Introduces three new theme-aware tokens: - `--ds-hero-eyebrow` already existed; blue theme bumped opacity to 0.92 so the eyebrow reads as full white. - `--ds-hero-cta-bg` + `--ds-hero-cta-fg` + `--ds-hero-cta-bg-hover` flip the primary hero CTA: mint-green on navy (default), white- on-blue under `data-theme="blue"`. `.home-hero-intro .eyebrow` now uses `--ds-hero-eyebrow` (mint on navy / white on blue) and `.btn-intro-primary` uses the CTA token trio. Recommended palette on blue theme: - Eyebrow: white at 92% opacity (clear on the blue gradient). - Primary CTA pill: white background, brand-blue dark text (`--primary-dark` = #005BA3) for AAA-level contrast. - Secondary CTA: translucent white pill (unchanged). * fix(web): blue theme — callout-hint info bg/border/ink re-tinted to brand blue (was indigo, clashed with brand-blue hero)	2026-05-21 06:19:16 +00:00
ZdenekSrotyr	1b0329e8c5	UI design system unification — one stylesheet, canonical primitives, nav fix (#284 ) * docs(plan): design-system unification plan (post-review revisions) Plan covers consolidating two CSS files into one, introducing canonical primitives (.btn family, .search-input, .filter-bar, .page-header, .data-table, .empty-state, .toast, .stat-card, .tab-strip), unifying the top-nav Admin trigger with sibling links, and migrating 41 templates that today carry inline <style> blocks. Post-review revisions: nav fix moved to first commit (user complaint lands first); sticky-header and dark-mode skeleton tasks dropped (defer to follow-up PRs); contract test class detection tokenizes class="..." attributes properly; baseline screenshot loop added to Task 0; vendor-token grep widened. * fix(nav): unify Admin trigger with sibling nav links The top-nav Admin entry is a <button class="app-nav-link app-nav-menu-trigger">, siblings are <a class="app-nav-link">. .app-nav-menu-trigger used to override .app-nav-link with "color: inherit; font: inherit", resetting font-size from 13px back to body default and color from --text-secondary to body color. Active state diverged too: .is-active on links used --primary blue, [aria-expanded=true] on the button used --border-light grey. Fix: expand .app-nav-link so it covers <button>-element resets (font-family: inherit, border: 0, background: transparent, cursor: pointer, display: inline-flex for chevron alignment). Add [aria-expanded="true"] as another active-state selector so the dropdown's open state highlights identically to .is-active on links. Delete the now-redundant .app-nav-menu-trigger rules that stripped button chrome. Extract the inline <script> from _app_header.html into a new app/web/static/app.js (loaded by base.html only — base_login.html has no nav). Sets up window.appUI.wireDropdown for both the user menu and the Admin dropdown via DOMContentLoaded. * style(css): consolidate style.css into style-custom.css + add cache-bust One stylesheet for the whole web UI: - style.css (1086 lines, legacy Google-inspired tokens + components) absorbed into style-custom.css under a labeled block, placed after the modern :root + body so style-custom's component rules continue to override the legacy ones (preserves the original cascade order that came from loading style.css first). - style.css deleted; <link> dropped from base.html + base_login.html. - static_url() now appends ?v=<mtime> to /static/<path>. Cheap per-request os.stat — auto-invalidates browser + proxy caches on redeploy without operator intervention. Mtime survives across uvicorn restarts as long as the file content is unchanged. Legacy classes (.btn, .card, .login-, .badge, .code-block, .flash, .form-group, .username-box, .btn-copy, .auth-tabs, .divider, etc.) still render — they live in style-custom.css now. Login pages, error page, password setup, and the dashboard's Claude Code Setup card all kept working in browser smoke. test(design): contract test for design-system invariants 7 structural invariants enforced from this commit onwards: - style.css must stay deleted - no template links style.css via static_url - exactly one bare :root block in style-custom.css - canonical primitives declared (.btn, .btn-primary, .search-input, .filter-bar, .page-header, .data-table, .empty-state, .toast, …) - no deprecated class names in templates (.users-table, .gp-table, .marketplaces-table, .audit-table, .users-search, .marketplaces-search, .modal-btn, .btn-primary-v2, …) - app.js loaded by base.html, NOT by base_login.html - 3 helper-level unit tests for the class-attribute tokenizer (multi-line attrs, Jinja-conditional fragments, false-positive prose) Two of the assertions intentionally start FAILING after this commit (missing primitives + legacy class refs in 7 admin templates) and will turn green as Tasks 4–7 add primitives and Tasks 8–15 migrate the templates. * feat(css): canonical button family + legacy token aliases Adds at top of :root: legacy token aliases (--bg, --card-bg, --text, --text-light, --secondary, --radius) pointing at modern equivalents. Absorbed style.css rules referenced these names; without aliases they fell back to 'unset'. Aliases live until Task 16 alongside their absorbed rules. Appends canonical .btn variants at end of file (last cascade): .btn-primary + .btn-primary-v2 + .modal-btn.primary (alias group) .btn-secondary + .btn-secondary-v2 + .modal-btn:not(.primary):not(.danger) .btn-ghost + .btn-ghost-v2 .btn-danger + .modal-btn.danger .btn-lg .btn:disabled + .btn:focus-visible (focus ring via --focus-ring) Existing absorbed .btn, .btn-primary, .btn-secondary, .btn-sm rules remain — the canonical block adds the missing variants + selector-list aliases so .modal-btn and v2 markup keep rendering until migration tasks swap them out. Contract test: .btn-danger now declared (one less missing primitive). Browser smoke: /admin/tokens hero + filter pills + empty state render correctly with the absorbed style.css rules now backed by real tokens. * feat(css): form-control primitives — .search-input + .filter-bar + .filter-pill + .form-input Canonical filter bar shape: 36px-height inputs (matches button height for vertical rhythm), 28px pills with .is-active state, consistent focus ring via --focus-ring token. Selector-list aliases for legacy per-page classes: - .users-search / .marketplaces-search / .kb-search → .search-input - .filters-card → .filter-bar - .pill[aria-pressed="true"] also matches the .filter-pill active state .form-input added as a sibling of .search-input for forms — same baseline height + radius + focus treatment, with textarea.form-input auto-sizing to min 96px and using the mono font (matches CSV/SQL pasted-snippet patterns on /admin/agent-prompt + /admin/workspace-prompt). Contract test: .search-input + .filter-bar + .filter-pill now declared. * feat(css): .page-header primitive + variants + .tab-strip Canonical page-header pattern with title (22px) + optional subtitle + optional eyebrow + right-aligned actions slot. Two modifiers: - .page-header--hero: gradient background (primary→primary-dark), 28px white title, semi-transparent subtitle/eyebrow. For /marketplace, /store, /profile-style pages that already use this layout via per-page inline <style>. Migration tasks delete the duplicated rules. - .page-header--compact: 18px title for dense admin index pages. .tab-strip + .tab-strip__item — the secondary tab row pattern used by /marketplace?tab=flea and similar. .is-active / [aria-selected=true] both flip the active treatment (primary color + bottom border). Contract test: .page-header / __title / __subtitle / __actions all now declared (4 fewer missing primitives). * feat(css+js): .data-table + .empty-state + .toast + .stat-card primitives Last primitive batch. All 8 canonical-primitives invariants in test_design_system_contract.py now green; only the template-migration test fails (expected — Tasks 8–15). .data-table (+ --compact modifier): selector-list aliases for legacy per-page table classes (.users-table, .gp-table, .marketplaces-table, .audit-table) so existing markup keeps rendering until migration. Compact modifier shrinks padding + font for dense lists (audit log). .empty-state with __icon / __title / __description / __actions — replaces the ad-hoc 'no results' rendering scattered across pages (corporate_memory, admin_users, admin_marketplaces, etc.). .toast / .toast-container — paired with window.appToast({kind, msg, timeout}) appended to app.js. Bottom-right stacked, click-to-dismiss, auto-dismiss after 4s by default. Kind 'success' / 'warning' / 'error' / 'info' shows a 3px colored left border. .stat-card (+ --accent variant) + .stat-row grid — for the dashboard metric tile row. * style(templates): migrate 8 templates off deprecated class names Mechanical class-attribute rewrite via tokenizer (preserves Jinja conditionals + multi-line attrs): modal-btn primary -> btn btn-primary modal-btn danger -> btn btn-danger modal-btn -> btn btn-secondary users-table -> data-table gp-table -> data-table marketplaces-table -> data-table audit-table -> data-table users-search -> search-input marketplaces-search -> search-input 8 templates touched: admin_groups, admin_marketplaces, admin_tokens, admin_users, admin_welcome, admin_workspace_prompt, my_tokens, corporate_memory_admin. 43 lines updated total. Inline <style> blocks in these templates still define rules for the old class names — those rules no longer match anything and become dead code, removed in Task 16's alias cleanup along with the selector-list aliases in style-custom.css. Contract test (tests/test_design_system_contract.py) now fully green: 9/9 invariants enforced from this commit onward. * feat(css): extend .data-table selector list to 13 more bespoke -table classes Visual unification of remaining tables across the codebase without per-template edits. The .data-table baseline rules (uppercase header tracking, 12px padding, hover state, border-radius) now apply to: .ad-table / .ea-table / .md-table / .members-table / .obs-table / .overview-stats-table / .registry-table / .sample-table / .sched-table / .sess-table / .sub-table / .subs-table / .ud-table These class names live in 12 templates (activity_center, admin_access, admin_group_detail, admin_scheduler_runs, admin_sessions, admin_store_submissions, admin_tables, admin_usage, admin_user_detail, catalog, me_debug, profile_sessions) that have their own per-page <style> blocks. Per-page rules with higher specificity still win for their custom needs (column widths, etc.) — this commit only sets a shared baseline so every table renders with the same chrome. Contract test stays green: 9/9 invariants enforced. * style(css): remove now-unused legacy class aliases Phase A renamed 8 templates off these names; no markup references them any more, so the selector-list memberships are dead weight. Removed from style-custom.css: .btn-primary-v2 / .btn-secondary-v2 / .btn-ghost-v2 .modal-btn / .modal-btn.primary / .modal-btn.danger / .modal-btn:not(.primary):not(.danger) .users-search / .marketplaces-search / .kb-search .users-table / .gp-table / .marketplaces-table / .audit-table .filters-card 37 lines smaller. Contract test catches any reintroduction. KEPT aliases (still in untouched template markup): - .pill (marketplace_plugin_detail.html, marketplace.html — these pages weren't part of Phase A's deprecated-class sweep; their own .pill CSS rules still apply) - All .data-table family extensions (.ad-table, .ea-table, .md-table, .members-table, .obs-table, .overview-stats-table, .registry-table, .sample-table, .sched-table, .sess-table, .sub-table, .subs-table, .ud-table) — these still render data tables in 12 templates; selector-list aliasing keeps them visually unified with .data-table baseline. - Legacy token aliases (--bg / --text / --text-light / --secondary / --card-bg / --radius) — still resolve absorbed style.css rules. Templates' inline <style> blocks still contain dead rules for the renamed classes (.users-search, .modal-btn, etc.); harmless but bloat. Optional follow-up: a separate sweep can drop those. * docs(changelog): design-system unification under [Unreleased] * feat(css): unify page-shell width — .container baseline 1280px + modifiers Inventory found 30+ unique max-width values across templates (280px login → 1600px admin/tables). The legacy .container default was 800px, which made every admin page set its own wider inline override — 30+ ad-hoc widths drifted as a result. Canonical: .container max-width = var(--width-app) (1280px). Pages that need a different shape opt in via modifiers: .container--narrow → var(--width-narrow) (800px) — long-form text, setup wizards .container--wide → var(--width-wide) (1400px) — admin lists, marketplace grids .container--full → max-width: none — hero / landing Pages that already set a NARROWER inline max-width (setup, login flows inside .login-card, etc.) still render at their narrower size — the inline override beats the new canonical 1280px. The visible change hits the ~20 admin pages currently rendering at 800px via the legacy default, which jump to 1280px and pick up consistent breathing room. Spacing also normalized: padding 24px 20px → var(--space-6) var(--space-5). * fix(home+catalog): gut dashboard sections + remove confusing toggle + fix table count Dashboard /home cleanup: - Remove 'Your Data' card — Data Packages is already a top-nav entry, so duplicating data sources on the landing page just adds noise. - Remove 'Account' card — group memberships + scripts + last sync belong on /profile, not on the welcome screen. - Remove entire right-column (Corporate Memory + Activity Center widgets) — both surfaces have dedicated admin pages reachable from the Admin dropdown. - Keep stats row (Tables/Columns/Rows/Data Size/Unstructured), env-setup-CTA, and Notifications card. /catalog cleanup: - Strip the 'Always included' badge + the locked toggle-switch from Core Business Data and Business Metrics cards. The toggle was always 'checked disabled' — it visually looked like a switch but could not be toggled, which was confusing. The 'Always included' copy itself was redundant once the toggle was gone. Agnes Internal already rendered without these, so the three cards are now visually consistent. Catalog data_stats fix: - 'total_tables' was len(sync_state) — counted only tables that had ever synced, so a 30-row table_registry with 0 ever synced rendered as '0 tables'. Switched to len(tables) — the registered business-data table list — so the count reflects what's actually available, not what's been touched. * fix(home): real stat numbers + drop unstructured tile + cleanup dead CSS Dashboard stats were hardcoded zeros (columns: 0, size_display: '0 MB', unstructured_display: '0 MB') and the table counter pulled from sync_state (synced) instead of table_registry (registered). On a fresh deployment with 30 registered tables and 0 ever synced, the page rendered '0 / 0 / 0 / 0 MB / 0 MB' — useless. Now: - Tables: COUNT() FROM table_registry WHERE source_type != 'internal'. Matches the /catalog Core Business Data counter. - Columns: SUM(sync_state.columns). Zero only when nothing's synced yet. - Rows: unchanged (SUM(sync_state.rows), already correct). - Data Size: SUM(sync_state.file_size_bytes), human-formatted via inline _fmt_bytes helper (KB/MB/GB). - Unstructured: tile dropped — was always '0 MB' and had no source. - last_updated: now derived from sync_state max(last_sync), wasn't set before so the 'Synced …' tag never rendered. Dashboard.html cleanup: ~725 lines of orphan inline <style> removed — .section-title, .data-source, .toggle-switch, .catalog-cta, .memory-card / .memory-stat / .memory-description / .memory-footer / .btn-memory, .activity-card / .activity-stat / .activity-text / .btn-activity, .account-grid / .account-row / .account-scripts / .badge-role / .badge-group / .cron-line, .badge-included / .badge-beta / .badge-demo. All matched markup deleted in the previous commit; the CSS was dead code until now. * ui(catalog): rename page heading 'Data Catalog' → 'Data Packages' The top-nav entry says 'Data Packages' but the page itself said 'Data Catalog' — confusing two-name product. Aligns the heading and <title> with the nav label. Subtitle trimmed too: 'manage your subscriptions' was a vestige of the toggle UI that just got removed, replaced with a one-liner describing what the page is for. Two other 'Data Catalog' strings stay: they live inside the table- profiler overlay JS and refer to an EXTERNAL catalog system (e.g. OpenMetadata / Atlan) that an operator may link to per table — that is a generic term for any external data-catalog product, not our page name. * fix(nav): dropdown clicks always work + mutual-exclusion close Two bugs in the wireDropdown helper: 1. Clicking trigger B while trigger A's menu was open left both open. e.stopPropagation() in trigger.click prevented the document-click handler from firing, so trigger A's open menu had no way to learn that something else was clicked. Net effect: state diverged across the two dropdowns the more you clicked. 2. The target-vs-trigger equality check (e.target !== trigger) was strict. Clicking the chevron <svg> inside the button reports the svg or its <path> child as e.target — not the button — so removing stopPropagation alone would trip the close branch in the same click that just opened the panel. Fix both at once: drop e.stopPropagation() AND switch the doc-handler guard to trigger.contains(e.target). Now any click outside both the trigger subtree and the panel subtree closes; any click on another trigger closes via the OTHER dropdown's doc handler; clicks inside the trigger (button OR svg child) are fully ignored by the doc handler and only the trigger's own toggle handler fires. * feat(ui): canonical blue-gradient hero on every admin page The UI had a per-page hero pattern on ~10 onboarding/marketing pages (admin_tokens / profile / install / setup_advanced / marketplace / my_tokens / store_upload / home_), each with its own ad-hoc CSS (.tokens-hero, .profile-hero, .install-hero, .upload-hero, …). The admin section's index + detail pages had plain H1/H2 with their own .users-title / .gp-title / .obs-title / .cfg-title / … inline styling. Net effect: half the app felt like a product, half felt like a spreadsheet. Now: - .page-header--hero CSS upgraded to match the look analysts already liked from admin_tokens: 28px/32px/24px padding, 14px radius, soft primary-tinted box-shadow (0 4px 16px rgba(0,115,209,0.2)), 28px semibold title, optional uppercase eyebrow + 13.5px subtitle. Narrow-viewport breakpoint included. - New _page_hero.html partial wraps the boilerplate. Usage: {% set page_hero_eyebrow = "Users & Access" %} {% set page_hero_title = "Users" %} {% set page_hero_subtitle = "…" %} {% include "_page_hero.html" %} - 15 admin templates migrated to it: admin_users / admin_groups / admin_marketplaces / admin_access / admin_sessions / admin_session_detail / admin_store_submissions / admin_scheduler_runs / admin_usage / admin_user_detail / admin_welcome / admin_workspace_prompt / admin_server_config / activity_center / admin/news_editor. Each gets a grouped eyebrow (Users & Access / Data / Agent Experience / Activity Center / Server) matching the Admin dropdown sections so the page identity is unambiguous at a glance. Legacy -title H2/H1 + adjacent subtitle paragraphs deleted; their per-page CSS rules are dead now (harmless, retire in a follow-up sweep alongside other inline-style cleanup the reviewers flagged). admin_tables.html intentionally NOT migrated — it's a standalone HTML page that doesn't extend base.html; a separate refactor. Test: test_admin_users_page_renders_for_admin assertion updated from .users-title to .page-header__title + .page-header--hero (the canonical pair). All other web/template tests stay green. * refactor(ui): dedup _humanbytes, drop 267 lines of dead inline CSS (1) _humanbytes consolidation: - Add TB branch + optional precision param (default 2 preserves existing Store detail callers; dashboard uses precision=1 for headline tiles). - Delete inline _fmt_bytes from dashboard handler — was a copy of _humanbytes with different rounding. One canonical helper now. (2) Dead inline-CSS sweep across 17 migrated templates: - Conservative regex: a CSS rule is deleted only when its primary class matches one of the known-dead names AND that name is NOT referenced from any class= attribute in the same file's markup. - Per-file 'in-use' guard saved several false positives that the deny list would have nuked (e.g. .users-toolbar, .gp-search, .obs-subtitle, .marketplaces-toolbar are still in use; only .users-table, .users-search, .users-title, .modal-btn, etc. that have NO markup left went away). - Removed: -267 lines across admin_users (-42), admin_marketplaces (-45), admin_groups (-31), my_tokens (-38), admin_tokens (-29), admin_access (-9), admin_user_detail (-6), admin_welcome (-8), admin_workspace_prompt (-8), admin_server_config (-2), admin_sessions (-1), admin_session_detail (-1), admin_usage (-1), admin_store_submissions (-3), admin_scheduler_runs (-3), activity_center (-4), corporate_memory_admin (-36). Contract test stays green (9/9); all web/template/render/user_management tests pass. * feat(ui): canonical hero on /catalog (Data Packages) Same .page-header--hero treatment as the admin pages — Data eyebrow, Data Packages title, Browse-the-data-sources subtitle. Removes the ad-hoc .page-title block (h1 / p / wrapper-div) and its CSS rules (now dead, 3 rule blocks deleted). * fix(nav): load app.js from _app_header.html — works on standalone pages The previous nav-fix commit moved the inline dropdown script from _app_header.html into app/web/static/app.js + added <script src=…> to base.html. That broke EVERY page that includes _app_header.html WITHOUT extending base.html (catalog, corporate_memory, admin_tables, install). They got the nav markup but no JS → both Admin and AD dropdowns dead on those pages. Fix: emit the <script src=app.js defer> directly inside the _app_header.html partial. Any page that includes the header now gets the script automatically — base.html-extenders AND standalone HTML pages alike. base.html's duplicate <script> line removed. Also fixes the wide-hero on /catalog: .page-header--hero now sets its own max-width: var(--width-app) (1280px) so standalone pages without a .container parent don't render the gradient edge-to-edge. catalog's .source-cards bumped from 900px → 1280px to match the hero, otherwise the page reads two-tier (wide blue band, narrow content) which the user flagged. Verified locally via agent-browser: Admin + AD dropdowns now click through on /catalog, /admin/tables, /corporate-memory. docs(plan): standalone pages → base.html framework migration plan Plan + Plan-agent review (8 must-fix items applied) for converting the 5 templates that ship their own <html><head><body> scaffold (catalog, install, corporate_memory, corporate_memory_admin, admin_tables) to extend base.html. Root cause of yesterday's 'dropdown dead on /catalog' regression: shared infrastructure in base.html doesn't propagate to standalones. * feat(base): body_attrs block + migrate install.html to extend base base.html: new {% block body_attrs %}{% endblock %} slot so pages that need <body> attributes (admin_tables has data-source-type) can carry them through extends. install.html: convert from standalone <html><head><body> scaffold to {% extends "base.html" %} with title / body_attrs / head_extra / layout / scripts blocks. Drops: - <!DOCTYPE>, <html>, </html>, <head>, </head> - <meta charset>, <meta viewport> - Duplicate <link rel="stylesheet" href="...style-custom.css"> (base.html already provides one) - <body> opening + closing tags - Leading _app_header.html include + _version_badge.html include (base.html handles both) Preserves per-page CSS (in head_extra), per-page JS (in scripts), the Inter font preconnect (kept inline; not hoisted to base in this PR — separate decision). Pilots the migration recipe before the 4 larger pages. * refactor(memory): extend base.html Same recipe as install.html. corporate_memory.html now inherits <html>/<head>/<body> + nav + app.js script tag from base.html. Page-specific CSS and JS preserved in head_extra + scripts blocks. * refactor(memory-admin): extend base.html Same recipe as install/corporate_memory. Curation page now in the shared rendering pipeline. * refactor(catalog): extend base.html catalog.html had the most complexity: 7 head-level assets (chart.js, Prism, prism-sql, metric_modal.css link + 2 preconnects + Inter stylesheet), 5 body-level <script> blocks including a <script type= "module"> for the metric modal, 2 duplicate style-custom.css links in <head>. The migration script preserved all of them — head-level externals hoisted to {% block head_extra %} in source order, body scripts relocated to {% block scripts %} in source order (so chart.js loads before the IIFE that builds Chart instances), duplicate style-custom.css links dropped (base.html provides one). * refactor(admin-tables): extend base.html + carry data-source-type The biggest of the 5 standalones at 3563 lines. <body data-source- type="{{ data_source_type }}"> attribute carried through via the new {% block body_attrs %} slot (admin_tables JS reads document.body.dataset.sourceType to switch between keboola and bigquery rendering paths). * release: 0.54.10 — UI design system unification + homepage status frame + initial workspace override + store guardrails Co-Authored-By: zdenek.srotyr <zdenek.srotyr@keboola.com> * refactor(web): migrate remaining templates to canonical design primitives - admin_group_detail: .data-table, .btn family, appToast(), remove duplicate table/button/toast CSS - admin_store_submission_detail: .data-table, .btn family, appToast(), remove duplicate btn/toast CSS - profile_sessions: .data-table, _page_hero.html, remove duplicate table/title CSS - me_debug: .data-table, .btn family, remove duplicate table/button CSS - marketplace: .btn-primary/.btn-secondary, remove duplicate button CSS - store_edit: remove duplicate .btn-primary/.btn-link CSS, canonical button classes - store_upload: remove duplicate .btn-primary/.btn-secondary/.btn-link CSS Co-Authored-By: zdenek.srotyr <zdenek.srotyr@keboola.com> --------- Co-authored-by: Devin AI <158243242+devin-ai-integration[bot]@users.noreply.github.com>	2026-05-14 13:28:03 +02:00
ZdenekSrotyr	b4d3c576af	Activity Center: audit log + telemetry + sessions + agnes_* tables (#278 ) * docs(spec): admin observability spec + Activity Center MVP plan Parent spec (480 lines) + executable plan (2295 lines, 14 TDD tasks). Covers Activity Center rebuild (/admin/activity), with /admin/sessions and /admin/feedback deferred to follow-up plans. Already incorporates reviewer-pass revisions across three angles (security, production resilience, code architecture): - _get_db import path corrected to app.auth.dependencies - Test fixtures aligned with seeded_app / admin_user / get_system_db - All new audit writes wrapped in try/except + logger.exception - Filename sanitization on session uploads - DuckDB DESC index behavior documented; upgrade window flagged - Migration idempotency + evolved-DB test cases - reveal_raw + shared-cache multi-worker explicitly deferred Targets schema v40 (audit_log gains params_before, client_ip, client_kind, correlation_id + 3 indices). * feat(db): schema v40 — audit_log gains params_before, client_ip, client_kind, correlation_id + 3 indices * chore(test): clean up Task 1 — drop unused import, rename stale test * feat(audit): AuditRepository.log() accepts params_before/client_ip/client_kind/correlation_id * test(audit): strengthen params_before assertion to round-trip JSON content * feat(audit): AuditRepository.query() rich filters + keyset cursor pagination * feat(sync): SyncStateRepository.list_recent() cross-table feed * feat(audit): POST /api/sync/trigger writes audit_log row * feat(audit): POST /api/scripts/run-due writes audit_log row * feat(audit): POST /api/upload/sessions writes audit_log row + sanitizes filename * feat(audit): GET /api/data/{table_id}/download writes audit_log row * feat(activity): /api/admin/activity timeline + /health + /sync endpoints * feat(ui): /admin/activity rebuilt — health pulse, timeline, sync grid; /activity-center → 308 redirect BREAKING: removed demo executive-pulse / maturity-roadmap content from activity_center.html. The page now reflects real audit_log + sync_history data. * feat(ui): admin nav + dashboard widget point at /admin/activity * feat(activity): recursive-audit suppression for AC read endpoints (60s window per actor+filter) * feat(activity): emit PostHog events when integration enabled (no-op default) * fix(audit): move v40 indices out of _SYSTEM_SCHEMA + update test_repositories to unpack query() tuple _SYSTEM_SCHEMA CREATE INDEX on audit_log(timestamp) failed when migration tests hand-roll a bare audit_log (id, action) without the timestamp column. Fix: remove indices from _SYSTEM_SCHEMA; add ADD COLUMN IF NOT EXISTS guards for timestamp and other pre-v40 columns in _v39_to_v40() so the upgrade path is safe on any hand-rolled schema; call _v39_to_v40 explicitly in the fresh-install (current==0) path to restore index creation there. Also unpack the (rows, next_cursor) tuple from AuditRepository.query() in the three TestAuditRepository tests that still treated it as a list. * docs: CHANGELOG entry for Activity Center MVP * chore: refresh stale module docstring in app/api/activity.py * feat(cli): agnes admin activity — terminal access to Activity Center (timeline + health + sync) * fix(db): _v39_to_v40 — add IF NOT EXISTS guard for 'action' column The v39→v40 ladder step adds defensive ADD COLUMN IF NOT EXISTS for every audit_log column so a hand-rolled bare audit_log (id only) is safe through the ladder. 'action' was missing from the guard list, causing CREATE INDEX idx_audit_action_time to fail on tests that stub audit_log with only an id column (tests/test_e2e_extract.py:: TestSchemaMigration::test_migration_preserves_and_extends). Local 6/6 schema tests + the previously-failing CI test pass. * docs(spec): platform telemetry epic — Boss directive + Activity Monitoring plan rebased onto v40 (stacked on zs/spec-activity-center) * feat(db): schema v41 — 7 usage_* tables for telemetry (events, summary, rollups, attribution) * chore(db): tighten v41 — usage_session_summary.session_id NOT NULL + upgrade test asserts all 7 tables * feat(usage): UsageAttributionRepository — replace/delete/lookup over usage_attribution_* tables * refactor(marketplace): extract list_inner_skills/agents/commands to src/marketplace_listing.py for reuse * feat(usage): explode plugin attribution on marketplace sync + store entity write; backfill script * refactor(marketplace): finish src/marketplace_listing.py extraction — drop duplicate _list_inner_* + _parse_frontmatter from app/api/marketplace.py * feat(usage): promote attribution helpers to src/usage_attribution_helpers.py; hook update_entity rename + bundle-swap; clarify best-effort semantics * feat(usage): UsageProcessor real extraction + rollup rebuild + 10 fixture-driven tests * fix(usage): include tool_id in event hash + executemany + rollup transaction (critical multi-tool-turn drop fix) Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com> * feat(marketplace): popularity stats — invocations_30d + trend + sort=most_used\|trending + Most Popular section * feat(admin): /admin/users/<id> Sessions section — list + single-file + bulk-zip downloads (audit-logged) * feat(usage): admin export endpoint + CLI — csv/json/parquet streaming, filters, audit-logged * feat(usage): agnes admin ask — LLM Text-to-SQL over usage_events with SELECT-only validator (audit-logged) * feat(usage): reprocess + prune endpoints + scheduler daily prune job + CLI * docs: PLATFORM_SETUP.md operator playbook + HOWTO/ cookbook (5 guides + index) Adds docs/PLATFORM_SETUP.md as a consolidated operator playbook covering bootstrap, TLS, marketplaces (curated + flea), scheduler env vars, telemetry extraction/export/ask/prune, privacy posture, and daily routine. Adds docs/HOWTO/ with 5 analyst cookbook guides: first query, snapshots for remote tables, private sessions, feedback + admin ask, and customizing skills. Existing setup docs (QUICKSTART, DEPLOYMENT, ONBOARDING, HEADLESS_USAGE) get a one-line cross-reference at the top pointing to PLATFORM_SETUP.md. * docs(changelog): platform telemetry epic — usage_* foundation + surfaces + admin access + docs Comprehensive [Unreleased] entry covering: usage_events/session_summary/ tool_daily/plugin_daily tables (v41), attribution lookup tables, backfill script, marketplace Most Popular + invocation chips + sort, admin Sessions section, export/ask/reprocess/prune endpoints + CLI mirrors, Activity Center (v40), PLATFORM_SETUP.md + HOWTO/ docs, and operations notes for v41 upgrade. * fix(security): block DuckDB read_/http_/glob functions in usage_ask validator + symlink escape guard in session zip + clarify mark-private semantics * fix(admin): parquet export tempfile cleanup on COPY failure + correct processed-first sort on /admin/users/<id>/sessions * feat(audit): close 8 production audit gaps — query (local/remote/hybrid), catalog/schema/sample, snapshot estimate/create, check-access * feat(ui): /admin/usage summary dashboard + per-user activity tab on /admin/users/<id> * fix(audit): cap error messages at 200 chars + audit user_activity reads + recursion guard on usage.summary * fix(audit): catalog.list audits on error path + clean up deferred json import * fix(ux): client_kind=cli for PAT auth + timeline empty state + email-instead-of-uuid + nav reorder + help text + loading indicators + ask doc * feat(observability): unify /admin/activity into single page with saved views - KPI cards (events, users, error rate, p95) clickable as quick-filters - Faceted filter dropdowns populated from audit_log in the current window - Sortable audit table, cursor pagination, per-row JSON side panel - Saved views (schema v43: user_observability_views) — per-user state - Top bar: window selector + 30s Live toggle + saved views dropdown - /admin/scheduler-runs → 308 redirect (source=scheduler filter) - New endpoints: /api/admin/observability/{facets,kpis,views} * test: update activity + scheduler-runs tests for unified page - test_admin_activity_page_renders asserts new structural anchors - test_admin_scheduler_runs_page_admin_only asserts 308 redirect * fix(observability): respect [hidden] on modal + side panel CSS `display: flex` on .obs-modal beat the [hidden] attribute's UA display:none, so the save-view modal rendered on page load and Cancel clicks couldn't dismiss it. Gate the modal's flex layout on :not([hidden]); add the same display:none guard prophylactically to .obs-panel and .obs-views-panel. * feat(observability): user enrichment in audit + interactive /admin/usage Activity: - /api/admin/activity now joins users for user_email + user_name per row - User column renders "name (id-prefix)" or "email (id-prefix)" instead of an opaque truncated UUID; falls back to id when the user record is missing Usage: - /admin/usage rewritten as the same filter/group-by/search pattern as /admin/activity. Faceted dropdowns (User / Tool / Source / Event type) populated from usage_events; debounced free-text search across tool_name / skill_name / subagent_type / command_name - New endpoints /api/admin/usage/{facets,kpis,query}; the query endpoint supports group_by in {day, username, tool_name, source, ref_id} with sort + offset pagination, plus an ungrouped raw-events mode - 4 KPI cards (events, distinct users, distinct tools, error rate) are clickable quick-filters; clicking a grouped row applies the bucket as a filter - Old static `?window=7d\|30d\|all` server preload removed; all state is client-side via since_minutes + group_by + filters in the URL * fix(observability): clearer labels, all-column sort, drop saved views UI - Rename page titles: "Activity" → "Server activity", "Usage" → "Tool usage" with a one-line subtitle on each explaining what the page covers and linking the other one. The two pages source different data (audit_log vs usage_events) and the previous labels conflated them. - Drop the saved-views dropdown + save modal from /admin/activity. The modal pop-open bug was the trigger; the value wasn't there yet. The /api/admin/observability/views CRUD + DuckDB table stay in place. - Rename "Live (30s)" to "Auto-refresh (30s)" with a tooltip clarifying that it's the re-fetch rate, not the time range. Time range now labeled "Time range" instead of "Window". - All audit-table columns are sortable (User, Source, Action, Resource, Result added); sort is page-local with a Jinja comment explaining the trade-off. Same for raw usage rows. - Fix duplicate sort-arrow bug — the literal "▼" in the Time th HTML was rendering alongside the CSS ::before arrow. Removed the literal; CSS is the single source of truth. * feat(observability): global Sessions browser + transcript viewer + CLI Web: - /admin/sessions — list every collected session JSONL across all users with time-range, user, model, errors-only and free-text filters. Default sort surfaces error-heavy sessions first. KPI cards (sessions, distinct users, sessions w/ errors, tool error rate) clickable as quick-filters. - /admin/sessions/<username>/<file> — transcript viewer rendering the JSONL chronologically: user prompts, assistant text, tool calls (with JSON input) and tool results (with flattened output). Errors get a red border + chip and a "Next error" navigation button at the top. - Admin dropdown gains a "Sessions" link. API: - GET /api/admin/sessions/{list,kpis,facets} — filtered cross-user reads off usage_session_summary - GET /api/admin/sessions/{username}/{file}/transcript — parses JSONL via the existing services.session_pipeline.lib, returns chronological events - GET /api/admin/sessions/{username}/{file}/download — JSONL stream, same path-safety guards as the per-user endpoint, audit-logged CLI: - `agnes admin sessions list [--user X] [--errors] [--since 7d]` — table output with `!` prefix on rows that hit a tool error - `agnes admin sessions show <username> <file>` — transcript dump, with `--errors` to print only the failed tool_result blocks - `agnes admin sessions download <username> <file> [-o path]` - `agnes admin sessions kpis` — top-level numbers * feat(internal): expose telemetry tables to agnes query with row-level RBAC Three new registered tables backed by system.duckdb, queryable through the same /api/query plumbing analysts use for Keboola / BigQuery / local sources: agnes_sessions → usage_session_summary (filter: username) agnes_usage → usage_events (filter: username) agnes_audit → audit_log (filter: user_id) RBAC is per-row, not per-table: admins see every user's rows; non-admins see only their own. The filter is built server-side from the auth user dict; non-admin filter values are regex-validated before SQL interpolation. Implementation: - new connector connectors/internal/ with access (filter+exec) + registry (idempotent table_registry seed at startup) - /api/query detects internal table refs and short-circuits to a CTE wrapper that prepends "WITH agnes_x AS (SELECT * FROM <src> WHERE …), …" then "SELECT * FROM (<user_sql>) AS _q". DuckDB cursor on the shared system.duckdb handle — opening parallel handles / ATTACH on the same file is blocked process-wide. - mixing internal + BQ / registered local tables in one SELECT is rejected (v1 limitation) - src.rbac.can_access_table waves internal tables through for all authenticated users; row scoping is the actual security control - /api/v2/schema and /api/v2/sample gained internal branches; sample intentionally skips its cache because rows are RBAC-scoped per caller - audit row written as action='query.internal' with is_admin flag Tests: connectors/internal/access — RBAC, filter clause, schema, CTE wrapper coexistence with user-supplied aggregations, unsafe-username rejection. 16/16 passing. Motivating queries this enables: SELECT tool_name, COUNT() FROM agnes_usage WHERE is_error GROUP BY 1 ORDER BY 2 DESC -- analyst self-introspection: which tools fail for me? SELECT user_id, COUNT() FROM agnes_audit WHERE action = 'session.transcript_view' GROUP BY 1 -- admin: who's been looking at whose session transcripts? * feat(admin): group dropdown into 5 named sections + internal tables in /catalog Admin dropdown gains section headers so admins can land on the right page without re-reading the full menu: Activity Center Server activity / Tool usage / Sessions Users & Access Users / Groups / Resource access / Tokens Data Tables Agent Experience Curated Marketplaces / Flea Submissions / Agent Setup Prompt / Agent Workspace Prompt Server Server config "Agent Experience" frames the curated content + prompts as one cluster — it's all admin-controlled material that shapes what an analyst's AI agent encounters. "Configuration" → "Server" since only one item lives there now. Renamed the section's first two items: "Activity" → "Server activity" (matches page H1) "Usage" → "Tool usage" Also fixes /catalog visibility of the internal tables (agnes_sessions / _usage / _audit) for non-admin users: ``app.auth.access.can_access`` short-circuits to True for resource_type='table' + an internal-table id. Without this, non-admins saw the tables in /api/v2/catalog (which uses the same RBAC bypass) but not on the /catalog HTML page (which calls can_access directly, requiring a resource_grants row internal tables don't have). CSS for `.app-nav-menu-section`: small caps, muted, non-clickable; first section trims top padding so the panel doesn't open with an awkward gap. * refactor(admin): move corporate memory into Admin > Agent Experience Memory link was the only admin-only entry in the primary nav (gated by session.user.is_admin). Moves it into the Admin dropdown under Agent Experience, alongside Curated Marketplaces / Flea Submissions / Prompts — all admin-curated content that shapes what an analyst's AI agent encounters. Renamed the nav label to "Shared Knowledge" to match what the page actually is (admin-curated organisational knowledge from session verification, surfaced to agents). URL stays at /corporate-memory; the route still gates on require_admin per the existing comment. Side effect: primary nav (Home / Marketplace / Data Packages) is now uniform for every authenticated user — no conditional admin-only entry. * ui: rename admin entries to Curated Knowledge / Init Prompt / Workspace Prompt - "Shared Knowledge" → "Curated Knowledge" (parallel with "Curated Marketplaces" in the same Agent Experience section; "curated" tells the admin what they do there — review + approve) - "Agent Setup Prompt" → "Init Prompt" (matches the `agnes init` flow it actually drives) - "Agent Workspace Prompt" → "Workspace Prompt" (the "Agent" prefix was redundant — every item in the section is agent-facing) Renames page titles + H1s on /admin/agent-prompt and /admin/workspace-prompt to match. * refactor: rename Usage → Telemetry across user-facing surfaces External surfaces all switch; internal Python module / file names and the physical DB tables (usage_events, usage_session_summary, usage_tool_daily, usage_plugin_daily) stay — renaming them would force a schema migration + a redo of the LLM Text-to-SQL prompt for no analyst-visible win. Changes: - Admin dropdown: "Tool usage" → "Telemetry" - Page H1 / <title>: same - URL: /admin/usage → /admin/telemetry; old URL 308-redirects - API prefix: /api/admin/usage/* → /api/admin/telemetry/* - CLI: primary command `agnes admin telemetry …`; `agnes admin usage` kept as a deprecated alias so existing operator scripts keep working - Internal data-source table id: agnes_usage → agnes_telemetry. The registry seed now evicts any stale internal-source row whose id no longer matches INTERNAL_TABLES, so the old `agnes_usage` row is removed from table_registry on next app boot - All tests + JS endpoint paths updated * test(rbac): include auto-appended internal tables in expectations get_accessible_tables now appends agnes_sessions / agnes_telemetry / agnes_audit to every authenticated user's accessible-tables list so the internal data source shows up in /catalog. The two existing rbac tests asserted hardcoded list shapes that pre-dated the change. Rewritten to assert "granted tables + the canonical internal-table set" instead of literal lists, so the test stays correct if the internal table roster changes again later. * ui: visual dividers between admin-dropdown sections Adds a 1px top border + 6px top margin to every section header except the first, so the five named groups (Activity Center, Users & Access, Data, Agent Experience, Server) read as visually separated clusters. The header itself stays small-caps + muted as before — the border is additive. * ui(memory): match obs-topbar visual on /corporate-memory The Curated Knowledge page (linked from the admin dropdown's Agent Experience section) opened straight into the stats bar — no title, no subtitle, no shared chrome with the other admin pages. Adds an obs-topbar-style header at the top of .container-memory: - H1 "Curated Knowledge" - subtitle explaining what the page is + how AI agents pull from it The `.ck-` class set duplicates the inline obs- styles from /admin/activity etc. for this one page; promoting the obs-* class set to style-custom.css for shared reuse is the obvious next step (4 pages already inline the same CSS), tracked as a follow-up. Page <title> also renamed from "Corporate Memory" → "Curated Knowledge". * ui(tables): list Agnes internal tables in /admin/tables + group in /catalog /admin/tables previously rendered three per-source-type listings (BQ / Keboola / Jira) and dropped any row whose source_type didn't match — so the agnes_sessions / agnes_telemetry / agnes_audit rows seeded into table_registry were invisible. Adds a fourth read-only section "Agnes internal tables" that filters source_type === 'internal' and renders the same registry-table layout the other sections use, with two changes: - no Register button (these rows are seeded on every app boot from connectors/internal/registry.py) - Edit + Delete actions hidden (any change would be reverted on the next start). Manage access stays so admins can still inspect. Mode badge picks up a new mode-internal CSS class (teal accent) so the display doesn't lie and call it "local". In /catalog, internal tables now group under an "agnes" accordion section (bucket="agnes" on seed) instead of falling into the catch-all "default". Single source of truth for which tables exist; admins find them where they expect. * ui(tables): Agnes internal as a 4th tab next to BQ/Keboola/Jira Previous iteration mounted the internal-table listing as a separate standalone card under the tab strip. Reshapes it to a proper tab-content section so admins switch between data sources via one consistent nav (BigQuery / Keboola / Jira / Agnes internal). - New tab button "Agnes internal" in the tab-nav. - The listing card becomes <section id="tab-content-internal" class="tab-content">; switchTab() already routes by id so no JS change beyond extending the hash allowlist for direct #internal links. - Tab content keeps the read-only treatment from the previous commit (no Register button, no Edit / Delete in renderRegistryListing). * ui: rename Curated Knowledge → Curated Memory Settles the naming back on "Curated Memory" — parallel structure with "Curated Marketplaces" in the same Agent Experience section, and zero rename ripple: URL (/corporate-memory), API (/api/memory/), CLI (agnes admin memory), and Python modules all stay on "memory" so the admin label finally lines up with the underlying surfaces. The "Curated" prefix still tells admins what they do on the page (review pending → approve / mandate / reject) and reads as a sibling of "Curated Marketplaces" right next to it in the dropdown. Touches: admin dropdown label, page <title>, page H1. DB tables stay on knowledge_ (already the canonical naming for the data shape). * ui: rename "Server activity" → "Audit log" "Audit log" is what the page actually is — server-side audit_log table rendered with KPI cards + filter bar + sortable table. The "Server activity" label confused the term with Claude Code session telemetry (Telemetry page) and didn't make the source/concept clear. Touches: - Admin dropdown nav label - /admin/activity page H1 + subtitle - /admin/telemetry subtitle cross-link - test_activity_api page-renders assertion URL (/admin/activity) and API (/api/admin/activity/) stay — the "activity" name has stuck at the route layer for a year; rerouting those would churn dashboards/bookmarks for zero analyst-visible win. ui(admin-nav): gray band on each section header for clearer separation Previous iteration used a 1px top border between section labels — the labels still blended into the items above/below at a glance. Switches to a light gray background band per section header, extended edge-to- edge inside the panel via negative horizontal margins. Bolder font-weight (700) reinforces the separation; bumping the font color isn't needed because the band itself does the work. First section's header tucks into the panel's top border-radius so the band reaches the corners without a gap. * ui(catalog): rename internal-table category to "Agnes Internal" `bucket` is what /catalog renders as the accordion category header verbatim — "agnes" lowercase didn't read as a real category name and got confused with a system identifier. Bumps to "Agnes Internal". Seed re-applies on every app boot so existing rows pick up the new bucket value via `ON CONFLICT (id) DO UPDATE`. * ui(catalog): split Agnes Internal into its own card on /catalog Previously the three internal tables landed inside the "Core Business Data" card under an "Agnes Internal" accordion alongside Keboola / BQ buckets — readers conflated system telemetry with business datasets, and the data_stats header counter ("3 tables · ~X rows total") only ever counted synced rows so internal tables looked invisible. Split the catalog page into two cards: - Core Business Data: only non-internal source_types (Keboola, BQ, Jira). Accordions group by bucket as before. Stats counter reflects this card's tables. - Agnes Internal: a dedicated card with its own visual treatment (teal accent matching the mode-internal badge in /admin/tables). Flat list (no accordion — only 3 rows, never grows here), each row carries the canonical `agnes query` snippet. Read-only — no profiler click, no In-stack toggle, no sync metadata. Route adds `internal_card` context object; template renders the new card only when it's non-None. * fix(rbac): hide internal tables from /admin/access + drop "my" framing Two related cleanups for the Agnes-internal tables: 1. /admin/access (resource grants) no longer lists them. The `can_access` check has a hardcoded internal-table bypass — security is row-level (per-request view filter), so a table-grain `resource_grants` row would do nothing. Surfacing them in the UI let admins set up grants that silently no-op. Filter at the `_table_blocks` projection so the UI tree never sees them. 2. Display names drop the analyst-perspective "my" framing: "Agnes — my sessions" → "Agnes sessions" "Agnes — my telemetry events" → "Agnes telemetry events" "Agnes — my audit log" → "Agnes audit log" The "my" only makes sense from the querying analyst's seat (`SELECT … FROM agnes_sessions` returns their rows); on /admin/* pages where admin sees / configures them across users, the pronoun was misleading. Description text now spells out the row-level RBAC contract explicitly. Display names update via TableRegistryRepository.register's ON CONFLICT UPDATE on next app boot; no manual cleanup needed. * ui: subtitle notes about agnes_* tables on each Activity Center page The recursive observability story — Agnes serves its own audit / telemetry / session data through the same `agnes query` plumbing analysts use for business data — wasn't surfaced anywhere on the admin pages that show that data. Three pages get a one-liner with the canonical `agnes query` snippet + the RBAC contract (analysts see their own rows, admin sees all): - /admin/activity (Audit log) → agnes_audit - /admin/telemetry (Tool usage) → agnes_telemetry - /admin/sessions → agnes_sessions Sets up the discovery moment for admins: they're reading the page, they see "you can query this from Claude Code", they remember it when an analyst asks "how do I find my own failed tool calls?". * ui(tables): explain "Show log" empty-state on /admin/tables Cache warmup log <pre> renders with a dark background and is only populated by the SSE stream during a Re-warm all run. Opening the page cold + clicking Show log just revealed a black bar with no context — admins couldn't tell what they were looking at. Adds an inline paragraph above the <pre> explaining what the log is, the row format, when it fills in, and where to find the historical audit trail (/admin/activity). The actual <pre> stays empty until SSE events arrive, but the surrounding copy carries the meaning. * ui(tables): auto-open cache-warmup log on Re-warm all click A Re-warm all run takes ~24s per remote BQ row. With the <details> collapsed by default, operators saw the button disable, watched a quiet ~24s pass, and assumed nothing had happened — the streaming log was hidden behind a closed disclosure. Two small JS tweaks: - cacheWarmupRun() opens the details on click, so streamed lines appear without an extra interaction - cacheWarmupOnStart() hides the inline hint paragraph the moment real log content lands, so the dark log block isn't competing with redundant context Hint paragraph also clarifies that only `query_mode='remote'` BQ rows are warmed — operators with only materialized/internal tables would see total=0 and the page would "do nothing" by spec. * ui: trim Agnes internal copy across surfaces Descriptions had grown to explain the extraction pipeline ("parsed out of session JSONLs"), the underlying table ("Backed by usage_session_summary"), the RBAC mechanic ("row-level RBAC at query time — analysts see their own; admin sees all"), and the SQL snippet. Every implementation detail meant another rewrite on the next iter. Strips to one stable line per surface: what the data is, plus "Also available locally for analysis". Mechanics live in code + docs; the page copy says what the user needs to know. Touched: - connectors/internal/access.py: INTERNAL_TABLES descriptions - activity_center.html / admin_usage.html / admin_sessions.html subtitles - catalog.html Agnes Internal card description + row strip - admin_tables.html "Agnes internal" tab hint * fix(internal): is_user_admin arity bugs + + saved-view payload cap Round-1 code review (PR #278) caught two blocking bugs and three nits. Blocking — both `is_user_admin(user)` (single dict arg) calls raised TypeError. is_user_admin signature is `(user_id, conn)`. Affected: - app/api/query.py:_run_internal_query — every POST /api/query that references agnes_sessions / agnes_telemetry / agnes_audit blew up with a 500. The headline analyst-facing feature of this PR was unusable through the API. - app/api/v2_sample.py — same shape; `GET /api/v2/sample/agnes_` returned 500. Both fixed to call `is_user_admin(user.get("id"), conn)`. Added two FastAPI-level tests in test_internal_data_source.py that go through the TestClient — the existing unit tests on `execute_internal_query` and `build_filter_clause` skipped the request-handler layer where the bugs lived, which is why this landed. Nits also closed: - connectors/internal/access.py: `+` allowed in _USERNAME_RE / _USER_ID_RE so RFC 5321 email local-parts (alice+test@x) resolve correctly without hitting InternalAccessError. - app/api/observability.py: saved-view payload capped at 64 KiB to prevent an admin from bloating system.duckdb with a malformed save. fix(security): close non-admin data-leak via underlying-table refs PR #278 R2 review surfaced a non-admin-exploitable bypass: SQL whose string literal contains 'agnes_sessions' routed into the privileged internal-query path, then queried the underlying physical table (usage_session_summary / usage_events / audit_log) directly, escaping the CTE wrapper's row filter. Two reinforcing defenses: 1. find_internal_refs() now strips single-quoted string literals before scanning for alias names — a literal alone no longer routes the request into the privileged code path. 2. execute_internal_query() rejects non-admin SQL that references the underlying physical tables (usage_, audit_log). The CTE wrapper only scopes the agnes_ aliases; a direct FROM on the base table — or a shadowing inner WITH that still has to read the base table — bypasses RBAC. Block before execution with an actionable error pointing to the agnes_* alias. Admins are unaffected (god-mode short-circuit on the filter clause). 3. tests/test_internal_data_source.py — three new negative tests covering literal-only matches, direct-table refs, and CTE shadow attempts. Also tightens usage_ask.py's SELECT-only validator: pragma_table_info, pragma_storage_info, pragma_database_, and duckdb_tables / columns / views / indexes / schemas are reflection functions that leak metadata the analyst question shouldn't reach. \bPRAGMA\b in _FORBIDDEN never matched the function-call form (word-boundary between `A` and `_`). fix(security): dynamic denylist for non-admin internal queries R3 review (PR #278) caught a wider data-leak than R2: the underlying- physical-table guard listed only the 7 usage_* + audit_log tables, but system.duckdb has 30+ other sensitive tables — users (emails + ids), personal_access_tokens, resource_grants, user_groups, user_observability_views, store_, marketplace_, knowledge_, etc. A non-admin SQL like SELECT FROM agnes_sessions UNION ALL SELECT email, id, … FROM users LIMIT 1 would leak every user's row. Replaces the hardcoded denylist with a dynamic allowlist — non-admin SQL may reference ONLY the registered agnes_* aliases. Every other table in `information_schema.tables` (main schema) is rejected. Future migrations that add a new sensitive table are automatically covered without re-editing this module. Also strips SQL comments (`/* /` and `--`) before the identifier scan so a comment-wrapped table name (`//users//`) can't slip past the regex. Four new negative tests pin: `users`, `personal_access_tokens`, block-comment wrap, line-comment wrap. Plus: per-user view-count cap (100) on /api/admin/observability/views so an admin can't fill system.duckdb with thousands of saved views. release: 0.54.0 — Activity Center + Telemetry + Sessions + internal datasource Cuts the work shipped across this PR (Activity Center build, recursive internal data source) into a versioned release. Bumps pyproject.toml to 0.54.0; renames the top of CHANGELOG.md from [Unreleased] to [0.54.0] — 2026-05-12 with a header summary; opens a fresh [Unreleased] section for the next round. --------- Co-authored-by: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-05-12 22:41:19 +02:00

Author

SHA1

Message

Date

Vojtech

001e5ce40e

feat(web): /home value-first redesign + unified page-shell across app (#366 )

* feat(web): value-first /home reskin (CEO mock palette + pillars + first-session)

Restructures `/home` to lead with product value instead of install steps,
matching the CEO mock proposed for the homepage:

- New intro hero on top — eyebrow `Welcome, {{ display_name }}`, H1
  `{{ instance_brand }} is your team's AI workspace`, lede framing the
  product as an "AI Chief of Staff", two CTAs (`Set up in ~15 min →`
  jumps to the wizard, `Just browse — no install needed` jumps to
  `#look-around`), and a four-pillar row (Data packages · Plugins ·
  Skills · Memory). Renders for both onboarded and not-onboarded users
  so the value framing is consistent across visits.
- New `first-session` narrative — five-beat walkthrough (launch → pick
  project → memory loads → ask → close) with mock terminal frames
  carrying traffic-light dots, prompts, and dimmed system output.
- Setup wizard chrome — progress chip (`Step 1 of N · ~15 min ·
  One-time · Reversible`), thin progress bar, and per-step number
  badges on each `.install-block` so the wizard reads as bounded
  instead of an open-ended scroll.
- Palette shift from blue to green/navy: `--hp-primary` aliases
  `#2ea877` (mint), `--hp-hero-bg` is navy `#0f1b3a`, code panels stay
  near-black `#0c1224` with warm-yellow `#ffd866` accents. The token
  alias is reused so downstream rules pick up the new accent
  automatically; instance theme overrides via
  `config.theme_overrides()` still win.
- VS Code surface tile carries a `Recommended` pill; the existing
  "Want to look around first?" section is renamed to `Explore your
  workspace` and gets the `#look-around` anchor.

All test-pinned class names and IDs (`install-hero`, `install-block`,
`home-mock`, `self-mark-btn`, `setupClaudeBtn`, `offboard-strip`,
`home-getting-started`, `home-gs-item`, `home-overview`,
`home-usage`) preserved as structural anchors; new visual language
overlays via additional classes. Existing onboarded/not-onboarded
branching, `/api/me/onboarded` POST, status frame gating, post-CTA
modal, and OS-tab switching JS unchanged. Stray `~/FoundryAI`
comment swapped for `~/{{ workspace_dir }}` to honor the
vendor-agnostic OSS rule.

51 home tests pass without modification.

* fix(web): /home palette inversion — dark intro hero on top, light setup card below

Previous reskin commit kept the install-hero as a dark navy gradient and
rendered the new intro hero as a light surface — opposite of what the CEO
mock specifies. Playwright comparison vs `data/ceo_home.html` confirmed:

- CEO mock: dark navy hero at TOP (with white pillars on navy), LIGHT
  white setup card BELOW with light step rows and dark code panels
  inset.
- Previous: light intro hero on top, dark setup card below. Inverted.

This patch flips both:

- `.home-hero-intro` now: dark navy gradient `#0f1b3a → #1a2a5f`, green
  radial glow in the corner, green eyebrow, white H1 (`accent` span
  green), rgba-white lede, green pill primary CTA, translucent-white
  secondary CTA, pillars row separated by hairline border-top with
  green square-dot bullets in front of each pillar header.
- `.install-hero` and `.install-block` now: white surface card with
  thin green accent strip across the top, light step rows split by
  hairline borders, green-tinted step-number circles (`#e6f9f0` bg,
  `#1f8a5e` ink), green progress chip + bar. Code panels
  (`.install-cmd`) and terminal frames stay dark — they're the "type
  this" surfaces.
- All previously-rgba-white descendants of `.install-hero`
  (close button, eyebrow, h1, lead, links, code chips, OS tabs,
  install notes, setup-CTA button, self-mark fallback, auto-detect
  badge, terminal-howto disclosure) re-skinned for light surface.

All 12 home page tests still pass (no markup changes, only CSS).

* fix(web): /home parity polish — system font + mock sizes + blue info hint + gray step-num

After v2 palette flip, user comparison vs CEO mock surfaced three
remaining gaps in the wizard area:

- Font stack mismatch: Agnes inherits Inter via `style-custom.css`,
  but the CEO mock uses the platform system stack (San Francisco on
  macOS, Segoe UI on Windows). The rendered weight/letterforms read
  noticeably different. `.home-mock` now declares
  `-apple-system, BlinkMacSystemFont, "Segoe UI", Roboto, sans-serif`
  for itself and all descendants, with the monospace stack reserved
  for `code`/`kbd`/`pre`, `.install-cmd`, and `.terminal-body`.
- Step number badges were green-tinted; mock uses neutral gray
  (`#f0f2f6` bg, `#4a5168` ink) — green is reserved for the "done"
  state. Switched to `--hp-surface-dim` + `--hp-text-secondary`.
- "Don't have a terminal open?" disclosure was an amber/yellow
  variant left over from the old dark-hero palette. Mock uses a
  blue info-hint vocabulary (`--info-bg: #eef3ff`,
  `--info-line: #4f7cf2`, `--info-ink: #1c3994`) with white kbd
  chips. Added the info-* tokens to the `:root` block and re-skinned
  `details.terminal-howto` (incl. summary, body, kbd) to match.

Step-body type sizes also brought in line with the mock spec —
`.install-block .label` (step h3 equivalent) is now 17px / 700 with
6px gap; `.install-note` body type is 14px / 1.55.

`--hp-info-bg / --hp-info-ink / --hp-info-line / --hp-warn-bg /
--hp-warn-ink / --hp-warn-line / --hp-surface-dim` added as
first-class tokens so future hint/warn callouts pick the same colors
without a duplicate vocabulary.

12/12 home tests pass.

* feat(web): centralize design tokens + reword /home wizard to 6 steps (CEO mock parity)

Two intertwined changes that touch both global design + /home structure:

GLOBAL TOKEN SHIFT (app/web/static/style-custom.css)
- `--primary` flipped from blue `#0073D1` to green `#2ea877` — same brand
  alias the rest of the app referenced, so every page picks up the new
  accent automatically. Old `--primary-dark` / `--primary-light` recolored
  to match.
- New tokens added: `--brand-accent`, `--hero-bg`, `--hero-ink`,
  `--surface-dim`, `--info-bg/ink/line`, `--warn-bg/ink/line`. Brings
  the global vocabulary in line with the CEO mock's `:root` block so
  callouts and hero surfaces don't have to invent local tokens.
- `--font-primary` switched from Inter-led stack to the system stack
  (`-apple-system, BlinkMacSystemFont, "Segoe UI", Roboto, "Inter",
  system-ui, sans-serif`) so weight/letterforms render identically on
  macOS (San Francisco) and Windows (Segoe UI) — matches the mock and
  avoids a font-loading flash for analysts without Inter installed.
- Shadow tints re-cast in navy `rgba(15,27,58,...)`; focus ring uses
  the new green `rgba(46,168,119,0.25)`.
- `.app-nav-link` font-size 13px → 14px, padding 6px 12px → 8px 14px,
  hover bg → `--primary-light` (mint), color → `--primary-dark`.
  `.app-nav-menu-item.is-active` re-tinted to the same green system.
- Sweep across 26 templates (style-custom.css + 25 template files)
  replacing every hardcoded `#0073D1` / `#005BA3` / `#E6F3FC` /
  `rgba(0,115,209,…)` / `rgba(0,86,163,…)` with token references or
  the new green hexes — 175 occurrences total. Pages that styled their
  own buttons / borders / shadows pick up the new brand color without
  per-page overrides.

/HOME WIZARD: 6 STEPS PER MOCK (app/web/templates/home_not_onboarded.html)
- Step 1 reworded `Install Claude Code on your computer` + `~3 min`
  subhead (mock copy).
- Step 2 renamed `Pick a folder for {{ instance_brand }}` (was
  `create your workspace folder`) — same `mkdir` command, mock-aligned
  framing.
- NEW Step 3 `Open a terminal inside that folder` — no shell command,
  just the "you are standing in the right directory" reassurance with
  a Finder/PowerShell/file-manager howto disclosure. Mirrors the CEO
  mock's Step 3.
- Step 4 (was Step 3, gated by `home_automode.show`) renamed
  `Launch Claude with auto-approve on`. Body copy lightly updated so
  it references "the next step" instead of "Step 4".
- Step 5 (was Step 4) renamed `Get the install script and paste it
  into Claude`. The setup-cta-lead now explicitly says
  "pasting the script into Claude Code will install {{ instance_brand
  }}…" so existing test assertions pinning the `install Agnes`
  substring still match.
- NEW Step 6 `Optional: create a one-word shortcut for next time` —
  prints an `echo 'alias {{workspace_dir|lower}}=…' >> ~/.zshrc`
  one-liner for Unix and an `Add-Content $PROFILE …` equivalent for
  Windows. OS tabs + copy buttons reuse the existing wizard chrome.
- Progress chip dynamic: `Step 1 of 6` when home_automode is on,
  `Step 1 of 5` when off. Progress bar fill `100 // total_steps` so
  the bar sits at 16-20 % on first paint.
- `.step-lede` token added for the new short body copy beneath each
  step label (14.5px / ink-soft).
- `macOS / Linux / WSL` tab labels changed to `macOS / Linux` per
  user instruction. Terminal-howto `WSL:` paragraph dropped; the
  paste-shortcut hint now reads `(Linux)` instead of `(Linux/WSL)`.
  Functional WSL handling in `connector_prompts.py` (it's a Linux
  detection fallback, not user-facing label) preserved.
- `setup_instructions.py` Claude Code install hint:
  `npm (Linux / WSL)` → `npm (Linux)`.

SURFACES — 4 CARDS PER MOCK
- Replaced the 3-tile `.home-usage-grid` with a 4-card grid:
  - VS Code (Recommended) — `.surface-card.feature`, green ring,
    DAILY USE eyebrow + 5-step numbered list + `Open VS Code setup
    guide →` link to `/setup-advanced#vscode`.
  - Terminal — QUICK ACCESS eyebrow + 4-step list.
  - Claude Code (Desktop app) — CONNECT IT eyebrow + 4-step list.
  - Cowork (claude.ai) — `.surface-card.incomplete`, warn-tinted
    border + `Instructions needed` badge + a TODO callout describing
    the missing content. The card is intentionally honest about the
    gap rather than hiding it.

TEST UPDATES
- `test_web_home_page.py` negative onboarded-state assertions
  rebased on the new step labels (6 entries instead of 4).
- `test_home_route_resolution.py` `test_home_renders_automode_block_by_default`
  + its `_when_env_off` counterpart now check the new
  `Step 4 — Launch Claude with auto-approve on` label.

* fix(web): /home section content + layout — verbatim mock match

User comparison flagged several remaining gaps; this patch rewrites
the three lower sections of /home to match the CEO mock spec exactly:

FIRST-SESSION (5 beats)
- h2 28px / 700 / -.5px tracking (was 19px / 600).
- lede 18px ink-soft (was 13.5px secondary).
- `.session-walk` wrapper, 36px gap between beats (mock spec).
- `.session-step` grid 48px / 1fr, gap 22px — number circle on
  the left, content on the right.
- `.session-num` 40 × 40 circle with SOLID GREEN bg (`--primary`)
  and WHITE text + soft green shadow (was 28px mint pill w/
  dark-green text).
- `.session-content h3` 18px / 600 (was 14.5px / 600).
- `.session-content > p` 15px.
- `.session-content .annotation` 13.5px ink-muted body type with
  `strong` for highlighting (replaces the upper-case "WHAT'S
  HAPPENING" eyebrow pattern that didn't match the mock).
- `.session-intro` callout card (white surface + mint icon block)
  framing the "five beats" tagline.
- `.session-tldr` summary box (brand-light bg + brand-dark left
  border) wrapping up the loop.
- Terminal frames re-skinned: `#0c1224` body / `#182241` bar /
  real macOS traffic-light colors `#ff5f57` / `#febc2e` / `#28c840`.
- Terminal body 13px / 1.65 line-height with mock-spec class
  vocabulary: `.you` (yellow input), `.ai-name` (brand bold),
  `.path` (light blue), `.dim` (translucent code-ink), `.caret`
  (blinking cursor).
- Five beats rewritten with mock's exact narrative flow (launch →
  menu → pick → ask → close), vendor-agnostic project names
  (`RevenueAnalysis`, `Onboarding`, etc.) replacing the customer-
  specific `GRPN_*` examples in the mock. Templated `{{
  instance_brand }}` / `{{ workspace_dir }}` / `{{ workspace_dir |
  lower }}` (the shortcut alias) everywhere.

SURFACES (4 cards)
- The section is no longer wrapped in a white rectangle; the
  `.home-usage` class loses its bg + border + padding (mock has the
  cards directly on the page bg).
- h2 28px (was 22px). Eyebrow 12px / 1.5px tracking / brand-dark.
- `.surface-card.feature` (VS Code) now uses 2px green border +
  vertical brand-light → white gradient (was 1px ring).
- `.surface-card.incomplete` (Cowork) uses 2px red border (`#e35e5e`)
  + vertical red-tint → white gradient (was yellow flat bg).
- `.surface-card .steps` panel: inner surface-dim bg + 8px radius
  + 13px font.
- `.surface-foot` top-border + ink-muted (mock spec).
- `.badge-warn` now a solid red box (`#e35e5e` bg + white ink + 4px
  radius) instead of a yellow pill, matching the mock.
- Header layout fixed: the global absorbed `header { display: flex;
  justify-content: space-between }` rule was making the h2 sit on
  the right of the eyebrow; explicit `display: block` override on
  `.home-mock section > header` puts the title on the LEFT under
  the eyebrow as the mock has.

BROWSE — Explore your workspace
- Wrapped in `<section class="browse-section">` with proper
  eyebrow + h2 + lede (was a bare `.section-label` div).
- `.browse-grid` 5-col grid (was responsive auto-fill, 4-card
  layout). Skills tile added as a 5th card linking to
  `/marketplace?type=skills`.
- `.browse-card` mock-spec: 22 20 padding, 28px icon, 15px title,
  12.5px ink-muted desc, hover lifts -2px with brand border +
  shadow-md.

Section wrappers (`.home-usage`, `.first-session`) no longer carry
the white card chrome — they sit directly on the page bg, matching
the mock. Only Getting Started + Overview keep their white cards.

GLOBAL eyebrow vocabulary (`.home-hero-intro .eyebrow`,
`.first-session > .eyebrow`, `.surfaces > header .eyebrow`,
`.browse-section .eyebrow`) all aligned to mock spec: 12px / 700 /
1.5px tracking / brand-dark color / 14px bottom margin.

Hero h1 bumped to 44px / 800 / -1px tracking (was 32px / 600).

51/51 home tests pass.

* fix(web): /home session-intro card + terminal-body verbatim mock match

User comparison flagged three remaining /home gaps; this patch
addresses each:

- `.session-intro` rule was missing — the "five beats" tagline
  rendered as a bare line with no card chrome. Added the mock-
  spec card: white surface, 14px radius, 20×24 padding, 1px
  border + shadow-sm, with a 44×44 brand-light icon block on the
  left.

- Beat 1 terminal-title was `~/{{ workspace_dir }} — zsh` (mock-
  style shell-pwd format), but the user wants every terminal
  frame across all 5 beats to read `claude — {{ instance_brand }}`.
  Updated.

- Terminal-body line structure for beats 2-5 rewritten verbatim
  from the CEO mock:
  - `<span class="prompt">&gt;</span><span class="you">…</span>`
    now has no space between the prompt and user input (mock
    pattern: zero gap, the .prompt's `margin-right: 8px` provides
    the visual separation).
  - Beat 2 menu items use `<strong>[N]</strong>` numbering with
    project entries on indented lines, each project name followed
    by a `<span class="dim">(N ago)</span>` timestamp at a fixed
    column — instead of my prior single-line concatenation.
  - Beat 3 narrative split into 4 stanzas separated by blank lines
    (matches mock): the "Switched to <strong>X</strong>" status,
    then dim Loaded/Last-session lines, then a stand-alone "One
    unprocessed input detected:" pair, then the "Want me to
    process …" question. My prior version dim-wrapped the entire
    block, which looked off.
  - Beat 4 narrative split into headline summary + risks section
    with <strong> heads + bullet lists separated by blank lines,
    matching the mock's "Q1 close summary" / "Open risks" rhythm.
    The Q1 question carries the mock's manual line-break + 2-
    space continuation indent inside the `.you` span — without
    that, terminal-body's `white-space: pre-wrap` would auto-wrap
    awkwardly at a different column than the mock.
  - Beat 5 exit narrative uses two separate dim lines + a
    standalone `.ai-name` "See you next time." line, then prompt
    + caret. My prior version collapsed everything into one dim
    block.
  - Project names changed from customer-specific (`GRPN_*`) to
    generic (RevenueAnalysis, WeeklyReview, Onboarding, OpsDb,
    HRHandShake) so the OSS distribution stays vendor-agnostic
    per CLAUDE.md.
  - `Marketing plan` examples replaced with `Q1 close` so the
    narrative stays plausible for an analyst audience.

12/12 home tests pass.

* fix(web): /home surfaces verbatim mock — VS Code thumb, Terminal expected-output, NEW badge

User comparison flagged three remaining surface-section gaps:

- VS Code surface card was rendering a generic "Screenshot pending"
  placeholder; the mock has a labeled inline mockup
  (`<a class="vscode-thumb">` w/ `.thumb-fallback`) showing the
  recommended 4-pane layout (EXPLORER yellow, TERMINAL 1 purple,
  TERMINAL 2 green, TERMINAL 3 orange) on a dark navy bg + a
  "Recommended layout" caption pill. CSS `.vscode-thumb` block
  added — uses gradient-strip backgrounds to draw the colored
  panel bars without needing a base64 image.

- "Recommended" badge was a pill (999px radius) with
  `--brand-accent` bg + navy text. Mock uses `.badge` instead of
  `.recommend-pill` — solid `--primary` (brand-dark green) bg
  with WHITE text and 4px radius. Replaced the class + CSS rule
  so the badge reads as a tag, not a pill.

- Terminal surface card was missing the "What you should see"
  subsection — mock has an `.expected-output` block showing a
  sample of the welcome menu inside a dim dashed panel. Added the
  block with the mock's exact rendered output (templated to
  `{{ instance_brand }}` + generic project names instead of
  customer-specific GRPN entries) plus the `.expected-output`
  CSS (surface-dim bg + dashed border + `::before` "WHAT YOU
  SHOULD SEE" eyebrow per mock spec).

Also addressed the explore-section feedback:

- Skills browse-card now carries the `new` class so it picks up
  the `.browse-card.new::after` corner badge ("NEW", green bg,
  white text, 10px / 700 / 0.5px tracking) per mock.
- Browse cards align same height via `align-self: stretch` (grid
  default) + `flex-grow: 1` on `.browse-desc` so descriptions
  fill remaining vertical space; previously the Skills tile sat
  shorter because its desc text was longer than others'.

Structural HTML changes to all four surface cards: dropped the
inner `<div class="surface-card-head">` wrapper + `<p
class="surface-pitch">` class in favor of mock's flat layout
(`.what` + `.steps` + `.when-to-use`). `<ol class="surface-steps">`
replaced with `<div class="steps"><strong
class="steps-eyebrow">DAILY USE / QUICK ACCESS / CONNECT IT</strong>
<ol>...</ol></div>` so the eyebrow + numbered list share the
mock's tinted surface-dim panel.

12/12 home tests pass.

* fix(web): align /home setup walkthrough to design spec

- Setup-section header (eyebrow + heading + lede) floats above the
  install hero; install card has no accent strip; step labels drop
  `Step N —` prefix; closing strip is single flex row.
- VS Code surface card renders recommended-layout screenshot from
  `/static/img/vscode-layout.png` with click-to-enlarge lightbox.
- Workspace install path cascades to `~/Desktop/{workspace_dir}` in
  every step, surface card, first-session annotation, and shortcut.
- Step 1 verify text restores Enterprise — Finance and Legal option.
- Step 6 shortcut installs a shell function with arg forwarding
  (`"$@"` unix / `@args` windows) and a user-facing Auto / YOLO
  permission-mode toggle.
- Step 5 manual-fallback details inline on the CTA row; description
  reads at step-lede size, not 13px chip.
- Setup-section heading no longer right-aligns (was inheriting
  `header { display: flex; justify-content: space-between }` from
  the legacy stylesheet; wrapper changed to `<div>`).
- Getting Started `<details>` block removed (duplicated links).

* test(web): align /home tests with restructured setup wizard

- Replace test_getting_started_card_renders_on_home with
  test_setup_section_renders_for_not_onboarded — asserts the new
  setup-section-header floats above the install hero and Getting
  Started markup is absent (block removed in the prior commit).
- Update automode-block test to match labels without the
  `Step N —` prefix.
- Update setup-CTA partial test to match the relabeled
  "Copy install script to clipboard" button.

Drop orphaned CSS for `.home-getting-started`, `.home-gs-summary*`,
and `.home-gs-item` — selectors had no matching markup after the
Getting Started block was removed.

Also: Step 3 `pwd` expected-output uses an absolute path
(`/Users/yourname/Desktop/{workspace_dir}`) instead of the
tilde-prefixed form, matching what the command actually prints.

* fix(web): repaint home_onboarded + setup_advanced; align CTA label

- home_onboarded + setup_advanced still carried the retired blue
  `#0056A3` as both `--hp-primary-dark` and the hero gradient
  endpoint. Both reference `var(--primary-dark)` now so the green
  palette cascades.
- setup_advanced YOLO snippet was the old `alias` form (no cd, no
  arg forwarding). Replaced with the shell function variant from
  /home Step 6 — drops into ~/Desktop/{workspace_dir} and forwards
  "\$@" (unix) / @args (Windows).
- setup_advanced ~/{workspace_dir} path references cascaded to
  ~/Desktop/{workspace_dir} so install story matches /home.
- Dashboard's "Setup a new Claude Code" button label aligned to the
  canonical "Copy install script to clipboard" — matches /home and
  the new docstring in _claude_setup_cta.jinja, which now mandates
  this label across consumers.

* fix(web): keep base brand blue; scope green palette to /home redesign

User noticed login + dashboard had turned green when the /home
redesign flipped --primary from blue (#0073D1) to green (#2ea877)
in commit 278f202e. The brand-wide flip went further than the
redesign needed — only /home, /home (onboarded), and /setup-advanced
intentionally use the green/navy spec; every other page (login,
dashboard, catalog, marketplace, admin, profile) was just inheriting
the green because --primary cascaded everywhere.

Revert the global brand colour to blue and lock the green into the
two outstanding redesign scopes:

- style-custom.css: --primary back to #0073D1, --primary-light back
  to rgba(0,115,209,0.1), --primary-dark back to #005BA3,
  --brand-accent back to a lighter blue.
- home_onboarded.html: .home-mock now sets --hp-primary,
  --hp-primary-dark, --hp-primary-light to explicit green hex
  (matching home_not_onboarded), so the hero stays green regardless
  of the global brand.
- setup_advanced.html: same lock — .advanced-mock pins the green
  palette in-scope.

Hero gradients on both pages now reference the local --hp-primary
chain (not the global --primary), so any future palette tweak inside
either scope cascades correctly without disturbing the rest of the app.

* refactor(web): hoist --hp-* into shared design-tokens.css (--ds-*)

PR 2 of the design-system extraction ladder. Pure mechanical rename
+ dedup; no visual diff on any rendered page (verified on /home,
/dashboard).

- New app/web/static/css/design-tokens.css declares the full token
  set on :root: brand surface (green primary, primary-dark, mint
  light, brand-accent), hero (navy bg + ink), code-panel (near-black
  bg + cool ink + warm-yellow), light surfaces (bg/surface/border),
  text (primary/secondary/muted), orange accent, info + warn
  callout vocabularies, navy-tinted elevation shadows, system font
  stack + mono.
- base.html loads it alongside style-custom.css so the tokens are
  globally available.
- Rename --hp-* -> --ds-* in home_not_onboarded (313 refs),
  home_onboarded (15), setup_advanced (39). 367 token references
  pointed at one of three local blocks; now all point at the
  global :root.
- Drop the three local token blocks. Each scope class
  (.home-mock / .advanced-mock) only keeps its base ink + font-size
  + line-height rules.

The legacy --primary family stays canonical for the blue base
brand — login, dashboard, catalog, marketplace, admin still read
blue. The design system is opt-in via the scope class.

* refactor(web): extract shared components.css; migrate /home markup

PR 3 of the design-system extraction ladder. First batch of
reusable components lifted out of home_not_onboarded.html into a
new shared stylesheet; markup migrated to consume them.

- New app/web/static/css/components.css with five components, all
  reusable on any page that loads design-tokens.css:
    .callout-rec        — amber lightbulb recommendation box
    .callout-hint       — blue info hint box
    .code-output        — "WHAT YOU SHOULD SEE" terminal output block
    .lightbox           — full-bleed image enlarge overlay
    .setup-section-header — wizard header (eyebrow + h2 + lede)
- base.html loads components.css after design-tokens.css.
- home_not_onboarded.html markup renamed:
    class="rec"             -> class="callout-rec"
    class="hint"            -> class="callout-hint"
    class="expected-output" -> class="code-output"
- Local CSS rules removed from home_not_onboarded.html for each of
  the extracted components — ~150 lines down to 5-line "extracted to
  components.css" comments. The bespoke wizard-specific styles
  (.install-cmd, .os-tabs, .mode-tabs, .terminal-frame) stay
  template-local for now since they only have one consumer.

Visual regression check: /home install hero renders the amber rec
callout, blue hint callout, dashed code-output block, green section
header, and click-to-enlarge VS Code thumb identically to the
pre-extraction render. 43 home tests pass.

* fix(web): unify page-headers — activity-center full-width, marketplace shares box

- /activity-center audit-log hero rendered as half-width because the
  _page_hero include was inside <header class="obs-topbar">, a flex
  row that pinned the time-range + auto-refresh controls next to it.
  The hero is now a sibling rendered before the <header>, so it
  spans the full container width like every other admin page; the
  controls keep their flex row underneath.
- Marketplace hero unified with .page-header--hero. Markup is now
  <section class="page-header page-header--hero mp-hero"> so the
  shared box drives padding/radius/gradient/max-width/shadow; the
  .mp-hero override block only carries the right-anchored cover
  image and the rules for the search row + scope checkboxes (which
  the canonical hero doesn't have). Inner text uses the canonical
  .page-header__eyebrow / __title / __subtitle classes.
- .page-header--hero shadow tint now follows the brand blue
  (rgba(0, 115, 209, 0.2)) instead of the leftover green from the
  prior palette flip; same depth highlight everywhere the gradient
  is blue.

* fix(web): unify remaining page heroes — admin, profile, install, store, stack

Sweep across pages that carried bespoke gradient hero markup so
every page-hero shares the canonical `.page-header--hero`
dimensions (padding 28/32/24, border-radius 14, max-width
var(--width-app), navy-tinted shadow, gradient with --primary →
--primary-dark). Inner text uses the .page-header__eyebrow /
__title / __subtitle classes so typography matches across the app.

- admin_tables: migrated to _page_hero.html include.
- admin_tokens: kept .tokens-hero wrapper for the counts-chip row
  but added the canonical class on the same element; stripped
  duplicate gradient + padding + typography rules.
- install: same pattern (kept hero-meta pill row).
- profile: migrated to _page_hero.html include.
- store_upload: kept .upload-hero wrapper for the .meta chip row;
  composite class with the canonical hero.
- setup_advanced: .advanced-mock .ad-hero now matches canonical
  dimensions; green palette retained via --ds-primary/dark.
- stack_card.css: .stack-hero (catalog + corporate-memory search
  hero) uses canonical gradient + padding + max-width.

The detail-page heroes (marketplace_plugin_detail,
marketplace_item_detail, catalog_*_detail, store_edit,
admin_group_detail, admin_store_submission_detail) stay bespoke
for now — they're rich detail headers with photos, badges, install
actions; converting them would lose contract context. Same applies
to dashboard.html env-setup-cta (it's a CTA card, not a page hero).

* fix(web): canonicalise .container — single page shell every page inherits

Previously each admin page set its own `.container:has(.<page>)
{max-width: none}` + `.<page>-page {max-width: 1400px}` override,
and per-page hero markup either nested inside flex toolbars (which
pinned the hero next to filter controls and squeezed it half-width)
or self-constrained with a different max-width than the page. /home,
/dashboard, /marketplace, and /admin/* ended up at different widths
with different nav-to-hero gaps.

- style-custom.css `.container` now carries the canonical 1280px
  max-width + `16px 32px 48px` padding so every page inherits the
  same nav-to-hero gap and side gutters. `.container > main` is
  margin/padding 0 so the container is the sole owner of gutters.
- `.page-header--hero` drops its self-constraining max-width and
  auto-centering margin — the container provides the width, so the
  hero sits flush with the table/toolbar below it.
- `.stack-hero` (catalog + corporate-memory) and `.advanced-mock
  .ad-hero` (/setup-advanced) follow the same pattern: container
  owns the width.
- Per-page max-width overrides stripped from admin_users,
  admin_access, admin_groups, admin_marketplaces, admin_welcome,
  admin_workspace_prompt.
- _page_hero include extracted from inside flex toolbars on
  admin_users, admin_access, admin_groups, admin_marketplaces,
  admin_server_config, admin_welcome, admin_workspace_prompt,
  admin_sessions, admin_session_detail, admin_usage,
  activity_center. The toolbar (`.users-toolbar`, `.gp-toolbar`,
  etc.) keeps only the filter + action controls; hero renders
  before it as a sibling.
- _page_chrome.html trimmed to just the page-background tint for
  the redesign scopes; the duplicate `.container` rules it carried
  are now redundant.

Verified: /home, /admin/marketplaces, /admin/users all render
container width 1280px with hero top at 88px (16px below the
72px-tall sticky nav). Same spacing as /home design spec.

* fix(web): admin_tables + admin_corporate_memory inherit canonical .container

Both pages were overriding `{% block layout %}` from base.html,
which bypasses the canonical `.container` wrapper. Result: hero
span the full viewport (1596px on a wide screen) while the inner
content sat at a narrower max-width — hero and content didn't
align, and the nav-to-hero gap differed from every other admin
page.

Switched both templates to `{% block content %}` so they render
inside the canonical `.container` from base.html — same path as
admin_groups, admin_users, admin_marketplaces, etc.

- admin_tables: dropped local `.page-title { max-width: 1600px }`
  + `.content { max-width: 1600px }` overrides (kept typography +
  inner gutter rules) and the mobile padding overrides that paired
  with them. Container now owns the gutters.
- admin_corporate_memory: only the block keyword needed changing;
  the template already had a clean inner structure (no max-width
  override on `.container-memory`).

Verified on /admin/tables and /admin/corporate-memory:
- .container width 1280, padding 16/32/48
- Hero top 88 (nav 72 + container padding-top 16)
- Hero + content both 1216px wide, both at left 190 — perfect
  alignment with /admin/groups.

* fix(web): drop .page-shell padding override + admin_tables stale :root

Two regressions discovered after the canonical-container unification:

1. `.container:has(.page-shell)` still set `padding: 28px 32px 48px`
   while the canonical `.container` had moved to `16px 32px 48px`.
   Every page-shell consumer (/admin/sessions, /admin/sessions/<id>,
   /admin/usage, /marketplace, /dashboard, marketplace detail pages,
   /me/activity, /store/*, /admin/store-submissions) was rendering
   with a 28px nav-to-hero gap while /admin/users + /admin/groups
   rendered with 16px. Same width, mismatched vertical rhythm.
   The opt-in rule is now a no-op marker: canonical container
   already provides 1280px + 16/32/48 + main margin/padding 0.

2. admin_tables.html had a stale `<style>` block that re-declared
   `:root { --primary: var(--primary); ... }`. The self-referential
   token resolved to empty, collapsing the page-header hero's
   `linear-gradient(135deg, var(--primary), var(--primary-dark))`
   to no background — the hero appeared as a pale ghost without
   colour. The entire shadow `:root` block was a stale copy of the
   design tokens that style-custom.css already provides. Dropped
   it; tokens now resolve from the global `:root`.

After both fixes /admin/sessions, /admin/tables, and every other
page-shell consumer match /admin/groups exactly: container 1280px,
container padding-top 16px, hero at top 88px / left 190px / width
1216px.

* fix(web): drop /admin/tokens .tokens-page width + padding override

`.tokens-page` carried its own `max-width: 1280px; margin: 0 auto;
padding: 28px 8px 48px` block — the canonical `.container` already
provides width + 16/32/48 padding, so the nested wrapper was
adding 28px on top of the container's 16px (= 44px nav-to-hero
gap, vs 16px on every other admin page) and shrinking the hero
sideways by 8px on each side (1200px vs the canonical 1216px).

After: container owns the layout; `.tokens-page` is just a
font-family scope. /admin/tokens hero now sits at top 88, left 190,
width 1216 — same numbers as /admin/groups / /admin/users.

* fix(web): hero links readable on blue; /admin/access Groups link href

- New `.page-header--hero a` rule in style-custom.css forces any
  anchor inside a gradient hero to render white + underlined so
  links stay readable on the blue background. Previously links
  inherited the global `var(--primary)` blue, which disappeared
  on top of the matching blue gradient. No per-page class needed —
  drop a plain `<a>` in any hero subtitle and it just works.
- /admin/access hero subtitle was Jinja-passing the inline link
  with HTML-entity-encoded quotes (`href=&quot;...&quot;`). The
  entities decoded to literal `"` characters inside the rendered
  href, producing `/admin/%22/admin/groups%22` — a 404. Switched
  the `set` to a block-set (`{% set page_hero_subtitle %}...{% endset %}`)
  so the inline `<a href="/admin/groups">Groups</a>` survives
  unescaped through `_page_hero.html`. Also stripped the now-redundant
  inline `style="color:#fff;text-decoration:underline;"` — the new
  shared rule handles it.

* fix(web): /dashboard top padding matches every other page

`.main` on /dashboard had `padding: 28px 32px 48px` while every
other page now uses `16px 32px 48px` via the canonical
`.container`. Dashboard bypasses `.container` (overrides
base.html's `layout` block to render a full-width `<main>`
directly), so the padding lives on `.main` itself — bumped the
top to 16px to match.

After: first child top = 88, left = 190, width = 1216 — same
numbers as /admin/groups / /admin/users / /admin/marketplaces.

* fix(web): green eyebrow + white title on .page-header--hero (matches /home)

`.page-header--hero .page-header__eyebrow` was faint white
(rgba(255,255,255,0.75)) — readable but unbranded against the blue
gradient. Changed to `var(--ds-brand-accent)` (mint green #54d3a0)
so every page hero pairs a green eyebrow with white title +
subtitle, echoing /home's setup-section header (green eyebrow,
dark heading combo). One CSS rule applies everywhere — no
per-page styling needed.

Also bumped the eyebrow to font-weight 700 / letter-spacing 1.2px
so the green stands out cleanly against the gradient.

* fix(web): page-header--hero + stack-hero use /home navy gradient

`.page-header--hero` and `.stack-hero` were on the brand-blue
gradient (`var(--primary)` → `var(--primary-dark)`) while
/home's hero (`.home-hero-intro`) sits on the deeper navy
gradient (`#0f1b3a` → `#1a2a5f`). Every other page-hero now
uses that same navy gradient so /home, /marketplace, /catalog,
/corporate-memory, /admin/*, /profile, /install, /dashboard,
/setup-advanced share one brand surface. Shadow tint adjusted
to the navy depth (rgba(15, 27, 58, 0.22)).

Brand blue stays the link/CTA colour everywhere else; only the
hero box itself is navy.

* fix(web): primary buttons green; marketplace tabs navy translucent

Two parity tweaks pulling the rest of the app toward /home's
visual language.

- `.btn-primary` (both rules in style-custom.css) now uses
  `var(--ds-primary)` / `var(--ds-primary-dark)` green fill,
  matching the "Copy install script to clipboard" button on
  /home. Brand-blue `--primary` still drives link colour and the
  accent surface; only the filled button background flipped to
  green. Every page with a `.btn-primary` (admin "+Add user",
  "+Add marketplace", catalog, marketplace actions, dashboard,
  modals) now reads as the same "do it" affordance.
- `.mp-tabs` (Curated Marketplace / Flea Market / My Stack tab
  group) now sits on the navy `--ds-hero-bg` with translucent
  white pills (rgba(255,255,255,0.10) inactive, 0.18 active) —
  same translucent-white-on-navy treatment as the "Just browse —
  no install needed" pill on /home. Icons render as soft white;
  per-tab colour-coding dropped in favour of the unified surface.

* fix(web): catalog/memory tabs + empty-state CTA + admin action buttons

Bring /catalog and /memory in line with /home + /marketplace:

- `.stack-tabs` (Browse / My Stack / Recipes on /catalog,
  Browse / My Stack on /memory) now uses the navy `--ds-hero-bg`
  container with translucent-white-on-navy pills, mirroring the
  `.mp-tabs` treatment and /home's "Just browse — no install
  needed" CTA pill. Per-tab icon colour-coding dropped — icons
  render as soft white on the navy fill.
- `.stack-tabs-row__actions .btn` (right-slot "+New Recipe",
  "+New Data Package" admin CTAs) now uses green primary fill
  (`--ds-primary`), matching `.btn-primary` and /home's
  "Copy install script to clipboard" button.
- `.stack-empty .cta a` (empty-state action button — the
  "Open /admin/tables →" CTA on /catalog and equivalent on
  /memory) flipped from blue `--primary` to green `--ds-primary`
  so the colour aligns with every other primary button in the app.

* fix(web): marketplace Search button green (--ds-primary) matching other CTAs

* fix(web): unify Search button + admin-action button across browse pages

- Added Search button (`<button class="stack-hero__search-btn">`)
  to /catalog and /memory heroes — same green pill as /marketplace.
  Wired to the existing live-filter pipeline (button click runs
  `applyFilters()` and refocuses the input). All three browse pages
  now wear the identical search bar UI.
- `.stack-hero__search-btn` shares `--ds-primary` fill with
  `.mp-hero .search-btn`.
- `.mp-actions .btn` ("Submit a skill or plugin" CTA on /marketplace)
  flipped from the legacy blue-outline to the same green primary
  fill + dimensions (`display: inline-flex; line-height: 1;
  padding: 9px 16px; gap: 6px`) as `.stack-tabs-row__actions .btn`
  on /catalog and /memory. All three right-slot action buttons
  render at identical height now.
- `.stack-tabs-row__actions .btn` got `inline-flex` + `line-height: 1`
  + `gap: 6px` so a `<button class="btn">` and a `<a class="btn">`
  both render at exactly 33px high — the embedded
  `.admin-only-hint` chip no longer pushes one variant taller
  than the other.

* fix(web): marketplace guide CTAs green (fastpath + primary); drop flea purple

* fix(web): dashboard CTA hero on navy; readable <code> chips in hero

- `.env-setup-cta` on /dashboard ("Set up a new Claude Code"
  card) flipped from the brand-blue gradient + green-tinted shadow
  to the canonical navy gradient (`--ds-hero-bg` → `#1a2a5f`) with
  navy-tinted shadow + 14px radius + 28/32/24 padding, matching
  `.page-header--hero` and /home's `.home-hero-intro`. Dashboard's
  top CTA now sits on the same brand surface as every other hero.
- Added `.page-header--hero code` rule — translucent white pill +
  warm-yellow ink (#ffd866) so `<code>` chips embedded in hero
  subtitles read as code samples against the navy gradient. The
  global `code` rule sets `color: var(--text-primary)` (dark),
  which turned in-hero chips into invisible dark-on-white-on-navy
  ghosts (e.g. the `-by-dev` suffix on /store/new).
- /store/new's `.page-header__subtitle code` dropped its inline
  style override — the shared rule handles it now.

* feat(web): two-theme switching via data-theme + admin toggle

Introduces a theme system that flips the entire UI palette between
"navy" (current design, default) and "blue" (pre-redesign palette)
via a single `<html data-theme="...">` attribute. Page markup, class
names, and component styles don't change — only the `--ds-*` token
values flip.

Backend
- New `app/instance_config.py::get_instance_theme()` resolves the
  active theme from `AGNES_INSTANCE_THEME` env > `instance.theme`
  in instance.yaml > default "navy". Unrecognised values clamp to
  "navy" so a typo doesn't break the page.
- `app/web/router.py::_build_context` injects `instance_theme`
  alongside `instance_brand` etc. so every template inherits it.
- `app/web/templates/base.html` renders
  `<html lang="en" data-theme="{{ instance_theme | default('navy') }}">`.

CSS
- `app/web/static/css/design-tokens.css` adds two new tokens to
  the default `:root` set: `--ds-hero-shadow` (drop-shadow tint
  on hero boxes) and `--ds-hero-eyebrow` (eyebrow accent colour).
  Plus a `:root[data-theme="blue"]` override block that flips
  seven tokens: `--ds-primary`, `--ds-primary-dark`,
  `--ds-primary-light`, `--ds-brand-accent`, `--ds-hero-bg`,
  `--ds-hero-bg-deep`, `--ds-hero-shadow`, `--ds-hero-eyebrow`.
  The blue theme aliases the brand surface tokens back to the
  legacy `--primary` family.
- `.page-header--hero`, `.stack-hero`, `.env-setup-cta`,
  `.home-mock .home-hero-intro` now reference the new
  `--ds-hero-shadow` and `--ds-hero-bg-deep` tokens instead of
  hard-coding `rgba(15, 27, 58, 0.22)` and `#1a2a5f` — gradient +
  shadow now flip with the theme.
- `.page-header--hero .page-header__eyebrow` uses
  `var(--ds-hero-eyebrow)` so the eyebrow goes mint-green on
  navy and translucent-white on blue (mint on blue reads poorly).

Admin
- `app/api/admin.py::_KNOWN_FIELDS["instance"]` now registers a
  `theme` field of kind `select` with options `["navy", "blue"]`
  and a `hint` explaining the trade-off. The existing
  /admin/server-config UI auto-renders a select for this — no
  template changes needed.

Defaults
- Default value is "navy" so existing instances see no visual
  change. Admins flip to "blue" via /admin/server-config to
  restore the pre-redesign look.

Restart note: uvicorn must reload to pick up the Python changes
(new getter, new template-context key, new known-field). CSS
changes hot-reload via browser refresh.

* fix(web): blue theme — home hero eyebrow + CTA contrast

`.home-hero-intro .eyebrow` and `.btn-intro-primary` referenced
`--ds-brand-accent` directly, which on the blue theme resolves to
the lighter brand-accent blue (#4F9DEB). Result: light-blue eyebrow
on the blue gradient ("WELCOME, ADMIN" barely readable) and a
light-blue button with darker-blue text ("Set up in ~15 min")
that all sat in the same hue range.

Introduces three new theme-aware tokens:
- `--ds-hero-eyebrow` already existed; blue theme bumped opacity
  to 0.92 so the eyebrow reads as full white.
- `--ds-hero-cta-bg` + `--ds-hero-cta-fg` + `--ds-hero-cta-bg-hover`
  flip the primary hero CTA: mint-green on navy (default), white-
  on-blue under `data-theme="blue"`.

`.home-hero-intro .eyebrow` now uses `--ds-hero-eyebrow` (mint on
navy / white on blue) and `.btn-intro-primary` uses the CTA token
trio.

Recommended palette on blue theme:
- Eyebrow: white at 92% opacity (clear on the blue gradient).
- Primary CTA pill: white background, brand-blue dark text
  (`--primary-dark` = #005BA3) for AAA-level contrast.
- Secondary CTA: translucent white pill (unchanged).

* fix(web): blue theme — callout-hint info bg/border/ink re-tinted to brand blue (was indigo, clashed with brand-blue hero)

2026-05-21 06:19:16 +00:00

ZdenekSrotyr

1b0329e8c5

UI design system unification — one stylesheet, canonical primitives, nav fix (#284 )

* docs(plan): design-system unification plan (post-review revisions)

Plan covers consolidating two CSS files into one, introducing
canonical primitives (.btn family, .search-input, .filter-bar,
.page-header, .data-table, .empty-state, .toast, .stat-card,
.tab-strip), unifying the top-nav Admin trigger with sibling
links, and migrating 41 templates that today carry inline
<style> blocks.

Post-review revisions: nav fix moved to first commit (user
complaint lands first); sticky-header and dark-mode skeleton
tasks dropped (defer to follow-up PRs); contract test class
detection tokenizes class="..." attributes properly; baseline
screenshot loop added to Task 0; vendor-token grep widened.

* fix(nav): unify Admin trigger with sibling nav links

The top-nav Admin entry is a <button class="app-nav-link
app-nav-menu-trigger">, siblings are <a class="app-nav-link">.
.app-nav-menu-trigger used to override .app-nav-link with
"color: inherit; font: inherit", resetting font-size from 13px
back to body default and color from --text-secondary to body
color. Active state diverged too: .is-active on links used
--primary blue, [aria-expanded=true] on the button used
--border-light grey.

Fix: expand .app-nav-link so it covers <button>-element resets
(font-family: inherit, border: 0, background: transparent,
cursor: pointer, display: inline-flex for chevron alignment).
Add [aria-expanded="true"] as another active-state selector
so the dropdown's open state highlights identically to .is-active
on links. Delete the now-redundant .app-nav-menu-trigger rules
that stripped button chrome.

Extract the inline <script> from _app_header.html into a new
app/web/static/app.js (loaded by base.html only — base_login.html
has no nav). Sets up window.appUI.wireDropdown for both the user
menu and the Admin dropdown via DOMContentLoaded.

* style(css): consolidate style.css into style-custom.css + add cache-bust

One stylesheet for the whole web UI:
- style.css (1086 lines, legacy Google-inspired tokens + components)
  absorbed into style-custom.css under a labeled block, placed after
  the modern :root + body so style-custom's component rules continue
  to override the legacy ones (preserves the original cascade order
  that came from loading style.css first).
- style.css deleted; <link> dropped from base.html + base_login.html.
- static_url() now appends ?v=<mtime> to /static/<path>. Cheap
  per-request os.stat — auto-invalidates browser + proxy caches on
  redeploy without operator intervention. Mtime survives across
  uvicorn restarts as long as the file content is unchanged.

Legacy classes (.btn, .card, .login-*, .badge, .code-block, .flash,
.form-group, .username-box, .btn-copy, .auth-tabs, .divider, etc.)
still render — they live in style-custom.css now. Login pages,
error page, password setup, and the dashboard's Claude Code Setup
card all kept working in browser smoke.

* test(design): contract test for design-system invariants

7 structural invariants enforced from this commit onwards:
- style.css must stay deleted
- no template links style.css via static_url
- exactly one bare :root block in style-custom.css
- canonical primitives declared (.btn, .btn-primary, .search-input,
  .filter-bar, .page-header, .data-table, .empty-state, .toast, …)
- no deprecated class names in templates (.users-table, .gp-table,
  .marketplaces-table, .audit-table, .users-search, .marketplaces-search,
  .modal-btn, .btn-primary-v2, …)
- app.js loaded by base.html, NOT by base_login.html
- 3 helper-level unit tests for the class-attribute tokenizer
  (multi-line attrs, Jinja-conditional fragments, false-positive prose)

Two of the assertions intentionally start FAILING after this commit
(missing primitives + legacy class refs in 7 admin templates) and
will turn green as Tasks 4–7 add primitives and Tasks 8–15 migrate
the templates.

* feat(css): canonical button family + legacy token aliases

Adds at top of :root: legacy token aliases (--bg, --card-bg, --text,
--text-light, --secondary, --radius) pointing at modern equivalents.
Absorbed style.css rules referenced these names; without aliases
they fell back to 'unset'. Aliases live until Task 16 alongside
their absorbed rules.

Appends canonical .btn variants at end of file (last cascade):
  .btn-primary + .btn-primary-v2 + .modal-btn.primary (alias group)
  .btn-secondary + .btn-secondary-v2 + .modal-btn:not(.primary):not(.danger)
  .btn-ghost + .btn-ghost-v2
  .btn-danger + .modal-btn.danger
  .btn-lg
  .btn:disabled + .btn:focus-visible (focus ring via --focus-ring)

Existing absorbed .btn, .btn-primary, .btn-secondary, .btn-sm rules
remain — the canonical block adds the missing variants + selector-list
aliases so .modal-btn and v2 markup keep rendering until migration
tasks swap them out.

Contract test: .btn-danger now declared (one less missing primitive).
Browser smoke: /admin/tokens hero + filter pills + empty state render
correctly with the absorbed style.css rules now backed by real tokens.

* feat(css): form-control primitives — .search-input + .filter-bar + .filter-pill + .form-input

Canonical filter bar shape: 36px-height inputs (matches button height
for vertical rhythm), 28px pills with .is-active state, consistent
focus ring via --focus-ring token.

Selector-list aliases for legacy per-page classes:
- .users-search / .marketplaces-search / .kb-search → .search-input
- .filters-card → .filter-bar
- .pill[aria-pressed="true"] also matches the .filter-pill active state

.form-input added as a sibling of .search-input for forms — same
baseline height + radius + focus treatment, with textarea.form-input
auto-sizing to min 96px and using the mono font (matches CSV/SQL
pasted-snippet patterns on /admin/agent-prompt + /admin/workspace-prompt).

Contract test: .search-input + .filter-bar + .filter-pill now declared.

* feat(css): .page-header primitive + variants + .tab-strip

Canonical page-header pattern with title (22px) + optional subtitle +
optional eyebrow + right-aligned actions slot. Two modifiers:
- .page-header--hero: gradient background (primary→primary-dark),
  28px white title, semi-transparent subtitle/eyebrow. For
  /marketplace, /store, /profile-style pages that already use this
  layout via per-page inline <style>. Migration tasks delete the
  duplicated rules.
- .page-header--compact: 18px title for dense admin index pages.

.tab-strip + .tab-strip__item — the secondary tab row pattern used by
/marketplace?tab=flea and similar. .is-active / [aria-selected=true]
both flip the active treatment (primary color + bottom border).

Contract test: .page-header / __title / __subtitle / __actions all
now declared (4 fewer missing primitives).

* feat(css+js): .data-table + .empty-state + .toast + .stat-card primitives

Last primitive batch. All 8 canonical-primitives invariants in
test_design_system_contract.py now green; only the template-migration
test fails (expected — Tasks 8–15).

.data-table (+ --compact modifier): selector-list aliases for legacy
per-page table classes (.users-table, .gp-table, .marketplaces-table,
.audit-table) so existing markup keeps rendering until migration.
Compact modifier shrinks padding + font for dense lists (audit log).

.empty-state with __icon / __title / __description / __actions —
replaces the ad-hoc 'no results' rendering scattered across pages
(corporate_memory, admin_users, admin_marketplaces, etc.).

.toast / .toast-container — paired with window.appToast({kind, msg,
timeout}) appended to app.js. Bottom-right stacked, click-to-dismiss,
auto-dismiss after 4s by default. Kind 'success' / 'warning' / 'error'
/ 'info' shows a 3px colored left border.

.stat-card (+ --accent variant) + .stat-row grid — for the dashboard
metric tile row.

* style(templates): migrate 8 templates off deprecated class names

Mechanical class-attribute rewrite via tokenizer (preserves Jinja
conditionals + multi-line attrs):

  modal-btn primary    -> btn btn-primary
  modal-btn danger     -> btn btn-danger
  modal-btn            -> btn btn-secondary
  users-table          -> data-table
  gp-table             -> data-table
  marketplaces-table   -> data-table
  audit-table          -> data-table
  users-search         -> search-input
  marketplaces-search  -> search-input

8 templates touched: admin_groups, admin_marketplaces, admin_tokens,
admin_users, admin_welcome, admin_workspace_prompt, my_tokens,
corporate_memory_admin. 43 lines updated total.

Inline <style> blocks in these templates still define rules for the
old class names — those rules no longer match anything and become
dead code, removed in Task 16's alias cleanup along with the
selector-list aliases in style-custom.css.

Contract test (tests/test_design_system_contract.py) now fully green:
9/9 invariants enforced from this commit onward.

* feat(css): extend .data-table selector list to 13 more bespoke -table classes

Visual unification of remaining tables across the codebase without
per-template edits. The .data-table baseline rules (uppercase header
tracking, 12px padding, hover state, border-radius) now apply to:

  .ad-table / .ea-table / .md-table / .members-table /
  .obs-table / .overview-stats-table / .registry-table /
  .sample-table / .sched-table / .sess-table / .sub-table /
  .subs-table / .ud-table

These class names live in 12 templates (activity_center, admin_access,
admin_group_detail, admin_scheduler_runs, admin_sessions,
admin_store_submissions, admin_tables, admin_usage, admin_user_detail,
catalog, me_debug, profile_sessions) that have their own per-page
<style> blocks. Per-page rules with higher specificity still win for
their custom needs (column widths, etc.) — this commit only sets a
shared baseline so every table renders with the same chrome.

Contract test stays green: 9/9 invariants enforced.

* style(css): remove now-unused legacy class aliases

Phase A renamed 8 templates off these names; no markup references
them any more, so the selector-list memberships are dead weight.
Removed from style-custom.css:

  .btn-primary-v2 / .btn-secondary-v2 / .btn-ghost-v2
  .modal-btn / .modal-btn.primary / .modal-btn.danger /
  .modal-btn:not(.primary):not(.danger)
  .users-search / .marketplaces-search / .kb-search
  .users-table / .gp-table / .marketplaces-table / .audit-table
  .filters-card

37 lines smaller. Contract test catches any reintroduction.

KEPT aliases (still in untouched template markup):
- .pill (marketplace_plugin_detail.html, marketplace.html — these
  pages weren't part of Phase A's deprecated-class sweep; their
  own .pill CSS rules still apply)
- All .data-table family extensions (.ad-table, .ea-table, .md-table,
  .members-table, .obs-table, .overview-stats-table, .registry-table,
  .sample-table, .sched-table, .sess-table, .sub-table, .subs-table,
  .ud-table) — these still render data tables in 12 templates;
  selector-list aliasing keeps them visually unified with .data-table
  baseline.
- Legacy token aliases (--bg / --text / --text-light / --secondary /
  --card-bg / --radius) — still resolve absorbed style.css rules.

Templates' inline <style> blocks still contain dead rules for the
renamed classes (.users-search, .modal-btn, etc.); harmless but
bloat. Optional follow-up: a separate sweep can drop those.

* docs(changelog): design-system unification under [Unreleased]

* feat(css): unify page-shell width — .container baseline 1280px + modifiers

Inventory found 30+ unique max-width values across templates (280px
login → 1600px admin/tables). The legacy .container default was 800px,
which made every admin page set its own wider inline override —
30+ ad-hoc widths drifted as a result.

Canonical: .container max-width = var(--width-app) (1280px). Pages
that need a different shape opt in via modifiers:

  .container--narrow → var(--width-narrow)  (800px) — long-form text,
                                                     setup wizards
  .container--wide   → var(--width-wide)    (1400px) — admin lists,
                                                     marketplace grids
  .container--full   → max-width: none — hero / landing

Pages that already set a NARROWER inline max-width (setup, login flows
inside .login-card, etc.) still render at their narrower size — the
inline override beats the new canonical 1280px. The visible change
hits the ~20 admin pages currently rendering at 800px via the legacy
default, which jump to 1280px and pick up consistent breathing room.

Spacing also normalized: padding 24px 20px → var(--space-6) var(--space-5).

* fix(home+catalog): gut dashboard sections + remove confusing toggle + fix table count

Dashboard /home cleanup:
- Remove 'Your Data' card — Data Packages is already a top-nav entry,
  so duplicating data sources on the landing page just adds noise.
- Remove 'Account' card — group memberships + scripts + last sync
  belong on /profile, not on the welcome screen.
- Remove entire right-column (Corporate Memory + Activity Center
  widgets) — both surfaces have dedicated admin pages reachable from
  the Admin dropdown.
- Keep stats row (Tables/Columns/Rows/Data Size/Unstructured),
  env-setup-CTA, and Notifications card.

/catalog cleanup:
- Strip the 'Always included' badge + the locked toggle-switch from
  Core Business Data and Business Metrics cards. The toggle was
  always 'checked disabled' — it visually looked like a switch but
  could not be toggled, which was confusing. The 'Always included'
  copy itself was redundant once the toggle was gone. Agnes Internal
  already rendered without these, so the three cards are now visually
  consistent.

Catalog data_stats fix:
- 'total_tables' was len(sync_state) — counted only tables that had
  ever synced, so a 30-row table_registry with 0 ever synced rendered
  as '0 tables'. Switched to len(tables) — the registered
  business-data table list — so the count reflects what's actually
  available, not what's been touched.

* fix(home): real stat numbers + drop unstructured tile + cleanup dead CSS

Dashboard stats were hardcoded zeros (columns: 0, size_display:
'0 MB', unstructured_display: '0 MB') and the table counter pulled
from sync_state (synced) instead of table_registry (registered).
On a fresh deployment with 30 registered tables and 0 ever synced,
the page rendered '0 / 0 / 0 / 0 MB / 0 MB' — useless.

Now:
- Tables: COUNT(*) FROM table_registry WHERE source_type != 'internal'.
  Matches the /catalog Core Business Data counter.
- Columns: SUM(sync_state.columns). Zero only when nothing's synced yet.
- Rows: unchanged (SUM(sync_state.rows), already correct).
- Data Size: SUM(sync_state.file_size_bytes), human-formatted via
  inline _fmt_bytes helper (KB/MB/GB).
- Unstructured: tile dropped — was always '0 MB' and had no source.
- last_updated: now derived from sync_state max(last_sync), wasn't set
  before so the 'Synced …' tag never rendered.

Dashboard.html cleanup: ~725 lines of orphan inline <style> removed —
.section-title, .data-source*, .toggle-switch*, .catalog-cta*,
.memory-card / .memory-stat / .memory-description / .memory-footer
/ .btn-memory, .activity-card / .activity-stat / .activity-text
/ .btn-activity, .account-grid / .account-row / .account-scripts
/ .badge-role / .badge-group / .cron-line, .badge-included /
.badge-beta / .badge-demo. All matched markup deleted in the
previous commit; the CSS was dead code until now.

* ui(catalog): rename page heading 'Data Catalog' → 'Data Packages'

The top-nav entry says 'Data Packages' but the page itself said
'Data Catalog' — confusing two-name product. Aligns the heading and
<title> with the nav label. Subtitle trimmed too: 'manage your
subscriptions' was a vestige of the toggle UI that just got removed,
replaced with a one-liner describing what the page is for.

Two other 'Data Catalog' strings stay: they live inside the table-
profiler overlay JS and refer to an EXTERNAL catalog system (e.g.
OpenMetadata / Atlan) that an operator may link to per table — that
is a generic term for any external data-catalog product, not our
page name.

* fix(nav): dropdown clicks always work + mutual-exclusion close

Two bugs in the wireDropdown helper:

1. Clicking trigger B while trigger A's menu was open left both open.
   e.stopPropagation() in trigger.click prevented the document-click
   handler from firing, so trigger A's open menu had no way to learn
   that something else was clicked. Net effect: state diverged across
   the two dropdowns the more you clicked.

2. The target-vs-trigger equality check (e.target !== trigger) was
   strict. Clicking the chevron <svg> inside the button reports the
   svg or its <path> child as e.target — not the button — so removing
   stopPropagation alone would trip the close branch in the same
   click that just opened the panel.

Fix both at once: drop e.stopPropagation() AND switch the doc-handler
guard to trigger.contains(e.target). Now any click outside both the
trigger subtree and the panel subtree closes; any click on another
trigger closes via the OTHER dropdown's doc handler; clicks inside
the trigger (button OR svg child) are fully ignored by the doc
handler and only the trigger's own toggle handler fires.

* feat(ui): canonical blue-gradient hero on every admin page

The UI had a per-page hero pattern on ~10 onboarding/marketing pages
(admin_tokens / profile / install / setup_advanced / marketplace /
my_tokens / store_upload / home_*), each with its own ad-hoc CSS
(.tokens-hero, .profile-hero, .install-hero, .upload-hero, …). The
admin section's index + detail pages had plain H1/H2 with their own
.users-title / .gp-title / .obs-title / .cfg-title / … inline styling.
Net effect: half the app felt like a product, half felt like a
spreadsheet.

Now:
- .page-header--hero CSS upgraded to match the look analysts already
  liked from admin_tokens: 28px/32px/24px padding, 14px radius, soft
  primary-tinted box-shadow (0 4px 16px rgba(0,115,209,0.2)), 28px
  semibold title, optional uppercase eyebrow + 13.5px subtitle.
  Narrow-viewport breakpoint included.
- New _page_hero.html partial wraps the boilerplate. Usage:
    {% set page_hero_eyebrow  = "Users & Access" %}
    {% set page_hero_title    = "Users" %}
    {% set page_hero_subtitle = "…" %}
    {% include "_page_hero.html" %}
- 15 admin templates migrated to it: admin_users / admin_groups /
  admin_marketplaces / admin_access / admin_sessions /
  admin_session_detail / admin_store_submissions /
  admin_scheduler_runs / admin_usage / admin_user_detail /
  admin_welcome / admin_workspace_prompt / admin_server_config /
  activity_center / admin/news_editor. Each gets a grouped eyebrow
  (Users & Access / Data / Agent Experience / Activity Center /
  Server) matching the Admin dropdown sections so the page identity
  is unambiguous at a glance.

Legacy *-title H2/H1 + adjacent subtitle paragraphs deleted; their
per-page CSS rules are dead now (harmless, retire in a follow-up
sweep alongside other inline-style cleanup the reviewers flagged).

admin_tables.html intentionally NOT migrated — it's a standalone
HTML page that doesn't extend base.html; a separate refactor.

Test: test_admin_users_page_renders_for_admin assertion updated
from .users-title to .page-header__title + .page-header--hero (the
canonical pair). All other web/template tests stay green.

* refactor(ui): dedup _humanbytes, drop 267 lines of dead inline CSS

(1) _humanbytes consolidation:
- Add TB branch + optional precision param (default 2 preserves existing
  Store detail callers; dashboard uses precision=1 for headline tiles).
- Delete inline _fmt_bytes from dashboard handler — was a copy of
  _humanbytes with different rounding. One canonical helper now.

(2) Dead inline-CSS sweep across 17 migrated templates:
- Conservative regex: a CSS rule is deleted only when its primary class
  matches one of the known-dead names AND that name is NOT referenced
  from any class= attribute in the same file's markup.
- Per-file 'in-use' guard saved several false positives that the deny
  list would have nuked (e.g. .users-toolbar, .gp-search, .obs-subtitle,
  .marketplaces-toolbar are still in use; only .users-table, .users-search,
  .users-title, .modal-btn, etc. that have NO markup left went away).
- Removed: -267 lines across admin_users (-42), admin_marketplaces (-45),
  admin_groups (-31), my_tokens (-38), admin_tokens (-29), admin_access
  (-9), admin_user_detail (-6), admin_welcome (-8), admin_workspace_prompt
  (-8), admin_server_config (-2), admin_sessions (-1), admin_session_detail
  (-1), admin_usage (-1), admin_store_submissions (-3), admin_scheduler_runs
  (-3), activity_center (-4), corporate_memory_admin (-36).

Contract test stays green (9/9); all web/template/render/user_management
tests pass.

* feat(ui): canonical hero on /catalog (Data Packages)

Same .page-header--hero treatment as the admin pages — Data eyebrow,
Data Packages title, Browse-the-data-sources subtitle. Removes the
ad-hoc .page-title block (h1 / p / wrapper-div) and its CSS rules
(now dead, 3 rule blocks deleted).

* fix(nav): load app.js from _app_header.html — works on standalone pages

The previous nav-fix commit moved the inline dropdown script from
_app_header.html into app/web/static/app.js + added <script src=…>
to base.html. That broke EVERY page that includes _app_header.html
WITHOUT extending base.html (catalog, corporate_memory*,
admin_tables, install). They got the nav markup but no JS → both
Admin and AD dropdowns dead on those pages.

Fix: emit the <script src=app.js defer> directly inside the
_app_header.html partial. Any page that includes the header now
gets the script automatically — base.html-extenders AND standalone
HTML pages alike. base.html's duplicate <script> line removed.

Also fixes the wide-hero on /catalog: .page-header--hero now sets
its own max-width: var(--width-app) (1280px) so standalone pages
without a .container parent don't render the gradient edge-to-edge.
catalog's .source-cards bumped from 900px → 1280px to match the
hero, otherwise the page reads two-tier (wide blue band, narrow
content) which the user flagged.

Verified locally via agent-browser: Admin + AD dropdowns now click
through on /catalog, /admin/tables, /corporate-memory.

* docs(plan): standalone pages → base.html framework migration plan

Plan + Plan-agent review (8 must-fix items applied) for converting
the 5 templates that ship their own <html><head><body> scaffold
(catalog, install, corporate_memory, corporate_memory_admin,
admin_tables) to extend base.html. Root cause of yesterday's
'dropdown dead on /catalog' regression: shared infrastructure in
base.html doesn't propagate to standalones.

* feat(base): body_attrs block + migrate install.html to extend base

base.html: new {% block body_attrs %}{% endblock %} slot so pages
that need <body> attributes (admin_tables has data-source-type)
can carry them through extends.

install.html: convert from standalone <html><head><body> scaffold
to {% extends "base.html" %} with title / body_attrs / head_extra
/ layout / scripts blocks. Drops:
- <!DOCTYPE>, <html>, </html>, <head>, </head>
- <meta charset>, <meta viewport>
- Duplicate <link rel="stylesheet" href="...style-custom.css">
  (base.html already provides one)
- <body> opening + closing tags
- Leading _app_header.html include + _version_badge.html include
  (base.html handles both)

Preserves per-page CSS (in head_extra), per-page JS (in scripts),
the Inter font preconnect (kept inline; not hoisted to base in
this PR — separate decision).

Pilots the migration recipe before the 4 larger pages.

* refactor(memory): extend base.html

Same recipe as install.html. corporate_memory.html now inherits
<html>/<head>/<body> + nav + app.js script tag from base.html.
Page-specific CSS and JS preserved in head_extra + scripts blocks.

* refactor(memory-admin): extend base.html

Same recipe as install/corporate_memory. Curation page now in the
shared rendering pipeline.

* refactor(catalog): extend base.html

catalog.html had the most complexity: 7 head-level assets (chart.js,
Prism, prism-sql, metric_modal.css link + 2 preconnects + Inter
stylesheet), 5 body-level <script> blocks including a <script type=
"module"> for the metric modal, 2 duplicate style-custom.css links
in <head>. The migration script preserved all of them — head-level
externals hoisted to {% block head_extra %} in source order, body
scripts relocated to {% block scripts %} in source order (so chart.js
loads before the IIFE that builds Chart instances), duplicate
style-custom.css links dropped (base.html provides one).

* refactor(admin-tables): extend base.html + carry data-source-type

The biggest of the 5 standalones at 3563 lines. <body data-source-
type="{{ data_source_type }}"> attribute carried through via the
new {% block body_attrs %} slot (admin_tables JS reads
document.body.dataset.sourceType to switch between keboola and
bigquery rendering paths).

* release: 0.54.10 — UI design system unification + homepage status frame + initial workspace override + store guardrails

Co-Authored-By: zdenek.srotyr <zdenek.srotyr@keboola.com>

* refactor(web): migrate remaining templates to canonical design primitives

- admin_group_detail: .data-table, .btn family, appToast(), remove duplicate table/button/toast CSS
- admin_store_submission_detail: .data-table, .btn family, appToast(), remove duplicate btn/toast CSS
- profile_sessions: .data-table, _page_hero.html, remove duplicate table/title CSS
- me_debug: .data-table, .btn family, remove duplicate table/button CSS
- marketplace: .btn-primary/.btn-secondary, remove duplicate button CSS
- store_edit: remove duplicate .btn-primary/.btn-link CSS, canonical button classes
- store_upload: remove duplicate .btn-primary/.btn-secondary/.btn-link CSS

Co-Authored-By: zdenek.srotyr <zdenek.srotyr@keboola.com>

---------

Co-authored-by: Devin AI <158243242+devin-ai-integration[bot]@users.noreply.github.com>

2026-05-14 13:28:03 +02:00

ZdenekSrotyr

b4d3c576af

Activity Center: audit log + telemetry + sessions + agnes_* tables (#278 )

* docs(spec): admin observability spec + Activity Center MVP plan

Parent spec (480 lines) + executable plan (2295 lines, 14 TDD tasks).
Covers Activity Center rebuild (/admin/activity), with /admin/sessions
and /admin/feedback deferred to follow-up plans.

Already incorporates reviewer-pass revisions across three angles
(security, production resilience, code architecture):
- _get_db import path corrected to app.auth.dependencies
- Test fixtures aligned with seeded_app / admin_user / get_system_db
- All new audit writes wrapped in try/except + logger.exception
- Filename sanitization on session uploads
- DuckDB DESC index behavior documented; upgrade window flagged
- Migration idempotency + evolved-DB test cases
- reveal_raw + shared-cache multi-worker explicitly deferred

Targets schema v40 (audit_log gains params_before, client_ip,
client_kind, correlation_id + 3 indices).

* feat(db): schema v40 — audit_log gains params_before, client_ip, client_kind, correlation_id + 3 indices

* chore(test): clean up Task 1 — drop unused import, rename stale test

* feat(audit): AuditRepository.log() accepts params_before/client_ip/client_kind/correlation_id

* test(audit): strengthen params_before assertion to round-trip JSON content

* feat(audit): AuditRepository.query() rich filters + keyset cursor pagination

* feat(sync): SyncStateRepository.list_recent() cross-table feed

* feat(audit): POST /api/sync/trigger writes audit_log row

* feat(audit): POST /api/scripts/run-due writes audit_log row

* feat(audit): POST /api/upload/sessions writes audit_log row + sanitizes filename

* feat(audit): GET /api/data/{table_id}/download writes audit_log row

* feat(activity): /api/admin/activity timeline + /health + /sync endpoints

* feat(ui): /admin/activity rebuilt — health pulse, timeline, sync grid; /activity-center → 308 redirect

BREAKING: removed demo executive-pulse / maturity-roadmap content from activity_center.html.
The page now reflects real audit_log + sync_history data.

* feat(ui): admin nav + dashboard widget point at /admin/activity

* feat(activity): recursive-audit suppression for AC read endpoints (60s window per actor+filter)

* feat(activity): emit PostHog events when integration enabled (no-op default)

* fix(audit): move v40 indices out of _SYSTEM_SCHEMA + update test_repositories to unpack query() tuple

_SYSTEM_SCHEMA CREATE INDEX on audit_log(timestamp) failed when migration
tests hand-roll a bare audit_log (id, action) without the timestamp column.
Fix: remove indices from _SYSTEM_SCHEMA; add ADD COLUMN IF NOT EXISTS guards
for timestamp and other pre-v40 columns in _v39_to_v40() so the upgrade path
is safe on any hand-rolled schema; call _v39_to_v40 explicitly in the
fresh-install (current==0) path to restore index creation there.

Also unpack the (rows, next_cursor) tuple from AuditRepository.query() in
the three TestAuditRepository tests that still treated it as a list.

* docs: CHANGELOG entry for Activity Center MVP

* chore: refresh stale module docstring in app/api/activity.py

* feat(cli): agnes admin activity — terminal access to Activity Center (timeline + health + sync)

* fix(db): _v39_to_v40 — add IF NOT EXISTS guard for 'action' column

The v39→v40 ladder step adds defensive ADD COLUMN IF NOT EXISTS for
every audit_log column so a hand-rolled bare audit_log (id only) is
safe through the ladder. 'action' was missing from the guard list,
causing CREATE INDEX idx_audit_action_time to fail on tests that
stub audit_log with only an id column (tests/test_e2e_extract.py::
TestSchemaMigration::test_migration_preserves_and_extends).

Local 6/6 schema tests + the previously-failing CI test pass.

* docs(spec): platform telemetry epic — Boss directive + Activity Monitoring plan rebased onto v40 (stacked on zs/spec-activity-center)

* feat(db): schema v41 — 7 usage_* tables for telemetry (events, summary, rollups, attribution)

* chore(db): tighten v41 — usage_session_summary.session_id NOT NULL + upgrade test asserts all 7 tables

* feat(usage): UsageAttributionRepository — replace/delete/lookup over usage_attribution_* tables

* refactor(marketplace): extract list_inner_skills/agents/commands to src/marketplace_listing.py for reuse

* feat(usage): explode plugin attribution on marketplace sync + store entity write; backfill script

* refactor(marketplace): finish src/marketplace_listing.py extraction — drop duplicate _list_inner_* + _parse_frontmatter from app/api/marketplace.py

* feat(usage): promote attribution helpers to src/usage_attribution_helpers.py; hook update_entity rename + bundle-swap; clarify best-effort semantics

* feat(usage): UsageProcessor real extraction + rollup rebuild + 10 fixture-driven tests

* fix(usage): include tool_id in event hash + executemany + rollup transaction (critical multi-tool-turn drop fix)

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

* feat(marketplace): popularity stats — invocations_30d + trend + sort=most_used|trending + Most Popular section

* feat(admin): /admin/users/<id> Sessions section — list + single-file + bulk-zip downloads (audit-logged)

* feat(usage): admin export endpoint + CLI — csv/json/parquet streaming, filters, audit-logged

* feat(usage): agnes admin ask — LLM Text-to-SQL over usage_events with SELECT-only validator (audit-logged)

* feat(usage): reprocess + prune endpoints + scheduler daily prune job + CLI

* docs: PLATFORM_SETUP.md operator playbook + HOWTO/ cookbook (5 guides + index)

Adds docs/PLATFORM_SETUP.md as a consolidated operator playbook covering
bootstrap, TLS, marketplaces (curated + flea), scheduler env vars, telemetry
extraction/export/ask/prune, privacy posture, and daily routine.

Adds docs/HOWTO/ with 5 analyst cookbook guides: first query, snapshots for
remote tables, private sessions, feedback + admin ask, and customizing skills.

Existing setup docs (QUICKSTART, DEPLOYMENT, ONBOARDING, HEADLESS_USAGE)
get a one-line cross-reference at the top pointing to PLATFORM_SETUP.md.

* docs(changelog): platform telemetry epic — usage_* foundation + surfaces + admin access + docs

Comprehensive [Unreleased] entry covering: usage_events/session_summary/
tool_daily/plugin_daily tables (v41), attribution lookup tables, backfill
script, marketplace Most Popular + invocation chips + sort, admin Sessions
section, export/ask/reprocess/prune endpoints + CLI mirrors, Activity Center
(v40), PLATFORM_SETUP.md + HOWTO/ docs, and operations notes for v41 upgrade.

* fix(security): block DuckDB read_*/http_*/glob functions in usage_ask validator + symlink escape guard in session zip + clarify mark-private semantics

* fix(admin): parquet export tempfile cleanup on COPY failure + correct processed-first sort on /admin/users/<id>/sessions

* feat(audit): close 8 production audit gaps — query (local/remote/hybrid), catalog/schema/sample, snapshot estimate/create, check-access

* feat(ui): /admin/usage summary dashboard + per-user activity tab on /admin/users/<id>

* fix(audit): cap error messages at 200 chars + audit user_activity reads + recursion guard on usage.summary

* fix(audit): catalog.list audits on error path + clean up deferred json import

* fix(ux): client_kind=cli for PAT auth + timeline empty state + email-instead-of-uuid + nav reorder + help text + loading indicators + ask doc

* feat(observability): unify /admin/activity into single page with saved views

- KPI cards (events, users, error rate, p95) clickable as quick-filters
- Faceted filter dropdowns populated from audit_log in the current window
- Sortable audit table, cursor pagination, per-row JSON side panel
- Saved views (schema v43: user_observability_views) — per-user state
- Top bar: window selector + 30s Live toggle + saved views dropdown
- /admin/scheduler-runs → 308 redirect (source=scheduler filter)
- New endpoints: /api/admin/observability/{facets,kpis,views}

* test: update activity + scheduler-runs tests for unified page

- test_admin_activity_page_renders asserts new structural anchors
- test_admin_scheduler_runs_page_admin_only asserts 308 redirect

* fix(observability): respect [hidden] on modal + side panel

CSS `display: flex` on .obs-modal beat the [hidden] attribute's UA
display:none, so the save-view modal rendered on page load and Cancel
clicks couldn't dismiss it. Gate the modal's flex layout on
:not([hidden]); add the same display:none guard prophylactically to
.obs-panel and .obs-views-panel.

* feat(observability): user enrichment in audit + interactive /admin/usage

Activity:
- /api/admin/activity now joins users for user_email + user_name per row
- User column renders "name (id-prefix)" or "email (id-prefix)" instead
  of an opaque truncated UUID; falls back to id when the user record is
  missing

Usage:
- /admin/usage rewritten as the same filter/group-by/search pattern as
  /admin/activity. Faceted dropdowns (User / Tool / Source / Event type)
  populated from usage_events; debounced free-text search across
  tool_name / skill_name / subagent_type / command_name
- New endpoints /api/admin/usage/{facets,kpis,query}; the query endpoint
  supports group_by in {day, username, tool_name, source, ref_id} with
  sort + offset pagination, plus an ungrouped raw-events mode
- 4 KPI cards (events, distinct users, distinct tools, error rate) are
  clickable quick-filters; clicking a grouped row applies the bucket as
  a filter
- Old static `?window=7d|30d|all` server preload removed; all state is
  client-side via since_minutes + group_by + filters in the URL

* fix(observability): clearer labels, all-column sort, drop saved views UI

- Rename page titles: "Activity" → "Server activity", "Usage" → "Tool usage"
  with a one-line subtitle on each explaining what the page covers and
  linking the other one. The two pages source different data (audit_log
  vs usage_events) and the previous labels conflated them.
- Drop the saved-views dropdown + save modal from /admin/activity. The
  modal pop-open bug was the trigger; the value wasn't there yet. The
  /api/admin/observability/views CRUD + DuckDB table stay in place.
- Rename "Live (30s)" to "Auto-refresh (30s)" with a tooltip clarifying
  that it's the re-fetch rate, not the time range. Time range now
  labeled "Time range" instead of "Window".
- All audit-table columns are sortable (User, Source, Action, Resource,
  Result added); sort is page-local with a Jinja comment explaining the
  trade-off. Same for raw usage rows.
- Fix duplicate sort-arrow bug — the literal "▼" in the Time th HTML was
  rendering alongside the CSS ::before arrow. Removed the literal; CSS
  is the single source of truth.

* feat(observability): global Sessions browser + transcript viewer + CLI

Web:
- /admin/sessions — list every collected session JSONL across all users
  with time-range, user, model, errors-only and free-text filters. Default
  sort surfaces error-heavy sessions first. KPI cards (sessions, distinct
  users, sessions w/ errors, tool error rate) clickable as quick-filters.
- /admin/sessions/<username>/<file> — transcript viewer rendering the
  JSONL chronologically: user prompts, assistant text, tool calls (with
  JSON input) and tool results (with flattened output). Errors get a red
  border + chip and a "Next error" navigation button at the top.
- Admin dropdown gains a "Sessions" link.

API:
- GET /api/admin/sessions/{list,kpis,facets} — filtered cross-user reads
  off usage_session_summary
- GET /api/admin/sessions/{username}/{file}/transcript — parses JSONL via
  the existing services.session_pipeline.lib, returns chronological events
- GET /api/admin/sessions/{username}/{file}/download — JSONL stream, same
  path-safety guards as the per-user endpoint, audit-logged

CLI:
- `agnes admin sessions list [--user X] [--errors] [--since 7d]` — table
  output with `!` prefix on rows that hit a tool error
- `agnes admin sessions show <username> <file>` — transcript dump, with
  `--errors` to print only the failed tool_result blocks
- `agnes admin sessions download <username> <file> [-o path]`
- `agnes admin sessions kpis` — top-level numbers

* feat(internal): expose telemetry tables to agnes query with row-level RBAC

Three new registered tables backed by system.duckdb, queryable through
the same /api/query plumbing analysts use for Keboola / BigQuery /
local sources:

  agnes_sessions  → usage_session_summary  (filter: username)
  agnes_usage     → usage_events           (filter: username)
  agnes_audit     → audit_log              (filter: user_id)

RBAC is per-row, not per-table: admins see every user's rows; non-admins
see only their own. The filter is built server-side from the auth user
dict; non-admin filter values are regex-validated before SQL interpolation.

Implementation:
- new connector connectors/internal/ with access (filter+exec) + registry
  (idempotent table_registry seed at startup)
- /api/query detects internal table refs and short-circuits to a CTE
  wrapper that prepends "WITH agnes_x AS (SELECT * FROM <src> WHERE …),
  …" then "SELECT * FROM (<user_sql>) AS _q". DuckDB cursor on the
  shared system.duckdb handle — opening parallel handles / ATTACH on the
  same file is blocked process-wide.
- mixing internal + BQ / registered local tables in one SELECT is
  rejected (v1 limitation)
- src.rbac.can_access_table waves internal tables through for all
  authenticated users; row scoping is the actual security control
- /api/v2/schema and /api/v2/sample gained internal branches; sample
  intentionally skips its cache because rows are RBAC-scoped per caller
- audit row written as action='query.internal' with is_admin flag

Tests: connectors/internal/access — RBAC, filter clause, schema, CTE
wrapper coexistence with user-supplied aggregations, unsafe-username
rejection. 16/16 passing.

Motivating queries this enables:
  SELECT tool_name, COUNT(*) FROM agnes_usage
   WHERE is_error GROUP BY 1 ORDER BY 2 DESC
   -- analyst self-introspection: which tools fail for me?

  SELECT user_id, COUNT(*) FROM agnes_audit
   WHERE action = 'session.transcript_view' GROUP BY 1
   -- admin: who's been looking at whose session transcripts?

* feat(admin): group dropdown into 5 named sections + internal tables in /catalog

Admin dropdown gains section headers so admins can land on the right
page without re-reading the full menu:

  Activity Center      Server activity / Tool usage / Sessions
  Users & Access       Users / Groups / Resource access / Tokens
  Data                 Tables
  Agent Experience     Curated Marketplaces / Flea Submissions /
                       Agent Setup Prompt / Agent Workspace Prompt
  Server               Server config

"Agent Experience" frames the curated content + prompts as one cluster
— it's all admin-controlled material that shapes what an analyst's AI
agent encounters. "Configuration" → "Server" since only one item lives
there now.

Renamed the section's first two items:
  "Activity" → "Server activity" (matches page H1)
  "Usage"    → "Tool usage"

Also fixes /catalog visibility of the internal tables (agnes_sessions /
_usage / _audit) for non-admin users: ``app.auth.access.can_access``
short-circuits to True for resource_type='table' + an internal-table id.
Without this, non-admins saw the tables in /api/v2/catalog (which uses
the same RBAC bypass) but not on the /catalog HTML page (which calls
can_access directly, requiring a resource_grants row internal tables
don't have).

CSS for `.app-nav-menu-section`: small caps, muted, non-clickable; first
section trims top padding so the panel doesn't open with an awkward gap.

* refactor(admin): move corporate memory into Admin > Agent Experience

Memory link was the only admin-only entry in the primary nav (gated by
session.user.is_admin). Moves it into the Admin dropdown under Agent
Experience, alongside Curated Marketplaces / Flea Submissions / Prompts
— all admin-curated content that shapes what an analyst's AI agent
encounters.

Renamed the nav label to "Shared Knowledge" to match what the page
actually is (admin-curated organisational knowledge from session
verification, surfaced to agents). URL stays at /corporate-memory; the
route still gates on require_admin per the existing comment.

Side effect: primary nav (Home / Marketplace / Data Packages) is now
uniform for every authenticated user — no conditional admin-only entry.

* ui: rename admin entries to Curated Knowledge / Init Prompt / Workspace Prompt

- "Shared Knowledge" → "Curated Knowledge" (parallel with "Curated
  Marketplaces" in the same Agent Experience section; "curated" tells
  the admin what they do there — review + approve)
- "Agent Setup Prompt" → "Init Prompt" (matches the `agnes init` flow
  it actually drives)
- "Agent Workspace Prompt" → "Workspace Prompt" (the "Agent" prefix
  was redundant — every item in the section is agent-facing)

Renames page titles + H1s on /admin/agent-prompt and
/admin/workspace-prompt to match.

* refactor: rename Usage → Telemetry across user-facing surfaces

External surfaces all switch; internal Python module / file names and the
physical DB tables (usage_events, usage_session_summary, usage_tool_daily,
usage_plugin_daily) stay — renaming them would force a schema migration
+ a redo of the LLM Text-to-SQL prompt for no analyst-visible win.

Changes:
- Admin dropdown: "Tool usage" → "Telemetry"
- Page H1 / <title>: same
- URL: /admin/usage → /admin/telemetry; old URL 308-redirects
- API prefix: /api/admin/usage/* → /api/admin/telemetry/*
- CLI: primary command `agnes admin telemetry …`; `agnes admin usage` kept
  as a deprecated alias so existing operator scripts keep working
- Internal data-source table id: agnes_usage → agnes_telemetry. The
  registry seed now evicts any stale internal-source row whose id no
  longer matches INTERNAL_TABLES, so the old `agnes_usage` row is
  removed from table_registry on next app boot
- All tests + JS endpoint paths updated

* test(rbac): include auto-appended internal tables in expectations

get_accessible_tables now appends agnes_sessions / agnes_telemetry /
agnes_audit to every authenticated user's accessible-tables list so the
internal data source shows up in /catalog. The two existing rbac tests
asserted hardcoded list shapes that pre-dated the change.

Rewritten to assert "granted tables + the canonical internal-table set"
instead of literal lists, so the test stays correct if the internal
table roster changes again later.

* ui: visual dividers between admin-dropdown sections

Adds a 1px top border + 6px top margin to every section header except
the first, so the five named groups (Activity Center, Users & Access,
Data, Agent Experience, Server) read as visually separated clusters.
The header itself stays small-caps + muted as before — the border is
additive.

* ui(memory): match obs-topbar visual on /corporate-memory

The Curated Knowledge page (linked from the admin dropdown's Agent
Experience section) opened straight into the stats bar — no title,
no subtitle, no shared chrome with the other admin pages. Adds an
obs-topbar-style header at the top of .container-memory:

  - H1 "Curated Knowledge"
  - subtitle explaining what the page is + how AI agents pull from it

The `.ck-*` class set duplicates the inline obs-* styles from
/admin/activity etc. for this one page; promoting the obs-* class set
to style-custom.css for shared reuse is the obvious next step (4 pages
already inline the same CSS), tracked as a follow-up.

Page <title> also renamed from "Corporate Memory" → "Curated Knowledge".

* ui(tables): list Agnes internal tables in /admin/tables + group in /catalog

/admin/tables previously rendered three per-source-type listings
(BQ / Keboola / Jira) and dropped any row whose source_type didn't
match — so the agnes_sessions / agnes_telemetry / agnes_audit rows
seeded into table_registry were invisible. Adds a fourth read-only
section "Agnes internal tables" that filters source_type === 'internal'
and renders the same registry-table layout the other sections use,
with two changes:

  - no Register button (these rows are seeded on every app boot from
    connectors/internal/registry.py)
  - Edit + Delete actions hidden (any change would be reverted on the
    next start). Manage access stays so admins can still inspect.

Mode badge picks up a new mode-internal CSS class (teal accent) so the
display doesn't lie and call it "local".

In /catalog, internal tables now group under an "agnes" accordion
section (bucket="agnes" on seed) instead of falling into the catch-all
"default". Single source of truth for which tables exist; admins find
them where they expect.

* ui(tables): Agnes internal as a 4th tab next to BQ/Keboola/Jira

Previous iteration mounted the internal-table listing as a separate
standalone card under the tab strip. Reshapes it to a proper
tab-content section so admins switch between data sources via one
consistent nav (BigQuery / Keboola / Jira / Agnes internal).

- New tab button "Agnes internal" in the tab-nav.
- The listing card becomes <section id="tab-content-internal"
  class="tab-content">; switchTab() already routes by id so no JS
  change beyond extending the hash allowlist for direct #internal
  links.
- Tab content keeps the read-only treatment from the previous commit
  (no Register button, no Edit / Delete in renderRegistryListing).

* ui: rename Curated Knowledge → Curated Memory

Settles the naming back on "Curated Memory" — parallel structure with
"Curated Marketplaces" in the same Agent Experience section, and zero
rename ripple: URL (/corporate-memory), API (/api/memory/*), CLI
(agnes admin memory), and Python modules all stay on "memory" so the
admin label finally lines up with the underlying surfaces.

The "Curated" prefix still tells admins what they do on the page
(review pending → approve / mandate / reject) and reads as a sibling
of "Curated Marketplaces" right next to it in the dropdown.

Touches: admin dropdown label, page <title>, page H1. DB tables stay
on knowledge_* (already the canonical naming for the data shape).

* ui: rename "Server activity" → "Audit log"

"Audit log" is what the page actually is — server-side audit_log table
rendered with KPI cards + filter bar + sortable table. The "Server
activity" label confused the term with Claude Code session telemetry
(Telemetry page) and didn't make the source/concept clear.

Touches:
- Admin dropdown nav label
- /admin/activity page H1 + subtitle
- /admin/telemetry subtitle cross-link
- test_activity_api page-renders assertion

URL (/admin/activity) and API (/api/admin/activity/*) stay — the
"activity" name has stuck at the route layer for a year; rerouting
those would churn dashboards/bookmarks for zero analyst-visible win.

* ui(admin-nav): gray band on each section header for clearer separation

Previous iteration used a 1px top border between section labels — the
labels still blended into the items above/below at a glance. Switches
to a light gray background band per section header, extended edge-to-
edge inside the panel via negative horizontal margins. Bolder
font-weight (700) reinforces the separation; bumping the font color
isn't needed because the band itself does the work.

First section's header tucks into the panel's top border-radius so the
band reaches the corners without a gap.

* ui(catalog): rename internal-table category to "Agnes Internal"

`bucket` is what /catalog renders as the accordion category header
verbatim — "agnes" lowercase didn't read as a real category name and
got confused with a system identifier. Bumps to "Agnes Internal".
Seed re-applies on every app boot so existing rows pick up the new
bucket value via `ON CONFLICT (id) DO UPDATE`.

* ui(catalog): split Agnes Internal into its own card on /catalog

Previously the three internal tables landed inside the "Core Business
Data" card under an "Agnes Internal" accordion alongside Keboola / BQ
buckets — readers conflated system telemetry with business datasets,
and the data_stats header counter ("3 tables · ~X rows total") only
ever counted synced rows so internal tables looked invisible.

Split the catalog page into two cards:
- Core Business Data: only non-internal source_types (Keboola, BQ,
  Jira). Accordions group by bucket as before. Stats counter reflects
  this card's tables.
- Agnes Internal: a dedicated card with its own visual treatment
  (teal accent matching the mode-internal badge in /admin/tables).
  Flat list (no accordion — only 3 rows, never grows here), each
  row carries the canonical `agnes query` snippet. Read-only — no
  profiler click, no In-stack toggle, no sync metadata.

Route adds `internal_card` context object; template renders the new
card only when it's non-None.

* fix(rbac): hide internal tables from /admin/access + drop "my" framing

Two related cleanups for the Agnes-internal tables:

1. /admin/access (resource grants) no longer lists them. The
   `can_access` check has a hardcoded internal-table bypass — security
   is row-level (per-request view filter), so a table-grain
   `resource_grants` row would do nothing. Surfacing them in the UI
   let admins set up grants that silently no-op. Filter at the
   `_table_blocks` projection so the UI tree never sees them.

2. Display names drop the analyst-perspective "my" framing:
     "Agnes — my sessions"          → "Agnes sessions"
     "Agnes — my telemetry events"  → "Agnes telemetry events"
     "Agnes — my audit log"         → "Agnes audit log"
   The "my" only makes sense from the querying analyst's seat
   (`SELECT … FROM agnes_sessions` returns *their* rows); on /admin/*
   pages where admin sees / configures them across users, the
   pronoun was misleading. Description text now spells out the
   row-level RBAC contract explicitly.

Display names update via TableRegistryRepository.register's ON CONFLICT
UPDATE on next app boot; no manual cleanup needed.

* ui: subtitle notes about agnes_* tables on each Activity Center page

The recursive observability story — Agnes serves its own audit /
telemetry / session data through the same `agnes query` plumbing
analysts use for business data — wasn't surfaced anywhere on the
admin pages that show that data. Three pages get a one-liner with
the canonical `agnes query` snippet + the RBAC contract (analysts
see their own rows, admin sees all):

- /admin/activity (Audit log)   → agnes_audit
- /admin/telemetry (Tool usage) → agnes_telemetry
- /admin/sessions               → agnes_sessions

Sets up the discovery moment for admins: they're reading the page,
they see "you can query this from Claude Code", they remember it
when an analyst asks "how do I find my own failed tool calls?".

* ui(tables): explain "Show log" empty-state on /admin/tables

Cache warmup log <pre> renders with a dark background and is only
populated by the SSE stream during a Re-warm all run. Opening the
page cold + clicking Show log just revealed a black bar with no
context — admins couldn't tell what they were looking at.

Adds an inline paragraph above the <pre> explaining what the log is,
the row format, when it fills in, and where to find the historical
audit trail (/admin/activity). The actual <pre> stays empty until
SSE events arrive, but the surrounding copy carries the meaning.

* ui(tables): auto-open cache-warmup log on Re-warm all click

A Re-warm all run takes ~24s per remote BQ row. With the <details>
collapsed by default, operators saw the button disable, watched a
quiet ~24s pass, and assumed nothing had happened — the streaming
log was hidden behind a closed disclosure.

Two small JS tweaks:
- cacheWarmupRun() opens the details on click, so streamed lines
  appear without an extra interaction
- cacheWarmupOnStart() hides the inline hint paragraph the moment
  real log content lands, so the dark log block isn't competing
  with redundant context

Hint paragraph also clarifies that only `query_mode='remote'` BQ
rows are warmed — operators with only materialized/internal tables
would see total=0 and the page would "do nothing" by spec.

* ui: trim Agnes internal copy across surfaces

Descriptions had grown to explain the extraction pipeline ("parsed
out of session JSONLs"), the underlying table ("Backed by
usage_session_summary"), the RBAC mechanic ("row-level RBAC at query
time — analysts see their own; admin sees all"), and the SQL snippet.
Every implementation detail meant another rewrite on the next iter.

Strips to one stable line per surface: what the data is, plus
"Also available locally for analysis". Mechanics live in code +
docs; the page copy says what the user needs to know.

Touched:
- connectors/internal/access.py: INTERNAL_TABLES descriptions
- activity_center.html / admin_usage.html / admin_sessions.html
  subtitles
- catalog.html Agnes Internal card description + row strip
- admin_tables.html "Agnes internal" tab hint

* fix(internal): is_user_admin arity bugs + + saved-view payload cap

Round-1 code review (PR #278) caught two blocking bugs and three nits.

Blocking — both `is_user_admin(user)` (single dict arg) calls raised
TypeError. is_user_admin signature is `(user_id, conn)`. Affected:

- app/api/query.py:_run_internal_query — every POST /api/query that
  references agnes_sessions / agnes_telemetry / agnes_audit blew up
  with a 500. The headline analyst-facing feature of this PR was
  unusable through the API.
- app/api/v2_sample.py — same shape; `GET /api/v2/sample/agnes_*`
  returned 500.

Both fixed to call `is_user_admin(user.get("id"), conn)`. Added two
FastAPI-level tests in test_internal_data_source.py that go through
the TestClient — the existing unit tests on `execute_internal_query`
and `build_filter_clause` skipped the request-handler layer where the
bugs lived, which is why this landed.

Nits also closed:
- connectors/internal/access.py: `+` allowed in _USERNAME_RE /
  _USER_ID_RE so RFC 5321 email local-parts (alice+test@x) resolve
  correctly without hitting InternalAccessError.
- app/api/observability.py: saved-view payload capped at 64 KiB to
  prevent an admin from bloating system.duckdb with a malformed save.

* fix(security): close non-admin data-leak via underlying-table refs

PR #278 R2 review surfaced a non-admin-exploitable bypass: SQL whose
string literal contains 'agnes_sessions' routed into the privileged
internal-query path, then queried the underlying physical table
(usage_session_summary / usage_events / audit_log) directly, escaping
the CTE wrapper's row filter. Two reinforcing defenses:

1. find_internal_refs() now strips single-quoted string literals
   before scanning for alias names — a literal alone no longer
   routes the request into the privileged code path.

2. execute_internal_query() rejects non-admin SQL that references
   the underlying physical tables (usage_*, audit_log). The CTE
   wrapper only scopes the agnes_* aliases; a direct FROM on the
   base table — or a shadowing inner WITH that still has to read
   the base table — bypasses RBAC. Block before execution with an
   actionable error pointing to the agnes_* alias. Admins are
   unaffected (god-mode short-circuit on the filter clause).

3. tests/test_internal_data_source.py — three new negative tests
   covering literal-only matches, direct-table refs, and CTE
   shadow attempts.

Also tightens usage_ask.py's SELECT-only validator: pragma_table_info,
pragma_storage_info, pragma_database_*, and duckdb_tables / columns /
views / indexes / schemas are reflection functions that leak metadata
the analyst question shouldn't reach. \bPRAGMA\b in _FORBIDDEN never
matched the function-call form (word-boundary between `A` and `_`).

* fix(security): dynamic denylist for non-admin internal queries

R3 review (PR #278) caught a wider data-leak than R2: the underlying-
physical-table guard listed only the 7 usage_* + audit_log tables,
but system.duckdb has 30+ other sensitive tables — users (emails +
ids), personal_access_tokens, resource_grants, user_groups,
user_observability_views, store_*, marketplace_*, knowledge_*, etc.
A non-admin SQL like

    SELECT * FROM agnes_sessions
    UNION ALL SELECT email, id, … FROM users LIMIT 1

would leak every user's row.

Replaces the hardcoded denylist with a **dynamic allowlist** —
non-admin SQL may reference ONLY the registered agnes_* aliases.
Every other table in `information_schema.tables` (main schema) is
rejected. Future migrations that add a new sensitive table are
automatically covered without re-editing this module.

Also strips SQL comments (`/* */` and `--`) before the identifier
scan so a comment-wrapped table name (`/**/users/**/`) can't slip
past the regex.

Four new negative tests pin: `users`, `personal_access_tokens`,
block-comment wrap, line-comment wrap.

Plus: per-user view-count cap (100) on /api/admin/observability/views
so an admin can't fill system.duckdb with thousands of saved views.

* release: 0.54.0 — Activity Center + Telemetry + Sessions + internal datasource

Cuts the work shipped across this PR (Activity Center build, recursive
internal data source) into a versioned release. Bumps pyproject.toml
to 0.54.0; renames the top of CHANGELOG.md from [Unreleased] to
[0.54.0] — 2026-05-12 with a header summary; opens a fresh
[Unreleased] section for the next round.

---------

Co-authored-by: Claude Sonnet 4.6 <noreply@anthropic.com>

2026-05-12 22:41:19 +02:00

3 commits