agnes-the-ai-analyst

Author	SHA1	Message	Date
ZdenekSrotyr	500db8cd3c	fix(query-guardrail): dry-run user SQL not synthetic SELECT * (#171 ) Closes #171. The /api/query cost guardrail used to dry-run a synthetic `SELECT * FROM <table>` for each registered remote-BQ row referenced by the user SQL — which made BigQuery estimate a full table scan, with column projection, predicate pushdown, and partition pruning all disabled. Narrow queries on big partitioned/clustered tables (the documented happy path for `agnes query --remote`) hit ~30,000× over-estimates and got rejected with 400 `remote_scan_too_large` even when BQ's own dry-run reported single-digit MB. Pavel's report on #171 traced the root cause and proposed the fix: rewrite the user SQL to BQ-native syntax and dry-run it as a single job, exactly the way `bq query --dry_run` works. Implementation: - New helper _rewrite_user_sql_for_bq_dry_run rewrites bare registered names (word-boundary, case-insensitive, longest-first to avoid prefix collisions) + bq."<ds>"."<tbl>" forms to backticked `<project>.<ds>.<tbl>` paths. - _bq_quota_and_cap_guard runs ONE dry-run on the rewritten SQL. Cap check uses the real estimate. - Fallback path: if BQ rejects with bq_bad_request (e.g. DuckDB-only syntax like ::INT casts), the guard falls back to the pre-fix per-table SELECT * approach so non-portable queries still get a (loose) cap estimate instead of fail-opening. Non-parse BQ errors (forbidden, upstream) still propagate as 502. - _bq_guardrail_inputs now also returns name_lookups so the rewriter has the (registered_name, bucket, source_table) mapping it needs. - Per-table breakdown is unavailable from a composite dry-run; total bytes are pinned to dry_run_set[0] for the post-flight record_bytes(sum(...)) call to keep returning the right total. Tests (7 new, 3 existing still pass): - dry-run receives rewritten user SQL with WHERE clause intact (the load-bearing assertion for #171) - single dry-run per request even with multiple registered tables (JOIN, UNION) referenced - fallback to per-table SELECT * on bq_bad_request - non-parse BQ errors (forbidden) still 502 - rewriter unit tests: bare + bq.path in same SQL, longest-name-wins on prefix collision, case-insensitive bare-name match	2026-05-04 21:08:21 +02:00
ZdenekSrotyr	bd462187e8	test(welcome-template): tighten default-rendered assertions to new agnes verbs The renderer no longer emits the legacy "da analyst setup" verb (the analyst flow uses `agnes init`, the admin flow uses `agnes auth import-token`). The disjunction assertions ("da analyst setup" OR "agnes auth" OR "curl") were permissive and would have silently kept passing even if the renderer regressed. Replace them with role-aware assertions that match the actual emitted markers and explicitly check that no legacy verb survives.	2026-05-04 21:07:51 +02:00
ZdenekSrotyr	8890b6f09b	fix(post-merge): clean up stale `da` verbs introduced via #174 merge Four call sites where #174 (branched from main before the agnes rename fully landed in some files) emitted or referenced `da fetch`. None are operator-visible runtime crashes — but `extractor.py` logs a stale verb to the operator log and `DATA_SOURCES.md` is current docs: - connectors/bigquery/extractor.py:431,434 (operator-facing log line on unverified BQ entity_type — was suggesting `da fetch`). - docs/DATA_SOURCES.md:77,85 (current public docs, two refs to `da fetch` in the workflow + the BQ scope description). - tests/test_cli_query_render.py:7 (module docstring listed `da fetch / agnes schema / etc.` — now `agnes snapshot create / agnes schema / etc.`). - tests/test_cli_snapshot_create.py:1 (docstring referenced `(folded from `da fetch`)` — historical, removed; no value once the rename landed). Pre-existing stale `da` references elsewhere in the branch (templates, operator runbooks, internal comments) are not touched by this commit — they live outside the merge surface and are a separate cleanup task. Verified: 10/10 across the affected test files pass.	2026-05-04 20:57:36 +02:00
ZdenekSrotyr	e438170ade	merge: pull #174 (BQ materialize view fix + concurrency, 0.33.0) into bootstrap branch Brings in zs/materialize-sync-fix (PR #174): - BigQuery view materialize works (wrap admin SQL in bigquery_query()) - Per-table mutex + fcntl.flock for concurrent COPY corruption - Cost guardrail dry-run engages on materialized rows - Schema v23 -> v24 migration: rewrite source_query to BQ-native - Server-generated trivial source_query from bucket+source_table - Validator backtick relaxation for materialized rows - 0.33.0 release cut Conflict resolution: - CHANGELOG.md: keep our [Unreleased] (bootstrap rewrite content) ABOVE the new [0.33.0] section from #174. The bootstrap rewrite remains unreleased; it'll cut 0.34.0 (or later) when this PR merges to main. - tests/conftest.py: union — keep our analyst-bootstrap fixture re-export AND #174's bq_instance / stub_bq_extractor fixtures. - pyproject.toml auto-merged to 0.33.0 (matches the cut), correct. - src/db.py auto-merged: SCHEMA_VERSION = 24, _v23_to_v24_finalize added — no overlap with our work which left schema at v23. - CLAUDE.md auto-merged: schema-history paragraph extended with v24. Verified: 79/79 across CLI bootstrap suite + materialize suite + schema v24 migration tests pass locally on Python 3.13/macOS.	2026-05-04 20:53:00 +02:00
ZdenekSrotyr	ee83cebbda	fix(cli): Windows console crash on cs-CZ codepage (port + broaden #172 ) Ports Minas's PR #172 (against pre-rename `da` CLI on main) and applies the principle to the post-rename `agnes` CLI. Two distinct failure modes on Windows consoles whose default codepage is cp1250 (cs-CZ) / cp1252 (en-US): 1. `agnes pull` and other Rich-progress codepaths UnicodeEncodeError on Braille spinner glyphs. Fix: `cli/main.py` reconfigures stdout/stderr to UTF-8 with errors='replace' at import time on `sys.platform == 'win32'` so Rich's legacy-Windows render path emits decodable bytes. Wrapped in try/except so pytest's captured streams (which aren't TextIOWrapper) don't break. 2. `agnes skills list` and `agnes skills show` UnicodeDecodeError when reading skill markdown containing em-dashes / accented chars. Default `Path.read_text()` uses locale.getpreferredencoding(False), which is the broken codepage on Windows. Fix: every call site passes encoding='utf-8' explicitly. Broader scope than #172 because: - The bootstrap rewrite renamed/removed several files Minas's PR patched (`cli/commands/analyst.py` -> rolled into init.py; `cli/commands/sync.py` -> split into pull/push). Those targets no longer exist; the equivalent code lives in init.py. - Other call sites Minas didn't touch (still bare in his branch) are patched here too — config.py / update_check.py / snapshot_meta.py / setup.py / skills.py — so the codebase has zero locale-default text I/O in cli/. Side cleanup: stale `Run `da`` reference in snapshot_meta.py:88 fixed to `agnes` while touching the file.	2026-05-04 20:45:29 +02:00
ZdenekSrotyr	e6a2c4c51d	tests: rename 'prj-grp' placeholder to 'my-project' for vendor-agnostic OSS The dashed identifier is what the test exercises (backticks required for dashed BQ project IDs); the literal string can be any synthetic value. 'prj-grp' is too close to a real customer-prefix pattern that the OSS vendor-scrub regex flags. 'my-project' matches placeholders used elsewhere in the project.	2026-05-04 20:38:47 +02:00
ZdenekSrotyr	e323ab76cc	fix(snapshot): catch httpx transport errors in --estimate path CI failure: test_readers_in_pre_init_dir asserted no Traceback in stderr when running `agnes snapshot create x --as y --estimate` in a folder that never saw `agnes init`. The estimate-guard fix in `3d587681` let `--estimate` skip the local_db check and reach `api_post_json`, but the existing `except V2ClientError` doesn't cover transport-layer failures. With no server configured the URL defaults to http://localhost:8000; httpx raises ConnectError → ConnectError isn't a V2ClientError → the exception bubbles up through Typer/rich as a full traceback. Add `except httpx.HTTPError` next to V2ClientError so connection / DNS / TLS / timeout failures all render the friendly hint `Run `agnes init …` first` instead of leaking transport noise.	2026-05-04 20:36:30 +02:00
ZdenekSrotyr	cd3293b994	release: 0.33.0 — BQ materialize view fix + concurrency control	2026-05-04 20:30:50 +02:00
ZdenekSrotyr	08e4959185	fix(push): read sessions from ~/.claude/projects/<encoded-cwd>/ Real bug: `agnes push` was reading `<workspace>/user/sessions/`, but Claude Code writes session jsonls to `~/.claude/projects/<encoded-cwd>/` and nothing on the analyst side ever copies them across. The SessionEnd hook ran `agnes push` happily and uploaded zero sessions every time. `cli/lib/claude_sessions.py` probes both Claude Code encoding variants (older `/`→`-` keeping spaces+tildes; newer all-non-alphanumeric→`-` with collapsed runs) and unions whichever exist. Users who upgraded Claude Code mid-project end up with both encoded dirs side-by-side on disk; the union ensures no session is left behind. Same-named jsonl in both dirs → newest mtime wins. `<workspace>/user/sessions/` survives as a fallback for any setup that explicitly mirrors sessions there. Verified on real disk: helper returns 2 dirs + 8 unioned session files for the Agnes-test workspace where the previous code returned 0.	2026-05-04 20:29:59 +02:00
ZdenekSrotyr	92d477e422	fix(setup): default /setup to analyst, hide admin tile from non-admins Three coupled UX fixes for the analyst-onboarding flow: 1. Dashboard "Setup a new Claude Code" CTA was rendering admin paste prompt for everyone (analysts couldn't actually execute the marketplace plugin install / skills setup steps). render_agent_prompt_banner now picks role based on user.is_admin — analysts get the analyst flow. 2. /setup default role changed from admin to analyst. Most visitors are analysts; admin layout is opt-in via the admin tile or ?role=admin. 3. Admin tile is admin-only on the role-tile nav. Non-admins see only the analyst tile. Server-side: non-admin requesting ?role=admin is silently downgraded to analyst (otherwise they'd see admin paste prompt despite no tile). Tests: - New: test_setup_page_admin_tile_hidden_for_non_admin (anonymous client can't see "Admin CLI" or role=admin link) - New: test_setup_page_admin_role_downgraded_for_non_admin (anonymous ?role=admin → analyst layout, no marketplace step in clipboard) - New: test_install_preview_default_role_is_analyst (admin signing in to bare /setup gets analyst clipboard by default) - Renamed: test_setup_page_default_role_is_admin → ..._is_analyst - Updated: test_setup_page_admin_clipboard_renders_admin_layout uses FastAPI dependency_overrides to inject admin user (admin layout is now admin-gated) - Updated: test_install_preview_visible_for_signed_in_user explicitly passes ?role=admin to exercise admin layout	2026-05-04 20:20:37 +02:00
ZdenekSrotyr	d8dc7c7799	fix: update legacy-string assertions in tests + onboarding template Caught by my own broader test scope after Devin fixes — three test files asserted on user-visible strings that were renamed by the bootstrap PR but the assertions weren't updated: - tests/test_api_query_guardrail.py:110 — asserted `da fetch in suggestion` on /api/query 400 response. Renamed to `agnes snapshot create`. - tests/test_query_materialized_error_message.py:56 — asserted `da sync` in materialized-not-yet error detail. Renamed to `agnes pull`. - tests/test_cli_error_render.py:71 — fixture data + assertion both carried `da fetch`. Updated to `agnes snapshot create`. Plus an actual content miss: docs/setup/claude_settings.json (a template shipped to operators) still installed `da sync` / `da sync --upload-only` hooks. The companion test file (tests/test_setup_hooks_template.py) was asserting that legacy state. Updated both: - Template hooks: `agnes pull --quiet` / `agnes push --quiet` - Test assertions + function name match the new commands	2026-05-04 20:08:07 +02:00
ZdenekSrotyr	3d58768143	fix: address Devin Review findings — incomplete renames + estimate guard 13 Devin findings across 10 files: 🔴 Critical: - app/api/v2_catalog.py:42 — `_fetch_hint` returns `da fetch` in /api/v2/catalog responses (user-visible in every catalog list) - cli/skills/agnes-data-querying.md — 11 stale `da fetch`/`da sync` refs in the bundled skill markdown - config/claude_md_template.txt:38 — referenced `agnes pull --docs-only` flag that does NOT exist in agnes pull (removed; spec only ships --quiet/--json/ --dry-run) 🟡 Important: - app/api/admin.py:252 — `da fetch` in bq_max_scan_bytes hint - cli/commands/auth.py:119 — `da sync` in import-token docstring (--help text) - cli/commands/tokens.py:48 — "Export it so `da` can use it" prose - ARCHITECTURE.md — 4 stale rows in CLI commands table - README.md — stale paragraphs for analysts (da sync, da analyst setup) 🚩 Substantive observations addressed: - app/api/query.py:249,302,489 — server-side error/help strings still said `da sync`/`da fetch` (returned in API responses to clients) - cli/commands/snapshot.py:235-241 — DuckDB existence guard incorrectly blocked `--estimate` (server-side dry-run that never opens local DB). Added test ensuring estimate path skips the guard. Skipped (intentionally historical): - app/api/admin.py:2377,2429,2437 — historical comments describing past manifest-vs-sync_state bug; past tense, accurate to keep as `da sync`.	2026-05-04 20:05:06 +02:00
ZdenekSrotyr	cd8dd9508c	docs(testing): add coverage honesty + prerequisites to E2E plan Adds three sections to the E2E plan: - "Coverage honesty" — explicit list of what the plan reveals (✅) and what it does NOT (❌, with reasoning per gap) - "Recommended additional coverage layers" — Tier 1/2/3 with realistic coverage estimates (~70 % / ~80 % / ~95 % / ~98 %) - "Prerequisites" table — what's needed on the VM, with fallback behavior per missing item The plan is intentionally not exhaustive. Goal is to surface the worst contract violations fast, not to prove correctness across all real-world environments. Documenting the gap explicitly so operators don't ship on a false sense of "tests passed = production-ready."	2026-05-04 19:59:47 +02:00
ZdenekSrotyr	5fa1c94b5c	fix(tests): smoke matrix asserts no-traceback only (per-command rc varies)	2026-05-04 19:47:18 +02:00
ZdenekSrotyr	5162c488bb	fix(tests): strip ANSI escapes from --help output before substring asserts Typer/rich emits ANSI styling in CI's --help output (e.g. `--metrics` becomes `-\x1b[0m\x1b[1;36m-metrics`), so literal substring asserts like `assert "--metrics" in result.output` fail. Locally the test runner auto-detects no-TTY and produces plain text, masking the issue. Add a small `_clean()` helper per test file that strips ANSI escape codes (`\x1b\[[0-9;]*m`) before substring containment checks.	2026-05-04 19:43:47 +02:00
ZdenekSrotyr	d311b07d5d	docs(testing): E2E verification plan for clean-analyst-bootstrap (PR #173 )	2026-05-04 19:41:50 +02:00
ZdenekSrotyr	5bffec641f	chore(lint): final ruff fixes	2026-05-04 19:32:52 +02:00
ZdenekSrotyr	675f8e1909	chore(lint): drop unused imports from new test files (ruff F401)	2026-05-04 19:32:31 +02:00
ZdenekSrotyr	ce108d4c6d	fix(schema): code-review follow-ups for `fac10b29` - _v23_to_v24_finalize: wrap row-update loop in BEGIN/COMMIT/ROLLBACK to match the project's transactional-finalizer pattern (compare _v12_to_v13_finalize, _v17_to_v18_finalize, _v18_to_v19_finalize). Pre-fix a process crash mid-loop left the schema_version unchanged but partially-converted rows persisted across restart — idempotent overall but inconsistent with project convention. - _v23_to_v24_finalize: re.sub replacement now uses a function-form (lambda) instead of an f-string, so any future project_id with a backslash sequence isn't misinterpreted as a group reference. - tests: add a Keboola-source materialized row case asserting the SELECT's source_type filter prevents non-BQ rewrites.	2026-05-04 19:32:24 +02:00
ZdenekSrotyr	20bb9efc0e	chore(lint): drop unused os import from init.py	2026-05-04 19:32:18 +02:00
ZdenekSrotyr	d44cace17c	docs(changelog): clean-analyst-bootstrap rewrite (BREAKING)	2026-05-04 19:25:38 +02:00
ZdenekSrotyr	cc84222216	docs: clean-install manual protocol in release checklist	2026-05-04 19:23:01 +02:00
ZdenekSrotyr	8403529fcd	test: clean-install integration suite (minimal/zero grants, force, pre-init)	2026-05-04 19:22:24 +02:00
ZdenekSrotyr	fac10b29e4	feat(schema): v24 — rewrite materialized BQ source_query to BQ-native Materialize now wraps admin SQL into bigquery_query('<billing>', '<inner>') which requires the inner SQL to be BigQuery-flavor (backticked identifiers, native function syntax). v24 migrates existing rows from DuckDB-flavor (bq."ds"."tbl") to (`<project>.ds.tbl`) using the configured BQ project. Idempotent on already-converted rows; logs a warning and skips when the project isn't configured (operator can configure + restart for retry).	2026-05-04 19:15:54 +02:00
ZdenekSrotyr	42e108ae5e	test: reader smoke matrix on zero-grants workspace	2026-05-04 19:15:39 +02:00
ZdenekSrotyr	a47c2be282	test: clean-bootstrap fixtures (fastapi_test_server, test_pat, zero_grants_workspace) Task 20: reusable pytest fixtures for the clean-bootstrap test suite. Tasks 21 and 22 (reader smoke matrix + init smoke matrix) consume them. - fastapi_test_server boots a real uvicorn subprocess against a tmp DATA_DIR, pre-seeded with admin@example.com (Admin group), analyst@example.com (Everyone group), and three tables (one per query_mode: local / materialized / remote). - web_session: cookie-authenticated httpx.Client for the admin user. - test_pat: minted JWT for the analyst with table grants on local + materialized. - test_pat_no_grants: same shape, zero resource_grants. - zero_grants_workspace: subprocess invocation of `agnes init` against the no-grants PAT; returns the bootstrapped workspace path. - NONEXISTENT_TABLE: module-level sentinel for the upcoming reader matrix. Subprocess uvicorn (mirrors tests/test_e2e_corporate_memory.py) instead of in-thread so DATA_DIR + module-level singletons in src.db don't bleed across tests. agnes CLI invoked via `python -m cli.main` instead of the .venv/bin/agnes shim, which depends on .pth file visibility that iCloud Drive intermittently re-hides on macOS.	2026-05-04 19:11:54 +02:00
ZdenekSrotyr	8d9323c99e	docs(claude-md): sweep surviving-verb da X references (Task 19 follow-up)	2026-05-04 19:01:27 +02:00
ZdenekSrotyr	3990fb0d85	docs(claude-md): rewrite verbs + paths for new CLI surface	2026-05-04 19:00:31 +02:00
ZdenekSrotyr	77eb3244c2	fix(config): uncomment materialize block in instance.yaml.example Spec-review note on `6c0846fd`: every other section in the example file with a default appears as live YAML, not commented. Match that convention so operators see the documented default rendered.	2026-05-04 18:59:56 +02:00
ZdenekSrotyr	7e1dd1adba	refactor(cli): drop sync/fetch/analyst/metrics; register init/pull/push (BREAKING)	2026-05-04 18:59:51 +02:00
ZdenekSrotyr	6c0846fd17	feat(config): expose materialize.lock_ttl_seconds in server-config New top-level 'materialize' section, single field (lock_ttl_seconds). Default 86400 (24h). Backs the file-lock TTL reclaim added in the per-table-mutex change. Editable via PUT /api/admin/server-config and the /admin/server-config UI.	2026-05-04 18:52:54 +02:00
ZdenekSrotyr	5551f12bb0	fix(cli): hint text 'Run: da sync' → 'Run: agnes pull'	2026-05-04 18:42:21 +02:00
ZdenekSrotyr	ff5da0af90	feat(cli): agnes admin metrics {import,export,validate}	2026-05-04 18:39:05 +02:00
ZdenekSrotyr	3871d5320a	feat(admin): server-generate materialized source_query, allow BQ backticks When admin registers a materialized BQ row with bucket+source_table but no source_query, the server generates 'SELECT * FROM `<project>.<ds>.<tbl>`' from instance.yaml's configured BQ project. Same fallback fires on PUT when flipping to materialized. The backtick rejection guard, which was appropriate for DuckDB-flavor source_query, is relaxed for materialized rows since the new wrapping path (Task 2) runs admin SQL through BQ jobs API which uses BQ-native syntax (backticks for dashed identifiers).	2026-05-04 18:37:27 +02:00
ZdenekSrotyr	42b8d0309b	feat(cli): agnes catalog --metrics replaces da metrics list/show	2026-05-04 18:33:17 +02:00
ZdenekSrotyr	8309141705	feat(cli): agnes snapshot create (folded from da fetch); friendly exit if no DuckDB	2026-05-04 18:32:30 +02:00
ZdenekSrotyr	5e1e8c4e14	feat(cli): agnes status = workspace state; old health check moves to agnes diagnose system	2026-05-04 18:29:15 +02:00
ZdenekSrotyr	b799aa534a	fix(cli): I1+I2 review — surface manifest_unauthorized + add 3 typed-error tests	2026-05-04 18:19:35 +02:00
ZdenekSrotyr	9b70ca3069	feat(cli): agnes init orchestrator + AGNES_WORKSPACE.md template	2026-05-04 18:15:08 +02:00
ZdenekSrotyr	c7c42de0f0	feat(sync): treat MaterializeInFlightError as 'skipped, in_flight' _run_materialized_pass distinguishes due-check skips from in-flight skips and never calls state.set_error for either. summary['skipped'] becomes a list of {table, reason} dicts; the end-of-pass log line breaks out the in_flight subcount. Hoists is_table_due to module-level import so test monkeypatching of the symbol intercepts the call (the previous local import made patches a no-op).	2026-05-04 18:11:38 +02:00
ZdenekSrotyr	60b6fbed97	feat(cli): agnes push command (extracted from sync --upload-only)	2026-05-04 18:09:57 +02:00
ZdenekSrotyr	7f89e1d594	feat(cli): agnes pull command (Typer wrapper around lib.pull.run_pull)	2026-05-04 18:07:28 +02:00
ZdenekSrotyr	15004126de	fix(cli-lib): I1+I2+I3 review fixes — token-precedence note, sync-state TODO, dry-run hermeticity test	2026-05-04 18:04:56 +02:00
ZdenekSrotyr	37da602060	feat(cli-lib): cli/lib/pull.py:run_pull primitive with lazy mkdir	2026-05-04 18:00:57 +02:00
ZdenekSrotyr	dc7e27082d	fix(bq-materialize): code-review follow-ups for `16eaf7a3` - extractor._try_acquire_file_lock: close fd and re-raise on non- BlockingIOError from fcntl.flock (read-only fs, unsupported flock, fd exhaustion). Pre-fix the fd leaked silently and the underlying OSError still propagated past the caller. - extractor: reorder module-level layout so logger is bound before the new lock-related helpers reference it. Deferred import of app.instance_config inside _get_lock_ttl_seconds documented inline. - extractor: comment _table_locks unbounded-by-design rationale. - tests: docstring + monkeypatch-target rationale for the two concurrency tests where the contract isn't obvious from the body.	2026-05-04 17:59:21 +02:00
ZdenekSrotyr	2b3d62fbf5	chore(.gitignore): allowlist cli/lib/ from generic lib/ rule (Task 7 follow-up)	2026-05-04 17:54:00 +02:00
ZdenekSrotyr	5aebeabf23	feat(cli-lib): cli/lib/hooks.py:install_claude_hooks	2026-05-04 17:53:20 +02:00
ZdenekSrotyr	d25d075ed2	docs(claude-md-template): rewrite verbs + paths for new CLI surface (Task 6) - Verb renames (da X -> agnes X for surviving verbs; legacy verbs already absent from this default template — admin overrides with legacy verbs are caught by Task 2's _LEGACY_STRINGS scan + Task 5's admin banner). - Path renames: data/parquet/ -> server/parquet/, data/duckdb/ -> user/duckdb/, data/metadata/ removed entirely (no longer exists per spec). - Drop user/artifacts/ from directory structure (spec workspace layout drops it; surviving paths: server/parquet/, user/duckdb/, user/snapshots/, user/sessions/). - Add AGNES_WORKSPACE.md pointer near top-of-template so analysts know where to find human-readable docs. Cleans Task 0.5's missed sweep on this file (was not in cli/ tree but is user-visible via /api/welcome). 81 claude_md/welcome_template tests pass.	2026-05-04 17:51:14 +02:00
ZdenekSrotyr	a92c624dba	feat(admin): yellow banner for legacy CLI verbs in workspace-prompt override	2026-05-04 17:46:50 +02:00
ZdenekSrotyr	8091620d33	fix(setup): role-aware clipboard render + JSON-escape ROLE injection Two Task 4 review fixes for app/web/templates/install.html: 1. JSON-escape `ROLE` JS const via `{{ role \| tojson }}` (defense in depth — removes the dependency on Jinja autoescape semantics for JS contexts; FastAPI's Literal validator already constrains role values). 2. Verify the analyst tile's clipboard payload is the analyst layout. The pre-existing role-aware plumbing (compute_default_agent_prompt threading role into setup_instructions_lines, picked up by the JS SETUP_INSTRUCTIONS_TEMPLATE array) was correct; adding regression tests that pin to the JS clipboard block specifically so a future inversion would fail loudly. Tests: analyst clipboard contains `agnes init` + `agnes catalog` and NOT `agnes auth import-token` / `agnes skills`; admin clipboard is the inverse. Plus an explicit assertion that ROLE is rendered via tojson.	2026-05-04 17:43:46 +02:00

... 3 4 5 6 7 ...

831 commits