agnes-the-ai-analyst

Author	SHA1	Message	Date
ZdenekSrotyr	c5d67faad2	feat(memory): DuckDB FTS BM25 search for knowledge items (#121 ) (#326 ) * feat(memory): DuckDB FTS BM25 search for knowledge items (#121) Replaces `title ILIKE '%q%' OR content ILIKE '%q%'` ranked by insertion order with BM25 relevance ranking via the DuckDB `fts` extension. Czech queries like `cesky` match documents containing `česky` (`strip_accents=1` + `lower=1`). Architecture: - src/fts.py — ensure_fts_loaded / ensure_knowledge_fts_index helpers. The extension is per-connection (INSTALL persisted at engine level, LOAD per-conn). Both helpers are idempotent and soft-fail on unavailability with a logged WARNING. - Schema v47 (_v46_to_v47) — builds the initial BM25 index over knowledge_items(title, content) keyed by id. Migration is best-effort against ANY exception (not just duckdb.Error) so the schema bump cannot get stuck on v46 if a non-DuckDB error escapes the helper. - KnowledgeRepository.search — FTS-or-ILIKE dichotomy with execute- time fallback. Same filter surface (statuses / category / domain / source_type / personal / audience / dismissed) either way. ensure_fts_loaded() returning True only guarantees the extension is loadable, NOT that the index exists — migration soft-fail or a concurrent overwrite=1 rebuild's drop-then-create window leaves the extension loaded but the index missing. The BM25 execute is wrapped in try/except duckdb.Error → ILIKE retry so transient failures cannot 500 the /api/memory?search= endpoint. - KnowledgeRepository.count_items — mirrors the same FTS-or-ILIKE decision tree plus the execute-time fallback so the count always matches the paginated result set. - Per-mutation rebuild — create and title-or-content update rebuild the index via overwrite=1 PRAGMA. Status flips skip (token stream unchanged). - app/main.py lifespan rebuilds once at boot as a safety net for instances already on v47 across restarts. - bm25_score column shape: ILIKE fallback now selects `NULL AS bm25_score` so the result column set matches the FTS path. Consumers can read the score uniformly; absence of relevance ranking is signalled by the column being None everywhere, not missing. Tests in tests/test_knowledge_fts_search.py (9 tests): - BM25 multi-term match set + adversarial-review fix asserting higher-density doc ranks first (skipped if extension unavailable). - bm25_score column attached when extension available. - ILIKE fallback path on search + count_items via patched ensure_fts_loaded → False; bm25_score is None on this path. - Adversarial-review fix: search and count_items also fall back when the extension is loaded but the index is missing (simulated via drop_fts_index PRAGMA — the exact production failure mode the fallback guards against). - Index rebuild on create (new item searchable immediately). - Title update re-surfaces row under new term, drops old. - Czech-diacritic round-trip (cesky query → česky doc). Pinned schema-version asserts bumped 46 → 47 (test_db_schema_version, test_home_stats, test_schema_v42_migration, test_schema_v46_migration). Closes #121. * release: 0.54.20 — Corporate Memory BM25 search + All-Items bulk-edit batch bar	2026-05-15 20:10:59 +02:00
ZdenekSrotyr	9e948abc9c	release(0.54.18): Curated Memory restructure + per-user Dismiss + bundled adversarial-review fixes (#316/#320/#322) (#324 ) * feat(web): Curated Memory restructure + per-user Dismiss + filter-state utility Squashed from cvrysanek/zsrotyr's 4-commit PR branch + rebased onto current main + CHANGELOG bullets spliced into [Unreleased] (preserves existing #316/#320/#322 entries that landed on main since the branch was authored). Routes + access: - /corporate-memory now user-facing (get_current_user), in primary nav next to "Data Packages" — same gate as /api/memory/. - /admin/corporate-memory is the new admin review queue location (was /corporate-memory/admin); reached via Admin dropdown. Template renamed: corporate_memory_admin.html → admin_corporate_memory.html. Visual chrome: - Both pages migrate to shared _page_hero.html blue hero band. Per-user Dismiss (new feature, schema v46): - knowledge_item_user_dismissed(user_id, item_id, dismissed_at) + index. - POST /api/memory/{id}/dismiss + DELETE (idempotent). - Mandatory items can never be dismissed — enforced at 2 layers. - GET /api/memory: hide_dismissed=false default + dismissed_by_me flag. - GET /api/memory/bundle: always excludes dismissed for the caller. - UI: Dismiss/Undismiss button per item (hidden for mandatory), gray-out + line-through for dismissed rows, Hide-dismissed toggle. Admin edit modal: - Category as <select> + "Add new category…" reveal. - Audience as <select> with (unset)/all/group:<name> from RBAC. - Tags: full tag-input widget (pills, ×-remove, Backspace pop, Enter/comma to add, ↑/↓ typeahead from EXISTING_TAGS). Bulk-edit modal pickers (closes #128): - Move-to-category / Add-tag: <select> + add-new. - Set-audience: <select> (no more typo-able 'gourp:eng'). - Remove-tag: closed-set picker. FilterState utility: - app/web/static/js/filter-state.js — save/load/clear/bindInputs for per-page localStorage filter state. Adopted on /corporate-memory. E2E verified live on a real VM through the API + browser flow. release: 0.54.18 — Curated Memory restructure + 4 adversarial-review fixes Bundles together: - #316 fix(store): surface review failures + harden publish gate (BREAKING fail-CLOSED guardrail, override v2+ promote, restore guard, retry/rescan staged-bundle, banner widening, LLM truncation retry) - #320 fix(store): C2 bundle export RBAC + H2 per-entity write lock + H3 update_status compare-and-swap with bg_verdict_skipped audit - #322 fix(store): M1 prompt sentinel filename escape + M2 atomic promote_to_version helper + L1 admin forensic download per-version - #324 Curated Memory restructure + per-user Dismiss + FilterState utility Bump from 0.54.17 → 0.54.18 (patch — pre-1.0 policy: every cycle is patch).	2026-05-15 18:51:05 +02:00
ZdenekSrotyr	3e19caa975	fix(security): RBAC filter uses stable user_id instead of mutable email local-part (#293 ) (#299 ) * fix(security): RBAC filter for agnes_sessions matches both email local-part and user_id The upload API (POST /api/upload/sessions) stores session files under user_sessions/{user_id}/ (UUID), while the session collector uses the OS username (email local-part). The session pipeline writes the directory name verbatim into usage_session_summary.username, so the column can contain either value depending on the ingestion path. The RBAC filter in build_filter_clause previously only matched the email local-part, missing sessions uploaded via the API. The fix adds an OR condition so non-admin users see rows where username matches either their email local-part or their user_id. Closes #293 Co-Authored-By: zdenek.srotyr <zdenek.srotyr@keboola.com> * fix(security): RBAC filter uses stable user_id instead of mutable email local-part Closes #293 Previous fix used OR condition matching both email local-part and user_id in the username column. This was fragile: email changes would break filtering. This commit introduces a dedicated user_id column populated by the session pipeline via resolve_user_id(), and switches the RBAC filter to use it exclusively. Changes: - Schema v45: add user_id column to usage_session_summary and usage_events - UsageProcessor: accept and store user_id in both tables - runner.py: resolve_user_id() maps directory name to users.id UUID (exact match for UUID dirs, email LIKE for local-part dirs) - INTERNAL_TABLES: agnes_sessions/agnes_telemetry filter on user_id column - build_filter_clause: simplified to WHERE user_id = '<uuid>' (no OR) - me.py/admin_user_sessions.py: query by user_id OR username for backward compatibility during transition - USAGE_PROCESSOR_VERSION bumped 2→3 to trigger reprocessing/backfill - Tests updated: 27 pass including new email-change resilience test Co-Authored-By: zdenek.srotyr <zdenek.srotyr@keboola.com> * fix(tests): bump schema version assertions 44→45 Co-Authored-By: zdenek.srotyr <zdenek.srotyr@keboola.com> * fix(docs): correct resolve_user_id docstring, add TypeError comment Co-Authored-By: zdenek.srotyr <zdenek.srotyr@keboola.com> * fix(security): address review — backward-compat OR, LIKE escaping, narrower TypeError Co-Authored-By: zdenek.srotyr <zdenek.srotyr@keboola.com> * fix(security): address code review — eliminate TypeError hack, add resolve_user_id tests Co-Authored-By: zdenek.srotyr <zdenek.srotyr@keboola.com> * fix(db): create user_id indexes in _v44_to_v45, not _SYSTEM_SCHEMA _SYSTEM_SCHEMA runs before the migration ladder. On an upgrade from v42/v43/v44, usage_events / usage_session_summary already exist without the user_id column (CREATE TABLE IF NOT EXISTS is a no-op), so the CREATE INDEX ... (user_id) lines in _SYSTEM_SCHEMA failed to bind and aborted _ensure_schema — the app would not start post-upgrade. Move the index creation to _v44_to_v45, which ADDs the column first. Same pattern as the v41 audit_log indices. * fix(usage): bump USAGE_PROCESSOR_VERSION 3→4 for user_id backfill #303 shipped USAGE_PROCESSOR_VERSION=3 (release 0.54.12) for its <command-name> slash extraction. This PR's 2→3 bump collided with it on rebase, so the reprocess loop would not re-trigger to backfill the new user_id column on deployments already running v3. Bump to 4. * release: 0.54.13 — RBAC filter uses stable user_id (#293) --------- Co-authored-by: Devin AI <158243242+devin-ai-integration[bot]@users.noreply.github.com>	2026-05-14 14:12:54 +00:00
Vojtech	37ad39c8a3	feat(home): status frame on /home (operator-gated, onboarded-only) (#297 ) * feat(home): status frame on /home — last sync, sessions, prompts, tokens, projects Adds the homepage status frame: a 5-card row above the install-hero / offboard-strip on /home showing the calling user's Last sync (their last `agnes pull`), Sessions, Prompts, Tokens used, and Projects worked on, with a 24h/7d pill toggle. Backed by `GET /api/me/home-stats?window=` (one DuckDB CTE joining `users` + `usage_session_summary` + `usage_events`) and SSR'd from the same `compute_home_stats` helper on initial paint so there's no spinner. The window toggle is the only JS-driven path. Side surfaces: - `GET /api/sync/manifest` now stamps `users.last_pull_at` so `agnes pull` (and the Claude Code SessionStart hook that wraps it) imprints the analyst's last sync time for the new card. - `usage_session_summary` gains four BIGINT token counters (input_tokens, output_tokens, cache_read_tokens, cache_creation_tokens) summed from JSONL `message.usage.` per assistant turn. - `USAGE_PROCESSOR_VERSION` bumps 1 → 2 so the session-pipeline reprocess loop invalidates stale summaries and backfills tokens on the next tick. Schema migration v43 → v44 is idempotent ALTERs (last_pull_at + 4 token columns) — fresh installs receive them from `_SYSTEM_SCHEMA`, upgrade path runs `_v43_to_v44`. Defaults (NULL / 0) backfill existing rows cleanly. 9 new tests in tests/test_home_stats.py cover the migration, endpoint shapes (24h/7d/unknown/empty/missing-user), and the manifest-side last_pull_at bump. docs(CHANGELOG): homepage status frame entries under [Unreleased] The post-rebase release-cut now belongs to whichever PR lands next after main rolled to 0.54.9. This PR logs its bullets under [Unreleased] (Added: homepage status frame, per-user pull tracking, token counters; Changed: schema v43 → v44 migration) so they ride out with the next release-cut. * fix(tests): bump test_schema_v42_migration asserts to v44 CI failed because tests/test_schema_v42_migration.py hardcoded `assert SCHEMA_VERSION == 43` and `assert v == 43` after init. v44 (homepage stats frame backing columns) was introduced in the preceding feat commit; this aligns the existing v42-era migration tests with the new schema version. * feat(home): gate status frame on operator flag + user.onboarded Two gates on the homepage status frame: 1. Operator master switch — `get_home_status_frame_visibility()` in app/instance_config.py mirrors the existing `get_home_automode_visibility()` shape: env var `AGNES_HOME_SHOW_STATUS_FRAME` > yaml `instance.home.show_status_frame` > default `True`. Cautious-rollout instances can disable the frame without forking; the yaml example documents both knobs. 2. Onboarded gate — the template only renders the frame when the caller's `users.onboarded` is true. First-day users see a clean install-hero before all-zero stat cards; the frame appears automatically on the next render after `agnes init` POSTs `/api/me/onboarded`. Router skips the `compute_home_stats` DB read entirely when either gate is closed; `home_stats` arrives at the template as None in that branch and the `{% if %}` shortcuts the include. Why both gates: PostHog feature flags evaluated and rejected — this codebase uses PostHog for analytics capture only, not feature gating; adding a per-user feature_enabled() call on the /home critical path would couple the homepage render to a remote eval and still require an admin master switch. The onboarded gate is a UX coherence rule layered on top of the operator switch, not an A/B test signal. 3 new tests in test_home_stats.py cover the env-var resolution (falsey values + default-true). The yaml example gets a `home:` block documenting both `show_automode` (pre-existing flag, was undocumented in the example) and `show_status_frame`.	2026-05-14 09:28:47 +00:00

4 commits