agnes-the-ai-analyst/docs/setup
Petr d180b2014e Step 28: Remote query architecture for local+remote table JOINs
Add src/remote_query.py CLI module enabling the AI agent to run SQL
queries spanning local Parquet tables and remote BigQuery tables in a
single DuckDB session on the server. Two-phase protocol: BQ sub-queries
(--register-bq) fetch filtered/aggregated data, then DuckDB SQL (--sql)
joins everything.

Safety: COUNT(*) pre-check, memory estimation (2GB cap), row limits
(500K per BQ sub-query, 100K final result).

Changes:
- New src/remote_query.py with CLI, BQ registration, output formatting
- Add bq_entity_type field to TableConfig (view vs table routing)
- Extract create_local_views() from duckdb_manager.py for reuse
- Update claude_md_template.txt with remote query agent instructions
- Update example configs with remote_query section and docs
- 52 new tests (42 remote_query + 10 bq_entity_type), all passing
2026-03-21 11:39:15 +01:00
..
bootstrap.yaml Move server venv setup to bootstrap, remove cross-platform pip freeze sync 2026-03-15 00:59:45 +01:00
claude_md_template.txt Step 28: Remote query architecture for local+remote table JOINs 2026-03-21 11:39:15 +01:00
claude_settings.json Initial commit: OSS data distribution platform 2026-03-08 23:31:28 +01:00