agnes-the-ai-analyst/tests
Petr d180b2014e Step 28: Remote query architecture for local+remote table JOINs
Add src/remote_query.py CLI module enabling the AI agent to run SQL
queries spanning local Parquet tables and remote BigQuery tables in a
single DuckDB session on the server. Two-phase protocol: BQ sub-queries
(--register-bq) fetch filtered/aggregated data, then DuckDB SQL (--sql)
joins everything.

Safety: COUNT(*) pre-check, memory estimation (2GB cap), row limits
(500K per BQ sub-query, 100K final result).

Changes:
- New src/remote_query.py with CLI, BQ registration, output formatting
- Add bq_entity_type field to TableConfig (view vs table routing)
- Extract create_local_views() from duckdb_manager.py for reuse
- Update claude_md_template.txt with remote query agent instructions
- Update example configs with remote_query section and docs
- 52 new tests (42 remote_query + 10 bq_entity_type), all passing
2026-03-21 11:39:15 +01:00
..
__init__.py Initial commit: OSS data distribution platform 2026-03-08 23:31:28 +01:00
test_account_service.py OSS cleanup: remove internal references, harden deployment, add config env interpolation 2026-03-09 07:59:57 +01:00
test_auto_profiling.py Add self-service data onboarding system 2026-03-09 14:25:37 +01:00
test_bigquery_adapter.py Add per-partition streaming sync and hybrid query architecture 2026-03-12 13:20:41 +01:00
test_bigquery_client.py Add per-partition streaming sync and hybrid query architecture 2026-03-12 13:20:41 +01:00
test_catalog_export.py Filter catalog metrics by configurable tag (e.g., AIAgent.FoundryAI) 2026-03-16 22:03:53 +01:00
test_config_bq_entity_type.py Step 28: Remote query architecture for local+remote table JOINs 2026-03-21 11:39:15 +01:00
test_config_query_mode.py Add per-partition streaming sync and hybrid query architecture 2026-03-12 13:20:41 +01:00
test_config_sync_schedule.py Fix sync_schedule validation to accept multi-time daily format 2026-03-17 13:21:14 +01:00
test_data_sync_query_mode.py Add per-partition streaming sync and hybrid query architecture 2026-03-12 13:20:41 +01:00
test_deploy_guard.py OSS cleanup: remove internal references, harden deployment, add config env interpolation 2026-03-09 07:59:57 +01:00
test_duckdb_manager.py Add per-partition streaming sync and hybrid query architecture 2026-03-12 13:20:41 +01:00
test_email_auth.py Add multi-domain support and full-email username generation 2026-03-10 10:50:01 +01:00
test_generate_sample_data.py Add --format parquet using project's ParquetManager 2026-03-10 21:46:20 +01:00
test_metrics.py Phase 2: Replace demo YAML metrics with OpenMetadata catalog data 2026-03-12 15:10:42 +01:00
test_openmetadata_client.py Implement OpenMetadata catalog integration (Phase 1) 2026-03-12 14:07:13 +01:00
test_openmetadata_enricher.py Add catalog export: generate YAML metrics and tables from OpenMetadata 2026-03-15 01:15:30 +01:00
test_openmetadata_transformer.py Filter catalog metrics by configurable tag (e.g., AIAgent.FoundryAI) 2026-03-16 22:03:53 +01:00
test_profiler.py Initial commit: OSS data distribution platform 2026-03-08 23:31:28 +01:00
test_remote_query.py Step 28: Remote query architecture for local+remote table JOINs 2026-03-21 11:39:15 +01:00
test_scheduler.py Support multiple daily sync times (e.g., "daily 07:00,13:00,18:00") 2026-03-16 23:09:48 +01:00
test_sync_data.py Extract Jira into connectors/jira module 2026-03-09 11:17:50 +01:00
test_table_registry.py Add self-service data onboarding system 2026-03-09 14:25:37 +01:00
test_user_service.py Initial commit: OSS data distribution platform 2026-03-08 23:31:28 +01:00
test_username_generation.py Fix: correct OpenMetadata catalog URL path and add debug logging 2026-03-12 14:34:12 +01:00