Open-source AI data analyst platform extracted from internal repo. Includes data sync engine, Keboola adapter, Flask web portal, server deployment scripts, and configuration templates.
1.5 KiB
1.5 KiB
Quick Start Guide
Prerequisites
- Python 3.10+
- SSH access to a Linux server (for production deployment)
- Data source credentials (Keboola token, BigQuery service account, etc.)
Local Development Setup
-
Clone the repository:
git clone <repo-url> cd ai-data-analyst -
Run the initialization script:
bash scripts/init.sh -
Configure your instance:
cp config/instance.yaml.example config/instance.yaml # Edit config/instance.yaml with your settings -
Set up environment variables:
# Edit .env with your data source credentials -
Create your data description:
cp config/data_description.md.example docs/data_description.md # Edit docs/data_description.md to define your tables -
Sync data:
source .venv/bin/activate python -m src.data_sync
Server Deployment
See DEPLOYMENT.md for full server setup instructions.
Using with Claude Code
Open the project in Claude Code. The CLAUDE.md file will guide the AI assistant through setup and analysis workflows.
Analyst Setup
- Visit your instance URL (e.g., https://data.yourcompany.com)
- Sign in with your company email
- Register your SSH key
- Follow the setup instructions to sync data locally
Analysis Workflow
- Sync latest data:
bash server/scripts/sync_data.sh - Open Claude Code in your project directory
- Ask Claude to analyze your data using DuckDB