Petr c56905d34f Initial commit: OSS data distribution platform

Open-source AI data analyst platform extracted from internal repo.
Includes data sync engine, Keboola adapter, Flask web portal,
server deployment scripts, and configuration templates.

2026-03-08 23:31:28 +01:00

1.5 KiB

Raw Blame History

Quick Start Guide

Prerequisites

Python 3.10+
SSH access to a Linux server (for production deployment)
Data source credentials (Keboola token, BigQuery service account, etc.)

Local Development Setup

Clone the repository:

git clone <repo-url>
cd ai-data-analyst

Run the initialization script:
```
bash scripts/init.sh
```

Configure your instance:

cp config/instance.yaml.example config/instance.yaml
# Edit config/instance.yaml with your settings

Set up environment variables:

# Edit .env with your data source credentials

Create your data description:

cp config/data_description.md.example docs/data_description.md
# Edit docs/data_description.md to define your tables

Sync data:

source .venv/bin/activate
python -m src.data_sync

Server Deployment

See DEPLOYMENT.md for full server setup instructions.

Using with Claude Code

Open the project in Claude Code. The CLAUDE.md file will guide the AI assistant through setup and analysis workflows.

Analyst Setup

Visit your instance URL (e.g., https://data.yourcompany.com)
Sign in with your company email
Register your SSH key
Follow the setup instructions to sync data locally

Analysis Workflow

Sync latest data: bash server/scripts/sync_data.sh
Open Claude Code in your project directory
Ask Claude to analyze your data using DuckDB

1.5 KiB Raw Blame History