Add standalone DuckDB-based data profiler script
Zero-dependency profiler for Parquet/CSV files producing JSON profiles with column statistics, histograms, alerts, and sample data. Supports single files, directories, composite primary keys, and optional HTML report generation.
This commit is contained in:
parent
c77a6f6c2e
commit
468f56092b
1 changed files with 1271 additions and 0 deletions
1271
scripts/standalone_profiler.py
Normal file
1271
scripts/standalone_profiler.py
Normal file
File diff suppressed because it is too large
Load diff
Loading…
Reference in a new issue