Profile Named Local IBGE Parquet Views
ibge_microdata_profile_parquet_viewsProfile local Parquet files as DuckDB views for bounded exploratory statistics. Get row counts, column types, null counts, numeric summaries, and frequent values without custom SQL.
Instructions
Profile local Parquet files as named DuckDB views and return bounded exploratory statistics.
Use this after converting IBGE fixed-width microdata to Parquet and before writing custom SQL. The tool reports row counts, column types, null/non-null counts, optional numeric min/max/mean, frequent values, and optional sample rows. By default it profiles the first 25 columns to keep exploration bounded; pass columns for a precise subset.
Input Schema
| Name | Required | Description | Default |
|---|---|---|---|
| topK | No | Number of most frequent values to return per profiled column. Defaults to 5 and is capped at 50. | |
| views | Yes | Named local Parquet views to profile. | |
| columns | No | Optional specific column names to profile. If omitted, the first maxColumns columns are profiled. | |
| maxColumns | No | Maximum columns to profile when columns is omitted. Defaults to 25 and is capped at 200. | |
| sampleRows | No | Number of sample rows to return per view. Defaults to 0 and is capped at 100. |