qc_maf_filter
Identify markers with low minor-allele frequency or high missing rate, outputting filtering statistics to CSV without modifying the dataset.
Instructions
Report markers that would be filtered by MAF / missingness (no changes applied).
Computes per-marker minor-allele frequency and missing rate, and counts how many
markers are monomorphic, below maf_threshold, or above max_missing missing.
Writes marker_filter_stats.csv. This is a report only — it does not modify Gigwa.
For large sets pass method="allelematrix" + max_markers to sample server-side.
Input Schema
| Name | Required | Description | Default |
|---|---|---|---|
| method | No | Genotype source: 'vcf' (full export, cached) or 'allelematrix' (paged, server-side subset). | vcf |
| region | No | Restrict analysis to a genomic window: 'chrom' or 'chrom:start-end' (1-based). | |
| output_dir | No | Directory for the output CSV(s) (default ./gigwa_results/<module>/). | |
| max_markers | No | Cap the number of markers analysed (evenly-spaced subsample); omit to use all. | |
| max_missing | No | Maximum per-marker missing-data fraction (0-1) before a marker is flagged. | |
| maf_threshold | No | Minor-allele-frequency threshold below which markers are flagged. | |
| variant_set_db_id | Yes | BrAPI variantSetDbId identifying the run (MODULE§project§run); from list_variant_sets / list_content. |
Output Schema
| Name | Required | Description | Default |
|---|---|---|---|
| result | Yes |