qc_heterozygosity
Calculates per-sample observed heterozygosity and flags outliers beyond a standard deviation threshold to detect contamination or inbreeding.
Instructions
Per-sample observed heterozygosity QC, flagging outliers.
High Ho relative to the cohort suggests contamination or off-types; very low
Ho suggests selfed/inbred or duplicated material. Flags samples more than
outlier_sd standard deviations from the mean. Writes
heterozygosity_samples.csv. For large sets pass method="allelematrix" +
max_markers to avoid a full VCF export.
Input Schema
| Name | Required | Description | Default |
|---|---|---|---|
| method | No | Genotype source: 'vcf' (full export, cached) or 'allelematrix' (paged, server-side subset). | vcf |
| region | No | Restrict analysis to a genomic window: 'chrom' or 'chrom:start-end' (1-based). | |
| outlier_sd | No | Flag points more than this many standard deviations from the mean. | |
| output_dir | No | Directory for the output CSV(s) (default ./gigwa_results/<module>/). | |
| max_markers | No | Cap the number of markers analysed (evenly-spaced subsample); omit to use all. | |
| variant_set_db_id | Yes | BrAPI variantSetDbId identifying the run (MODULE§project§run); from list_variant_sets / list_content. |
Output Schema
| Name | Required | Description | Default |
|---|---|---|---|
| result | Yes |