VCF Processing
This section describes how to process VCF files using the MEA-Pipeline.
Query Notation
Notation |
Remark |
|---|---|
|
Select the variants based on a bed file or a position file that is
provided in the config.yaml with parameter |
|
Select the samples based on a headerless text file that is provided
in the config.yaml with parameter |
|
Set the minimum depth to [INT] for variant calls. |
|
Set alleles to het calls if the minimum alternate read count > INT and minimum alternate read ratio (out of total reads) > FLOAT. |
|
Set het alleles to OPT, where OPT can be REF (reference alleles), ALT (alternate alleles), MISS (missing alleles) or MAJ (major alleles). |
|
Annotate variants using snpEff software, with provided snpEff database
set in the config.yaml with parameter |
|
Annotate using the |
|
Select the variants that have sample missingness up to FLOAT value. |
|
Select the samples that have variant missingness up to FLOAT value. |
|
Select variants with minimum allele count (MAC) of INT. |
|
Select variants with minimum allele frequency (MAF) of FLOAT. |
|
Decompose complex variants using the |
|
Split multi-allelec variants into individual variants. |
|
Select single-nucleotide variants. |
|
Deduplicate variants that have identical position by selecting those with the lowest sample missingness. |
|
Select samples with Fws > FLOAT, as calculated by moimix. |