Indicates whether region is heterodisomic or isodisomic/deletion. See caveats
LOW_SIZE
Lower bound of the estimated size of the called region. Distance between first and last UPD site of the region
INF_SITES
Number of UPD informative sites in the called regions
SNPS
Total number of SNPs in the region
HET_HOM
Number of heterozygous and homozygous sites in the region. Used to indicate if region has LOH
OPP_SITES
Number of sites in the region that have the opposite parental origin. So no other number than 0 have been observed…
START_LOW
Earliest possible start positon of the UPD region (position of last anti-UPD site prior to the region)
END_HIGH
Last possible end position of the UPD region (position of first anti-UPD site prior to the region)
HIGH_SIZE
Upper bound of the estimated size of the called region. Distance between the surrounding anti-UPD sites.
Informative sites (upd sites)
Fourth field indicates what type of field:
Type
Description
ANTI_UPD
Site where proband has inherited alleles from both parents (e.g. AA x BB = AB)
UPD_MATERNAL_ORIGIN
Site where both alleles are inherited from the mother, or a fathers allele was deleted. (e.g. AA x BB = AA)
UPD_PATERNAL_ORIGIN
Site where both alleles are inherited from the father, or a mothers allele was deleted. (e.g. AA x BB = BB)
PB_HETEROZYGOUS
Heterozygous site in the proband (used only to call hetero/isodisomy)
PB_HOMOZYGOUS
Homozygous site in the proband (used only to call hetero/isodisomy)
UNINFORMATIVE
Various sites excluded from the analysis. Ignore these…
Caveats
The isodisomic/heterodisomic calling depends on estimating if there is a run of homozygousity in the called region. This is called using presence of heterozygous sites within the call, and should only be seen as a rough estimate for smaller calls. For more statistically sound detection of this, use e.g. bcftools roh to detect regions of homozygozity and combine the results.
Isodisomies cannot be differentiated from deletions using this software. Combine the calls with SV/CNV calls to determine if there is an overlapping deletion. If overlapping deletion, the “UPD” call can be used to detemine which parental allele was deleted.
UPD.py
Simple software to call UPD regions from germline exome/wgs trios.
Documentation
Installation
pip install --editable .orpython setup.py installThe only dependencies are click and coloredlogs.
Input VCF requirements
--vepRunning
Where PB_ID/MOTHER_ID/FATHER_ID are the sample IDs from the vcf header.
Optional parameters
af-tagaf-tagOutput
Called regions (udp regions)
BED file of all called regions fulfilling the –min-sites and –min-size filter criteria, including some annotation.
Example call:
Informative sites (upd sites)
Fourth field indicates what type of field:
Caveats
The isodisomic/heterodisomic calling depends on estimating if there is a run of homozygousity in the called region. This is called using presence of heterozygous sites within the call, and should only be seen as a rough estimate for smaller calls. For more statistically sound detection of this, use e.g. bcftools roh to detect regions of homozygozity and combine the results.
Isodisomies cannot be differentiated from deletions using this software. Combine the calls with SV/CNV calls to determine if there is an overlapping deletion. If overlapping deletion, the “UPD” call can be used to detemine which parental allele was deleted.