Published February 28, 2022 | Version v4
Dataset Open

HPRC PanGenie results

Creators

  • 1. Heinrich Heine University, Medical Faculty, Institute for Medical Biometry and Bioinformatics, Düsseldorf

Description

HPRC PanGenie results

Input VCF used for PanGenie for the HPRC experiments as well as genotyping results, statistics and filters computed. All experiments are based on the Minigraph-Cactus (MC) graph.

Experiments were run at Heinrich-Heine University Düsseldorf by Jana Ebler ([email protected]). Pipelines used to produce these results are here: http://bitbucket.org/jana_ebler/hprc-experiments/src/master/genotyping-experiments/

 

How to run PanGenie on the MC variants

We ran PanGenie using the file "cactus_filtered_ids.vcf.gz" as input (contained in this repository). It was produced from the file "hprc-v1.0-mc-grch38.vcf.gz" generated from the MC graph using vg decompose. The output VCF generated by PanGenie can be converted into a bi-allelic VCF containing a single record for each (nested) variant allele, i.e. after decomposing large bubbles into their nested variants using the script: https://bitbucket.org/jana_ebler/hprc-experiments/src/master/genotyping-experiments/workflow/scripts/convert-to-biallelic.py

# run PanGenie, produces genotyped VCF "pangenie_genotyping.vcf"
PanGenie -i <input-reads> -v cactus_filtered_ids.vcf -r <reference-genome> -o pangenie -j 24 -t 24

# decompose bubbles and produce a bi-allelic VCF with genotypes for each (nested) allele
cat pangenie_genotyping.vcf | python3 convert-to-biallelic.py cactus_filtered_ids_biallelic.vcf > pangenie_genotyping_biallelic.vcf

 

 

Files

README-HPRC-PanGenie.txt

Files (46.0 GB)

Name Size Download all
md5:bda48e8f41d8db1a8d36f894a3706667
40.3 GB Download
md5:340b55a56fa1dde2d4f9da408114844e
2.7 MB Download
md5:882538e67d3251492493cdd8a09f267a
393.4 MB Download
md5:8e96a349b8341c18090de1fcc271a5ab
1.6 GB Download
md5:abf8f3114e7328c1ce374ec67adaaaa1
2.1 MB Download
md5:1ca2cd03316077152394c603fc87293f
1.3 GB Download
md5:18ee443dfb1bdc21d883390d0876d80f
2.1 MB Download
md5:c46c04180c892ff9cde2c65448079bb6
749.7 MB Download
md5:18562b8bce6bf60faa8428b660fce915
1.8 MB Download
md5:dd40f1b611688d54a87293f51f1e45b7
2.2 kB Preview Download
md5:ea6c5c441f6b0a4c0bf6112e9a049998
1.3 kB Download
md5:fb57857fea5d42c31e31733f39ecaf36
1.7 GB Download