For more information contact us at bina.rd@roche.com
Latest version: https://github.com/bioinform/metasv/archive/0.5.2.tar.gz
For other versions, see "releases". https://github.com/bioinform/metasv/releases
The following Python packages must be installed:
OpenMP
support using make OMP=no
.MetaSV is a python package and can be installed using pip
. To install type pip install https://github.com/bioinform/metasv/archive/0.5.2.tar.gz
. The current version
of MetaSV is 0.5.2. In general, the install source would be https://github.com/bioinform/metasv/archive/version.tar.gz
Type run_metasv.py -h
for help.
cd test
./test_run.sh
run_metasv.py --reference reference.fasta --boost_ins --breakdancer_native breakdancer.out --breakseq_native breakseq.gff --cnvnator_native cnvnator.call --pindel_native pindel_D pindel_LI pindel_SI pindel_TD pindel_INV --sample HG005 --bam alignments.bam --spades SPAdes/spades.py --age AGE/age_align --num_threads 15 --workdir work --outdir out --min_ins_support 2 --max_ins_intervals 500000 --isize_mean 500 --isize_sd 150
run_metasv.py --reference reference.fasta --breakdancer_native breakdancer.out --breakseq_native breakseq.gff --cnvnator_native cnvnator.call --pindel_native pindel_D pindel_LI pindel_SI pindel_TD pindel_INV --outdir out --sample NA12878 --disable_assembly --filter_gaps --keep_standard_contigs
run_metasv.py --reference reference.fasta --boost_ins --sample HG005 --bam alignments.bam --spades SPAdes/spades.py --age AGE/age_align --num_threads 15 --workdir work --outdir out --min_ins_support 2 --max_ins_intervals 500000 --isize_mean 500 --isize_sd 150
run_metasv.py --reference reference.fasta --breakdancer_native breakdancer.out --breakseq_native breakseq.gff --cnvnator_native cnvnator.call --pindel_native pindel_D --sample HG005 --bam alignments.bam --spades SPAdes/spades.py --age AGE/age_align --num_threads 15 --workdir work --outdir out --isize_mean 500 --isize_sd 150 --svs_to_assemble DEL --svs_to_report DEL
Option | Definition | Use |
---|---|---|
--sample STRING |
Sample name (default: None) | -- |
--reference STRING |
Reference file | -- |
--gaps STRING |
Gap bed file (default: None) | -- |
--boost_ins |
Use soft-clips for improving insertion detection (default: False) | Enable for soft-clip analysis |
--disable_assembly |
Disable assembly (default: False) | -- |
--bam STRING |
BAM file (default: None) | Include for assembly and genotyping |
--svs_to_assemble {DEL,INS} [{DEL,INS} ...] |
SVs to assemble (default: set(['DEL', 'INS'])) | Include for assembly |
--spades STRING |
Path to SPAdes executable (default: None) | Include for assembly | --age STRING |
Path to AGE executable (default: None) | Include for assembly | --num_threads INT |
Number of threads to use (default: 1) | -- |
There are different factors that contribute in balancing sensitivity/specificity trade-off:
NOTE: In the following tables:
INC: Increasing/Enabling will increase sensitivity (and thus decrease specificity)
DEC: Decreasing/Disabling will increase sensitivity (and thus decrease specificity)
Option | Definition | Impact |
---|---|---|
--filter_gaps |
Filter out gaps (default: False) | DEC |
--keep_standard_contigs |
Keep only the major contigs + MT (default: False) | DEC |
Option | Definition | Impact |
---|---|---|
--isize_mean NUM |
Insert size mean (default: 350.0) | - |
--isize_sd NUM |
Insert size standard deviation (default: 50.0) | - |
Option | Definition | Impact |
---|---|---|
--wiggle INT |
Wiggle for interval overlap (default: 100) | - |
--inswiggle INT |
Wiggle for insertions, overides wiggle (default: 100) | - |
--minsvlen INT |
Minimum length acceptable to be an SV (default: 50) | DEC |
--maxsvlen INT |
Maximum length SV to report (default: 1000000) | INC |
--overlap_ratio NUM |
Reciprocal overlap ratio (default: 0.5) | - |
Option | Definition | Impact |
---|---|---|
--min_avg_base_qual NUM |
Minimum average base quality (default: 20) | DEC |
--min_mapq NUM |
Minimum MAPQ (default: 5) | DEC |
--min_soft_clip INT |
Minimum soft-clip (default: 20) | DEC |
--max_nm INT |
Maximum number of edits (default: 10) | INC |
--min_matches INT |
Mininum number of matches (default: 50) | DEC |
--min_ins_support INT |
Minimum read support for calling insertions using soft-clips (default: 5) | DEC |
--min_ins_support_frac NUM |
Minimum fraction of reads supporting insertion using soft-clips (default: 0) | DEC |
--max_ins_intervals INT |
Maximum number of insertion intervals to generate (default: 10000) | INC |
Option | Definition | Impact |
---|---|---|
--extraction_max_read_pairs INT |
Maximum number of pairs to extract for assembly (default: 10000) | INC |
--spades_max_interval_size INT |
Maximum SV length for assembly (default: 50000) | INC |
Option | Definition | Impact |
---|---|---|
--gt_window INT |
Window for genotyping (default: 100) | - |
--gt_normal_frac NUM |
Min. fraction of reads supporting reference for genotyping (default: 0.05) | - |
Abyzov,A. et al. (2015) Analysis of deletion breakpoints from 1,092 humans reveals details of mutation mechanisms. Nat. Commun., 6, 7256.
Lam,H.Y. et al. (2010) Nucleotide-resolution analysis of structural variants using BreakSeq and a breakpoint library. Nat. Biotechnol., 28, 47–55.