BioMate automatically selects the optimal variant caller for your sequencing data, runs nf-core/sarek on AWS Batch, and returns annotated, audit-ready VCF results with Gold/Silver/Bronze QC grading — no bioinformatics infrastructure required.
The best variant caller depends on your sequencing technology, cohort size, and research context. BioMate reads your data characteristics and routes automatically — you never need to pick manually.
BioMate covers germline and somatic calling, targeted panels, and structural variant detection — all within nf-core/sarek.
SNP and indel calling from whole-genome sequencing. GATK4 GVCF mode for joint genotyping. ACMG/AMP variant classification with ClinVar annotation.
Tumor-normal matched or tumor-only somatic SNV and indel detection with GATK4 Mutect2. Somatic CNV calling with CNVKit or GATK4 ModelSegments.
Hybrid-capture and amplicon panel sequencing. BED-file-based target restriction, higher coverage QC thresholds, and panel-specific hard filters for small cohorts.
Copy number variant detection via GATK4 gCNV and Manta. Structural variants (deletions, inversions, translocations) via TIDDIT. Results visualized as circos plots.
nf-core/sarek is the peer-reviewed, community-maintained Nextflow pipeline for germline and somatic variant analysis — trusted by cancer genome atlases, biobanks, and clinical research programs worldwide.
BioMate orchestrates sarek on AWS Batch, handles reference genome download, supplies the correct dbSNP and Mills-Gold-Standard indel resources for BQSR, and streams step-by-step progress back to the UI in real time.
Yes. GATK4 HaplotypeCaller and DeepVariant both call SNPs and indels from WGS or targeted panel data. BioMate applies VQSR or hard filtering as appropriate based on cohort size and data type. Structural variants are detected by TIDDIT and Manta within the nf-core/sarek pipeline.
GRCh38 (hg38), GRCh37 (hg19), and GRCm39 (mouse mm39) are supported out of the box. Custom reference genomes can be uploaded for non-model organisms or assembly versions not in the standard index. BioMate auto-downloads the relevant dbSNP and gnomAD annotation databases for the selected genome.
BioMate routes somatic samples to GATK4 Mutect2 within nf-core/sarek, with matched tumor-normal pairs supported. Somatic VCFs are filtered with FilterMutectCalls and annotated with Funcotator or VEP. Tumor-only mode is available when no matched normal is available, with appropriate caveats noted in the QC report.
GATK VQSR tranche thresholds (99% sensitivity for SNPs, 99% for indels) and ENCODE-standard coverage requirements (30x minimum for WGS germline, 100x for somatic). Results are graded Gold (meets all thresholds), Silver (meets minimum coverage but borderline VQSR), or Bronze (below recommended coverage). Every threshold links to the GATK Best Practices documentation.
Describe your sequencing data in plain English. BioMate selects GATK4 or DeepVariant, runs nf-core/sarek, and returns annotated VCF results with QC grades.
Try free →