Documentation for the methylphase toolkit
Use this minimal workflow to go from an indexed, mod-tagged BAM to phased methylation labels.
sample.mod.bam and sample.mod.bam.bai.This drives split-reads internally, merges haplotype and methylation features, selects the best latent variant model, and imputes read labels.
methylphase phase-variants \
--floria floria/contig/contig.haploset \
--bam sample.mod.bam \
--motif GATC_6mA_1 \
--out results/phase_variants
Key outputs land under results/phase_variants: split_reads/*, dataset.tsv, categories.toml, best_model.json, best_responsibilities.tsv, imputed.tsv, and imputed_labels.tsv.
split_reads/read_clustering_raw.tsvimputed_labels.tsv, summary.jsonfits/If you just need read clustering and per-read methylation tables:
methylphase extract \
--bam sample.mod.bam \
--motif CG_5 \
--output-dir results/extract
methylphase split-reads \
--bam sample.mod.bam \
--motif-file motifs.tsv \
--output-dir results/split \
--cluster-algorithm hdbscan \
--emit-fastq \
--threads 8