Skip to main content

bactopia_assembler

Tags: bacteria assembly hybrid shovill dragonflye unicycler illumina nanopore sample-scope

Assemble bacterial genomes using short read, long read, or hybrid strategies.

Automatically selects the appropriate assembler based on input read types:

  • Short Paired-End Reads: Uses Shovill (SKESA/SPAdes wrapper).
  • Short Single-End Reads: Uses Shovill (SKESA/SPAdes wrapper).
  • Long Reads: Uses Dragonflye (Flye/Miniasm wrapper).
  • Hybrid: Uses Unicycler or Dragonflye (with polishing).

Summary statistics for each assembly are generated using assembly-scan.

Uses named record input with explicit read slots (r1, r2, se, lr, assembly) as Path?.

When runtype is 'assembly' or 'assembly_accession' and --reassemble is not set,

the original assembly is used without re-assembly.

Inputs

record (
meta: Record,
r1: Path?,
r2: Path?,
se: Path?,
lr: Path?,
fna: Path?
)
FieldTypeDescription
metaRecordGroovy Record containing sample information
r1Path?Illumina R1 reads (paired-end)
r2Path?Illumina R2 reads (paired-end)
sePath?Single-end Illumina reads
lrPath?Long reads (ONT/PacBio) for long-read or hybrid assembly
fnaPath?Assembly file (FASTA) for assembly-based runtypes

Outputs

record (
meta: Record,
fna: Path?,
r1: Path?,
r2: Path?,
se: Path?,
lr: Path?,
tsv: Path?,
results: Set<Path>,
logs: Set<Path?>,
nf_logs: Set<Path>,
versions: Set<Path>
)
FieldTypeDescription
metaRecordSample information record
fnaPath?Assembled contigs in FASTA format
r1Path?Passthrough Illumina R1 reads
r2Path?Passthrough Illumina R2 reads
sePath?Passthrough single-end reads
lrPath?Passthrough long reads
tsvPath?Tab-delimited report of assembly statistics (N50, length, coverage)
resultsSet<Path>All output files to be published
logsSet<Path?>Optional program specific log files
nf_logsSet<Path>Nextflow-specific log files (e.g. .command.{begin
versionsSet<Path>A YAML formatted file with program versions

Parameters

Assembler Parameters

ParameterTypeDefaultDescription
--shovill_assemblerstringskesaAssembler to be used by Shovill (choices: skesa, megahit, spades, velvet)
--dragonflye_assemblerstringflyeAssembler to be used by Dragonflye (choices: flye, miniasm, raven)
--use_unicyclerbooleanUse unicycler for paired end assembly
--min_contig_leninteger500Minimum contig length <0=AUTO>
--min_contig_covinteger2Minimum contig coverage <0=AUTO>
--contig_namefmtstringFormat of contig FASTA IDs in 'printf' style
--shovill_optsstringExtra assembler options in quotes for Shovill
--shovill_kmersstringK-mers to use <blank=AUTO>
--dragonflye_optsstringExtra assembler options in quotes for Dragonflye
--trimbooleanEnable adaptor trimming
--no_stitchbooleanDisable read stitching for paired-end reads
--no_corrbooleanDisable post-assembly correction
--unicycler_modestringnormalBridging mode used by Unicycler (choices: conservative, normal, bold)
--min_component_sizeinteger1000Graph dead ends smaller than this size (bp) will be removed from the final graph
--min_dead_end_sizeinteger1000Graph dead ends smaller than this size (bp) will be removed from the final graph
--nanohqbooleanfalseFor Flye, use '--nano-hq' instead of --nano-raw
--medaka_modelstringThe model to use for Medaka polishing
--medaka_roundsinteger0The number of Medaka polishing rounds to conduct
--racon_roundsinteger1The number of Racon polishing rounds to conduct
--no_polishbooleanSkip the assembly polishing step
--no_miniasmbooleanSkip miniasm+Racon bridging
--no_rotatebooleanDo not rotate completed replicons to start at a standard gene
--reassemblebooleanfalseIf reads were simulated, they will be used to create a new assembly.
--polypolish_roundsinteger1Number of polishing rounds to conduct with Polypolish for short read polishing
--pilon_roundsinteger0Number of polishing rounds to conduct with Pilon for short read polishing

Used By

Subworkflows

Workflows

  • bactopia - Comprehensive bacterial analysis pipeline for complete genomic characterization.
  • staphopia - Comprehensive analysis pipeline for Staphylococcus aureus isolates.

Citations

If you use this in your analysis, please cite the following.

Source

View source on GitHub

Version

BACTOPIA_ASSEMBLER:
- bactopia-assembler: 1.0.5