nf-core/bacass
Simple bacterial assembly and annotation pipeline
Define where the pipeline should find input data and save output data.
Path to tab-separated sample sheet
string
The output directory where the results will be saved. You have to use absolute paths to storage on Cloud infrastructure.
string
Email address for completion summary.
string
^([a-zA-Z0-9_\-\.]+)@([a-zA-Z0-9_\-\.]+)\.([a-zA-Z]{2,5})$
Parameters for QC and trim short-reads
save trimmed files
boolean
save files that failed to pass trimming thresholds ending in *.fail.fastq.gz
boolean
Skip FastQC
boolean
Skip Nanoplot
boolean
Skip ToulligQC
boolean
Skip FastP
boolean
Specify true to not write any reads that pass trimming thresholds.This can be used to use fastp for the output report only.
boolean
Tool for ONT read filtering
string
porechop
Arguments for running Filtlong
string
Minimun length for Filtlong to work
integer
10
Path to Kraken2 database.
string
Path to the Kmerfinder bacteria database (ie: ‘/path_to/kmerfinder/bacteria/ or ‘/path_to/kmerfinder/bacteria.tar.gz’). Only taxonomic group ‘bacteria’ is allowed. See database installation (guidelines)[https://bitbucket.org/genomicepidemiology/kmerfinder_db/src/master/] for details. The latest version can be downloaded from the KmerFinder Database FTP site or directly during pipeline execution with. Example:
--kmerfinderdb 'ftp://ftp.cbs.dtu.dk/public/CGE/databases/KmerFinder/version/latest/bacteria.tar.gz'
(Note: Downloading the KmerFinder database (~30GB) via FTP can be time-consuming. It’s recommended to pre-download the database and provide the *.tar.gz file using this parameter instead of URL site.).
An older version is available on (Zenodo)[https://zenodo.org/records/13447056/files/20190108_kmerfinder_stable_dirs.tar.gz].
string
Reference FASTA file.
string
Reference GFF file.
string
Parameters for the assembly
The assembler to use for assembly. Use the appropriate assembler according to the chosen assembly_type. Refer to the README.md for further clarification.
string
Which type of assembly to perform.
string
Extra arguments for Unicycler
string
Allowed technologies for long read assembly.
string
This can be used to supply extra options to the Canu assembler. Will be ignored when other assemblers are used.
string
Options for running BUSCO
BUSCO lineage to use, you can use other BUSCO lineages, these are available in: https://busco.ezlab.org/list_of_lineages.html
string
bacteria_odb10
BUSCO mode to use
string
Path to BUSCO database
string,null
Path to BUSCO config file
string,null
Clean intermediate BUSCO files
boolean
Which assembly polishing method to use.
string
Parameters for the annotation
The annotation method to annotate the final assembly.
string
Extra arguments for prokka annotation tool.
string
FASTA file with protein sequences to be used as reference by PROKKA.
string
Path to Bakta database
string
Download Bakta database
boolean
Specifies a configuration file for the DFAST annotation method.
string
assets/test_config_dfast.py
Use the reference files (.fasta, .gff) from kmerfinder step for LIFTOFF.
boolean
Skip running Kraken2 classifier on reads.
boolean
Skip annotating the assembly with Prokka /DFAST.
boolean
Skip running PycoQC
on long read input.
boolean
Skip BUSCO
boolean
Skip polishing the long-read assembly with fast5 input. Will not affect short/hybrid assemblies.
boolean
Skip MultiQC
boolean
Parameters used to describe centralised config profiles. These should not be edited.
Git commit id for Institutional configs.
string
master
Base directory for Institutional configs.
string
https://raw.githubusercontent.com/nf-core/configs/master
Institutional config name.
string
Institutional config description.
string
Institutional config contact information.
string
Institutional config URL link.
string
Less common options for the pipeline, typically set in a config file.
Display version and exit.
boolean
Method used to save pipeline results to output directory.
string
MultiQC report title. Printed as page header, used for filename if not otherwise specified.
string
Email address for completion summary, only when pipeline fails.
string
^([a-zA-Z0-9_\-\.]+)@([a-zA-Z0-9_\-\.]+)\.([a-zA-Z]{2,5})$
Send plain-text email instead of HTML.
boolean
File size limit when attaching MultiQC reports to summary emails.
string
25.MB
^\d+(\.\d+)?\.?\s*(K|M|G|T)?B$
Do not use coloured log outputs.
boolean
Incoming hook URL for messaging service
string
Custom config file to supply to MultiQC.
string
Custom logo file to supply to MultiQC. File name must also be set in the MultiQC config file
string
Custom MultiQC yaml file containing HTML including a methods description.
string
Boolean whether to validate parameters against the schema at runtime
boolean
true
Base URL or local path to location of pipeline test dataset files
string
https://raw.githubusercontent.com/nf-core/test-datasets/
Suffix to add to the trace report filename. Default is the date and time in the format yyyy-MM-dd_HH-mm-ss.
string