Meta-AMR-Plus is a NextFlow pipeline designed for analyzing long-read Nanopore metagenomic data. It detects antimicrobial resistance (AMR) genes, identifies plasmids, and performs taxonomic classification using multiple tools and reference databases. The pipeline processes sequencing data efficiently and generates standardized output tables, making it easier to compare results across different tools and datasets.
- Read QC (
FastQC) - Performs read pre-processing
- Adapter clipping and merging (
porechop) - Low complexity and quality filtering (
Filtlong) - Host-read removal (
Minimap2)
- Generates statistics for host-read removal using
Samtools. - Performs optional genome assembly with
Flyeand assesses assembly quality usingQUAST. - Optionally polishes the assembly using
Racon. - Optionally downloads databases for AMR detection tools and PlasmidFinder if not provided by the user.
- Performs AMR detection on assembled data using one or more of:
ResFinderAMRFinderPlusCARD-RGIAbricateandResFinderon unassembled reads.
- Optionally performs
hAMRonizationto generate a comprehensive report integrating results from multiple AMR detection tools. - Performs plasmid detection assembled data using:
PlasmidFinderPlasClass
- Performs taxonomic classification using
CentrifugeandKaiju. - Generates visualizations for Centrifuge and Kaiju results using
Krona. - Presents quality control and summary statistics for preprocessing, assembly, taxonomic classification, host-read removal, and AMR detection using ResFinder, AMRFinderPlus, CARD-RGI, and Abricate (
MultiQC).
First, prepare a samplesheet with your input data that looks as follows:
samplesheet.csv:
| sample | fastq_1 |
|---|---|
| sg17 | sample17.fastq.gz |
| sg18 | sample18.fastq.gz |
| sg19 | sample19.fastq.gz |
Each row represents a fastq file (single-end).
Additionally, you will need a database sheet that looks as follows:
samplesheet.csv:
| tool | db_name | db_params | db_path |
|---|---|---|---|
| kaiju | kaiju_db | /<path>/<to>/kaiju_db |
|
| centrifuge | centrifuge_db | /<path>/<to>/centrifuge_database |
The db_path column should point to directories or .tar.gz archives containing the databases required for the selected tools. For Kaiju and Centrifuge, pre-existing databases must be provided. For other tools, you can either provide the database path, or the pipeline will automatically generate the required database if not supplied.
Note
Abricate and PlasClass come with built-in databases, so no external database input is required for them.
Now, you can run the pipeline using:
nextflow run ./main.nf \
-profile docker \
--input samplesheet.csv \
--databases database.csv \
--outdir results \
--perform_assembly \
--perform_polish_assembly \
--run_resfinder \
--download_resfinder_db \To see the results of an example test run with a full size dataset refer to the results tab on the nf-core website pipeline page.
For more details about the output files and reports, please refer to the output documentation.
Meta-AMR-Plus was originally written by Leila Nasirzadeh.
We thank the following people for their contributions to the development of this pipeline: \