NO-MISS Bacterial isolate dataset

Published in Data Releases
June 04, 2026
2 min read
NO-MISS Bacterial isolate dataset

This dataset describes the sequencing of 24 bacterial isolates from US FDA CFSAN in quadruplicate on a single PromethION flowcell using the latest Oxford Nanopore Technologies’ NO-MISS Isolate sequencing rapid barcoding kit and protocol, incorporating the universal bead-beating extraction method for robust DNA recovery across diverse bacterial species. The 24-strain dataset contains 15 distinct species representing the following genera: Bacillus, Cronobacter, Citrobacter, Enterobacter, Klebsiella, Listeria, Pseudomonas, Salmonella, Shigella, Staphylococcus, and Vibrio. This includes strains containing multiple plasmids and strains with a variety of antimicrobial resistance (AMR)-associated genes and mutations. Libraries were prepared using the Rapid Barcoding Kit 96 V14 (SQK-RBK114.96) and sequenced on a single PromethION 2 Integrated (P2i) device.

Raw signal data were basecalled on the P2i using the SUP@5.2.0 model. Resulting reads were analysed using the EPI2ME Bacterial & Fungal Genomes (wf-bacterial-genomes) workflow to assess genome assembly and characterisation performance across replicate samples. This dataset provides a benchmark for evaluating the reproducibility and throughput of the NO-MISS workflow when applied to bacterial isolate sequencing.

Sample

DetailDescription
Sample Namesample_1-sample_24
Organismbacteria
Molecule TypeDNA
Sample Typeliquid cultures
Technical replicates4
Flow Cell replicates1
Link to sample sourceNot publicly available

Preparation

Sample preparation was performed according to NO-MISS, the Nanopore-only Microbial Isolate Sequencing Solution using universal bead-beating and the 96 sample Rapid Barcoding Kit (RBK).

DetailDescription
ExtractionUniversal bead-beating
Library PrepNO-MISS
KitSQK-RBK114.96

Further preparation information such as sample storage suggestions can be found on the Oxford Nanopore Website.

Sequencing

Sequence data were generated using the following configuration:

DetailDescription
Flow CellFLO-PRO114M
DeviceP2i
ChemistryR10.4.1
Basecall Modelv5.2.0 SUP
MinKNOW Versionv25.11.0

Data Download

The dataset is available for anonymous download, without login, from a public Amazon Web Services S3 bucket. The bucket is part of the Open Data on AWS project enabling sharing and analysis of a wide range of data. The data can be downloaded with the AWS CLI command:

aws s3 sync --no-sign-request s3://ont-open-data/nomiss_96BC_P2I_SUP_2026 nomiss_96BC_P2I_SUP_2026

See the tutorials page for information on downloading the dataset.

You can also browse and download the files in your web browser courtesy of 42basepairs.

Folder nameSizeDescription
raw1.6 TBPOD5 files
basecalls74 GBBAM files
analysis157 GBWorkflow outputs

Analysis

The Bacterial & Fungal Genomes workflow was used for genome assembly and isolate characterisation. This analysis includes de novo assembly performed by Flye followed by polishing using medaka. Genome assemblies are subsequently annotated with Bakta, and plasmid contigs are identified using MOB-suite. As part of isolate analysis, the workflow also performs MLST based species typing, Salmonella serotyping using SeqSero2, genome identity based species assignment using Sourmash, and AMR gene annotation using ResFinder.

The workflow was run with the default parameters, additionally supplying the Flye assembler with target genome coverage at 50x (--flye_asm_coverage 50) and setting the genome size for coverage estimation to 10Mb (--flye_genome_size 10000000).

Analysis outputs are available. The analysis results are located in the S3 bucket under the prefix:

s3://ont-open-data/nomiss_96BC_P2I_SUP_2026/analysis

The analysis outputs include complete workflow results, including HTML reports for interactive data exploration and polished genome assemblies in FASTA format.

wf-bacterial-genomes-report.html provides an overall summary report containing read and assembly statistics, plasmid analysis, gene annotations, and a summary of AMR genes identified across all analysed samples.

The workflow also generates output files from individual analysis tools to support downstream analyses and reuse. These include annotated genome files in GFF3 and GBFF formats, detailed plasmid analysis results reporting identified chromosome and plasmid markers, and plasmid FASTA files. Additionally, the full ResFinder output provides detailed alignment statistics for any identified AMR genes.

  • Poster: Rapid whole-genome sequencing, de novo assembly, and characterisation of bacterial isolates - Learn how to perform rapid whole-genome sequencing, de novo assembly, and characterisation of bacterial isolates.

  • Poster: Rapid and scalable whole-genome microbial isolate sequencing - Another poster highlighting the end-to-end, scalable workflow for whole-genome sequencing of microbial isolates.

  • How to sequence microbial isolates with the NO-MISS workflow - Step-by-step guidance for sequencing microbial isolates using the NO-MISS workflow, from sample preparation through to data analysis.


Tags

#datasets

Share

Table Of Contents

1
Sample
2
Preparation
3
Sequencing
4
Data Download
5
Analysis
6
Related Materials

Related Posts

Metagenomic Assembly Sheds Light on Microbial Diversity in Compost
April 17, 2026
2 min

Quick Links

WorkflowsOpen DataContact

Social Media

© 2020 - 2026 Oxford Nanopore Technologies plc. All rights reserved. Registered Office: Gosling Building, Edmund Halley Road, Oxford Science Park, OX4 4DQ, UK | Registered No. 05386273 | VAT No 336942382. Oxford Nanopore Technologies, the Wheel icon, EPI2ME, Flongle, GridION, Metrichor, MinION, MinIT, MinKNOW, Plongle, PromethION, SmidgION, Ubik and VolTRAX are registered trademarks of Oxford Nanopore Technologies plc in various countries. Oxford Nanopore Technologies products are not intended for use for health assessment or to diagnose, treat, mitigate, cure, or prevent any disease or condition.