The Oxford Nanopore Technologies Open Data project (ont-open-data) provides reference sequencing data from Oxford Nanopore sequencing devices. Data access is provided through the Registry of Open Data on AWS.
The Oxford Nanopore Technologies Open Data project aims to provide exemplar datasets from state of the art of Oxford Nanopore sequencing. Datasets are provided without restriction on availability or use to aid researchers, primarily in the field of genomics and transcriptomics. Our previous data releases have been used to aid the development of new algorithms for small variant and structural variant calling, and in the creation of new single-cell transcriptomics analyses. Data has also been used together with datasets from other sources for the complete telomere to telomere assembly of the Genome In A Bottle sample HG002.
The list below represents the most recent dataset or analysis of its class. As new datasets, basecallers, and analysis methods are developed this list will be updated.
Dataset | Flow Cell | Kit | Basecall model | EPI2ME workflows | Date |
---|---|---|---|---|---|
Genome in a Bottle Data Release 2025.01 | FLO-PRO114 | SQK-LSK114 | v5.0.0 SUP and HAC | wf-basecalling, wf-human-variation | 2025-01-26 |
Modified Base Best Practices and Benchmarking | FLO-PRO114 | SQK-LSK114 | v5.0.0 SUP and HAC | 2024-10-22 | |
Nanopore-only T2T assembly of a human genome | FLO-PRO114 | SQK-LSK114 | v4.3.0 SUP and HAC | 2024-05-22 | |
Telomere sequencing | FLO-PRO114M | SQK-LSK114 | v5.0.0 SUP and HAC | wf-teloseq | 2024-05-21 |
Updated Tumor Normal Pair Benchmark Dataset | FLO-PRO114 | SQK-LSK114 | v4.2.0 SUP and HAC | wf-basecalling, wf-somatic-variation | 2024-03-07 |
An experimental extremely high-accuracy, ultra-long sequencing kit | FLO-PRO114 | SQK-ULK114 | Bespoke dorado model | 2023-12-06 | |
Reduced Representation Methylation Sequencing (RRMS) | FLO-MIN106 | SQK-LSK110 | v3.3 | 2022-07-27 | |
CliveOME cfDNA dataset | FLO-PRO114 | SQK-LSK114 | v3.5.1 | 2022-05-18 |
Dataset | Flow Cell | Kit | Basecall model | EPI2ME workflows | Date |
---|---|---|---|---|---|
293T and jurkat 10x single cell transcriptomics | FLO-PRO114M | SQK-LSK114 | v5.0.0 SUP and HAC | wf-single-cell | 2025-02-17 |
Dataset | Flow Cell | Kit | Basecall model | EPI2ME workflows | Date |
---|---|---|---|---|---|
Plasmid Validation | FLO-MIN114 | SQK-RBK114 | v5.0.0 SUP and HAC | wf-clone-validation | 2025-04-28 |
Dataset | Flow Cell | Kit | Basecall model | EPI2ME workflows | Date |
---|---|---|---|---|---|
Community contributions to Oxford Nanopore Open Data project | FLO-MIN114 | SQK-LSK114 | RAW data only | 2023-01-23 |
Dataset | Flow Cell | Kit | Basecall model | EPI2ME workflows | Date |
---|---|---|---|---|---|
Zymo Fecal Metagenome | FLO-PRO114M | SQK-LSK114 | v5.0.0 SUP and HAC | wf-metagenomics | 2025-05-06 |
We strive to keep all analyses current, though some may fall behind the more current sequencing runs and basecalling results.
The datasets are freely available for download and could be used for:
The data deposited showcases sequences from a representative subset of sequencing chemistries. The datasets correspond to publicly-available reference samples including the widely available Genome In A Bottle human reference samples. Raw data are provided with metadata and scripts to describe sample and data provenance.
Historical datasets can also be found in the repository. Our previous data release blog posts are archived under the data-releases category.
All data is available from the Registry of Open Data on AWS. See the Oxford Nanopore Open Data Tutorials page for more information.
Related Links