The Oxford Nanopore Technologies Open Data project (ont-open-data) provides reference sequencing data from Oxford Nanopore sequencing devices. Data access is provided through the Registry of Open Data on AWS.
The Oxford Nanopore Technologies Open Data project aims to provide exemplar datasets from state of the art of Oxford Nanopore sequencing. Datasets are provided without restriction on availability or use to aid researchers, primarily in the field of genomics and transcriptomics. Our previous data releases have been used to aid the development of new algorithms for small variant and structural variant calling, and in the creation of new single-cell transcriptomics analyses. Data has also been used together with datasets from other sources for the complete telomere to telomere assembly of the Genome In A Bottle sample HG002.
The list below represents the most recent dataset or analysis of its class. As new datasets, basecallers, and analysis methods are developed this list will be updated.
Dataset | Flow Cell | Kit | Basecall model | EPI2ME workflows | Date |
---|---|---|---|---|---|
Chromatin accessibility | FLO-PRO114M | SQK-LSK114 | v5.2.0 HAC | 2025-06-13 | |
Telomere sequencing | FLO-PRO114M | SQK-LSK114 | v5.0.0 SUP/HAC | wf-teloseq | 2025-05-21 |
Genome in a Bottle Data Release 2025.01 | FLO-PRO114 | SQK-LSK114 | v5.0.0 SUP/HAC | wf-basecalling, wf-human-variation | 2025-01-26 |
Modified Base Best Practices and Benchmarking | FLO-PRO114 | SQK-LSK114 | v5.0.0 SUP/HAC | 2024-10-22 | |
Nanopore-only T2T assembly of a human genome | FLO-PRO114 | SQK-LSK114 | v4.3.0 SUP/HAC | 2024-05-22 | |
Updated Tumor Normal Pair Benchmark Dataset | FLO-PRO114 | SQK-LSK114 | v4.2.0 SUP/HAC | wf-basecalling, wf-somatic-variation | 2024-03-07 |
Reduced Representation Methylation Sequencing (RRMS) | FLO-MIN106 | SQK-LSK110 | v3.3 | 2022-07-27 | |
CliveOME cfDNA dataset | FLO-PRO114 | SQK-LSK114 | v3.5.1 | 2022-05-18 |
Dataset | Flow Cell | Kit | Basecall model | EPI2ME workflows | Date |
---|---|---|---|---|---|
Visium HD 3′ mouse brain | FLO-PRO114M | SQK-LSK114 | v5.2.0 SUP/HAC | wf-singlecell | 2025-06-25 |
293T and jurkat 10x single cell transcriptomics | FLO-PRO114M | SQK-LSK114 | v5.0.0 SUP/HAC | wf-single-cell | 2025-02-17 |
Dataset | Flow Cell | Kit | Basecall model | EPI2ME workflows | Date |
---|---|---|---|---|---|
Plasmid Validation | FLO-MIN114 | SQK-RBK114 | v5.0.0 SUP/HAC | wf-clone-validation | 2025-04-28 |
Dataset | Flow Cell | Kit | Basecall model | EPI2ME workflows | Date |
---|---|---|---|---|---|
Community contributions to Oxford Nanopore Open Data project | FLO-MIN114 | SQK-LSK114 | RAW data only | 2023-01-23 |
Dataset | Flow Cell | Kit | Basecall model | EPI2ME workflows | Date |
---|---|---|---|---|---|
Zymo Fecal Metagenome | FLO-PRO114M | SQK-LSK114 | v5.0.0 SUP/HAC | wf-metagenomics | 2025-05-06 |
We strive to keep all analyses current, though some may fall behind the more current sequencing runs and basecalling results.
The datasets are freely available for download and could be used for:
The data deposited showcases sequences from a representative subset of sequencing chemistries. The datasets correspond to publicly-available reference samples including the widely available Genome In A Bottle human reference samples. Raw data are provided with metadata and scripts to describe sample and data provenance.
Historical datasets can also be found in the repository. Our previous data release blog posts are archived under the data-releases category.
All data is available from the Registry of Open Data on AWS. See the Oxford Nanopore Open Data Tutorials page for more information.
Related Links