Genomics Data Lake



The Genomics Data Lake provides various public datasets that you can access for free and integrate into your genomics analysis workflows and applications. The datasets include genome sequences, variant info, and subject/sample metadata in BAM, FASTA, VCF, CSV file formats.

The Genomics Data Lake is hosted in the West US 2 and West Central US Azure region. Allocating compute resources in West US 2 and West Central US is recommended for affinity.

To Access the Genomic Data Lake: https://docs.microsoft.com/en-us/azure/open-datasets/dataset-genomics-data-lake

Leave a Reply

Your email address will not be published. Required fields are marked *