AWS Public Sector Blog

Tag: datasets

AWS branded background with text "Interactive access and visualization of geospatial data from the AWS Open Data Program"

Interactive access and visualization of geospatial data from the AWS Open Data Program

Open data is reshaping how we understand and respond to global challenges. From climate change to disaster recovery, the ability to access and analyze large-scale geospatial datasets is critical for scientific research, policy-making, and real-world applications. Leading the charge are several open data initiatives designed to lower access barriers and accelerate innovation: Open Data on AWS, the Amazon Sustainability Data Initiative (ASDI), and the Maxar Open Data Program on AWS. Together, these programs demonstrate the power of cloud-enabled open data to democratize access to geospatial information, promote global collaboration, and drive real-world impact. In this post, we demonstrate how to explore and visualize these datasets using interactive web applications and Jupyter notebooks.

AWS Branded Background with text "How to use life sciences data from AWS Open Data program in Amazon Bedrock"

How to use life sciences data from AWS Open Data program in Amazon Bedrock

In this post, we discuss how to use datasets in the Registry of Open Data on AWS with Amazon Bedrock Knowledge Bases. With Amazon Bedrock Knowledge Bases, you can give foundation models (FMs) and agents contextual information from private and public data sources to deliver more relevant, accurate, and customized responses.

AWS Branded Background with text "66 new or updated datasets available on the Registry of Open Data on AWS"

66 new or updated datasets available on the Registry of Open Data on AWS

Through the AWS Open Data Sponsorship Program, customers are making over 300 PB of high-value, cloud-optimized data available for public use. All publicly available datasets can be found in the Registry of Open Data on AWS and are now also discoverable on Exchange. This quarter, AWS released 66 new or updated datasets. Read this post to learn more.

AWS Branded Background with text "How to use data from AWS Open Data program in Amazon Bedrock"

How to use data from the AWS Open Data program in Amazon Bedrock

Many government agencies, like the National Oceanic and Atmospheric Administration (NOAA), participate in the AWS Open Data Sponsorship Program. In this post, we discuss how to use NOAA datasets in the Registry of Open Data on AWS using Amazon Bedrock Knowledge Bases.

AWS branded background design with text overlay that says "53 new or updated datasets available on the Registry of Open Data on AWS"

53 new or updated datasets available on the Registry of Open Data on AWS

The AWS Open Data Sponsorship Program makes high-value, cloud-optimized datasets publicly available on Amazon Web Services (AWS). AWS works with data providers to democratize access to data by making it available to the public for analysis on AWS; develop new cloud-based techniques, formats, and tools that lower the cost of working with data; and encourage the development of communities that benefit from access to shared datasets. The full list of publicly available datasets are on the Registry of Open Data on AWS and are now also discoverable on AWS Data Exchange. This quarter, AWS released 53 new or updated datasets and you can learn more about them in this post.

AWS branded background design with text overlay that says "39 new or updated datasets available on the Registry of Open Data on AWS"

39 new or updated datasets available on the Registry of Open Data on AWS

The AWS Open Data Sponsorship Program makes high-value, cloud-optimized datasets publicly available on Amazon Web Services (AWS). AWS works with data providers to democratize access to data by making it available to the public for analysis on AWS; develop new cloud-based techniques, formats, and tools that lower the cost of working with data; and encourage the development of communities that benefit from access to shared datasets. Through this program, customers are making over 100 petabytes (PB) of high-value, cloud-optimized data available for public use. This quarter, AWS released 39 new or updated datasets.

AWS branded background design with text overlay that says "Accelerating data processing for IRCC with Amazon EC2 instances"

Accelerating data processing for IRCC with Amazon EC2 instances

Immigration, Refugees, and Citizenship Canada (IRCC) faced a significant challenge: the need to perform complex fuzzy string matching across two different datasets. IRCC embarked on a transformative project that redefined its data processing capabilities and showcased the power of cloud computing in overcoming substantial data challenges. Instead of dealing with months of undifferentiated heavy lifting activities, IRCC successfully used Amazon Elastic Compute Cloud (Amazon EC2) instances to complete the work in a only few days. Read this post to learn more.

AWS branded background design with text overlay that says "21 new or updated datasets available on the Registry of Open Data on AWS"

21 new or updated datasets available on the Registry of Open Data on AWS

The Amazon Web Services (AWS) Open Data Sponsorship Program makes high-value, cloud-optimized datasets publicly available on AWS. Through this program, customers are making more than 100 petabytes (PB) of high-value, cloud-optimized data available for public use. This past quarter, AWS released 21 new or updated datasets. What will you build with these datasets?

AWS branded background design with text overlay that says "Brain Data Science Platform increases EEG accessibility with open data and research enabled by AWS"

Brain Data Science Platform increases EEG accessibility with open data and research enabled by AWS

About 4.5 million electroencephalogram (EEG) tests are performed in the US each year. That’s more than if every person in Oregon, Connecticut, or Iowa got an EEG. Because they provide insights into brain activity and not just structure, EEGs are one of the most common tests ordered by doctors to help make a diagnosis for people with brain problems. The Brain Data Science Platform (BDSP), hosted on Amazon Web Services (AWS), is increasing EEG accessibility through cooperative data sharing and research enabled by the cloud. Read this post to learn more.

AWS branded background design with text overlay that says "34 new or updated datasets available on the Registry of Open Data on AWS"

34 new or updated datasets available on the Registry of Open Data on AWS

The Amazon Web Services (AWS) Open Data Sponsorship Program makes high-value, cloud-optimized datasets publicly available on AWS. Through this program, customers are making more than 100 petabytes (PB) of high-value, cloud-optimized data available for public use. Read this blog post to learn about the 34 new or updated datasets that were released in the first quarter.