🌍 Finding Environmental Data Online 🌱

Key Sources for Ecosystem, Water, Climate, and Social Data

Author

Mike Johnson

Published

March 29, 2025

πŸ›οΈ U.S. Government Data Portals

1. Data.gov
- Central hub for open government data πŸ“Š
- Datasets from agencies like NOAA, NASA, and USGS
- APIs and download options

2. USGS ScienceBase
- Geospatial and environmental science data 🌎

3. USGS Earth Explorer
- Satellite imagery and aerial photos πŸ“Έ

4. National Map
- Topographic and land cover data πŸ—ΊοΈ

🌧️ Hydrology and Water Resources πŸ’§

1. USGS NWIS
- Streamflow, groundwater, and water quality data 🌊

2. NOAA National Water Model
- Hydrologic modeling and forecasts πŸ’¦

3. NASA Global Hydrology Resource Center
- Satellite-based hydrology data 🌐

4. EPA Water Data
- Water quality and watershed datasets 🌱

5. Internet of Water
- Water data infrastructure tools πŸŒŠπŸ”§

β˜€οΈ Climate and Weather Data

1. NOAA NCEI
- Historical weather and climate records 🌀️

2. NASA Earthdata
- Remote sensing data from NASA’s satellites πŸš€

3. PRISM Climate Group
- High-resolution climate datasets 🌞

4. GridMET
- Gridded meteorological data 🌧️

5. GHCN
- Historical station-based climate observations πŸ§‘β€πŸ”¬

🌱 Land Cover and Remote Sensing Data

1. USGS NLCD
- Land cover classification 🏞️

2. NASA MODIS & Landsat
- Satellite imagery for land cover and vegetation 🌳

3. Copernicus Open Access Hub
- Free access to Sentinel data 🌍

4. Google Earth Engine
- Cloud-based geospatial analysis πŸŒ₯️

5. Microsoft Planetary Computer
- Earth observation and AI-based analytics 🌐

🌍 GitHub - Awesome Public Datasets πŸ€–

Awesome Public Datasets
- A curated list of high-quality public datasets 🌟
- Covers climate, geospatial, hydrology, social sciences, economics, health, and more πŸ’‘
- Frequently updated with new sources ⏳
- Great for machine learning, research, and open data applications πŸš€

🌾 Colorado-Specific Environmental Data πŸ”οΈ

1. Colorado Information Marketplace
- Open datasets on environment, energy, and demographics πŸ“ˆ

2. Colorado Division of Water Resources
- Water rights and streamflow data πŸ’§

3. Colorado State Forest Service
- Forestry and wildfire data 🌲πŸ”₯

4. Colorado Department of Public Health & Environment
- Air, water, and public health data πŸ₯

🌐 UN & Global Environmental Data 🌏

1. UN Data Portal
- Global statistics on environment, economics, and population 🌍

2. FAO AQUASTAT
- Global water resources data 🌊

3. UN Environment Programme (UNEP) Data
- Climate, pollution, and biodiversity 🌿

4. World Bank Open Data
- Economic and environmental data πŸ’Ό

5. WHO Global Health Observatory
- Health and environmental risk factors πŸ₯

πŸ™οΈ Demographic and Social Data

πŸ—ΊοΈ U.S. Census Bureau
- Official demographics, housing, and income data πŸ“ŠπŸ‘

πŸ“‹ American Community Survey (ACS)
- Annual detailed social and economic statistics for communities πŸ“ˆ

⚠️ Social Vulnerability Index (SVI)
- Measures community vulnerability to disasters πŸŒͺ️🚨
- Amazing data, really crappy new disclosure

πŸš€ Opportunity Atlas
- Tracks economic mobility based on childhood conditions πŸ’΅πŸ“

🌍 EPA Environmental and Social Data

βš–οΈ EPA EJScreen
- Environmental justice mapping for disadvantaged communities πŸ“Œ
- Currently been taken down

🌎 EPA Envirofacts
- Access air, water, land, and hazardous waste data πŸ­πŸ’¨

🌬️ EPA National Emissions Inventory (NEI)
- Nationwide air pollution emissions database πŸ“ŠπŸŒ«οΈ

☣️ Toxic Release Inventory (TRI)
- Tracks chemical emissions from industrial facilities 🏭☒️

🌱 Environmental Resilience Tools
- Climate adaptation and resilience datasets 🌑️🌊

πŸ’» R Packages πŸ§‘β€πŸ’»

1. tidycensus – U.S. Census and ACS data πŸ“Š

2. dataRetrieval – USGS & EPA hydrology data (Worlds largest water API) πŸ’§

3. climateR – Climate datasets like PRISM & GridMET 🌑️

3. elevatr - Access Elevation Data from Various APIs

4. rnoaa – NOAA weather and climate data 🌦️

5. neonUtilities – NEON ecological data 🌱

6. riem – Allows to get weather data from Automated Surface Observing System (ASOS) stations (airports) in the whole world thanks to the Iowa Environment Mesonet websi 🌬️

7. nasapower - API client for NASA POWER global meteorology, surface solar energy and climatology data API β˜€οΈ

8. FedData - Automate Downloading Geospatial Data Available from Several Federated Data Sources 🌧️

9. GHCNr - fast and friendly interface with the Global Historical Climatology Network daily (GHCNd) database, which contains daily summaries of weather station data worldwide 🌧️

10. osmdata - Download and import of β€˜OpenStreetMap’ (β€˜OSM’) data 🚧

11. tidyUSDA: A Minimal Tool Set for Gathering USDA Quick Stat Data for Analysis and Visualization 🚜

🌐 Key Academic Databases

Almost all Journals now have an open data policy. In these, data is either part of the paper, or, archived at a central database:

1. Nature Scientific Data
- Open access journal dedicated to data, publishing descriptions of research datasets and articles on research data sharing from all areas of natural sciences, medicine, engineering and social sciences. πŸ“‘

2. Google Scholar
- A free, widely used academic search engine πŸ“š
- Access research papers, theses, and patents πŸ”

3. PubMed
- Biomedical literature and health-related datasets 🧬
- Useful for health, epidemiology, and environmental data πŸ“Š

4. IEEE Xplore
- Engineering, computer science, and technology research πŸ–₯️
- Often includes datasets related to environmental modeling 🌍

5. DataCite
- Repository for datasets linked to scholarly articles πŸ—‚οΈ
- Includes data across a variety of disciplines, including earth sciences 🌎

πŸ“‘ Research Gateways & Repositories

1. arXiv
- Preprints for physics, math, computer science, and environmental sciences πŸ§‘β€πŸ”¬

2. Dryad
- Open data repository with a focus on life sciences and ecology 🌱

3. OpenICPSR
- Social science and environmental datasets πŸ™οΈ

4. Zenodo
- Open-access repository with datasets and research outputs πŸ“Š

5. HydroShare
- Open data repository with a focus on hydrologic data resoruces πŸ’§

🌟 Searching for Datasets in Published Articles

  • Use keywords such as β€œopen data,” β€œenvironmental datasets,” β€œclimate data,” etc. πŸ”‘
  • Look for datasets linked directly in the article or listed in supplemental material πŸ“‚
  • Check for articles published in high-impact journals like Nature, Science, and PNAS for reliable datasets πŸ“°

Introduction to Google Data Commons

Google Data Commons is an open data platform that integrates public datasets across various domains, making them easily accessible through a structured knowledge graph. It allows users to explore, analyze, and visualize data without needing to download large datasets or manage complex data infrastructures.

Why Use Google Data Commons?

  • Access to Multiple Datasets: Google Data Commons aggregates datasets from sources like the U.S. Census Bureau, NOAA, World Bank, and NASA.

  • Simplified Data Exploration: Users can query datasets using the online interface or the R package without prior database management experience.

  • Interoperability: Combines socioeconomic, environmental, and hydrologic data for cross-domain analysis.

  • Visualization Tools: Provides built-in charts, tables, and maps for data representation.

How Google Data Commons Works

  • Knowledge Graph Structure: Data is organized as a knowledge graph, enabling relationships between datasets.

  • Querying via Online Interface: Users can explore datasets interactively through the website without needing programming knowledge.

  • Integration with Google Cloud: Allows seamless use in cloud-based analysis.

Getting Started with Google Data Commons

Visit Google Data Commons to browse datasets and visualizations.

Regrade Policy:

  • All daily exercises are eligible for a regrade with credit up to 80%.

  • Lab 1 is eligible for a regrade for up to 50% of the lost points

  • Remember, there is Extra Credit available in the form of a final in which you resubmit your Personal Website with all class labs and projects linked. (Get started now 🏎️)

Assigment:

  • Identify 5 data sources that you might be interested in.

  • They can come from this list but should be specific (e.g. β€œWater Quality Data about Bicarbonate” not β€œdataRetrivial”).

  • You can use these to jump start your group discussions in lab so try to have this done prior to lab (although the due date is Friday at 5pm)

  • Enjoy Spring Break 🌴 ⛰️ β˜€ !

Project Introduction

Final Project