Temporal Dataset of Emerging and Legacy PFAS Concentrations (ng/L) in 11 Inland Watersheds (Lakes, Rivers, Islands) and Median PFAS Concentrations Across 14 Seas and Oceans Over Multiple Time Periods

Published: 8 July 2025| Version 1 | DOI: 10.17632/zs77m26gb2.1
Contributors:
,

Description

This dataset presents a comprehensive compilation of Per- and Polyfluoroalkyl Substances (PFAS) concentration data (ng/L), a group of persistent environmental contaminants detected across diverse ecosystems, gathered from multiple freshwater and marine environments with a focus on distinguishing emerging and legacy compounds. The aim of this project was to systematically collect, organize, and differentiate PFAS concentration data from published literature to support future research, environmental monitoring, and policy development. The freshwater dataset covers PFAS concentrations from 11 inland watersheds, including lakes, rivers, and islands. Key water bodies featured include the Cape Fear River, Lake Ontario, Rhine River, Okinawa Island, Livingstone Island, Longyearbyen, Lake Victoria, Jiulong River, Ganga River, the Faroe Islands, and the Greater Melbourne Area. In addition to freshwater data, the dataset includes median PFAS concentrations from 14 seas and oceans, such as the North Atlantic Ocean, Arctic Ocean, North Sea, and others. By compiling PFAS data from both marine and freshwater systems across different time periods, this dataset provides a clearer picture of global PFAS distribution patterns. The distinction between emerging PFAS (such as GenX and other replacement compounds) and legacy PFAS (such as PFOA, PFOS) is explicitly defined and consistently maintained. The data were extracted from peer-reviewed studies, technical reports, and environmental monitoring programs. Each data point is accompanied by information on the location, water body type, PFAS classification, and where available, the sampling year(s). This structured dataset enables users to explore spatial and temporal PFAS trends and supports comparative analysis across diverse aquatic environments. By clearly distinguishing emerging and legacy compounds and presenting their concentration patterns over time, this dataset serves as a valuable foundation for ecological risk assessment, pollution management, and scientific research.

Files

Steps to reproduce

The data for this project were collected through a systematic review and extraction process from multiple peer-reviewed scientific articles, technical reports, and environmental monitoring datasets focused on Per- and Polyfluoroalkyl Substances (PFAS) in freshwater and marine environments. The goal was to compile PFAS concentration data (ng/L) across diverse geographic regions and time periods while clearly differentiating between emerging and legacy compounds. To identify relevant studies, I conducted comprehensive searches using academic databases including Web of Science, ScienceDirect, Scopus, and Google Scholar. Search terms included combinations of keywords such as “PFAS,” “perfluoroalkyl substances,” “emerging PFAS,” “legacy PFAS,” “water contamination,” “lakes,” “rivers,” “oceans,” “marine,” and specific site names like “Cape Fear River” or “North Atlantic Ocean.” I carefully screened the literature to select studies that reported quantitative PFAS concentration values in either freshwater bodies (such as lakes, rivers, islands) or marine environments (seas and oceans) with two or more clearly identified sampling year. The extraction process focused on collecting the following information: 1. Sampling location and water body name 2. Water body type (lake, river, island, sea, ocean) 3. Individual PFAS compound names 4. Classification as emerging (e.g., GenX, PFBS) or legacy (e.g., PFOA, PFOS) compounds 5. Concentration values (median, mean, or individual measurements) in ng/L 6. Year(s) of data collection or sample reporting To ensure consistency, I standardized all concentration values into the same unit (ng/L) and applied a clear definition for emerging and legacy PFAS compounds based on widely accepted classifications in the literature. This distinction was maintained consistently across all datasets. The data were manually compiled, verified, and organized using Microsoft Excel. Each entry was cross-checked against original study figures, tables, or supplementary materials to ensure accuracy. In cases where data were presented graphically, I used careful estimation or consulted supplementary datasets where possible to retrieve exact values. No laboratory instruments, reagents, or experimental procedures were used, as the project was entirely literature-based. To maintain transparency and reproducibility, we documented the source reference (including DOI or report links) for each dataset entry, allowing users to trace the information back to its original publication. The dataset also includes the temporal dimension by noting the year(s) of sampling to support trend analysis over time. This structured and transparent workflow allows other researchers to reproduce the dataset by following a similar literature review and extraction process, and it provides a reliable foundation for further analysis of PFAS distribution trends across both inland watersheds and global marine environments.

Institutions

University of North Carolina at Charlotte

Categories

Oceanography, Environmental Science, Environmental Chemistry, Ecosystem Toxicology, Pacific Ocean, Indian Ocean, Atlantic Ocean, Arctic Ocean, Water Quality, Coastal Water, Aquatic Toxicology, Ocean, Sea, Southern Ocean, Inland Waters of Africa, Inland Waters of Antarctica, Inland Waters of Asia, Inland Waters of Europe, Inland Waters of North America, Mediterranean Sea, Marine Ecotoxicology, Groundwater Contamination, North Sea, Per- and Polyfluoroalkyl Substances

Funding

U.S. National Science Foundation

2444939

Licence