This page documents how PollutionLookup.com collects, joins, and serves EPA pollution data. If you are citing this data in academic research, journalism, or policy work, please reference this page and the original federal sources described below.
The Superfund program, created by the Comprehensive Environmental Response, Compensation, and Liability Act of 1980 (CERCLA), gives EPA authority to clean up the nation's worst contaminated sites. The National Priorities List is the subset of sites eligible for long-term federal-led cleanup. Our database includes 1,380 sites drawn from EPA's Superfund Site Information portal and the SEMS (Superfund Enterprise Management System) public extracts.
Source: epa.gov/superfund, SEMS public data
TRI was established by the Emergency Planning and Community Right-to-Know Act of 1986. Facilities in covered industries that exceed reporting thresholds must annually disclose releases and off-site transfers of about 770 listed chemicals. Our database includes 23,331 facilities and 233,079 individual chemical-year release records pulled from the TRI Basic Data Files.
Source: TRI Basic Data Files
ECHO is EPA's flagship enforcement database. It covers facilities regulated under the Clean Air Act, Clean Water Act (NPDES permits), and RCRA (hazardous waste), and tracks inspections, formal enforcement actions, penalties, and compliance status. Our database includes 285,063 ECHO facilities pulled from the ECHO Exporter files.
Source: ECHO Data Downloads
EPA publishes these datasets separately and never fully joins them. We use two mechanisms to unify records so one search can return Superfund, TRI, and ECHO data together:
/api/nearby endpoint queries the raw lat/lng indexes rather than relying on the join tables.This is intentionally conservative: we do not merge records without high-confidence linkage. You may occasionally see the same physical site listed more than once if EPA has not reconciled its own identifiers.
state_summary, county_summary, and chemical_summary tables are built via SQL aggregation for fast per-page queries without in-memory caching.ECHO publishes updated data weekly. TRI is updated once per year (typically summer) with the prior calendar year's releases. Superfund information is updated as EPA changes site status. Our database is rebuilt from the latest sources on a regular basis. Most recent build: 2026-04-09.
If you use this data in research or publications, we suggest:
PollutionLookup.com. (2026). U.S. pollution lookup database: combined EPA Superfund, TRI, and ECHO enforcement data. Retrieved from https://pollutionlookup.com/methodology
We also encourage citing the original EPA source datasets directly — Superfund NPL, TRI Basic Data Files, and ECHO Exporter — alongside this page.
State-level CSV downloads are linked from each state page. Raw endpoints:
/api/download/{state}/echo.csv — e.g. /api/download/california/echo.csv/api/download/{state}/superfund.csv — e.g. /api/download/california/superfund.csvFor programmatic access see the JSON API documentation.