This web application allows the public to search and find information about contaminated sites caused by a release of hazardous materials or hazardous waste products and print out results.
This is a list of sites from the “SASU (Site Assessment and Support Unit) Case Management” System, which is also known as the LUST Database. Sites on this list include, but are not limited to: sites with known releases from regulated Underground Storage Tank (UST) systems, sites with releases from non-regulated UST system sources, releases from non-UST related sources, sites with potential releases for which impacts or sources of release were not confirmed at time of entry into the system. The “SASU Case Management” system was initially developed in the late 1980’s/early 1990’s to track sites that the Site Assessment and Support Unit (SASU) worked on. At the time SASU contained the LUST Program. Overtime the “SASU Case Management” System evolved to primarily track releases from regulated Leaking Underground Storage Tanks (LUST) and is currently maintained by the DEEP’s Corrective Action Unit (CAU). This dataset does not replace a full review of other environmental datasets available through CT Open Data, HazConnect (https://connecticut.hazconnect.com/listincidentpublic.aspx), files publicly available either using the DEEP on-line Public Portal (https://filings.deep.ct.gov/DEEPDocumentSearchPortal/) and/or at the DEEP Record Center File Room (https://portal.ct.gov/DEEP/About/Environmental-Quality-Records-Records-Center-File-Room). We know there may be errors in the data although we strive to minimize them. Examples of errors may include: misspelled or incomplete addresses and/or missing data. A separate dataset ( https://data.ct.gov/Environment-and-Natural-Resources/List-of-Contaminated-or-Potentially-Contaminated-S/u76p-weqj/data ) is also published for: List of Contaminated or Potentially Contaminated Sites - Remediation Division. The two database systems are maintained by different Divisions within the agency. There may be sites in both databases due to an overlap in responsibilities of the two Divisions.
https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/
Federal Superfund sites are some of the most polluted in the United States. This dataset contains a multifaceted view of Superfunds, including free-form text descriptions, geography, demographics and socioeconomics.
The core data was scraped from the National Priorities List (NPL) provided by the U.S. Environmental Protection Agency (EPA). This table provides basic information such as site name, site score, date added, and links to a site description and current status. Apache Tika was used to extract text from the site description pdfs. The addresses were scraped from site status pages, and used to geocode to latitude and longitude and Census block group. The block group assignment was used to join with the Census Bureau's planning database, a rich source of nationwide demographic and socioeconomic data. The full source code used to generate the data can be found here, on github.
I have provided three separate downloads to explore:
Some caveats:
I would like to thank the EPA and the Census Bureau for making such detailed information publicly available. For relevant academic work, please see Burwell-Naney et al. (2013) and references, both to and therein.
Please let me know if you have any suggestions for improving the dataset!
This asset includes Superfund site-specific sampling information including location of samples, types of samples, and analytical chemistry characteristics of samples. Information is associated with a particular contaminated sate as there is no national database of this information.
Geoheritage is a generic term which lies at the intersection of science, society, and sustainability and is applied to significant geologic features and landforms that have scientific, educational, cultural, economic, and aesthetic value. Many geologic sites have enriched society through the geoheritage values: scientific research and education, cultural significance, economic opportunities, and aesthetic appeal. The Geoheritage Sites of the Nation geodatabase (GDB) provides an initial inventory of geoheritage sites to showcase the geodiversity and natural heritage throughout the United States (U.S.) and its territories. Sites included in this inventory were selected from compiled geosite references of in situ geologic features that have been previously identified as geologic points of interest on Federal land designations, U.S. National Park Service Geoheritage Site Examples on Public Lands, Unofficial National Register of Geoheritage Sites, stratotype inventory, National Natural Landmarks, Bureau of Land Management National Conservation Areas, U.S. Forest Service Geologic Features of our National Forests and Grasslands, and the International Union for Geological Sciences First 100 Geological Heritage Sites. For this initial inventory, one geoheritage site on Federal public lands within each U.S. state and territory were selected to provide a geographically distributed dataset that highlights the geodiversity of the nation (n=55). Sites were further characterized by the scientific significance of the geologic phenomena, educational opportunity for both formal and informal learning, cultural connections to the landscape, historic and present-day examples of economic opportunities, and the aesthetic features within our Federal public lands. The GDB also includes direct linkages to the USGS National Geologic Map Database to allow users to discover spatially related geologic mapping publications and information products within the expansive database. Connecting geoheritage values to scientifically significant geologic features can inspire the public to better understand the landforms and their related geologic processes to foster a deeper understanding of the role geology plays in society.
Below is an explanation of the data along with some features that are available on this map application (description is also provided in the "Getting Started" widget of the application).A variety of different colored circles appear throughout the map. They represent sites that are associated with the following programs:1) Department of Toxic Substances Control (DTSC) sites:a) Historical Inactive - Identifies sites from an older database that are non-active sites where, through a Preliminary Endangerment Assessment (PEA) or other evaluation, DTSC has determined that a removal or remedial action or further extensive investigation is required.b) School Cleanup - Identifies proposed and existing school sites that are being evaluated by DTSC for possible hazardous materials contamination. School sites are further defined as “Cleanup”, where remedial actions are or have occurred.c) School Evaluation - Identifies proposed and existing school sites that are being evaluated by DTSC for possible hazardous materials contamination. School sites are further defined as “Evaluation”, where further investigation is needed.d) Corrective Action - Investigation or cleanup activities at Resource Conservation and Recovery Act (RCRA) or state-only hazardous waste facilities (that were required to obtain a permit or have received a hazardous waste facility permit from DTSC or U.S. EPA).e) State Response - Identifies confirmed release sites where DTSC is involved in remediation, either in a lead or oversight capacity. These confirmed release sites are generally high-priority and high potential risk.f) Evaluation - Identifies suspected, but unconfirmed, contaminated sites that need or have gone through a limited investigation and assessment process.g) Tiered Permit - A corrective action cleanup project on a hazardous waste facility that either was eligible to treat or permitted to treat waste under the Tiered Permitting system.2) State Water Board or DTSC sites:a) Leaking Underground Storage Tank (LUST) Cleanup - Includes all Underground Storage Tank (UST) sites that have had an unauthorized release (i.e. leak or spill) of a hazardous substance, usually fuel hydrocarbons, and are being (or have been) cleaned up. These sites are regulated under the State Water Board's UST Cleanup Program and/or similar programs conducted by each of the nine Regional Water Boards or Local Oversight Programs.b) Cleanup Program - Includes all "non-federally owned" sites that are regulated under the State Water Board's Site Cleanup Program and/or similar programs conducted by each of the nine Regional Water Boards. Cleanup Program Sites are also commonly referred to as "Site Cleanup Program sites".c) Voluntary Cleanup - Identifies sites with either confirmed or unconfirmed releases, and the project proponents have requested that the State Water Board or DTSC oversee evaluation, investigation, and/or cleanup activities and have agreed to provide coverage for the lead agency’s costs.3) Othera) Permitted Tanks - The "Permitted Tanks" data set includes Facilities that are associated with permitted underground storage tanks from the California Environmental Reporting System (CERS) database. The CERS data consists of current and recently closed permitted underground storage tank (UST) facilities information provided to CERS by Certified Unified Program Agencies (CUPAs).*Note: Underground Storage Tank Cleanup and Cleanup Program project records are pulled from the State Water Board's GeoTracker database. The Permitted Tanks information was obtained from California EPA’s California Environmental Reporting System (CERS) database. All other project records were obtained from DTSC's EnviroStor database. Program descriptions come from DTSC’s EnviroStor Glossary of Terms and the State Water Board’s GeoTracker Site/Facility Type Definitions. The information associated with these records was last updated in the application on 4/24/2023.Boundary layers in this map application include: a) CA Senate District Boundaries Outline (2011)b) CA Senate District Boundaries (2011)c) CA Senate District Boundaries Outline (2022)d) CA Senate District Boundaries (2022)e) CA Assembly District Boundaries Outline (2022)f) CA Assembly District Boundaries (2022)g) CalEnviroScreen 3.0h) CalEnviroScreen 4.0<!--How to use the map: 1) Click on the map layers icon to view all the map’s available layers. Check or uncheck the checkbox next to each layer to display or hide it.<!--2) Click on the legend icon to view what the elements in each layer represents. Note: a layer must be turned on in order for it to be displayed on the legend.<!--3) Click on the district filters icon if you are interested in filtering for sites within a specified district. In this feature, select the type of district and then enter the district number within the “Type in a Value” box. Once ready, click the “Apply” button.<!--4) Click on the select icon (found on the top left corner of the map) to select multiple sites within a boundary that you set. Note: if you would like to view one site record, you can click on the site’s icon on the map.<!--5) Click the arrow icon (located at the bottom of the map) to view the attribute table of the sites and district boundary layers. Note: this feature will automatically update based on what is displayed on the map or selected.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Each point is a representation of an U.S. Department of Energy Office of Legacy Management (LM) site. The coordinates of each site point are either the location of a site monument, the centroid of the site boundary, or a general location of the site. This dataset is automatic synchronized with the LM environmental database on a weekly basis. The location and attributes of each site are maintained within the LM environmental database.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
The RIDB is an API accessible database of US Gov't Recreation site data contributed by twelve participating agencies in the Recreation One Stop program. This data is used on Recreation.gov and is available to the public for various other uses.This record was taken from the USDA Enterprise Data Inventory that feeds into the https://data.gov catalog. Data for this record includes the following resources: ISO-19139 metadata ArcGIS Hub Dataset ArcGIS GeoService For complete information, please visit https://data.gov.
It is a database that details the interactions of extruded, unpaired RNA nucleotide bases. It presents and classifies the protein binding pockets that accommodate them, and also allows the recognition of similar protein binding patters involved in interactions with different RNA molecules. Given an unbound structure of a target protein, it allows the prediction of its RNA nucleotide binding sites. The goal of this database is to describe, classify, and predict the interactions between protein binding sites and single-stranded RNA bases. Specifically, RsiteDB describes the protein binding pockets that accommodate extruded nucleotides not involved in RNA base pairing. RsiteDB has two modes of operation. Analysis and classification of protein-RNA interactions: Given a protein-RNA complex RsiteDB analyzes its nucleotide and dinucleotide binding sites. It details the properties of the protein binding pockets that accommodate these extruded nucleotides and presents a list of proteins with similar binding pockets. These proteins may have a totally different overall sequences and structural folds. RsiteDB details and visualizes the features shared by all the binding sites classified to the same cluster. Prediction of RNA dinucleotide binding sites: Given a target, potentially unbound, protein structure we search its surface for regions similar to the created 3-D consensus binding patterns of RNA dinucleotides. The recognized regions are predicted to serve as binding sites. Using leave-one-out tests, the success rate of these predictions was estimated to be about 80%. It must be noted that currently we do not aim to predict whether a protein can bind RNA; rather, given an unbound RNA binding protein, our goal is to predict its binding sites and their modes of interaction. In addition, due to a low number of single nucleotide clusters, currently, we do not use them for the prediction.
Provides information on all items designated or under consideration for designation (i.e. calendared) by the New York City Landmarks Preservation Commission (LPC). This dataset contains information on all items designated or under consideration for designation (i.e. calendared) by the New York City Landmarks Preservation Commission (LPC). The dataset contains records for each individual, scenic, or interior landmark, as well as properties or sites located within the boundaries of historic districts. Please note that points in this dataset represent individual buildings in addition to non-building sites (such as vacant lots or monuments) regulated by LPC. It is possible for a single property to have multiple designations (such as individual and interior designations, or individual and historic district). For this reason, it is not uncommon to see multiple points on a single tax lot and multiple records for a single property within the database. Please pay close attention to the "MOST_CURRENT," "BBL_STATUS and "LAST_ACTION_ON_LP" fields, which together denote the designation status of a site/property (see Attribute Definitions for more information). The geographic locations of the points in this dataset are derived primarily from the Department of City Planning's PLUTO data in combination with the Department of Information Technology & Telecommunication's building footprint information. Because this dataset is not automatically updated when changes occur in the underlying dataset, BIN numbers and tax lot information are potentially out of date. Please pay close attention to the field descriptions present in the file's metadata to understand how to use this data set. Time values are auto-generated and do not reflect the official time of any LPC action, including designation or calendaring.
https://spdx.org/licenses/CC0-1.0.htmlhttps://spdx.org/licenses/CC0-1.0.html
To determine the impact of the intron-exon architecture and splice site strength on splice site selection, we created a database of alternative splice sites (ALTssDB) using the Human Exon Splicing Event Database HEXEvent, the Intron DB, and GeneBase. MaxEntScan, a computational tool was used to assign splice site scores. To minimize variability, we focused on competing alternative 5′ or 3′ splice site pairs of internal exons with only one alternative splice pattern. Thus, two alternative 5′ splice sites compete for a common 3′ss, or two alternative 3′ splice sites compete for a common 5′ss. As a result, ALTssDB reports the location of the major splice site and its competing alternative 3′ or 5′ splice site, corresponding exon sizes, usage levels, splice site scores and flanking intron lengths. Methods ALTssDB was created using EST data from the Human Exon splicing Events (HEXEvent) database. HEXEvent contains information regarding the location of competing splice sites, the resulting exon sizes, alternative splice site usage levels and the gene associated with each mRNA. The HEXEvent data was filtered to obtain a dataset comprising of only pairs of competing 5' and 3' splice sites separately. This database was subsequently modified to include splice site junction information and MaxENT scores using MaxEntScan. Using an R script and IntronDB dataset, (a database detailing eukaryotic intron features) flanking intron lengths were added to the database. Alternative splicing events were further filtered to include only events that have 10 or more EST counts.
National Priorities List (NPL) Sites - The Comprehensive Environmental Response, Compensation and Liability Information System (CERCLIS) (Superfund) Public Access Database contains a selected set of non-enforcement confidential information and is updated by the regions every 90 days. The data describes what has happened at Superfund sites prior to this quarter (updated quarterly). This database includes lists of involved parties (other Federal Agencies, states, and tribes), Human Exposure and Ground Water Migration, and Site Wide Ready for Reuse, Construction Completion, and Final Assessment Decision (GPRA-like measures) for fund lead sites. Other information that is included has been included only as a service to allow public evaluations utilizing this data. EPA does not have specific Data Quality Objectives for use of the data. Independent Quality Assessments may be made of this data by reviewing the QAPP
Comprehensive dataset of 15 Archaeological sites in Pennsylvania, United States as of July, 2025. Includes verified contact information (email, phone), geocoded addresses, customer ratings, reviews, business categories, and operational details. Perfect for market research, lead generation, competitive analysis, and business intelligence. Download a complimentary sample to evaluate data quality and completeness.
Comprehensive dataset of 34 Superfund sites in Colombia as of June, 2025. Includes verified contact information (email, phone), geocoded addresses, customer ratings, reviews, business categories, and operational details. Perfect for market research, lead generation, competitive analysis, and business intelligence. Download a complimentary sample to evaluate data quality and completeness.
Contaminants at Comprehensive Environmental Response, Compensation and Liability Information System (CERCLIS) (Superfund) Sites - The CERCLIS Public Access Database contains a selected set of non-enforcement confidential information and is updated by the regions every 90 days. The data describes what has happened at Superfund sites prior to this quarter (updated quarterly). This database includes lists of involved parties (other Federal Agencies, states, and tribes), Human Exposure and Ground Water Migration, and Site Wide Ready for Reuse, Construction Completion, and Final Assessment Decision (GPRA-like measures) for fund lead sites. Other information that is included has been included only as a service to allow public evaluations utilizing this data. EPA does not have specific Data Quality Objectives for use of the data. Independent Quality Assessments may be made of this data by reviewing the Quality Assurance Action Plan (QAPP ).
Comprehensive dataset of 5 Archaeological sites in Alabama, United States as of July, 2025. Includes verified contact information (email, phone), geocoded addresses, customer ratings, reviews, business categories, and operational details. Perfect for market research, lead generation, competitive analysis, and business intelligence. Download a complimentary sample to evaluate data quality and completeness.
List of mines and quarries in the UK including information about operational status, products, lithostratigraphy, chronostratigraphy, pit and operator addresses, minerals planning authority. Digital data has been sold from the BritPits database, since 1994, this has been customised to suit purchasers. Use is also made of sets of operational workings data by Bureau Services who pay royalties and get updates. Older data on operators tends to be incomplete as it was not recorded. Updating is ongoing to update litho- and chronostrat data. Originally, only details of currently active sites were included in the database but, because of the importance of former workings for waste disposal and as Sites of Special Scientific Interest, information is now collected on both inactive and closed operations. The data is held in a relational database using an Oracle server and a Microsoft Access front-end. The database can be used for many purposes: mailing lists, route planning, market intelligence/analysis, and resource planning, and data has been supplied to a wide range of customers.
https://edg.epa.gov/EPA_Data_License.htmhttps://edg.epa.gov/EPA_Data_License.htm
THIS DATA ASSET NO LONGER ACTIVE: This is metadata documentation for the National Priorities List (NPL) Publication Assistance Databsae (PAD), a Lotus Notes application that holds Region 7's universe of NPL site information such as site description, threats and contaminants, cleanup approach, environmental process, community involvement, site repository, and regional contacts. This database used to be updated annually, at different times for different NPLs, but it is currently no longer being used. This work fell under objectives for EPA's 2003-2008 Strategic Plan (Goal 3) for Land Preservation & Restoration, which are to clean up and reuse contaminated land.
This dataset holds information on qualifying and non-qualifying features (species and habitats) on Sites of Special Scientific Interest (SSSIs), Special Areas of Conservation (SACs) and Special Protection Areas (SPAs). The purpose of this data capture is to maintain and have access to information on Wales' qualifying, non-qualifying and notified features (species and habitats) on SSSIs, SACs and SPAs. Data entry began in 2005 and is ongoing as new sites are designated and existing sites redesignated.
This web application allows the public to search and find information about contaminated sites caused by a release of hazardous materials or hazardous waste products and print out results.