100+ datasets found

d
Addresses (Open Data)
catalog.data.gov
data-academy.tempe.gov
+11more
Updated Nov 22, 2025
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
City of Tempe (2025). Addresses (Open Data) [Dataset]. https://catalog.data.gov/dataset/addresses-open-data
Explore at:
Dataset updated
Nov 22, 2025
Dataset provided by
City of Tempe
Description
This dataset is a compilation of address point data for the City of Tempe. The dataset contains a point location, the official address (as defined by The Building Safety Division of Community Development) for all occupiable units and any other official addresses in the City. There are several additional attributes that may be populated for an address, but they may not be populated for every address. Contact: Lynn Flaaen-Hanna, Development Services Specialist Contact E-mail Link: Map that Lets You Explore and Export Address Data Data Source: The initial dataset was created by combining several datasets and then reviewing the information to remove duplicates and identify errors. This published dataset is the system of record for Tempe addresses going forward, with the address information being created and maintained by The Building Safety Division of Community Development.Data Source Type: ESRI ArcGIS Enterprise GeodatabasePreparation Method: N/APublish Frequency: WeeklyPublish Method: AutomaticData Dictionary
SWAMP Data Dashboard
data.cnra.ca.gov
data.ca.gov
+2more
csv, pdf
Updated Nov 17, 2025
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
California State Water Resources Control Board (2025). SWAMP Data Dashboard [Dataset]. https://data.cnra.ca.gov/dataset/swamp-data-dashboard
Explore at:
csv, pdfAvailable download formats
Dataset updated
Nov 17, 2025
Dataset authored and provided by
California State Water Resources Control Board
Description
This dataset supports the SWAMP Data Dashboard, a public-facing tool developed by the Surface Water Ambient Monitoring Program (SWAMP) to provide accessible, user-friendly access to water quality monitoring data across California. The dashboard and its associated datasets are designed to help the public, researchers, and decision-makers explore and download monitoring data collected from California’s surface waters.

This dataset includes five distinct resources:

SWAMP Stations – Geospatial and descriptive information about SWAMP monitoring sites.

Water Quality Results – Field and lab analysis results for chemical and physical parameters measured in water samples.

Toxicity Summary Results – Summarized results from aquatic toxicity tests. Summary records are entries in the database that summarize the results from multiple replicate toxicity tests of the same sample water.

Habitat Results – Data on physical habitat conditions typically collected alongside biological monitoring to provide context for interpreting water quality conditions. Includes scores for the California Stream Condition Index (CSCI) and Algal Stream Condition Index (ASCI).

Tissue Summary Results – Annual summary statistics of contaminant concentrations in aquatic organism tissue samples. The data are derived from raw individual and composite tissue sample results.

These data are collected by SWAMP and its partners to support water quality assessments, identify trends, and inform water resource management. The SWAMP Data Dashboard provides interactive visualizations and filtering tools to explore this data by region, parameter, and more.

The SWAMP dataset is sourced from the California Environmental Data Exchange Network (CEDEN), which serves as the central repository for water quality data collected by various monitoring programs throughout the state. As such, there is some overlap between this dataset and the broader CEDEN datasets also published on the California Open Data Portal (see Related Resources). This SWAMP dataset represents a curated subset of CEDEN data, specifically tailored for use in the SWAMP Data Dashboard.

Access the SWAMP Data Dashboard: https://gispublic.waterboards.ca.gov/swamp-data/

*This dataset is provisional and subject to revision. It should not be used for regulatory purposes.
NYS Substance Use Disorder Data
kaggle.com
zip
Updated Jan 1, 2021
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
State of New York (2021). NYS Substance Use Disorder Data [Dataset]. https://www.kaggle.com/datasets/new-york-state/nys-substance-use-disorder-data/discussion
Explore at:
zip(798133 bytes)Available download formats
Dataset updated
Jan 1, 2021
Dataset authored and provided by
State of New York
License
https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/
Area covered
New York
Description
Content

More details about each file are in the individual file descriptions.

Context

This is a dataset hosted by the State of New York. The state has an open data platform found here and they update their information according the amount of data that is brought in. Explore New York State using Kaggle and all of the data sources available through the State of New York organization page!

Update Frequency: This dataset is updated annually.

Acknowledgements

This dataset is maintained using Socrata's API and Kaggle's API. Socrata has assisted countless organizations with hosting their open data and has been an integral part of the process of bringing more data to the public.

This dataset is distributed under the following licenses: Public Domain
D
History of work (all graph datasets)
druid.datalegend.net
api.druid.datalegend.net
+1more
application/n-quads +5
Updated Nov 4, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
History of Work (2025). History of work (all graph datasets) [Dataset]. https://druid.datalegend.net/HistoryOfWork/historyOfWork-all-latest
Explore at:
application/n-quads, application/n-triples, application/trig, ttl, jsonld, application/sparql-results+jsonAvailable download formats
Dataset updated
Nov 4, 2025
Dataset authored and provided by
History of Work
License
Attribution-ShareAlike 4.0 (CC BY-SA 4.0)https://creativecommons.org/licenses/by-sa/4.0/
License information was derived automatically
Description
History of Work

Here you find the History of Work resources as Linked Open Data. It enables you to look ups for HISCO and HISCAM scores for an incredible amount of occupational titles in numerous languages.

Data can be queried (obtained) via the SPARQL endpoint or via the example queries. If the Linked Open Data format is new to you, you might enjoy these data stories on History of Work as Linked Open Data and this user question on Is there a list of female occupations?.

NEW version - CHANGE notes

This version is dated Apr 2025 and is not backwards compatible with the previous version (Feb 2021). The major changes are: - incredible simplification of graph representation (from 81 to 12); - use of sdo (https://schema.org/) rather than schema (http://schema.org); - replacement of prov:wasDerivedFrom with sdo:isPartOf to link occupational titles to originating datasets; - etl files (used for conversion to Linked Data) now publicly available via https://github.com/rlzijdeman/rdf-hisco; - update of issues with language tags; - specfication of language tags for english (eg. @en-gb, instead of @en); - new preferred API: https://api.druid.datalegend.net/datasets/HistoryOfWork/historyOfWork-all-latest/sparql (old API will be deprecated at some point: https://api.druid.datalegend.net/datasets/HistoryOfWork/historyOfWork-all-latest/services/historyOfWork-all-latest/sparql ) .

There are bound to be some issues. Please leave report them here.

Figure 1. Part of model illustrating the basic relation between occupations, schema.org and HISCO. https://druid.datalegend.net/HistoryOfWork/historyOfWork-all-latest/assets/601beed0f7d371035bca5521" alt="hisco-basic">

Figure 2. Part of model illustrating the relation between occupation, provenance and HISCO auxiliary variables. https://druid.datalegend.net/HistoryOfWork/historyOfWork-all-latest/assets/601beed0f7d371035bca551e" alt="hisco-aux">
World Bank: Education Data
kaggle.com
zip
Updated Mar 20, 2019
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
World Bank (2019). World Bank: Education Data [Dataset]. https://www.kaggle.com/datasets/theworldbank/world-bank-intl-education
Explore at:
zip(0 bytes)Available download formats
Dataset updated
Mar 20, 2019
Dataset provided by
World Bank Grouphttp://www.worldbank.org/
Authors
World Bank
License
https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/
Description
Context

The World Bank is an international financial institution that provides loans to countries of the world for capital projects. The World Bank's stated goal is the reduction of poverty. Source: https://en.wikipedia.org/wiki/World_Bank

Content

This dataset combines key education statistics from a variety of sources to provide a look at global literacy, spending, and access.

For more information, see the World Bank website.

Fork this kernel to get started with this dataset.

Acknowledgements

https://bigquery.cloud.google.com/dataset/bigquery-public-data:world_bank_health_population

http://data.worldbank.org/data-catalog/ed-stats

https://cloud.google.com/bigquery/public-data/world-bank-education

Citation: The World Bank: Education Statistics

Dataset Source: World Bank. This dataset is publicly available for anyone to use under the following terms provided by the Dataset Source - http://www.data.gov/privacy-policy#data_policy - and is provided "AS IS" without any warranty, express or implied, from Google. Google disclaims all liability for any damages, direct or indirect, resulting from the use of the dataset.

Banner Photo by @till_indeman from Unplash.

Inspiration

Of total government spending, what percentage is spent on education?
MHS Dashboard Children and Youth Demographic Datasets
data.chhs.ca.gov
data.ca.gov
+1more
csv, zip
Updated Nov 7, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Department of Health Care Services (2025). MHS Dashboard Children and Youth Demographic Datasets [Dataset]. https://data.chhs.ca.gov/dataset/child-youth-ab470-datasets
Explore at:
csv(1358269), csv(430905), csv(461467), csv(44757018), csv(31283542), csv(374496), csv(116973), csv(2298761), csv(1072808), csv(270327), csv(191127), csv(18869990), csv(43150), csv(1396290), csv(268395), csv(35041649), csv(32085), csv(11599), csv(998465), csv(1324593), zipAvailable download formats
Dataset updated
Nov 7, 2025
Dataset provided by
California Department of Health Care Serviceshttp://www.dhcs.ca.gov/
Authors
Department of Health Care Services
Description
The following datasets are based on the children and youth (under age 21) beneficiary population and consist of aggregate Mental Health Service data derived from Medi-Cal claims, encounter, and eligibility systems. These datasets were developed in accordance with California Welfare and Institutions Code (WIC) § 14707.5 (added as part of Assembly Bill 470 on 10/7/17). Please contact BHData@dhcs.ca.gov for any questions or to request previous years’ versions of these datasets. Note: The Performance Dashboard AB 470 Report Application Excel tool development has been discontinued. Please see the Behavioral Health reporting data hub at https://behavioralhealth-data.dhcs.ca.gov/ for access to dashboards utilizing these datasets and other behavioral health data.
O
Department of Community Resources & Services Online Data Sources
opendata.howardcountymd.gov
data.wu.ac.at
csv, xlsx, xml
Updated Oct 28, 2019
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Department of Community Resources & Services (2019). Department of Community Resources & Services Online Data Sources [Dataset]. https://opendata.howardcountymd.gov/w/kdeq-r7qc/j72c-n6z5?cur=LdI0ncE4AfX&from=n10jJ2BVdMM
Explore at:
xml, csv, xlsxAvailable download formats
Dataset updated
Oct 28, 2019
Dataset authored and provided by
Department of Community Resources & Services
Description
This dataset lists various data sources used within the Department of Community Resources & Services for various internal and external reports. This dataset allows individuals and organizations to identify the type of data they are looking for and to which geographical level they are trying to get the data for (i.e. National, State, County, etc.). This dataset will be updated every quarter and should be utilized for research purposes
A
Privately Owned Public Spaces (POPS)
data.amerigeoss.org
data.cityofnewyork.us
+5more
csv, json, rdf, xml
Updated Jul 9, 2019
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
United States (2019). Privately Owned Public Spaces (POPS) [Dataset]. https://data.amerigeoss.org/de/dataset/privately-owned-public-spaces-pops
Explore at:
json, rdf, xml, csvAvailable download formats
Dataset updated
Jul 9, 2019
Dataset provided by
United States
Description
Privately owned public spaces, also known by the acronym POPS, are outdoor and indoor spaces provided for public enjoyment by private owners in exchange for bonus floor area or waivers, an incentive first introduced into New York City’s zoning regulations in 1961. To find out more about POPS, visit the Department of City Planning's website at http://nyc.gov/pops. This database contains detailed information about each privately owned public space in New York City.

Data Source: Privately Owned Public Space Database (2018), owned and maintained by the New York City Department of City Planning and created in collaboration with Jerold S. Kayden and The Municipal Art Society of New York.
Z
Data from: A Large-scale Dataset of (Open Source) License Text Variants
data.niaid.nih.gov
Updated Mar 31, 2022
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Stefano Zacchiroli (2022). A Large-scale Dataset of (Open Source) License Text Variants [Dataset]. https://data.niaid.nih.gov/resources?id=zenodo_6379163
Explore at:
Dataset updated
Mar 31, 2022
Dataset provided by
LTCI, Télécom Paris, Institut Polytechnique de Paris
Authors
Stefano Zacchiroli
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
We introduce a large-scale dataset of the complete texts of free/open source software (FOSS) license variants. To assemble it we have collected from the Software Heritage archive—the largest publicly available archive of FOSS source code with accompanying development history—all versions of files whose names are commonly used to convey licensing terms to software users and developers. The dataset consists of 6.5 million unique license files that can be used to conduct empirical studies on open source licensing, training of automated license classifiers, natural language processing (NLP) analyses of legal texts, as well as historical and phylogenetic studies on FOSS licensing. Additional metadata about shipped license files are also provided, making the dataset ready to use in various contexts; they include: file length measures, detected MIME type, detected SPDX license (using ScanCode), example origin (e.g., GitHub repository), oldest public commit in which the license appeared. The dataset is released as open data as an archive file containing all deduplicated license blobs, plus several portable CSV files for metadata, referencing blobs via cryptographic checksums.

For more details see the included README file and companion paper:

Stefano Zacchiroli. A Large-scale Dataset of (Open Source) License Text Variants. In proceedings of the 2022 Mining Software Repositories Conference (MSR 2022). 23-24 May 2022 Pittsburgh, Pennsylvania, United States. ACM 2022.

If you use this dataset for research purposes, please acknowledge its use by citing the above paper.
COVID19_datasets
kaggle.com
zip
Updated Apr 2, 2022
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Suradech Kongkiatpaiboon (2022). COVID19_datasets [Dataset]. https://www.kaggle.com/datasets/suradechk/covid19-datasets/discussion
Explore at:
zip(136322570 bytes)Available download formats
Dataset updated
Apr 2, 2022
Authors
Suradech Kongkiatpaiboon
Description
Collected COVID-19 datasets from various sources as part of DAAN-888 course, Penn State, Spring 2022. Collaborators: Mohamed Abdelgayed, Heather Beckwith, Mayank Sharma, Suradech Kongkiatpaiboon, and Alex Stroud

**1 - COVID-19 Data in the United States ** Source: The data is collected from multiple public health official sources by NY Times journalists and compiled in one single file. Description: Daily count of new COVID-19 cases and deaths for each state. Data is updated daily and runs from 1/21/2020 to 2/4/2022. URL: https://github.com/nytimes/covid-19-data/blob/master/us-states.csv Data size: 38,814 row and 5 columns.

**2 - Mask-Wearing Survey Data ** Source: The New York Times is releasing estimates of mask usage by county in the United States. Description: This data comes from a large number of interviews conducted online by the global data and survey firm Dynata, at the request of The New York Times. The firm asked a question about mask usage to obtain 250,000 survey responses between July 2 and July 14, enough data to provide estimates more detailed than the state level. URL: https://github.com/nytimes/covid-19-data/blob/master/mask-use/mask-use-by-county.csv Data size: 3,142 rows and 6 columns

**3a - Vaccine Data – Global ** Source: This data comes from the US Centers for Disease Control and Prevention (CDC), Our World in Data (OWiD) and the World Health Organization (WHO). Description: Time series data of vaccine doses administered and the number of fully and partially vaccinated people by country. This data was last updated on February 3, 2022 URL: https://github.com/govex/COVID-19/blob/master/data_tables/vaccine_data/global_data/time_series_covid19_vaccine_global.csv
Data Size: 162,521 rows and 8 columns

**3b -Vaccine Data – United States ** Source: The data is comprised of individual State's public dashboards and data from the US Centers for Disease Control and Prevention (CDC). Description: Time series data of the total vaccine doses shipped and administered by manufacturer, the dose number (first or second) by state. This data was last updated on February 3, 2022. URL: https://github.com/govex/COVID-19/blob/master/data_tables/vaccine_data/us_data/time_series/vaccine_data_us_timeline.csv
Data Size: 141,503 rows and 13 columns

**4 - Testing Data ** Source: The data is comprised of individual State's public dashboards and data from the U.S. Department of Health & Human Services. Description: Time series data of total tests administered by county and state. This data was last updated on January 25, 2022. URL: https://github.com/govex/COVID-19/blob/master/data_tables/testing_data/county_time_series_covid19_US.csv
Data size: 322,154 rows and 8 columns

**5 – US State and Territorial Public Mask Mandates ** Source: Data from state and territory executive orders, administrative orders, resolutions, and proclamations is gathered from government websites and cataloged and coded by one coder using Microsoft Excel, with quality checking provided by one or more other coders. Description: US State and Territorial Public Mask Mandates from April 10, 2020 through August 15, 2021 by County by Day URL: https://data.cdc.gov/Policy-Surveillance/U-S-State-and-Territorial-Public-Mask-Mandates-Fro/62d6-pm5i Data Size: 1,593,869 rows and 10 columns

**6 – Case Counts & Transmission Level ** Source: This open-source dataset contains seven data items that describe community transmission levels across all counties. This dataset provides the same numbers used to show transmission maps on the COVID Data Tracker and contains reported daily transmission levels at the county level. The dataset is updated every day to include the most current day's data. The calculating procedures below are used to adjust the transmission level to low, moderate, considerable, or high.
Description: US State and County case counts and transmission level from 16-Aug-2021 to 03-Feb-2022 URL: https://data.cdc.gov/Public-Health-Surveillance/United-States-COVID-19-County-Level-of-Community-T/8396-v7yb Data Size: 550,702 rows and 7 columns

**7 - World Cases & Vaccination Counts ** Source: This is an open-source dataset collected and maintained by Our World in Data. OWID provides research and data to help against the world’s largest problems.
Description: This dataset includes vaccinations, tests & positivity, hospital & ICU, confirmed cases, confirmed deaths, reproduction rate, policy responses and other variables of interest. URL: https://github.com/owid/covid-19-data/tree/master/public/data Data Size: 67 columns and 157,000 rows

**8 - COVID-19 Data in the European Union ** Source: This is an open-source dataset collected and maintained by ECDC. It is an EU agency aimed at strengthening Europe's defenses against infectious diseases.
Description: This dataset co...
Emission probabilities.
plos.figshare.com
xls
Updated Oct 4, 2023
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Mark J. Panaggio; Mike Fang; Hyunseung Bang; Paige A. Armstrong; Alison M. Binder; Julian E. Grass; Jake Magid; Marc Papazian; Carrie K. Shapiro-Mendoza; Sharyn E. Parks (2023). Emission probabilities. [Dataset]. http://doi.org/10.1371/journal.pone.0292354.t003
Explore at:
xlsAvailable download formats
Unique identifier
https://doi.org/10.1371/journal.pone.0292354.t003
Dataset updated
Oct 4, 2023
Dataset provided by
PLOShttp://plos.org/
Authors
Mark J. Panaggio; Mike Fang; Hyunseung Bang; Paige A. Armstrong; Alison M. Binder; Julian E. Grass; Jake Magid; Marc Papazian; Carrie K. Shapiro-Mendoza; Sharyn E. Parks
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
During the COVID-19 pandemic, many public schools across the United States shifted from fully in-person learning to alternative learning modalities such as hybrid and fully remote learning. In this study, data from 14,688 unique school districts from August 2020 to June 2021 were collected to track changes in the proportion of schools offering fully in-person, hybrid and fully remote learning over time. These data were provided by Burbio, MCH Strategic Data, the American Enterprise Institute’s Return to Learn Tracker and individual state dashboards. Because the modalities reported by these sources were incomplete and occasionally misaligned, a model was needed to combine and deconflict these data to provide a more comprehensive description of modalities nationwide. A hidden Markov model (HMM) was used to infer the most likely learning modality for each district on a weekly basis. This method yielded higher spatiotemporal coverage than any individual data source and higher agreement with three of the four data sources than any other single source. The model output revealed that the percentage of districts offering fully in-person learning rose from 40.3% in September 2020 to 54.7% in June of 2021 with increases across 45 states and in both urban and rural districts. This type of probabilistic model can serve as a tool for fusion of incomplete and contradictory data sources in order to obtain more reliable data in support of public health surveillance and research efforts.
California Public Schools 2024-25
catalog.data.gov
data.ca.gov
+4more
Updated Oct 23, 2025
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
California Department of Education (2025). California Public Schools 2024-25 [Dataset]. https://catalog.data.gov/dataset/california-public-schools-2024-25
Explore at:
Dataset updated
Oct 23, 2025
Dataset provided by
California Department of Educationhttps://www.cde.ca.gov/
Area covered
California
Description
This layer serves as the authoritative geographic data source for California's K-12 public school locations during the 2024-25 academic year. Schools are mapped as point locations and assigned coordinates based on the physical address of the school facility. The school records are enriched with additional demographic and performance variables from the California Department of Education's data collections. These data elements can be visualized and examined geographically to uncover patterns, solve problems and inform education policy decisions.The schools in this file represent a subset of all records contained in the CDE's public school directory database. This subset is restricted to TK-12 public schools that were open in October 2024 to coincide with the official 2024-25 student enrollment counts collected on Fall Census Day in 2024 (first Wednesday in October). This layer also excludes nonpublic nonsectarian schools and district office schools.The CDE's California School Directory provides school location other basic school characteristics found in the layer's attribute table. The school enrollment, demographic and program data are collected by the CDE through the California Longitudinal Achievement System (CALPADS) and can be accessed as publicly downloadable files from the Data & Statistics web page on the CDE website. Schools are assigned X, Y coordinates using a quality controlled geocoding and validation process to optimize positional accuracy. Most schools are mapped to the school structure or centroid of the school property parcel and are individually verified using aerial imagery or assessor's parcels databases. Schools are assigned various geographic area values based on their mapped locations including state and federal legislative district identifiers and National Center for Education Statistics (NCES) locale codes.
Classification of Mars Terrain Using Multiple Data Sources - Dataset - NASA...
data.nasa.gov
Updated Mar 31, 2025
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
nasa.gov (2025). Classification of Mars Terrain Using Multiple Data Sources - Dataset - NASA Open Data Portal [Dataset]. https://data.nasa.gov/dataset/classification-of-mars-terrain-using-multiple-data-sources
Explore at:
Dataset updated
Mar 31, 2025
Dataset provided by
NASAhttp://nasa.gov/
Description
Classification of Mars Terrain Using Multiple Data Sources Alan Kraut1, David Wettergreen1 ABSTRACT. Images of Mars are being collected faster than they can be analyzed by planetary scientists. Automatic analysis of images would enable more rapid and more consistent image interpretation and could draft geologic maps where none yet exist. In this work we develop a method for incorporating images from multiple instruments to classify Martian terrain into multiple types. Each image is segmented into contiguous groups of similar pixels, called superpixels, with an associated vector of discriminative features. We have developed and tested several classification algorithms to associate a best class to each superpixel. These classifiers are trained using three different manual classifications with between 2 and 6 classes. Automatic classification accuracies of 50 to 80% are achieved in leave-one-out cross-validation across 20 scenes using a multi-class boosting classifier.
w
State of California - Data
data.wu.ac.at
Updated Oct 11, 2013
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Global (2013). State of California - Data [Dataset]. https://data.wu.ac.at/odso/datahub_io/NDZlMmFjNWEtMGY1ZS00ZWVhLTgzZWEtMmY5ZmFhMGQyMjEx
Explore at:
Dataset updated
Oct 11, 2013
Dataset provided by
Global
Description
About

Data from the State of California. From website:

Access raw State data files, databases, geographic data, and other data sources. Raw State data files can be reused by citizens and organizations for their own web applications and mashups.

Openness

Open. Effectively in the public domain. Terms of use page says:

In general, information presented on this web site, unless otherwise indicated, is considered in the public domain. It may be distributed or copied as permitted by law. However, the State does make use of copyrighted data (e.g., photographs) which may require additional permissions prior to your use. In order to use any information on this web site not owned or created by the State, you must seek permission directly from the owning (or holding) sources. The State shall have the unlimited right to use for any purpose, free of any charge, all information submitted via this site except those submissions made under separate legal contract. The State shall be free to use, for any purpose, any ideas, concepts, or techniques contained in information provided through this site.
d
Education - Arlington Public Schools Students Per Teacher
datasets.ai
s.cnmilf.com
+1more
23, 53
Updated Aug 7, 2021
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Arlington County, VA (2021). Education - Arlington Public Schools Students Per Teacher [Dataset]. https://datasets.ai/datasets/education-arlington-public-schools-students-per-teacher-78c3c
Explore at:
53, 23Available download formats
Dataset updated
Aug 7, 2021
Dataset authored and provided by
Arlington County, VA
Area covered
Arlington County, Arlington County Public Schools
Description
The Arlington Profile combines countywide data sources and provides a comprehensive outlook of the most current data on population, housing, employment, development, transportation, and community services. These datasets are used to obtain an understanding of community, plan future services/needs, guide policy decisions, and secure grant funding. A PDF Version of the Arlington Profile can be accessed on the Arlington County website.
d
Open Data Portal Tutorial for Maryland State Agencies
datasets.ai
opendata.maryland.gov
+1more
33
Updated Nov 10, 2020
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
State of Maryland (2020). Open Data Portal Tutorial for Maryland State Agencies [Dataset]. https://datasets.ai/datasets/open-data-portal-tutorial-for-maryland-state-agencies
Explore at:
33Available download formats
Dataset updated
Nov 10, 2020
Dataset authored and provided by
State of Maryland
Area covered
Maryland
Description
This is a PDF document created by the Department of Information Technology (DoIT) and the Governor's Office of Performance Improvement to assist training Maryland state employees on use of the Open Data Portal, https://opendata.maryland.gov. This document covers direct data entry, uploading Excel spreadsheets, connecting source databases, and transposing data. Please note that this tutorial is intended for use by state employees, as non-state users cannot upload datasets to the Open Data Portal.
Public Safety
alaska-critical-infrastructure-akdot.hub.arcgis.com
Updated Mar 14, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Alaska Department of Transportation & Public Facilities (2024). Public Safety [Dataset]. https://alaska-critical-infrastructure-akdot.hub.arcgis.com/datasets/public-safety
Explore at:
Dataset updated
Mar 14, 2024
Dataset authored and provided by
Alaska Department of Transportation & Public Facilitieshttps://dot.alaska.gov/
Area covered
Description
All datasets have been downloaded from other sources. Below is a table of each dataset, with the organization is was sourced from along with the URL for the original data source. To learn more about a specific dataset, please use the source URL and reach out to the organization it was sourced from. Fire Stations are a combination of local government sources when possible, otherwise the DCRA data set was used. See the lower table for full details. Layer NameCategoryData SourceSource LinkCall Routing 911Public SafetyDCRALinkPSAP AreasPublic SafetyMatcomLinkStateTrooper DetatchmentsPublic SafetyDCRALinkAlaska_AddressesPublic SafetyDewberry, State of AlaskaLinkFire_DepartmentsPublic SafetyMultipleSee BelowFBI_Uniform_Crime_ReportingPublic SafetyDCRALinkState_TroopersPublic SafetyDCRALinkVillage_Public_Safety_OfficerPublic SafetyDCRALinkPublic Safety Answering PointsPublic SafetyDCRALink Local GovernmentSource URLHainesLinkKenaiLinkKetchikanLink JuneauLink Kodiak IslandLink Matanuska SustinaLink Municipality of AnchorageLink North StarLink North SlopeLink UnalaskaLink WrangellLink Bristol BayLink Denali Link Northwest ArcticLink PetersburgLink SitkaLink SkagwayLink YakutatLink Remaining State Fire Stations (HIFLD)Link
National Information Infrastructure - Dataset - data.gov.uk
ckan.publishing.service.gov.uk
Updated Oct 31, 2013
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
ckan.publishing.service.gov.uk (2013). National Information Infrastructure - Dataset - data.gov.uk [Dataset]. https://ckan.publishing.service.gov.uk/dataset/national-information-infrastructure
Explore at:
Dataset updated
Oct 31, 2013
Dataset provided by
CKANhttps://ckan.org/
License
Open Government Licence 3.0http://www.nationalarchives.gov.uk/doc/open-government-licence/version/3/
License information was derived automatically
Description
Over the summer of 2013, the Cabinet Office started to develop the processes to support the maintenance of a dynamic NII. We can now launch a first iteration which will be the basis for user feedback and the identification of additional datasets. The processes for defining the NII can be broadly outlined as follows: a) Identifying and maintaining an inventory of data held by government; b) Prioritising data to be included in the NII; and c) Supporting organisations to release data, where possible. The Cabinet Office has developed an over-arching framework for the NII to be used as a “thinking tool” in engaging with the NII. Without this framework it will be hard to communicate the function and benefits of the NII. The framework combines a high-level categorisation of government data and characteristics of different types of data to provide a framework for the processes and identify early candidates for inclusion in the NII. The data themes in the framework for the NII relate primarily to characteristics of the organisation which hold the data and also reflect the high level categories of data in the G8 Open Data Charter. Transparency was one of the key three priorities of the recent G8, chaired by the UK where all G8 Leaders signed up to a set of principles specified in an Open Data Charter. G8 members identified 14 high-value areas, jointly regarded as data that will help unlock the economic potential of open data, support and encourage innovation, and provide greater accountability to improve our democracies. The UK has aligned these categories to inform the creation of its NII. Datasets listed against Transport and Infrastructure include datasets owned and held by government agencies, ALBs and the wider transport industry, reflecting the organisation of information in the sector. Overlaying these data themes, we have analysed user feedback, ODUG benefits cases, applications and services which successfully use government data, and expert feedback to develop 4 primary uses of data. These are: a) Location: Geospatial data which can inform mapping and planning. b) Performance and Delivery: Data which shows how effectively public bodies and services are fulfilling their public tasks and the delivery of policy. c) Fiscal: Government spend, procurement and contractual data as well as data about the financial management of public sector activities. This also includes data that government holds about companies which may be of value to users. d) Operational: Data about the operational structure, placement of public service delivery points and the nature of the resources available within each of them.
Data from: NICHE: A Curated Dataset of Engineered Machine Learning Projects...
figshare.com
txt
Updated May 30, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Ratnadira Widyasari; Zhou YANG; Ferdian Thung; Sheng Qin Sim; Fiona Wee; Camellia Lok; Jack Phan; Haodi Qi; Constance Tan; Qijin Tay; David LO (2023). NICHE: A Curated Dataset of Engineered Machine Learning Projects in Python [Dataset]. http://doi.org/10.6084/m9.figshare.21967265.v1
Explore at:
txtAvailable download formats
Unique identifier
https://doi.org/10.6084/m9.figshare.21967265.v1
Dataset updated
May 30, 2023
Dataset provided by
figshare
Figsharehttp://figshare.com/
Authors
Ratnadira Widyasari; Zhou YANG; Ferdian Thung; Sheng Qin Sim; Fiona Wee; Camellia Lok; Jack Phan; Haodi Qi; Constance Tan; Qijin Tay; David LO
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
Machine learning (ML) has gained much attention and has been incorporated into our daily lives. While there are numerous publicly available ML projects on open source platforms such as GitHub, there have been limited attempts in filtering those projects to curate ML projects of high quality. The limited availability of such high-quality dataset poses an obstacle to understanding ML projects. To help clear this obstacle, we present NICHE, a manually labelled dataset consisting of 572 ML projects. Based on evidences of good software engineering practices, we label 441 of these projects as engineered and 131 as non-engineered. In this repository we provide "NICHE.csv" file that contains the list of the project names along with their labels, descriptive information for every dimension, and several basic statistics, such as the number of stars and commits. This dataset can help researchers understand the practices that are followed in high-quality ML projects. It can also be used as a benchmark for classifiers designed to identify engineered ML projects.

GitHub page: https://github.com/soarsmu/NICHE
O*NET Database
onetcenter.org
excel, mysql, oracle +2
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
National Center for O*NET Development, O*NET Database [Dataset]. https://www.onetcenter.org/database.html
Explore at:
oracle, sql server, text, mysql, excelAvailable download formats
Dataset provided by
Occupational Information Network
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Area covered
United States
Dataset funded by
US Department of Labor, Employment and Training Administration
Description
The O*NET Database contains hundreds of standardized and occupation-specific descriptors on almost 1,000 occupations covering the entire U.S. economy. The database, which is available to the public at no cost, is continually updated by a multi-method data collection program. Sources of data include: job incumbents, occupational experts, occupational analysts, employer job postings, and customer/professional association input.
Data content areas include:
Worker Characteristics (e.g., Abilities, Interests, Work Styles)
Worker Requirements (e.g., Education, Knowledge, Skills)
Experience Requirements (e.g., On-the-Job Training, Work Experience)
Occupational Requirements (e.g., Detailed Work Activities, Work Context)
Occupation-Specific Information (e.g., Job Titles, Tasks, Technology Skills)

Facebook

Twitter

Click to copy link

Link copied

Cite

City of Tempe (2025). Addresses (Open Data) [Dataset]. https://catalog.data.gov/dataset/addresses-open-data

Addresses (Open Data)

Explore at:

19 scholarly articles cite this dataset (View in Google Scholar)

Dataset updated

Nov 22, 2025

Dataset provided by

City of Tempe

Description

This dataset is a compilation of address point data for the City of Tempe. The dataset contains a point location, the official address (as defined by The Building Safety Division of Community Development) for all occupiable units and any other official addresses in the City. There are several additional attributes that may be populated for an address, but they may not be populated for every address. Contact: Lynn Flaaen-Hanna, Development Services Specialist Contact E-mail Link: Map that Lets You Explore and Export Address Data Data Source: The initial dataset was created by combining several datasets and then reviewing the information to remove duplicates and identify errors. This published dataset is the system of record for Tempe addresses going forward, with the address information being created and maintained by The Building Safety Division of Community Development.Data Source Type: ESRI ArcGIS Enterprise GeodatabasePreparation Method: N/APublish Frequency: WeeklyPublish Method: AutomaticData Dictionary

Clear search

Close search

Google apps

Main menu

Addresses (Open Data)

SWAMP Data Dashboard

NYS Substance Use Disorder Data

Content

Context

Acknowledgements

History of work (all graph datasets)

History of Work

NEW version - CHANGE notes

World Bank: Education Data

Context

Content

Acknowledgements

Inspiration

MHS Dashboard Children and Youth Demographic Datasets

Department of Community Resources & Services Online Data Sources

Privately Owned Public Spaces (POPS)

Data from: A Large-scale Dataset of (Open Source) License Text Variants

COVID19_datasets

Emission probabilities.

California Public Schools 2024-25

Classification of Mars Terrain Using Multiple Data Sources - Dataset - NASA...

State of California - Data

About

Openness

Education - Arlington Public Schools Students Per Teacher

Open Data Portal Tutorial for Maryland State Agencies

Public Safety

National Information Infrastructure - Dataset - data.gov.uk

Data from: NICHE: A Curated Dataset of Engineered Machine Learning Projects...

O*NET Database

Addresses (Open Data)See More Versions

Addresses (Open Data)