Facebook
TwitterAttribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Wastewater-based epidemiology is a promising public health tool that can yield a more representative view of the population than case reporting. However, only about 80% of the U.S. population is connected to public sewers, and the characteristics of populations missed by wastewater-based epidemiology are unclear. To address this gap, we used publicly available datasets to assess sewer connectivity in the U.S. by location, demographic groups, and economic groups. Data from the U.S. Census’ American Housing Survey revealed that sewer connectivity was lower than average when the head of household was American Indian and Alaskan Native, White, non-Hispanic, older, and for larger households and those with higher income, but smaller geographic scales revealed local variations from this national connectivity pattern. For example, data from the U.S. Environmental Protection Agency showed that sewer connectivity was positively correlated with income in Minnesota, Florida, and California. Data from the U.S. Census’ American Community Survey and Environmental Protection Agency also revealed geographic areas with low sewer connectivity, such as Alaska, the Navajo Nation, Minnesota, Michigan, and Florida. However, with the exception of the U.S. Census data, there were inconsistencies across datasets. Using mathematical modeling to assess the impact of wastewater sampling inequities on inferences about epidemic trajectory at a local scale, we found that in some situations, even weak connections between communities may allow wastewater monitoring in one community to serve as a reliable proxy for an interacting community with no wastewater monitoring, when cases are widespread. A systematic, rigorous assessment of sewer connectivity will be important for ensuring an equitable and informed implementation of wastewater-based epidemiology as a public health monitoring system.
Facebook
TwitterThird grade English Language Arts (ELA) and Math test results for the 2016-2017 school year by census tract for the state of Michigan. Data Driven Detroit obtained these datasets from MI School Data, for the State of the Detroit Child tool in July 2017. Test results were originally obtained on a school level and aggregated to census tract by Data Driven Detroit. Student data was suppressed when less than five students were tested per school.Click here for metadata (descriptions of the fields).
Facebook
TwitterCC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
License information was derived automatically
This dataset gives the complete list of all 205,822 exceptional Dehn fillings on the 1-cusped hyperbolic 3-manifolds that have ideal triangulations with at most 9 ideal tetrahedra.
Facebook
TwitterAttribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Correlation of the percentage of a Florida county subdivision not connected to septic tanks with different demographic or economic variables.
Facebook
TwitterAttribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
This is a hybrid gridded dataset of demographic data for China from 1979 to 2100, given as 21 five-year age groups of population divided by gender every year at a 0.5-degree grid resolution.
The historical period (1979-2020) part of this dataset combines the NASA SEDAC Gridded Population of the World version 4 (GPWv4, UN WPP-Adjusted Population Count) with gridded population from the Inter-Sectoral Impact Model Intercomparison Project (ISIMIP, Histsoc gridded population data).
The projection (2010-2100) part of this dataset is resampled directly from Chen et al.’s data published in Scientific Data.
This dataset includes 31 provincial administrative districts of China, including 22 provinces, 5 autonomous regions, and 4 municipalities directly under the control of the central government (Taiwan, Hong Kong, and Macao were excluded due to missing data).
Method - demographic fractions by age and gender in 1979-2020
Age- and gender-specific demographic data by grid cell for each province in China are derived by combining historical demographic data in 1979-2020 with the national population census data provided by the National Statistics Bureau of China.
To combine the national population census data with the historical demographics, we constructed the provincial fractions of demographic in each age groups and each gender according to the fourth, fifth and sixth national population census, which cover the year of 1979-1990, 1991-2000 and 2001-2020, respectively. The provincial fractions can be computed as:
\(\begin{align*} \begin{split} f_{year,province,age,gender}= \left \{ \begin{array}{lr} POP_{1990,province,age,gender}^{4^{th}census}/POP_{1990,province}^{4^{th}census} & 1979\le\mathrm{year}\le1990\\ POP_{2000,province,age,gender}^{5^{th}census}/POP_{2000,province}^{5^{th}census} & 1991\le\mathrm{year}\le2000\\ POP_{2010,province,age,gender}^{6^{th}census}/POP_{2010,province}^{6^{th}census}, & 2001\le\mathrm{year}\le2020 \end{array} \right. \end{split} \end{align*}\)
Where:
- \( f_{\mathrm{year,province,age,gender}}\)is the fraction of population for a given age, a given gender in each province from the national census from 1979-2020.
- \(\mathrm{PO}\mathrm{P}_{\mathrm{year,province,age,gender}}^{X^{\mathrm{th}}\mathrm{census} }\) is the total population for a given age, a given gender in each province from the Xth national census.
- \(\mathrm{PO}\mathrm{P}_{\mathrm{year,province}}^{X^{\mathrm{th}}\mathrm{census} }\) is the total population for all ages and both genders in each province from the Xth national census.
Method - demographic totals by age and gender in 1979-2020
The yearly grid population for 1979-1999 are from ISIMIP Histsoc gridded population data, and for 2000-2020 are from the GPWv4 demographic data adjusted by the UN WPP (UN WPP-Adjusted Population Count, v4.11, https://beta.sedac.ciesin.columbia.edu/data/set/gpw-v4-population-count-adjusted-to-2015-unwpp-country-totals-rev11), which combines the spatial distribution of demographics from GPWv4 with the temporal trends from the UN WPP to improve accuracy. These two gridded time series are simply joined at the cut-over date to give a single dataset - historical demographic data covering 1979-2020.
Next, historical demographic data are mapped onto the grid scale to obtain provincial data by using gridded provincial code lookup data and name lookup table. The age- and gender-specific fraction were multiplied by the historical demographic data at the provincial level to obtain the total population by age and gender for per grid cell for china in 1979-2020.
Method - demographic totals and fractions by age and gender in 2010-2100
The grid population count data in 2010-2100 under different shared socioeconomic pathway (SSP) scenarios are drawn from Chen et al. published in Scientific Data with a resolution of 1km (~ 0.008333 degree). We resampled the data to 0.5 degree by aggregating the population count together to obtain the future population data per cell.
This previously published dataset also provided age- and gender-specific population of each provinces, so we calculated the fraction of each age and gender group at provincial level. Then, we multiply the fractions with grid population count to get the total population per age group per cell for each gender.
Note that the projected population data from Chen’s dataset covers 2010-2020, while the historical population in our dataset also covers 2010-2020. The two datasets of that same period may vary because the original population data come from different sources and are calculated based on different methods.
Disclaimer
This dataset is a hybrid of different datasets with independent methodologies. Spatial or temporal consistency across dataset boundaries cannot be guaranteed.
Facebook
TwitterAttribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
All population characteristics in the table were identical for the synthetic microdata and the American Community Survey data.
Facebook
TwitterThis dataset uses trajectory data from a large set of drifters to extract and analyze displacement signals associated with coherent eddies in the Gulf of Mexico, using a multivariate wavelet ridge analysis as presented in Lilly and Pérez-Brunius (2021). The data includes eddy displacement signals for all ridges, as well as the time-varying ellipse parameters and estimated ellipse center location. The instantaneous frequency is also included, as is the instantaneous bias estimate derived by Lilly and Olhede (2012). The data are organized as appended trajectory data that can be readily separated through the use of the "ids" field. The ridge length (\(L\)),and ridge-averaged circularity (\(\overline{\xi}\)) are also included, as is measure of statistical significance denoted by (\(\rho\)). The dataset is available for download as a NetCDF file.
Lilly, J. M. and P. Pérez-Brunius (2021). Extracting statistically significant eddy signals from large Lagrangian datasets using wavelet ridge analysis, with application to the Gulf of Mexico. Nonlinear Processes in Geophysics, 28: 181–212. https://doi.org/10.5194/npg-28-181-2021.
Lilly, J. M. and Olhede, S. C.: Analysis of modulated multivariate oscillations, IEEE T. Signal Proces., 60, 600–612, 2012. 10.1109/TSP.2011.2173681
Facebook
TwitterThe key stage 2 datasets combine information from the following two data sources: - prior attainment records (key stage 1 results) - school census records Key stage assessment data received from the Standard Testing Agency (STA) is matched to school census records to identify pupils’ ethnicities and free school meal eligibility.
Location: England Time period: 2017/18
This data measures the percentage of eligible pupils who met the expected and higher standards in reading, writing and maths at the end of key stage 2 when children are 10 or 11 years old.
Facebook
Twitterhttps://www.ontario.ca/page/open-government-licence-ontariohttps://www.ontario.ca/page/open-government-licence-ontario
Data includes: board and school information, grade 3 and 6 EQAO student achievements for reading, writing and mathematics, and grade 9 mathematics EQAO and OSSLT. Data excludes private schools, Education and Community Partnership Programs (ECPP), summer, night and continuing education schools.
How Are We Protecting Privacy?
Results for OnSIS and Statistics Canada variables are suppressed based on school population size to better protect student privacy. In order to achieve this additional level of protection, the Ministry has used a methodology that randomly rounds a percentage either up or down depending on school enrolment. In order to protect privacy, the ministry does not publicly report on data when there are fewer than 10 individuals represented.
The information in the School Information Finder is the most current available to the Ministry of Education at this time, as reported by schools, school boards, EQAO and Statistics Canada. The information is updated as frequently as possible.
This information is also available on the Ministry of Education's School Information Finder website by individual school.
Descriptions for some of the data types can be found in our glossary.
School/school board and school authority contact information are updated and maintained by school boards and may not be the most current version. For the most recent information please visit: https://data.ontario.ca/dataset/ontario-public-school-contact-information.
Facebook
TwitterCONTEXT
Practice Scenario: The UIW School of Engineering wants to recruit more students into their program. They will recruit students with great math scores. Also, to increase the chances of recruitment, the department will look for students who qualify for financial aid. Students who qualify for financial aid more than likely come from low socio-economic backgrounds. One way to indicate this is to view how much federal revenue a school district receives through its state. High federal revenue for a school indicates that a large portion of the student base comes from low incomes families.
The question we wish to ask is as follows: Name the school districts across the nation where their Child Nutrition Programs(c25) are federally funded between the amounts $30,000 and $50,000. And where the average math score for the school districts corresponding state is greater than or equal to the nations average score of 282.
The SQL query below in 'Top5MathTarget.sql' can be used to answer this question in MySQL. To execute this process, one would need to install MySQL to their local system and load the attached datasets below from Kaggle into their MySQL schema. The SQL query below will then join the separate tables on various key identifiers.
DATA SOURCE Data is sourced from The U.S Census Bureau and The Nations Report Card (using the NAEP Data Explorer).
Finance: https://www.census.gov/programs-surveys/school-finances/data/tables.html
Math Scores: https://www.nationsreportcard.gov/ndecore/xplore/NDE
COLUMN NOTES
All data comes from the school year 2017. Individual schools are not represented, only school districts within each state.
FEDERAL FINANCE DATA DEFINITIONS
t_fed_rev: Total federal revenue through the state to each school district.
C14- Federal revenue through the state- Title 1 (no child left behind act).
C25- Federal revenue through the state- Child Nutrition Act.
Title 1 is a program implemented in schools to help raise academic achievement for all students. The program is available to schools where at least 40% of the students come from low inccome families.
Child Nutrition Programs ensure the children are getting the food they need to grow and learn. Schools with high federal revenue to these programs indicate students that also come from low income families.
MATH SCORES DATA DEFINITIONS
Note: Mathematics, Grade 8, 2017, All Students (Total)
average_scale_score - The state's average score for eighth graders taking the NAEP math exam.
Facebook
TwitterThis dataset contains college readiness information, by U.S. Census Tract, for the state of Michigan. This dataset is reporting score information from the 2017-2018 school year. Students were considered ready for college if they scored at or above the benchmark scores. The SAT Benchmarks represent the likelihood of success in entry-level college courses. The benchmark for Evidenced-Based Reading and Writing (EBRW) is 480 and 530 for Math. The SAT total score reported for Michigan is the combined Evidenced-Based Reading and Writing, and Math Student Score. The Total Score range is 400 – 1600. Data Driven Detroit obtained this data from MiSchoolData.org in October 2018 at a building level and aggregated the data to a tract level.Click here for metadata (descriptions of the fields).
Facebook
TwitterAttribution 2.5 (CC BY 2.5)https://creativecommons.org/licenses/by/2.5/
License information was derived automatically
Estimated Resident Population (ERP) is the official measure of the Australian population. ERP for sub-state regions (including SA2s and LGAs) is published annually, with a reference date of 30 June. ERP is the official measure of the Australian population, based on the concept of usual residence. It refers to all people, regardless of nationality, citizenship or legal status, who usually live in Australia, with the exception of foreign diplomatic personnel and their families. Note, years 2012-2016 describe preliminary rebased (PR) data. For more information about PR refer to the dataset's Explanatory Notes. This dataset has been compiled using Census data, mathematical models and a range of indicator data. Current indicators include building approvals, Medicare enrolments (provided by the Department of Human Services) and electoral enrolments (provided by the Australian Electoral Commission). Data is sourced from: ABS.Stat and further information is available at http://stat.data.abs.gov.au/Index.aspx?DataSetCode=ABS_ERP_LGA2016. For additional information about this dataset and other related statistics, contact the National Information and Referral Service on 1300 135 070.
Facebook
Twitterhttps://www.icpsr.umich.edu/web/ICPSR/studies/9270/termshttps://www.icpsr.umich.edu/web/ICPSR/studies/9270/terms
This dataset provides annual population projections for the 50 states and the District of Columbia by age, sex, and race for the years 1986 through 2010. The projections were made using a mathematical projection model called the cohort-component method. This method allows separate assumptions to be made for each of the components of population change: births, deaths, internal migration, and international migration. The projections are consistent with the July 1, 1986 population estimates for states. In general, the projections assume a slight increase in the national levels of fertility, an increasing level of life expectancy, and a decreasing level of net international migration. Internal migration assumptions are based on the annual state-to-state migration data for the years 1975-1986.
Facebook
TwitterThis dataset contains college readiness information, by U.S. Census Block Group, for the state of Michigan. This dataset is reporting score information from the 2017-2018 school year. Students were considered ready for college if they scored at or above the benchmark scores. The SAT Benchmarks represent the likelihood of success in entry-level college courses. The benchmark for Evidenced-Based Reading and Writing (EBRW) is 480 and 530 for Math. The SAT total score reported for Michigan is the combined Evidenced-Based Reading and Writing, and Math Student Score. The Total Score range is 400 – 1600. Data Driven Detroit obtained this data from MiSchoolData.org in October 2018 at a building level and aggregated the data to a block group level.
Facebook
TwitterOpen Government Licence 3.0http://www.nationalarchives.gov.uk/doc/open-government-licence/version/3/
License information was derived automatically
The number of pupils making 2 levels progress between Key Stages where prior attainment data exists against the number of eligible pupils in cohort with matched valid results at KS1, expressed as a percentage. Source: Department for Children Schools and Families (DCSF) Publisher: DCLG Floor Targets Interactive Geographies: County/Unitary Authority, Government Office Region (GOR), National Geographic coverage: England Time coverage: 2003/04 to 2007/08 Type of data: Survey (census)
Facebook
TwitterOpen Government Licence - Canada 2.0https://open.canada.ca/en/open-government-licence-canada
License information was derived automatically
Number and percentage of Indigenous people with a postsecondary credential in STEM (science, technology, engineering and math and computer science) and BHASE (non-STEM) fields of study.
Not seeing a result you expected?
Learn how you can add new datasets to our index.
Facebook
TwitterAttribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Wastewater-based epidemiology is a promising public health tool that can yield a more representative view of the population than case reporting. However, only about 80% of the U.S. population is connected to public sewers, and the characteristics of populations missed by wastewater-based epidemiology are unclear. To address this gap, we used publicly available datasets to assess sewer connectivity in the U.S. by location, demographic groups, and economic groups. Data from the U.S. Census’ American Housing Survey revealed that sewer connectivity was lower than average when the head of household was American Indian and Alaskan Native, White, non-Hispanic, older, and for larger households and those with higher income, but smaller geographic scales revealed local variations from this national connectivity pattern. For example, data from the U.S. Environmental Protection Agency showed that sewer connectivity was positively correlated with income in Minnesota, Florida, and California. Data from the U.S. Census’ American Community Survey and Environmental Protection Agency also revealed geographic areas with low sewer connectivity, such as Alaska, the Navajo Nation, Minnesota, Michigan, and Florida. However, with the exception of the U.S. Census data, there were inconsistencies across datasets. Using mathematical modeling to assess the impact of wastewater sampling inequities on inferences about epidemic trajectory at a local scale, we found that in some situations, even weak connections between communities may allow wastewater monitoring in one community to serve as a reliable proxy for an interacting community with no wastewater monitoring, when cases are widespread. A systematic, rigorous assessment of sewer connectivity will be important for ensuring an equitable and informed implementation of wastewater-based epidemiology as a public health monitoring system.