46 datasets found
  1. N

    Five Points, AL Median Household Income Trends (2010-2021, in 2022...

    • neilsberg.com
    csv, json
    Updated Jan 11, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Neilsberg Research (2024). Five Points, AL Median Household Income Trends (2010-2021, in 2022 inflation-adjusted dollars) [Dataset]. https://www.neilsberg.com/research/datasets/90cd2e14-73f0-11ee-949f-3860777c1fe6/
    Explore at:
    csv, jsonAvailable download formats
    Dataset updated
    Jan 11, 2024
    Dataset authored and provided by
    Neilsberg Research
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Area covered
    Alabama, Five Points
    Variables measured
    Median Household Income, Median Household Income Year on Year Change, Median Household Income Year on Year Percent Change
    Measurement technique
    The data presented in this dataset is derived from the U.S. Census Bureau American Community Survey (ACS) 2017-2021 5-Year Estimates. It presents the median household income from the years 2010 to 2021 following an initial analysis and categorization of the census data. Subsequently, we adjusted these figures for inflation using the Consumer Price Index retroactive series via current methods (R-CPI-U-RS). For additional information about these estimations, please contact us via email at research@neilsberg.com
    Dataset funded by
    Neilsberg Research
    Description
    About this dataset

    Context

    The dataset illustrates the median household income in Five Points, spanning the years from 2010 to 2021, with all figures adjusted to 2022 inflation-adjusted dollars. Based on the latest 2017-2021 5-Year Estimates from the American Community Survey, it displays how income varied over the last decade. The dataset can be utilized to gain insights into median household income trends and explore income variations.

    Key observations:

    From 2010 to 2021, the median household income for Five Points increased by $3,119 (5.39%), as per the American Community Survey estimates. In comparison, median household income for the United States increased by $4,559 (6.51%) between 2010 and 2021.

    Analyzing the trend in median household income between the years 2010 and 2021, spanning 11 annual cycles, we observed that median household income, when adjusted for 2022 inflation using the Consumer Price Index retroactive series (R-CPI-U-RS), experienced growth year by year for 5 years and declined for 6 years.

    https://i.neilsberg.com/ch/five-points-al-median-household-income-trend.jpeg" alt="Five Points, AL median household income trend (2010-2021, in 2022 inflation-adjusted dollars)">

    Content

    When available, the data consists of estimates from the U.S. Census Bureau American Community Survey (ACS) 2017-2021 5-Year Estimates. All incomes have been adjusting for inflation and are presented in 2022-inflation-adjusted dollars.

    Years for which data is available:

    • 2010, 2011, 2012, 2013, 2014, 2015, 2016, 2017, 2018, 2019, 2020, 2021

    Variables / Data Columns

    • Year: This column presents the data year from 2010 to 2021
    • Median Household Income: Median household income, in 2022 inflation-adjusted dollars for the specific year
    • YOY Change($): Change in median household income between the current and the previous year, in 2022 inflation-adjusted dollars
    • YOY Change(%): Percent change in median household income between current and the previous year

    Good to know

    Margin of Error

    Data in the dataset are based on the estimates and are subject to sampling variability and thus a margin of error. Neilsberg Research recommends using caution when presening these estimates in your research.

    Custom data

    If you do need custom data for any of your research project, report or presentation, you can contact our research staff at research@neilsberg.com for a feasibility of a custom tabulation on a fee-for-service basis.

    Inspiration

    Neilsberg Research Team curates, analyze and publishes demographics and economic data from a variety of public and proprietary sources, each of which often includes multiple surveys and programs. The large majority of Neilsberg Research aggregated datasets and insights is made available for free download at https://www.neilsberg.com/research/.

    Recommended for further research

    This dataset is a part of the main dataset for Five Points median household income. You can refer the same here

  2. o

    Health Record Hiccups - 5526 real-world time series with change points...

    • explore.openaire.eu
    • data.niaid.nih.gov
    Updated Nov 17, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    T. Phuong Quan (2022). Health Record Hiccups - 5526 real-world time series with change points labelled by crowd-sourced visual inspection [Dataset]. http://doi.org/10.5281/zenodo.7331161
    Explore at:
    Dataset updated
    Nov 17, 2022
    Authors
    T. Phuong Quan
    Area covered
    World
    Description

    This work uses data generated via the Zooniverse.org platform. All research publications using data derived from Zooniverse approved projects are required to acknowledge the Zooniverse and the Project Builder platform. Please use the text: "This publication uses data generated via the Zooniverse.org platform." We would like to thank the Zooniverse team and all the Zooniverse volunteers who donated their time freely and generously. This work uses data provided by patients and collected by the NHS as part of their care and support. We thank all the people of Oxfordshire who contribute to the Infections in Oxfordshire Research Database. Research Database Team: L Butcher, H Boseley, C Crichton, DW Crook, DW Eyre, O Freeman, J Gearing (community), R Harrington, K Jeffery, M Landray, A Pal, TEA Peto, TP Quan, J Robinson (community), J Sellors, B Shine, AS Walker, D Waller. Patient and Public Panel: G Blower, C Mancey, P McLoughlin, B Nichols. This work was supported by the National Institute for Health Research Health Protection Research Unit (NIHR HPRU) in Healthcare Associated Infections and Antimicrobial Resistance at the University of Oxford in partnership with Public Health England (PHE) (NIHR200915), and by the NIHR Oxford Biomedical Research Centre. 5526 real-world time series with labels for the location of all abrupt changes in level, variability, trend, presence/absence of data points, and irregular outliers. The time series were produced from a range of electronic health record data extracts from a large UK hospital group. Values in each data field were aggregated by day/week/month, and numeric summary values calculated for each timepoint from the (often non-numeric) data by applying simple functions (e.g. number of values present, percentage of missing values, number of distinct values, median value). Labels were produced by visual inspection of time series plots from ~2000 volunteers, via the Health Record Hiccups project on the Zooniverse platform (https://www.zooniverse.org/projects/phuongquan/health-record-hiccups). Volunteers drew a vertical line on the image wherever they saw a change point (green line if they were certain, yellow line if they were unsure). Consensus labels per image were calculated using density based clustering with noise (R v3.6.3, dbscan v1.1-5), and converted back to a date.

  3. Data from: Public Housing Developments

    • data.lojic.org
    • opendata.atlantaregional.com
    • +1more
    Updated Nov 12, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Department of Housing and Urban Development (2024). Public Housing Developments [Dataset]. https://data.lojic.org/datasets/HUD::public-housing-developments-1
    Explore at:
    Dataset updated
    Nov 12, 2024
    Dataset provided by
    United States Department of Housing and Urban Developmenthttp://www.hud.gov/
    Authors
    Department of Housing and Urban Development
    Area covered
    Description

    HUD furnishes technical and professional assistance in planning, developing and managing these developments. Public Housing Developments are depicted as a distinct address chosen to represent the general location of an entire Public Housing Development, which may be comprised of several buildings scattered across a community. The building with the largest number of units is selected to represent the location of the development. Location data for HUD-related properties and facilities are derived from HUD's enterprise geocoding service. While not all addresses are able to be geocoded and mapped to 100% accuracy, we are continuously working to improve address data quality and enhance coverage. Please consider this issue when using any datasets provided by HUD. When using this data, take note of the field titled “LVL2KX” which indicates the overall accuracy of the geocoded address using the following return codes: ‘R’ - Interpolated rooftop (high degree of accuracy, symbolized as green) ‘4’ - ZIP+4 centroid (high degree of accuracy, symbolized as green) ‘B’ - Block group centroid (medium degree of accuracy, symbolized as yellow) ‘T’ - Census tract centroid (low degree of accuracy, symbolized as red) ‘2’ - ZIP+2 centroid (low degree of accuracy, symbolized as red) ‘Z’ - ZIP5 centroid (low degree of accuracy, symbolized as red) ‘5’ - ZIP5 centroid (same as above, low degree of accuracy, symbolized as red) Null - Could not be geocoded (does not appear on the map) For the purposes of displaying the location of an address on a map only use addresses and their associated lat/long coordinates where the LVL2KX field is coded ‘R’ or ‘4’. These codes ensure that the address is displayed on the correct street segment and in the correct census block. The remaining LVL2KX codes provide a cascading indication of the most granular level geography for which an address can be confirmed. For example, if an address cannot be accurately interpolated to a rooftop (‘R’), or ZIP+4 centroid (‘4’), then the address will be mapped to the centroid of the next nearest confirmed geography: block group, tract, and so on. When performing any point-in polygon analysis it is important to note that points mapped to the centroids of larger geographies will be less likely to map accurately to the smaller geographies of the same area. For instance, a point coded as ‘5’ in the correct ZIP Code will be less likely to map to the correct block group or census tract for that address. In an effort to protect Personally Identifiable Information (PII), the characteristics for each building are suppressed with a -4 value when the “Number_Reported” is equal to, or less than 10. To learn more about Public Housing visit: https://www.hud.gov/program_offices/public_indian_housing/programs/ph/, for questions about the spatial attribution of this dataset, please reach out to us at GISHelpdesk@hud.gov. Data Dictionary: DD_Public Housing Developments Date Updated: Q1 2025

  4. r

    Data and code for: Better self-care through co-care? A latent profile...

    • researchdata.se
    • demo.researchdata.se
    Updated Aug 19, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Carolina Wannheden; Marta Roczniewska; Henna Hasson; Klas Karlgren; Ulrica von Thiele Schwarz (2024). Data and code for: Better self-care through co-care? A latent profile analysis of primary care patients’ experiences of e-health-supported chronic care management [Dataset]. http://doi.org/10.48723/kzja-5k21
    Explore at:
    (19302), (61550), (4737), (3482), (12552), (38887), (3082)Available download formats
    Dataset updated
    Aug 19, 2024
    Dataset provided by
    Karolinska Institutet
    Authors
    Carolina Wannheden; Marta Roczniewska; Henna Hasson; Klas Karlgren; Ulrica von Thiele Schwarz
    Time period covered
    Oct 2018 - Jun 2019
    Area covered
    Stockholm County
    Description

    This data description contains code (written in the R programming language), as well as processed data and results presented in a research article (see references). No raw data are provided and the data that are made available cannot be linked to study participants. The sample consists of 180 of 308 eligible participants (adult primary care patients in Sweden, living with chronic illness) who responded to a Swedish web-based questionnaire at two time points. Using a confirmatory factor analysis, we calculated latent factor scores for 9 constructs, based on 34 questionnaire items. In this dataset, we share the latent factor scores and the latent profile analysis results. Although raw data are not shared, we provide the questionnaire item, including response scales. The code that was used to produce the latent factor scores and latent profile analysis results is also provided.

    The study was performed as part of a research project exploring how the use of eHealth services in chronic care influence interaction and collaboration between patients and healthcare. The purpose of the study was to identify subgroups of primary care patients who are similar with respect to their experiences of co-care, as measured by the DoCCA scale (von Thiele Schwarz, 2021). Baseline data were collected after patients had been introduced to an eHealth service that aimed to support them in their self-care and digital communication with healthcare; follow-up data were collected 7 months later. All patients were treated at the same primary care center, located in the Stockholm Region in Sweden.

    Cited reference: von Thiele Schwarz U, Roczniewska M, Pukk Härenstam K, Karlgren K, Hasson H, Menczel S, Wannheden C. The work of having a chronic condition: Development and psychometric evaluation of the Distribution of Co-Care Activities (DoCCA) Scale. BMC Health Services Research (2021) 21:480. doi: 10.1186/s12913-021-06455-8

    The DATASET consists of two files: factorscores_docca.csv and latent-profile-analysis-results_docca.csv.

    • factorscores_docca.csv: This file contains 18 variables (columns) and 180 cases (rows). The variables represent latent factors (measured at two time points, T1 and T2) and the values are latent factor scores. The questionnaire data that were used to produce the latent factor scores consist of 20 items that measure experiences of collaboration with healthcare, based on the DoCCA scale. These items were included in the latent profile analysis. Additionally, latent factor scores reflecting perceived self-efficacy in self-care (6 items), satisfaction with healthcare (2 items), self-rated health (2 items), and perceived impact of e-health (4 items) were calculated. These items were used to make comparisons between profiles resulting from the latent profile analysis. Variable definitions are provided in a separate file (see below).

    • latent-profile-analysis-results_docca.csv: This file contains 14 variables (columns) and 180 cases (rows). The variables represent profile classifications (numbers and labels) and posterior classification probabilities for each of the identified profiles, 4 profiles at T1 and 5 profiles at T2. Transition probabilities (from T1 to T2 profiles) were not calculated due to lacking configural similarity of profiles at T1 and T2; hence no transition probabilities are provided.

    The ASSOCIATED DOCUMENTATION consists of one file with variable definitions in English and Swedish, and four script files (written in the R programming language):

    • variable-definitions_swe-eng.xlsx: This file consists of four sheets. Sheet 1 (scale-items_original_swedish) specifies the questionnaire items (in Swedish) that were used to calculate the latent factor scores; response scales are included. Sheet 2 (scale-items_translated_english) provides an English translation of the questionnaire items and response scales provided in Sheet 1. Sheet 3 (factorscores_docca) defines the variables in the factorscores_docca.csv dataset. Sheet 4 (latent-profile-analysis-results) defines the variables in the latent-profile-analysis-results_docca.csv dataset.

    • R-script_Step-0_Factor-scores.R: R script file with the code that was used to calculate the latent factor scores. This script can only be run with access to the raw data file which is not publicly shared due to ethical constraints. Hence, the purpose of the script file is code transparency. Also, the script shows the model specification that was used in the confirmatory factor analysis (CFA). Missingness in data was accounted for by using Full Information Maximum Likelihood (FIML).

    • R-script_Step-1_Latent-profile-analysis.R: R script file with the code that was used to run the latent profile analyses at T1 and T2 and produce profile plots. This code can be run with the provided dataset factorscores_docca.csv. Note that the script generates the results that are provided in the latent-profile-analysis-results_docca.csv dataset.

    • R-script_Step-2_Non-parametric-tests.R: R script file with the code that was used to run non-parametric tests for comparing exogenous variables between profiles at T1 and T2. This script uses the following datasets: factorscores_docca.csv and latent-profile-analysis-results_docca.csv.

    • R-script_Step-3_Class-transitions.R: R script file with the code that was used to create a sankey diagram for illustrating class transitions. This script uses the following dataset: latent-profile-analysis-results_docca.csv.

    Software requirements: To run the code, the R software environment and R packages specified in the script files need to be installed (open source). The scripts were produced in R version 4.2.1.

  5. n

    Data for: A modified Michaelis-Menten equation estimates growth from birth...

    • data.niaid.nih.gov
    • dataone.org
    • +1more
    zip
    Updated Jan 22, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Catherine Ley; William Walters (2024). Data for: A modified Michaelis-Menten equation estimates growth from birth to 3 years in healthy babies in the US [Dataset]. http://doi.org/10.5061/dryad.4j0zpc8jf
    Explore at:
    zipAvailable download formats
    Dataset updated
    Jan 22, 2024
    Dataset provided by
    Max Planck Institute for Biology
    Stanford University School of Medicine
    Authors
    Catherine Ley; William Walters
    License

    https://spdx.org/licenses/CC0-1.0.htmlhttps://spdx.org/licenses/CC0-1.0.html

    Description

    Background: Standard pediatric growth curves cannot be used to impute missing height or weight measurements in individual children. The Michaelis-Menten equation, used for characterizing substrate-enzyme saturation curves, has been shown to model growth in many organisms including nonhuman vertebrates. We investigated whether this equation could be used to interpolate missing growth data in children in the first three years of life and compared this interpolation to several common interpolation methods and pediatric growth models. Methods: We developed a modified Michaelis-Menten equation and compared expected to actual growth, first in a local birth cohort (N=97) and then in a large, outpatient, pediatric sample (N=14,695). Results: The modified Michaelis-Menten equation showed excellent fit for both infant weight (median RMSE: boys: 0.22kg [IQR:0.19; 90%<0.43]; girls: 0.20kg [IQR:0.17; 90%<0.39]) and height (median RMSE: boys: 0.93cm [IQR:0.53; 90%<1.0]; girls: 0.91cm [IQR:0.50;90%<1.0]). Growth data were modeled accurately with as few as four values from routine well-baby visits in year 1 and seven values in years 1-3; birth weight or length was essential for best fit. Interpolation with this equation had comparable (for weight) or lower (for height) mean RMSE compared to the best-performing alternative models. Conclusions: A modified Michaelis-Menten equation accurately describes growth in healthy babies aged 0–36 months, allowing interpolation of missing weight and height values in individual longitudinal measurement series. The growth pattern in healthy babies in resource-rich environments mirrors an enzymatic saturation curve. Methods Sources of data: Information on infants was ascertained from two sources: the STORK birth cohort and the STARR research registry. (1) Detailed methods for the STORK birth cohort have been described previously. In brief, a multiethnic cohort of mothers and babies was followed from the second trimester of pregnancy to the babies’ third birthday. Healthy women aged 18–42 years with a single-fetus pregnancy were enrolled. Households were visited every four months until the baby’s third birthday (nine baby visits), with the weight of the baby at each visit recorded in pounds. Medical charts were abstracted for birth weight and length. (2) STARR (starr.stanford.edu) contains electronic medical record information from all pediatric and adult patients seen at Stanford Health Care (Stanford, CA). STARR staff provided anonymized information (weight, height and age in days for each visit through age three years; sex; race/ethnicity) for all babies during the period 03/2013–01/2022 followed from birth to at least 36 months of age with at least five well-baby care visits over the first year of life.
    Inclusion of data for modeling: All observed weight and height values were evaluated in kilograms (kg) and centimeters (cm), respectively. Any values assessed beyond 1,125 days (roughly 36 months) and values for height and weight deemed implausible by at least two reviewers (e.g., significant losses in height, or marked outliers for weight and height) were excluded from the analysis. Additionally, weights assessed between birth and 19 days were excluded. At least five observations across the 36-month period were required: babies with fewer than five weight or height values after the previous criteria were excluded from analyses. Model: We developed our weight model using values from STORK babies and then replicated it with values from the STARR babies. Height models were evaluated in STARR babies only because STORK data on height were scant. The Michaelis-Menten equation is described as follows: v = Vmax ([S]/(Km + [S]) , where v is the rate of product formation, Vmax is the maximum rate of the system, [S] is the substrate concentration, and Km is a constant based upon the enzyme’s affinity for the particular substrate. For this study the equation became: P = a1 (Age/(b1+ Age)) + c1, where P was the predicted value of weight (kg) or height (cm), Age was the age of the infant in days, and c1 was an additional constant over the original Michaelis-Menten equation that accounted for the infant’s non-zero weight or length at birth. Each of the parameters a1, b1 and c1 was unique to each child and was calculated using the nonlinear least squares (nls) method. In our case, weight data were fitted to a model using the statistical language R, by calling the formula nls() with the following parameters: fitted_model <-nls(weights~(c1+(a1*ages)/(b1+ages)), start = list(a1 = 5, b1 = 20, c1=2.5)), where weights and ages were vectors of each subject’s weight in kg and age in days. The default Gauss-Newton algorithm was used. The optimization objective is not convex in the parameters and can suffer from local optima and boundary conditions. In such cases good starting values are essential: the starting parameter values (a1=5, b1=20, c1=2.5) were adjusted manually using the STORK dataset to minimize model failures; these tended to occur when the parameter values, particularly a1 and b1, increased without bound during the iterative steps required to optimize the model. These same parameter values were used for the larger STARR dataset. The starting height parameter values for height modeling were higher than those for weight modeling, due to the different units involved (cm vs. kg) (a1=60, b1=530, c1=50). Because this was a non-linear model, goodness of fit was assessed primarily via root mean squared error (RMSE) for both weight and height. Imputation tests: To test for the influence of specific time points on the models, we limited our analysis to STARR babies with all recommended well-baby visits (12 over three years). Each scheduled visit except day 1 occurred in a time window around the expected well-baby visit (Visit1: Day 1, Visit2: days 20–44, Visit3: 46–90, Visit4: 95–148, Visit5: 158–225, Visit6: 250–298, Visit7: 310–399, Visit8: 410–490, Visit9: 500–600, Visit10: 640–800, Visit11: 842–982, Visit12: 1024–1125). We considered two different sets: infants with all scheduled visits in the first year of life (seven total visits) and those with all scheduled visits over the full three-year timeframe (12 total visits). We fit these two sets to the model, identifying baseline RMSE. Then, every visit, and every combination of two to five visits were dropped, so that the RMSE or model failures for a combination of visits could be compared to baseline. Prediction: We sought to predict weight or height at 36 months (Y3) from growth measures assessed only up to 12 months (Y1) or to 24 months (Y1+Y2), utilizing the “last value” approach. In brief, the last observation for each child (here, growth measures at 36 months) is used to assess overall model fit, by focusing on how accurately the model can extrapolate the measure at this time point. We identified all STARR infants with at least five time points in Y1 and at least two time points in both Y2 and Y3, with the selection of these time points based on maximizing the number of later time points within the constraints of the well-baby visit schedule for Y2 and Y3. The per-subject set of time points (Y1-Y3) was fitted using the modified Michaelis-Menten equation and the mean squared error was calculated, acting as the “baseline” error. The model was then run on the subset of Y1 only and of Y1+Y2 only. To test predictive accuracy of these subsets, the RMSE was calculated using the actual weights or heights versus the predicted weights or heights of the three time series. Comparison with other models: We examined how well the modified Michaelis-Menten equation performed interpolation in STARR babies compared to ten other commonly used interpolation methods and pediatric growth models including: (1) the ‘last observation carried forward’ model; (2) the linear model; (3) the robust linear model (RLM method, base R MASS package); (4) the Laird and Ware linear model (LWMOD method); (5) the generalized additive model (GAM method); (6) locally estimated scatterplot smoothing (LOESS method, base R stats package); (7) the smooth spline model (smooth.spline method, base R stats package); (8) the multilevel spline model (Wand method); (9) the SITAR (superimposition by translation and rotation) model and (10) fast covariance estimation (FACE method). Model fit used the holdout approach: a single datapoint (other than birth weight or birth length) was randomly removed from each subject, and the RMSE of the removed datapoint was calculated as the model fitted to the remaining data. The hbgd package was used to fit all models except the ‘last observation carried forward’ model, the linear model and the SITAR model. For the ‘last observation carried forward’ model, the holdout data point was interpolated by the last observation by converting the random holdout value to NA and then using the function na.locf() from the zoo R package. For the simple linear model, the holdout-filtered data were used to determine the slope and intercept via R’s lm() function, which were then used to calculate the holdout value. For the SITAR model, each subject was fitted by calling the sitar() function with df=2 to minimize failures, and the RMSE of the random holdout point was subsequently calculated with the predict() function. For this analysis, set.seed(1234) was used to initialize the pseudorandom generator.

  6. E

    Buoy Telemetry: Automated Quality Control

    • pricaimcit.services.brown.edu
    Updated May 11, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Rhode Island Data Discovery Center (2022). Buoy Telemetry: Automated Quality Control [Dataset]. https://pricaimcit.services.brown.edu/erddap/info/buoy_telemetry_0ffe_2dc0_916e/index.html
    Explore at:
    Dataset updated
    May 11, 2022
    Dataset authored and provided by
    Rhode Island Data Discovery Center
    Time period covered
    Dec 4, 2024
    Area covered
    Variables measured
    FDOM, time, AirTemp, latitude, O2Surface, longitude, pHSurface, PARSurface, AirPressure, FDOMDespike, and 79 more
    Description

    The near real-time data presented here is intended to provide a 'snapshot' of current conditions within Narragansett Bay and has been subjected to automated QC pipelines. QA of data is performed following manufacturer guidelines for calibration and servicing of each sensor. QC'd datasets that have undergone additional manual inspection by researchers is provided in a 3 month lagging interval. Following the publication of human QC'd data, automated QC'd data from the previous 3 month window will be removed. See the 'Buoy Telemetry: Manually Quality Controlled' dataset for the full quality controlled dataset.The Automated QC of measurements collected from buoy platforms is performed following guidelines established by the Ocean Observatories Initiative (https://oceanobservatories.org/quality-control/) and implemented in R. Spike Test: To identify spikes within collected measurements, data points are assessed for deviation against a 'reference' window of measurement generated in a sliding window (k=7) . If a data point exceeds the deviation threshold (N=2), the spike is replaced with the 'reference' data point, which is determined using a median smoothing approach in the R package 'oce'. Despiked data is then written into the instrument table as 'Measurement_Despike'. Global Range Test: Data points are checked against the maximum and minimum measurements using a dataset of global measurements provided by IOOC (https://github.com/oceanobservatories/qc-lookup). QC Flags from global range tests are stored in the instrument table as 'Measurement_Global_Range_QC'. QC Flags: Measurement within global threshold= 0, Below minimum global threshold =1, Above maximum global threshold =2. Local Range Test: Data point values are checked against historical seasonal ranges for each parameter, using data provided by URI GSO's Narragansett Bay Long-Term Plankton Time Series (https://web.uri.edu/gso/research/plankton/). QC Flags from local range tests are stored in the instrument table as 'Measurement_Local_Range_QC'. Local Range QC Flags: Measurement within local seasonal threshold= 0, Below minimum local seasonal threshold =1, Above maximum local seasonal threshold =2. Stuck Value Test: To identify potential stuck values from a sensor, each data point is compared to subsequent values using sliding 3 and 5 frame windows. QC Flags from stuck value tests are stored in the instrument table as 'Measurement_Stuck_Value_QC' QC Flags: No stuck value detected= 0, Suspect Stuck Sensor (3 consecutive identical values) =1, Stuck Sensor (5 consecutive identical values) =2. Instrument Range Test: Data point values for meterological measurements are checked against the manufacturer's specified measurement ranges. QC Flags: Measurement within instrument range= 0, Measurement below instrument range =1, Measurement above instrument range =2. cdm_data_type=Other Conventions=COARDS, CF-1.6, ACDD-1.3 Easternmost_Easting=0.0 geospatial_lat_max=0.0 geospatial_lat_min=0.0 geospatial_lat_units=degrees_north geospatial_lon_max=0.0 geospatial_lon_min=0.0 geospatial_lon_units=degrees_east infoUrl=riddc.brown.edu institution=Rhode Island Data Discovery Center keywords_vocabulary=GCMD Science Keywords Northernmost_Northing=0.0 sourceUrl=(local files) Southernmost_Northing=0.0 standard_name_vocabulary=CF Standard Name Table v55 subsetVariables=station_name testOutOfDate=now-26days time_coverage_end=2024-12-04T06:00Z time_coverage_start=2024-12-04T05:40Z Westernmost_Easting=0.0

  7. o

    Status of River Breakages Along Juba and Shabelle Rivers - Issued February...

    • sodma-dev.okfn.org
    Updated Jun 4, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    sodma-dev.okfn.org (2025). Status of River Breakages Along Juba and Shabelle Rivers - Issued February 2022 - Dataset - SODMA Open Data Portal [Dataset]. https://sodma-dev.okfn.org/dataset/status-of-river-breakages-along-juba-and-shabelle-rivers-issued-february-20221
    Explore at:
    Dataset updated
    Jun 4, 2025
    Dataset provided by
    Somali Disaster Management Agencyhttps://sodma.gov.so/
    Description

    Three consecutive poor rainy seasons within the Juba and Shabelle River basins inside Somalia and the Ethiopian highlands have led to the current serious hydrological drought along the two rivers. The river levels in the upper sections are currently at their historical minimum, while the mid and lower sections of the Shabelle River having run dry. With no rains expected in February and most of March, the river flow will continue to decline. The reduced river flow along the two rivers has negatively impacted agriculture production, domestic and livestock water use. This has also led to an increase of new river breakages as the riverine communities attempt to extract the limited resource to support livelihood activities.\r \r SWALIM has completed analysis and mapping of the river breakages along the two rivers using very high resolution satellite images acquired thanks to the kind contribution of the Embassy of France. The study has identified 101 open points along the Shabelle, out of which 24 points are new and the rest have remained open since the last survey in August 2021. Along the Juba River, 35 open points were identified out of which 5 are new points. During this drought period, it is expected that the riverine communities will continue to extract water from the rivers by breaching the banks and this will only see an increase of the open river bank points.\r \r Several other weak points which are not necessarily open but have the potential to overflow or break were identified during the analysis. The summary of the different types of points mapped.

  8. d

    Health and Retirement Study (HRS)

    • search.dataone.org
    Updated Nov 21, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Damico, Anthony (2023). Health and Retirement Study (HRS) [Dataset]. http://doi.org/10.7910/DVN/ELEKOY
    Explore at:
    Dataset updated
    Nov 21, 2023
    Dataset provided by
    Harvard Dataverse
    Authors
    Damico, Anthony
    Description

    analyze the health and retirement study (hrs) with r the hrs is the one and only longitudinal survey of american seniors. with a panel starting its third decade, the current pool of respondents includes older folks who have been interviewed every two years as far back as 1992. unlike cross-sectional or shorter panel surveys, respondents keep responding until, well, death d o us part. paid for by the national institute on aging and administered by the university of michigan's institute for social research, if you apply for an interviewer job with them, i hope you like werther's original. figuring out how to analyze this data set might trigger your fight-or-flight synapses if you just start clicking arou nd on michigan's website. instead, read pages numbered 10-17 (pdf pages 12-19) of this introduction pdf and don't touch the data until you understand figure a-3 on that last page. if you start enjoying yourself, here's the whole book. after that, it's time to register for access to the (free) data. keep your username and password handy, you'll need it for the top of the download automation r script. next, look at this data flowchart to get an idea of why the data download page is such a righteous jungle. but wait, good news: umich recently farmed out its data management to the rand corporation, who promptly constructed a giant consolidated file with one record per respondent across the whole panel. oh so beautiful. the rand hrs files make much of the older data and syntax examples obsolete, so when you come across stuff like instructions on how to merge years, you can happily ignore them - rand has done it for you. the health and retirement study only includes noninstitutionalized adults when new respondents get added to the panel (as they were in 1992, 1993, 1998, 2004, and 2010) but once they're in, they're in - respondents have a weight of zero for interview waves when they were nursing home residents; but they're still responding and will continue to contribute to your statistics so long as you're generalizing about a population from a previous wave (for example: it's possible to compute "among all americans who were 50+ years old in 1998, x% lived in nursing homes by 2010"). my source for that 411? page 13 of the design doc. wicked. this new github repository contains five scripts: 1992 - 2010 download HRS microdata.R loop through every year and every file, download, then unzip everything in one big party impor t longitudinal RAND contributed files.R create a SQLite database (.db) on the local disk load the rand, rand-cams, and both rand-family files into the database (.db) in chunks (to prevent overloading ram) longitudinal RAND - analysis examples.R connect to the sql database created by the 'import longitudinal RAND contributed files' program create tw o database-backed complex sample survey object, using a taylor-series linearization design perform a mountain of analysis examples with wave weights from two different points in the panel import example HRS file.R load a fixed-width file using only the sas importation script directly into ram with < a href="http://blog.revolutionanalytics.com/2012/07/importing-public-data-with-sas-instructions-into-r.html">SAScii parse through the IF block at the bottom of the sas importation script, blank out a number of variables save the file as an R data file (.rda) for fast loading later replicate 2002 regression.R connect to the sql database created by the 'import longitudinal RAND contributed files' program create a database-backed complex sample survey object, using a taylor-series linearization design exactly match the final regression shown in this document provided by analysts at RAND as an update of the regression on pdf page B76 of this document . click here to view these five scripts for more detail about the health and retirement study (hrs), visit: michigan's hrs homepage rand's hrs homepage the hrs wikipedia page a running list of publications using hrs notes: exemplary work making it this far. as a reward, here's the detailed codebook for the main rand hrs file. note that rand also creates 'flat files' for every survey wave, but really, most every analysis you c an think of is possible using just the four files imported with the rand importation script above. if you must work with the non-rand files, there's an example of how to import a single hrs (umich-created) file, but if you wish to import more than one, you'll have to write some for loops yourself. confidential to sas, spss, stata, and sudaan users: a tidal wave is coming. you can get water up your nose and be dragged out to sea, or you can grab a surf board. time to transition to r. :D

  9. d

    Data from: Light quality and spatial variability influences on seedling...

    • search.dataone.org
    • data.niaid.nih.gov
    • +1more
    Updated May 25, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Rebecca Ostertag; Jodie Rosam; Laura Warman; Ryan Perroy; Susan Cordell (2024). Light quality and spatial variability influences on seedling regeneration in Hawaiian lowland wet forests [Dataset]. http://doi.org/10.5061/dryad.x95x69pt4
    Explore at:
    Dataset updated
    May 25, 2024
    Dataset provided by
    Dryad Digital Repository
    Authors
    Rebecca Ostertag; Jodie Rosam; Laura Warman; Ryan Perroy; Susan Cordell
    Description

    Tropical forest understories tend to be light-limited. The red-to-far-red ratio (R:FR) is a useful and reliable index of light quality and its spatial variability can influence competition between native and non-native seedlings. While percent light transmittance has been quantified in some Hawaiian lowland wet forests (HLWF), no information exists on how the spatial distribution of understory light varies in relation to species invasion, or if patterns of seedling regeneration and light are linked. We measured the R:FR of light in the understory to assess light quality in three HLWF forest types: native-dominated, partially-invaded, and Psidium cattleyanum- (strawberry guava) dominated to quantify light quality in the understory (0-50 cm height). We also identified relationships between light quality and native and non-native seedling presence, diversity, and abundance. Together, these data can help to determine the restoration potential of HLWF. Linear mixed-effect modeling showed tha..., STUDY SITES See Table 1 of corresponding publication for list of the 9 lowland wet forest sites.  LIGHT QUALITY DATA COLLECTION AND SPATIAL PATTERNS  At each site (n=9), we established a 20 x 80 m plot with an internal grid system measuring 5 x 5 m. This led to 85 points for data collection per site. Light quality measurements taken with a SKR 110 Red / Far-red sensor (SKYE Instruments, London, UK), at five height categories. In a few cases a fallen log or other barrier precluded sampling at a given point/height combination (n = 20 missing points). The collected R:FR light data were assigned local coordinates (x,y,z origin for each plot arbitrarily set at 100/100/0m) and imported into a GIS software program (ArcGIS Pro 3.2, ESRI, Redlands, CA). See corresponding publication for more details  SEEDLING PATTERNS  All woody seedlings were identified to species, and fern and grass species were grouped together as native or non-native morphospecies. See corresponding publication for deta..., , # Reference Information

    Provenance for this README

    • File name: README_Dataset-LightQualitySeedlingRegeneration_v0.1.0.txt
    • Authors: Rebecca Ostertag
    • Other contributors: Jodie Rosam, Laura Warman, Ryan Perroy, Susan Cordell
    • Date created: 2024-05-21
    • Date modified: 2024-05-22

    Dataset Version and Release History

    • Current Version:
    • Embargo Provenance: n/a
      • Scope of embargo: n/a
      • Embargo period: n/a

    Dataset Attribution and Usage

    • Dataset Title: Data for the article "Light quality and spatial variability influences on seedling regeneration in Hawaiian lowland wet forests"
    • Persistent Identifier: https://doi.org/10.5061/dryad.x95x69pt4
    • Dataset Contributors:
      • Creators: Jodie R Rosam, Laura Warma...
  10. Fractional Abundance Datasets for Salt Patches and Marshes Across the...

    • zenodo.org
    tiff
    Updated Jul 11, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Manan Sarupria; Manan Sarupria; Pinki Mondal; Pinki Mondal; Rodrigo Vargas; Rodrigo Vargas; Matthew Walter; Matthew Walter; Jarrod Miller; Jarrod Miller (2025). Fractional Abundance Datasets for Salt Patches and Marshes Across the Delmarva Peninsula, v2 [Dataset]. http://doi.org/10.5281/zenodo.15866496
    Explore at:
    tiffAvailable download formats
    Dataset updated
    Jul 11, 2025
    Dataset provided by
    Zenodohttp://zenodo.org/
    Authors
    Manan Sarupria; Manan Sarupria; Pinki Mondal; Pinki Mondal; Rodrigo Vargas; Rodrigo Vargas; Matthew Walter; Matthew Walter; Jarrod Miller; Jarrod Miller
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Area covered
    Delmarva Peninsula
    Description

    Abstract:

    Coastal agricultural lands in the eastern USA are increasingly plagued by escalating soil salinity, rendering them unsuitable for profitable farming. Saltwater intrusion into groundwater or soil salinization can lead to alterations in land cover, such as diminished plant growth, or complete land cover transformation. Two notable instances of such transformations include the conversion of farmland to marshland or to barren salt patches devoid of vegetation. However, quantifying these land cover changes across vast geographic areas poses a significant challenge due to their varying spatial granularity. To tackle this issue, a non-linear spectral unmixing approach utilizing a Random Forest (RF) algorithm was employed to quantify the fractional abundance of salt patches and marshes. Using 2022 Sentinel-2 imagery, gridded datasets for salt patches and marshes were generated across the Delmarva Peninsula (14 coastal counties in Delaware, Maryland and Virginia, USA), along with the associated uncertainty. The RF models were constructed using 100 trees and 27,437 reference data points, resulting in two sets of ten models: one for salt patches and another for marshes. Validation metrics for sub-pixel fractional abundances revealed a moderate R-squared value of 0.50 for the salt model ensemble and a high R-squared value of 0.90 for the marsh model ensemble. These models predicted a total area of 16.34 sq. km. for salt patches and 1,256.71 sq. km. for marshes. In these datasets, we only report fractional abundance values ranging from 0.4 to 1 for salt patches and 0.25 to 1 for marshes, along with the standard deviation associated with each value.


    --------------------------------------------


    This collection of gridded data layers provides fractional abundance of salt patches and marshes for the year 2022 for 14 counties in the Delmarva Peninsula in the United States of America (USA). This collection is comprised of 10 files in the form of a single band raster:

    1. Five files for Fractional abundance mean: Salt patch – Mean of per-pixel fractional abundance from an ensemble of 10 RF models. Only pixels with salt patch fraction ≥ 0.40 were retained in this layer.

    2. Five files for Fractional abundance mean: Marsh – Mean of per-pixel fractional abundance from an ensemble of 10 RF models. Only pixels with marsh fraction ≥ 0.25 were retained in this layer.

    Input Data:

    This approach integrated Sentinel-2 Level 2 A surface reflectance imagery (June, July, and August - 2022), a global land use/land cover dataset from ESRI (Karra et al., 2021), a NAIP-derived Delmarva land cover dataset (Mondal et al., 2022), high-resolution PlanetScope true color images (Planet Team, 2017), very high-resolution Unoccupied Aerial Vehicle (UAV) imagery, and ground truth data.

    We derived several spectral indices (see table below) from the Sentinel-2 Level 2 A bands and then used those as inputs into a Random Forest (RF) classifier in python.

    Method:

    The research utilized Sentinel-2 Level 2 A surface reflectance imagery for spectral unmixing. This multispectral dataset, corrected for atmospheric and radiometric effects, encompasses 13 spectral bands from visible to near-infrared wavelengths (0.443–2.190 micrometers). The imagery offers spatial resolutions ranging from 10 m to 60 m and is captured every 5 days. To aid in selecting reference points for model training and testing, high-resolution (60 cm) UAV images of specific farmlands in Dorchester and Somerset counties, Maryland, were acquired under optimal weather conditions.

    The study incorporated multiple datasets to refine the analysis. The Sentinel-2 derived global land use/land cover dataset from ESRI was employed to isolate relevant land cover classes such as 'Crops' and 'Rangeland'. A NAIP-derived Delmarva land cover dataset with eight classes helped exclude non-agricultural land cover types. High-resolution PlanetScope true color images with 3 m spatial resolution were used as reference data for model validation.

    A composite image was generated from Sentinel-2 Level 2 A images using a maximum Normalized Difference Vegetation Index (NDVI) filter. This composite was created from Sentinel-2 images captured between June 1 and August 30, 2022, retaining pixels with the highest NDVI values. This approach effectively highlighted areas of reduced crop cover due to high salinity levels, even during peak growing season. Cloud masking was performed using Sentinel-2 cloud probability imagery, applying a 20% threshold for maximum cloud probability. The pre-processing of Sentinel-2 imagery was conducted on Google Earth Engine (GEE), a cloud-based geospatial data processing platform.

    NDVI = (Near infrared – Red) / (Near infrared + Red)

    The NDVI maximum composite incorporated seven original Sentinel-2 bands (R, G, B, Red-Edge 1 & 2, NIR, SWIR) and five additional indices. These indices included the Enhanced Vegetation Index (EVI), Moisture Stress Index (MSI), and Modified Soil Adjusted Vegetation Index (MSAVI). Furthermore, two new indices were developed for this study: the Normalized Difference Salt Patch Index (NDSPI) and Modified Salt Patch Index (MSPI). These novel indices were designed to enhance the spectral separability between salt patches and bare soil, maximizing the difference in values between these two land cover types.

    Spectral Index

    Equation

    EVI: Enhanced Vegetation Index

    2.5 × ((NIR - RED)) / ((NIR + 6 × RED – 7.5 × BLUE + 1) )

    MSAVI: Modified soil-adjusted vegetation index

    (2 × NIR + 1 - √(((2 × NIR + 1)^2 – 8 × (NIR - RED)) )) /2

    MSI: Moisture Stress Index

    SWIR / NIR

    NDSPI: Normalized Difference Salt Patch Index

    (SWIR - B) / (SWIR + B)

    MSPI: Modified Salt Patch Index

    (R + G + B + NIR - SWIR) / (R + G + B + NIR + SWIR)

    For the training process, we identified five common endmembers: salt patch, bare soil, crop, water, and marsh, which were present in and around the selected farmlands. Reference points for bare soil were defined as pixels of soil in farmlands that did not contain salt patches or crops. For salt, reference points were identified as pixels representing salt patches with little to no vegetation. These reference points were gathered using Sentinel-2 imagery, primarily captured on June 29, 2022, and were supplemented by additional UAV imagery from various dates. Farm locations were chosen based on the visibility of significant salt patches, with the imagery dates being as close as possible to the UAV flight dates. Additional ground truth data for land cover was collected during the summer of 2022 to enhance the remotely gathered points. In total, 27,437 reference points were collected for model training and testing: 239 for salt, 1,096 for bare soil, 5,198 for crops, 20,131 for water, and 773 for marsh. Out of these reference points, 142 (69 for salt, 23 for bare soil, and 50 for crops) were collected during field visits; the remainder was obtained digitally with visual support from PlanetLabs data.

    In this study, we applied a Random Forest (RF) classifier for nonlinear spectral unmixing. The RF classifier functions by utilizing an ensemble of decision trees that are independently trained on random subsets of training data through bootstrap aggregation. The final classification is determined by aggregating votes from all trees, with the endmember receiving the highest total votes being selected as the final output. To access soft voting information from the RF classifier, we used its probability prediction function called ‘predict_proba’. This function enables each decision tree to produce a probability distribution for each endmember instead of making a single class decision. The probability distribution from a decision tree indicates how likely it is that an input pixel belongs to each endmember. The final predicted probabilities are calculated by averaging these distributions across all decision trees for each of the five endmembers. As a result, each pixel in the final output is represented by five probability values that indicate the fractional abundance of each corresponding endmember within that pixel. These probabilities sum to one, effectively illustrating the spectral unmixing of a mixed pixel. A pixel value of 0 signifies the absence of a specific endmember, while a value of 1 indicates a pure pixel. Values between 0 and 1 reflect varying levels of mixed endmembers.

    The RF model used for salt patch unmixing included a total of 4,302 reference points: 239 for salt, 1,195 for crops, and 956 points each for bare soil, water, and marshes. The RF model for marsh unmixing utilized a total of 27,437 reference points: 239 for salt patches, 5,198 for crops, 1,096 for bare soil, 20,131 for water, and 773 for marshes. For both models, the input data was divided into 80% for training purposes and 20% for testing.

    Accuracy assessment:

    Visual validation of the salt patch model's predictions show low Mean Squared Error (MSE) and Mean Absolute Error (MAE) values of 0.035 and 0.059, respectively (See table below). However, the model does not explain all the variability in the data, as evidenced by the moderate R-squared value of 0.50.

    Parameter

    Salt (227 points)

    Marsh (761

  11. Data from: Public Housing Authorities

    • data.lojic.org
    • hudgis-hud.opendata.arcgis.com
    • +1more
    Updated Nov 12, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Department of Housing and Urban Development (2024). Public Housing Authorities [Dataset]. https://data.lojic.org/maps/HUD::public-housing-authorities-1
    Explore at:
    Dataset updated
    Nov 12, 2024
    Dataset provided by
    United States Department of Housing and Urban Developmenthttp://www.hud.gov/
    Authors
    Department of Housing and Urban Development
    Area covered
    Description

    Public Housing was established to provide decent and safe rental housing for eligible low-income families, the elderly, and persons with disabilities. Public housing comes in all sizes and types, from scattered single family houses to high-rise apartments for elderly families. There are approximately 1.2 million households living in public housing units, managed by over 3,300 housing agencies (HAs). HUD administers Federal aid to local housing agencies (HAs) that manage the housing for low-income residents at rents they can afford. HUD furnishes technical and professional assistance in planning, developing and managing these developments. Location data for HUD-related properties and facilities are derived from HUD's enterprise geocoding service. While not all addresses are able to be geocoded and mapped to 100% accuracy, we are continuously working to improve address data quality and enhance coverage. Please consider this issue when using any datasets provided by HUD. When using this data, take note of the field titled “LVL2KX” which indicates the overall accuracy of the geocoded address using the following return codes: ‘R’ - Interpolated rooftop (high degree of accuracy, symbolized as green) ‘4’ - ZIP+4 centroid (high degree of accuracy, symbolized as green) ‘B’ - Block group centroid (medium degree of accuracy, symbolized as yellow) ‘T’ - Census tract centroid (low degree of accuracy, symbolized as red) ‘2’ - ZIP+2 centroid (low degree of accuracy, symbolized as red) ‘Z’ - ZIP5 centroid (low degree of accuracy, symbolized as red) ‘5’ - ZIP5 centroid (same as above, low degree of accuracy, symbolized as red) Null - Could not be geocoded (does not appear on the map) For the purposes of displaying the location of an address on a map only use addresses and their associated lat/long coordinates where the LVL2KX field is coded ‘R’ or ‘4’. These codes ensure that the address is displayed on the correct street segment and in the correct census block. The remaining LVL2KX codes provide a cascading indication of the most granular level geography for which an address can be confirmed. For example, if an address cannot be accurately interpolated to a rooftop (‘R’), or ZIP+4 centroid (‘4’), then the address will be mapped to the centroid of the next nearest confirmed geography: block group, tract, and so on. When performing any point-in polygon analysis it is important to note that points mapped to the centroids of larger geographies will be less likely to map accurately to the smaller geographies of the same area. For instance, a point coded as ‘5’ in the correct ZIP Code will be less likely to map to the correct block group or census tract for that address. To learn more about Public Housing visit: https://www.hud.gov/program_offices/public_indian_housing/programs/ph/, for questions about the spatial attribution of this dataset, please reach out to us at GISHelpdesk@hud.gov. Data Dictionary: DD_Public Housing Authorities Date Updated: Q1 2025

  12. w

    Appalachian Basin Play Fairway Analysis: Thermal Quality Analysis in...

    • data.wu.ac.at
    zip
    Updated Mar 6, 2018
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    HarvestMaster (2018). Appalachian Basin Play Fairway Analysis: Thermal Quality Analysis in Low-Temperature Geothermal Play Fairway Analysis (GPFA-AB) CrossSections.zip [Dataset]. https://data.wu.ac.at/schema/geothermaldata_org/NTY0NDVkYTQtMWFmMi00MDM5LTk4ODYtODFjOTE4MzllNWI4
    Explore at:
    zipAvailable download formats
    Dataset updated
    Mar 6, 2018
    Dataset provided by
    HarvestMaster
    Area covered
    569054e73cc2da6fc04023c3ec2173c2f26f519a
    Description

    This collection of files are part of a larger dataset uploaded in support of Low Temperature Geothermal Play Fairway Analysis for the Appalachian Basin (GPFA-AB, DOE Project DE-EE0006726). Phase 1 of the GPFA-AB project identified potential Geothermal Play Fairways within the Appalachian basin of Pennsylvania, West Virginia and New York. This was accomplished through analysis of 4 key criteria: thermal quality, natural reservoir productivity, risk of seismicity, and heat utilization. Each of these analyses represent a distinct project task, with the fifth task encompassing combination of the 4 risks factors. Supporting data for all five tasks has been uploaded into the Geothermal Data Repository node of the National Geothermal Data System (NGDS).

    This submission comprises the data for Thermal Quality Analysis (project task 1) and includes all of the necessary shapefiles, rasters, datasets, code, and references to code repositories that were used to create the thermal resource and risk factor maps as part of the GPFA-AB project. The identified Geothermal Play Fairways are also provided with the larger dataset. Figures (.png) are provided as examples of the shapefiles and rasters. The regional standardized 1 square km grid used in the project is also provided as points (cell centers), polygons, and as a raster. Two ArcGIS toolboxes are available: 1) RegionalGridModels.tbx for creating resource and risk factor maps on the standardized grid, and 2) ThermalRiskFactorModels.tbx for use in making the thermal resource maps and cross sections. These toolboxes contain item description documentation for each model within the toolbox, and for the toolbox itself. This submission also contains three R scripts: 1) AddNewSeisFields.R to add seismic risk data to attribute tables of seismic risk, 2) StratifiedKrigingInterpolation.R for the interpolations used in the thermal resource analysis, and 3) LeaveOneOutCrossValidation.R for the cross validations used in the thermal interpolations.

    Some file descriptions make reference to various 'memos'. These are contained within the final report submitted October 16, 2015.

    Each zipped file in the submission contains an 'about' document describing the full Thermal Quality Analysis content available, along with key sources, authors, citation, use guidelines, and assumptions, with the specific file(s) contained within the .zip file highlighted.

    UPDATE: Newer version of the Thermal Quality Analysis has been added here: https://gdr.openei.org/submissions/879 (Also linked below) Newer version of the Combined Risk Factor Analysis has been added here: https://gdr.openei.org/submissions/880 (Also linked below) This is one of three associated .zip files relating to ‘favorable counties results’ within the Thermal Quality Analysis task of the Low Temperature Geothermal Play Fairway Analysis for the Appalachian Basin. This .zip file contains the Cross Section points and lines shapefiles as well as plots of results for depth to 80 degrees C and the thermal play fairway metric (PFM) on a 0 to 5 point scale. Image files (.png) are included. The cross validation results as image files (.png) for the predicted depth to 80 degrees C are included for each of the 5 cross section lines as well.

    The three associated .zip files contain raster for thermal resource predicted mean, standard error of predicted mean, and cross validation results on the county level for the 5 county maps made in the GPFA-AB project. Thermal resource cross sections made using the models in ThermalRiskFactorModels toolbox are also provided. Cross validation results are only shown for the Depth to 80 °C variable, but other variables can be made using the provided functions and cross validation data.

    Details about the selected of the favorable counties are provided in 10_GPFA-AB_SelectBestThermalResourcesCounties.pdf (Smith, 2015), contained in the final report.

    The favorable counties referenced here were selected on the basis of the thermal quality analysis portion of the project. Four counties were selected from each of the three states in the study area (New York, Pennsylvania and West Virginia), for a total of twelve. Because some counties are adjacent, there are 5 county level maps. The image MapOfBestCounties.png within the collection of images shows all twelve highlighted on one map.
    • ChauErie refers to Erie County, PA and Chautauqua County, NY.
    • FayettePreston refers to Fayette County, PA and Preston county, WV • Gilmer refers to Gilmer County, WV • KnawLinc refers to Kanawha County, WV and Lincoln County, WV • FingerLakes refers to Stuben, Tomkins, and Chemung Counties of NY and Potter and Tiaga Counties of PA

  13. o

    Data from: Where did the finch go? Insights from radio telemetry of the...

    • explore.openaire.eu
    • data.niaid.nih.gov
    • +2more
    Updated Apr 4, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Marc-Olivier Beausoleil; Carlos Camacho; Julio Rabadán-González; Kristen Lalla; Roxanne Richard; Paola Carrion-Avilés; Andrew P. Hendry; Rowan D. H. Barrett (2022). Where did the finch go? Insights from radio telemetry of the medium ground finch (Geospiza fortis) [Dataset]. http://doi.org/10.5061/dryad.qbzkh18kc
    Explore at:
    Dataset updated
    Apr 4, 2022
    Authors
    Marc-Olivier Beausoleil; Carlos Camacho; Julio Rabadán-González; Kristen Lalla; Roxanne Richard; Paola Carrion-Avilés; Andrew P. Hendry; Rowan D. H. Barrett
    Description

    Data from: Where did the finch go? Insights from radio telemetry of the medium ground finch (Geospiza fortis) VERSION v.00001.2022-03-31 Author Information Corresponding Investigator Name: Marc-Olivier Beausoleil Institution: McGill University Email: marc-olivier.beausoleil@mail.mcgill.ca Co-investigator 1 Name: Carlos Camacho Institution 1: Instituto Pirenaico de Ecología—CSIC Institution 2: Lund University Co-investigator 2 Name: Julio Rabadán-González Institution: Observation.org Co-investigator 3 Name: Kristen Lalla Institution: McGill University Co-investigator 4 Name: Roxanne Richard Institution: McGill University Co-investigator 5 Name: Paola Carrion-Avilés Institution: McGill University Co-investigator 6 Name: Andrew P. Hendry Institution: McGill University Co-investigator 7 Name: Rowan D. H. Barrett Institution: McGill University Date of data collection: 2019 Geographic location of data collection: El Garrapatero, Santa Cruz Island, Galápagos, Ecuador (0°41'22.9" S, 90°13'19.7" W) Funding sources that supported the collection of the data: Natural Sciences and Engineering Research Council of Canada (NSERC) Discovery and Canada Research Chair grant (A.P.H.) NSERC Discovery Grant and Canada Research Chair grant (R.D.H.B.) NSERC Canada Graduate Scholarship, Biodiversity, Ecosystem Services and Sustainability (BESS) NSERC CREATE (M.-O.B) Fonds de recherche du Québec, Nature et technologies Scholarship (M.-O.B) Keywords behaviour, communal roosting, Geospiza fortis, habitat selection, home range, spatial ecology Recommended citation for this dataset: Beausoleil, M.-O. et al. (2022), Where did the finch go? Insights from radio telemetry of the medium ground finch (Geospiza fortis), Dryad, Dataset, https://doi.org/10.5061/dryad.qbzkh18kc ## DATA & FILE OVERVIEW 1. Description of dataset and scripts The dataset and script were developed to 1. Explore Darwin’s finch movement and space use ; 2. Ascertain data quantity and quality for a 3-week data collection period; 3. Identify limitations of radio-telemetry methods in a volcanic terrain. To fulfil these aims, we deployed VHF radio-telemetry tags on a focal sample of five medium ground finches (Geospiza fortis) on Santa Cruz in the Galápagos, Ecuador. We then estimated the home range and core area of these birds in the arid coastal zone and characterized their habitat selection patterns and movement behaviour. The scripts and data and for the R language (R Core Team 2022; R version, 4.0.4 (Lost Library Book)). 2. File List: File 1 Name: Beausoleil_2022_G.fortis.telemetry.data.RData Description: Data to generate the main figures. File 2 Name: Beausoleil_2022_Main.figures.R Description: The main script generates the main figures. Dependencies: requires "Beausoleil_2022_initialize.R" and "Beausoleil_2022_add_legend_to_empty_facet.R". File 3 Name: Beausoleil_2022_initialize.R Description: Load packages and custom functions for the analyses Dependencies: Installation of the packages File 4 Name: Beausoleil_2022_add_legend_to_empty_facet.R Description: Function to plot legend in ggplot. ## METHODOLOGICAL INFORMATION DATA-SPECIFIC INFORMATION FOR: Beausoleil_2022_G.fortis.telemetry.data.RData 1. Number of variables: See below. 2. Number of cases/rows: See below. 3. Variable List: - `elgar.hab` (6 rows): - Polygon of habitat (sf object), - geometry (a polygon or multipolygon) for the habitats, "Beach", "Inland water", "Manzanillo Forest", "Opuntia Forest", "Pacific Sea", and "Road Paved". - `data.birds` (143 rows): - Locations of all birds (sf object with a data frame) - pid (point ID) - band (band number of each bird) - PointType (the bird's activity: "Diurnal activity", "Nest", and "Roosting") - date (of the observed point), and - geometry (containing the coordinates of the points). - `razimuth.data` (219 rows): - Razimuth data.frame that was input in the model (raw data) - indiv (number of each bird) - obs_id (identification number for the point that was recorded) - utm_x (x coordinate) - utm_y (y coordinate) - azimuth (in angle) - date (of the observed point) - PointType (the bird's activity: "Diurnal activity", "Nest", and "Roosting"), - prior_r (prior for the model) - `model.matrix` (116 rows): Locations of the finches in data frame format estimated with the azimuthal telemetry model - `model.output.list` (length 5): List of the 5 MCMC models output (from razimuth package), one for each bird. 4. Missing data codes: None 5. Abbreviations used: None 6. Other relevant information: Please refer to the article. ## REFERENCES R Core Team. 2022. R: A language and environment for statistical computing. R Foundation for Statistical Computing. Vienna, Austria. Movement patterns and habitat selection of animals have important implications for ecology and evolution. Darwin's finches are a classic model system for ecological a...

  14. Public Housing Buildings

    • hudgis-hud.opendata.arcgis.com
    • data.lojic.org
    • +2more
    Updated Nov 12, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Department of Housing and Urban Development (2024). Public Housing Buildings [Dataset]. https://hudgis-hud.opendata.arcgis.com/datasets/public-housing-buildings-2
    Explore at:
    Dataset updated
    Nov 12, 2024
    Dataset provided by
    United States Department of Housing and Urban Developmenthttp://www.hud.gov/
    Authors
    Department of Housing and Urban Development
    Area covered
    Description

    HUD administers Federal aid to local Housing Agencies (HAs) that manage housing for low-income residents at rents they can afford. Likewise, HUD furnishes technical and professional assistance in planning, developing, and managing the buildings that comprise low-income housing developments. This dataset provides the location and resident characteristics of public housing development buildings. Location data for HUD-related properties and facilities are derived from HUD's enterprise geocoding service. While not all addresses are able to be geocoded and mapped to 100% accuracy, we are continuously working to improve address data quality and enhance coverage. Please consider this issue when using any datasets provided by HUD. When using this data, take note of the field titled “LVL2KX” which indicates the overall accuracy of the geocoded address using the following return codes: ‘R’ - Interpolated rooftop (high degree of accuracy, symbolized as green) ‘4’ - ZIP+4 centroid (high degree of accuracy, symbolized as green) ‘B’ - Block group centroid (medium degree of accuracy, symbolized as yellow) ‘T’ - Census tract centroid (low degree of accuracy, symbolized as red) ‘2’ - ZIP+2 centroid (low degree of accuracy, symbolized as red) ‘Z’ - ZIP5 centroid (low degree of accuracy, symbolized as red) ‘5’ - ZIP5 centroid (same as above, low degree of accuracy, symbolized as red) Null - Could not be geocoded (does not appear on the map) For the purposes of displaying the location of an address on a map only use addresses and their associated lat/long coordinates where the LVL2KX field is coded ‘R’ or ‘4’. These codes ensure that the address is displayed on the correct street segment and in the correct census block. The remaining LVL2KX codes provide a cascading indication of the most granular level geography for which an address can be confirmed. For example, if an address cannot be accurately interpolated to a rooftop (‘R’), or ZIP+4 centroid (‘4’), then the address will be mapped to the centroid of the next nearest confirmed geography: block group, tract, and so on. When performing any point-in polygon analysis it is important to note that points mapped to the centroids of larger geographies will be less likely to map accurately to the smaller geographies of the same area. For instance, a point coded as ‘5’ in the correct ZIP Code will be less likely to map to the correct block group or census tract for that address. In an effort to protect Personally Identifiable Information (PII), the characteristics for each building are suppressed with a -4 value when the “Number_Reported” is equal to, or less than 10. To learn more about Public Housing visit: https://www.hud.gov/program_offices/public_indian_housing/programs/ph/ Development FAQs - IMS/PIC | HUD.gov / U.S. Department of Housing and Urban Development (HUD), for questions about the spatial attribution of this dataset, please reach out to us at GISHelpdesk@hud.gov. Data Dictionary: DD_Public Housing Buildings Date Updated: Q1 2025

  15. Mountain Lion Habitat Model for NSNF Connectivity - CDFW [ds1045]

    • catalog.data.gov
    • data.cnra.ca.gov
    • +4more
    Updated Nov 27, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    California Department of Fish and Wildlife (2024). Mountain Lion Habitat Model for NSNF Connectivity - CDFW [ds1045] [Dataset]. https://catalog.data.gov/dataset/mountain-lion-habitat-model-for-nsnf-connectivity-cdfw-ds1045
    Explore at:
    Dataset updated
    Nov 27, 2024
    Dataset provided by
    California Department of Fish and Wildlifehttps://wildlife.ca.gov/
    Description

    The Maxent modeling algorithm was used to build the species distribution model at 270 m spatial resolution using species occurrence points and environmental layers as predictors (Phillips et al. 2006). Species occurrence points were primarily obtained from CNDDB (California Natural Diversity Database) and other CDFW sources, GBIF (Global Biodiversity Information Facility), PRBO (Point Blue Conservation Science) and Arctos museum databases. Vegetation, distance to water, elevation, and bioclimatic variables (Franklin et al. 2013) were used as predictor variables. The models were run at 270 m spatial resolution with five replications using cross-validation as a method of sample evaluation. Cross-validation involved the partitioning of the sample data into n subsets, fitting the models to n-1subsets, and testing the model on the one subset not used in fitting the model. Initial model runs showed that our models converged around 2,000 iterations and for this reason we ran all models with 2,500 maximum iterations. Maxent was implemented in R using the ‘dismo''package (Hijmans et al. 2011). Model evaluation was carried out using the ‘PresenceAbsence''package in R (Freeman and Moisen 2008). We used AUC as a metric to evaluate model performance. The package also computes threshold values using several accuracy metrics to translate predicted probability maps into binary suitable and unsuitable habitats. We selected the MeanProb, a threshold set based on the mean predicted probability of species occurrences. The output from Maxent are grid datasets in a multiband ‘tif''format with one band for each replication. We averaged the five replicated maps and created a mean grid for each species. The grid was then symbolized to represent low (threshold-50), medium (50-75) and high (75-100) habitat suitability, with pixel values that are below the threshold excluded. Models were reviewed by CDFW species experts; please review the use limitations.For more information see the project report at [https://nrm.dfg.ca.gov/FileHandler.ashx?DocumentID=85358].

  16. HUD Insured Multifamily Properties

    • anrgeodata.vermont.gov
    • data.lojic.org
    • +3more
    Updated Nov 12, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Department of Housing and Urban Development (2024). HUD Insured Multifamily Properties [Dataset]. https://anrgeodata.vermont.gov/maps/HUD::hud-insured-multifamily-properties-1/about
    Explore at:
    Dataset updated
    Nov 12, 2024
    Dataset provided by
    United States Department of Housing and Urban Developmenthttp://www.hud.gov/
    Authors
    Department of Housing and Urban Development
    Area covered
    Description

    The FHA insured Multifamily Housing portfolio consists primarily of rental housing properties with five or more dwelling units such as apartments or town houses, but can also be nursing homes, hospitals, elderly housing, mobile home parks, retirement service centers, and occasionally vacant land. Please note that this dataset overlaps the Multifamily Properties Assisted layer. The Multifamily property locations represent the approximate location of the property. Location data for HUD-related properties and facilities are derived from HUD's enterprise geocoding service. While not all addresses are able to be geocoded and mapped to 100% accuracy, we are continuously working to improve address data quality and enhance coverage. Please consider this issue when using any datasets provided by HUD. When using this data, take note of the field titled “LVL2KX” which indicates the overall accuracy of the geocoded address using the following return codes: ‘R’ - Interpolated rooftop (high degree of accuracy, symbolized as green) ‘4’ - ZIP+4 centroid (high degree of accuracy, symbolized as green) ‘B’ - Block group centroid (medium degree of accuracy, symbolized as yellow) ‘T’ - Census tract centroid (low degree of accuracy, symbolized as red) ‘2’ - ZIP+2 centroid (low degree of accuracy, symbolized as red) ‘Z’ - ZIP5 centroid (low degree of accuracy, symbolized as red) ‘5’ - ZIP5 centroid (same as above, low degree of accuracy, symbolized as red) Null - Could not be geocoded (does not appear on the map) For the purposes of displaying the location of an address on a map only use addresses and their associated lat/long coordinates where the LVL2KX field is coded ‘R’ or ‘4’. These codes ensure that the address is displayed on the correct street segment and in the correct census block. The remaining LVL2KX codes provide a cascading indication of the most granular level geography for which an address can be confirmed. For example, if an address cannot be accurately interpolated to a rooftop (‘R’), or ZIP+4 centroid (‘4’), then the address will be mapped to the centroid of the next nearest confirmed geography: block group, tract, and so on. When performing any point-in polygon analysis it is important to note that points mapped to the centroids of larger geographies will be less likely to map accurately to the smaller geographies of the same area. For instance, a point coded as ‘5’ in the correct ZIP Code will be less likely to map to the correct block group or census tract for that address. In an effort to protect Personally Identifiable Information (PII), the characteristics for each building are suppressed with a -4 value when the “Number_Reported” is equal to, or less than 10. To learn more about HUD Insured Multifamily Properties visit: https://www.hud.gov/program_offices/housing/mfh Data Dictionary: DD_HUD Insured Multifamilly Properties Date of Coverage: 02/2025

  17. Regional and Community Vitality Index

    • open.canada.ca
    • data.urbandatacentre.ca
    • +3more
    esri rest, fgdb/gdb +7
    Updated Feb 17, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Natural Resources Canada (2025). Regional and Community Vitality Index [Dataset]. https://open.canada.ca/data/dataset/461123f1-1370-4709-aeda-639783ee8455
    Explore at:
    shp, pdf, mxd, xls, wms, tiff, esri rest, html, fgdb/gdbAvailable download formats
    Dataset updated
    Feb 17, 2025
    Dataset provided by
    Ministry of Natural Resources of Canadahttps://www.nrcan.gc.ca/
    License

    Open Government Licence - Canada 2.0https://open.canada.ca/en/open-government-licence-canada
    License information was derived automatically

    Time period covered
    Jan 1, 2001 - Jan 1, 2023
    Description

    The RVI/CVI database is derived from the CanEcumene 3.0 GDB (Eddy, et. al. 2023) using a selection of socio-economic variables identified in Eddy and Dort (2011) that aim to capture the overall state of socio-economic conditions of communities as ‘human habitats’. This dataset was developed primarily for application in mapping socio-economic conditions of communities and regions for environmental and natural resource management, climate change adaptation, Impact Assessments (IAs) and Regional Assessments (RAs), and Cumulative Effects Assessment (CEA). The RVI/CVI is comprised of five sub-indicators: 1) population change, 2) age structure, 3) education levels, 4) employment levels, and 5) real estate values. Index values are based on percentile ranks of each sub-indicator, and averaged for each community, and for three ranked groups: 1) all of Canada, 2) by province, and 3) by population size. The data covers the Census periods of 2001, 2006, 2011 (NHS), 2016, and 2021. The index is mapped in two ways: 1) as ‘points’ for individual communities (CVI), and 2) as ‘rasters’ for spatial interpolation of point data (RVI). These formats provide an alternative spatial framework to conventional StatsCan CSD framework. (For more information on this approach see Eddy, et. al. 2020). ============================================================================================ Eddy, B.G., Muggridge, M., LeBlanc, R., Osmond, J., Kean, C., and Boyd, E. 2023. The CanEcumene 3.0 GIS Database. Federal Geospatial Platform (FGP), Natural Resources Canada. https://gcgeo.gc.ca/viz/index-en.html?keys=draft-3f599fcb-8d77-4dbb-8b1e-d3f27f932a4b Eddy B.G., Muggridge M, LeBlanc R, Osmond J, Kean C, Boyd E. 2020. An Ecological Approach for Mapping Socio-Economic Data in Support of Ecosystems Analysis: Examples in Mapping Canada’s Forest Ecumene. One Ecosystem 5: e55881. https://doi.org/10.3897/oneeco.5.e55881 Eddy, B.G.; Dort, A. 2011. Integrating Socio-Economic Data for Integrated Land Management (ILM): Examples from the Humber River Basin, western Newfoundland. Geomatica, Vol. 65, No. 3, p. 283-291. doi:10.5623/cig2011-044.

  18. A multi-modal human neuroimaging dataset for data integration: simultaneous...

    • openneuro.org
    Updated Dec 4, 2019
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Giulia Lioi; Claire Cury; Lorraine Perronnet; Marsel Mano; Elise Bannier; Anatole Lecuyer; Christian Barillot (2019). A multi-modal human neuroimaging dataset for data integration: simultaneous EEG and MRI acquisition during a motor imagery neurofeedback task: XP1 [Dataset]. http://doi.org/10.18112/openneuro.ds002336.v1.0.1
    Explore at:
    Dataset updated
    Dec 4, 2019
    Dataset provided by
    OpenNeurohttps://openneuro.org/
    Authors
    Giulia Lioi; Claire Cury; Lorraine Perronnet; Marsel Mano; Elise Bannier; Anatole Lecuyer; Christian Barillot
    License

    CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
    License information was derived automatically

    Description

    ———————————————————————————————— ORIGINAL PAPERS ———————————————————————————————— Mano, Marsel, Anatole Lécuyer, Elise Bannier, Lorraine Perronnet, Saman Noorzadeh, and Christian Barillot. 2017. “How to Build a Hybrid Neurofeedback Platform Combining EEG and FMRI.” Frontiers in Neuroscience 11 (140). https://doi.org/10.3389/fnins.2017.00140 Perronnet, Lorraine, L Anatole, Marsel Mano, Elise Bannier, Maureen Clerc, Christian Barillot, Lorraine Perronnet, et al. 2017. “Unimodal Versus Bimodal EEG-FMRI Neurofeedback of a Motor Imagery Task.” Frontiers in Human Neuroscience 11 (193). https://doi.org/10.3389/fnhum.2017.00193.

    This dataset named XP1 can be pull together with the dataset XP2 (DOI: 10.18112/openneuro.ds002338.v1.0.0). Data acquisition methods have been described in Perronnet et al. (2017, Frontiers in Human Neuroscience). Simultaneous 64 channels EEG and fMRI during right-hand motor imagery and neurofeedback (NF) were acquired in this study (as well as in XP2). For this study, 10 subjects performed three types of NF runs (bimodal EEG-fMRI NF, unimodal EEG-NF and fMRI-NF).

    ———————————————————————————————— EXPERIMENTAL PARADIGM ————————————————————————————————
    Subjects were instructed to perform a kinaesthetic motor imagery of the right hand and to find their own strategy to control and bring the ball to the target. The experimental protocol consisted of 6 EEG-fMRI runs with a 20s block design alternating rest and task motor localizer run (task-motorloc) - 8 blocks X (20s rest+20 s task) motor imagery run without NF (task-MIpre) -5 blocks X (20s rest+20 s task) three NF runs with different NF conditions (task-eegNF, task-fmriNF, task-eegfmriNF) occurring in random order- 10 blocks X (20s rest+20 s task) motor imagery run without NF (task-MIpost) - 5 blocks X (20s rest+20 s task)

    ———————————————————————————————— EEG DATA ———————————————————————————————— EEG data was recorded using a 64-channel MR compatible solution from Brain Products (Brain Products GmbH, Gilching, Germany).

    RAW EEG DATA

    EEG was sampled at 5kHz with FCz as the reference electrode and AFz as the ground electrode, and a resolution of 0.5 microV. Following the BIDs arborescence, raw eeg data for each task can be found for each subject in

    XP1/sub-xp1*/eeg

    in Brain Vision Recorder format (File Version 1.0). Each raw EEG recording includes three files: the data file (.eeg), the header file (.vhdr) and the marker file (*.vmrk). The header file contains information about acquisition parameters and amplifier setup. For each electrode, the impedance at the beginning of the recording is also specified. For all subjects, channel 32 is the ECG channel. The 63 other channels are EEG channels.

    The marker file contains the list of markers assigned to the EEG recordings and their properties (marker type, marker ID and position in data points). Three type of markers are relevant for the EEG processing: R128 (Response): is the fMRI volume marker to correct for the gradient artifact S 99 (Stimulus): is the protocol marker indicating the start of the Rest block S 2 (Stimulus): is the protocol marker indicating the start of the Task (Motor Execution Motor Imagery or Neurofeedback)
    Warning : in few EEG data, the first S99 marker might be missing, but can be easily “added” 20 s before the first S 2.

    PREPROCESSED EEG DATA

    Following the BIDs arborescence, processed eeg data for each task and subject in the pre-processed data folder :

    XP1/derivatives/sub-xp1*/eeg_pp/*eeg_pp.*

    and following the Brain Analyzer format. Each processed EEG recording includes three files: the data file (.dat), the header file (.vhdr) and the marker file (*.vmrk), containing information similar to those described for raw data. In the header file of preprocessed data channels location are also specified. In the marker file the location in data points of the identified heart pulse (R marker) are specified as well.

    EEG data were pre-processed using BrainVision Analyzer II Software, with the following steps: Automatic gradient artifact correction using the artifact template subtraction method (Sliding average calculation with 21 intervals for sliding average and all channels enabled for correction. Downsampling with factor: 25 (200 Hz) Low Pass FIR Filter:Cut-off Frequency: 50 Hz. Ballistocardiogram (pulse) artifact correction using a semiautomatic procedure (Pulse Template searched between 40 s and 240 s in the ECG channel with the following parameters:Coherence Trigger = 0.5, Minimal Amplitude = 0.5, Maximal Amplitude = 1.3. The identified pulses were marked with R. Segmentation relative to the first block marker (S 99) for all the length of the training protocol (las S 2 + 20 s).

    EEG NF SCORES

    Neurofeedback scores can be found in the .mat structures in

    XP1/derivatives/sub-xp1*/NF_eeg/d_sub*NFeeg_scores.mat

    Structures names NF_eeg are composed by the following subfields: ID : Subject ID, for example sub-xp101 lapC3_ERD : a 1x1280 vector of neurofeedback scores. 4 scores per secondes, for the whole session. eeg : a 64x80200 matrix, with the pre-processed EEG signals with the step described above, filtered between 8 and 30 Hz. lapC3_bandpower_8Hz_30Hz : 1x1280 vector. Bandpower of the filtered signal with a laplacian centred on C3, used to estimate the lapC3_ERD. lapC3_filter : 1x64 vector. Laplacian filter centred on C3 channel.

    ———————————————————————————————— BOLD fMRI DATA ———————————————————————————————— All DICOM files were converted to Nifti-1 and then in BIDs format (version 2.1.4) using the software dcm2niix (version v1.0.20190720 GVV7.4.0)

    fMRI acquisitions were performed using echo- planar imaging (EPI) and covering the entire brain with the following parameters

    3T Siemens Verio EPI sequence TR=2 s TE=23 ms Resolution 2x2x4 mm3 FOV = 210×210mm2 N of slices: 32 No slice gap

    As specified in the relative task event files in XP1\ *events.tsv files onset, the scanner began the EPI pulse sequence two seconds prior to the start of the protocol (first rest block), so the the first two TRs should be discarded. The useful TRs for the runs are therefore

    task-motorloc: 320 s (2 to 322) task-MIpre and task-MIpost: 200 s (2 to 202) task-eegNF, task-fmriNF, task-eegfmriNF: 400 s (2 to 402)

    In task events files for the different tasks, each column represents:

    • 'onset': onset time (sec) of an event
    • 'duration': duration (sec) of the event
    • 'trial_type': trial (block) type: rest or task (Rest, Task-ME, Task-MI, Task-NF)
    • ''stim_file’: image presented in a stimulus block: during Rest, Motor Imagery (Task-MI) or Motor execution (Task-ME) instructions were presented. On the other hand, during Neurofeedback blocks (Task-NF) the image presented was a ball moving in a square that the subject could control self-regulating his EEG and/or fMRI brain activity.

    Following the BIDs arborescence, the functional data and relative metadata are found for each subject in the following directory

    XP1/sub-xp1*/func

    BOLD-NF SCORES

    For each subject and NF session, a matlab structure with BOLD-NF features can be found in

    XP1/derivatives/sub-xp1*/NF_bold/

    In view of BOLD-NF scores computation, fMRI data were preprocessed using AutoMRI, a software based on spm8 and with the following steps: slice-time correction, spatial realignment and coregistration with the anatomical scan, spatial smoothing with a 6 mm Gaussian kernel and normalization to the Montreal Neurological Institute template For each session, a first level general linear model analysis modeling was then performed. The resulting activation maps (voxel-wise Family-Wise error corrected at p < 0.05) were used to define two ROIs (9x9x3 voxels) around the maximum of activation in the ipsilesional primary motor area (M1) and supplementary motor area (SMA) respectively.

    The BOLD-NF scores were calculated as the difference between percentage signal change in the two ROIs (SMA and M1) and a large deep background region (slice 3 out of 16) whose activity is not correlated with the NF task. A smoothed version of the NF scores over the precedent three volumes was also computed.

    The NF_boldi structure has the following structure

    NF_bold → .m1 → .nf → .smoothnf
    → .roimean (averaged BOLD signal in the ROI) → .bgmean (averaged BOLD signal in the background slice) → .method
    NFscores.fmri → .sma→ .nf → .smoothnf
    → .roimean (averaged BOLD signal in the ROI) → .bgmean (averaged BOLD signal in the background slice) → .method

    Where the subfield method contains information about the ROI size (.roisize), the background mask (.bgmask) and ROI mask (.roimask).

    More details about signal processing and NF calculation can be found in Perronnet et al. 2017 and Perronnet et al. 2018.

    ———————————————————————————————— ANATOMICAL MRI DATA ———————————————————————————————— As a structural reference for the fMRI analysis, a high resolution 3D T1 MPRAGE sequence was acquired with the following parameters

    3T Siemens Verio 3D T1 MPRAGE TR=1.9 s TE=22.6

  19. A multi-modal human neuroimaging dataset for data integration: simultaneous...

    • openneuro.org
    Updated Sep 24, 2020
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Giulia Lioi; Claire Cury; Lorraine Perronnet; Marsel Mano; Elise Bannier; Anatole Lecuyer; Christian Barillot (2020). A multi-modal human neuroimaging dataset for data integration: simultaneous EEG and fMRI acquisition during a motor imagery neurofeedback task: XP2 [Dataset]. http://doi.org/10.18112/openneuro.ds002338.v2.0.1
    Explore at:
    Dataset updated
    Sep 24, 2020
    Dataset provided by
    OpenNeurohttps://openneuro.org/
    Authors
    Giulia Lioi; Claire Cury; Lorraine Perronnet; Marsel Mano; Elise Bannier; Anatole Lecuyer; Christian Barillot
    License

    CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
    License information was derived automatically

    Description

    ———————————————————————————————— ORIGINAL PAPERS ———————————————————————————————— Lioi, G., Cury, C., Perronnet, L., Mano, M., Bannier, E., Lécuyer, A., & Barillot, C. (2019). Simultaneous MRI-EEG during a motor imagery neurofeedback task: an open access brain imaging dataset for multi-modal data integration Authors. Accepted for publication in Scientific Data. https://doi.org/https://doi.org/10.1101/862375 Mano, Marsel, Anatole Lécuyer, Elise Bannier, Lorraine Perronnet, Saman Noorzadeh, and Christian Barillot. 2017. “How to Build a Hybrid Neurofeedback Platform Combining EEG and FMRI.” Frontiers in Neuroscience 11 (140). https://doi.org/10.3389/fnins.2017.00140 Lorraine Perronnet, Anatole Lecuyer, Marsel Mano, Mathis Fleury, Giulia Lioi, Claire Cury, Maureen Clerc, Fabien Lotte, and Christian Barillot. 2018. “Learning 2-in-1 : Towards Integrated EEG-FMRI-Neurofeedback.” BioRxiv, no. 397729. https://doi.org/10.1101/397729.

    ———————————————————————————————— OVERVIEW ———————————————————————————————— This dataset XP2 can be pull together with the dataset XP1, available here : https://openneuro.org/datasets/ds002336. Data acquisition methods have been described in Perronnet et al. (2017, Frontiers in Human Neuroscience). Simultaneous 64 channel EEG and fMRI during right-hand motor imagery and neurofeedback (NF) were acquired in this study (as well as in XP1). This study involved 16 subjects randomly assigned to two groups: in a first group they performed bimodal EEG-fMRI NF with a bi-dimensional feedback metaphor, in the second group the same task was executed with a mono-dimensional feedback.

    ———————————————————————————————— EXPERIMENTAL PARADIGM ————————————————————————————————

    The experimental protocol consisted of 5 EEG-fMRI runs with a 20s block design alternating rest and task. 1 block = 20s rest + 20s task. Task description : _task-MIpre : motor imagery run without NF. 8 blocks. _task-1dNF or _task-2dNF : bimodal neurofeedback, with either a mono-dimensional neurofeedback display (mean of EEG NF and fMRI NF scores), either a bi-dimensional display (one modality per dimension). The list of subjects with 1d or 2d is given above. Each subjects had 3 runs. 8 blocks per run. _task-MIpost : motor imagery run without NF. 8 blocks. Subjects with mono-dimensional feedback display : xp201 : 1D xp202 : 1D xp203 : 1D xp206 : 1D xp211 : 1D xp218 : 1D xp219 : 1D xp220 : 1D xp222 : 1D

    Subjects with bi-dimensional feedback display : xp204 : 2D xp205 : 2D xp207 : 2D xp210: 2D xp213 : 2D xp216 : 2D xp217 : 2D xp221 : 2D

    ———————————————————————————————— EEG DATA ———————————————————————————————— EEG data was recorded using a 64-channel MR compatible solution from Brain Products (Brain Products GmbH, Gilching, Germany).

    RAW EEG DATA

    EEG was sampled at 5kHz with FCz as the reference electrode and AFz as the ground electrode, and a resolution of 0.5 microV. Following the BIDs arborescence, raw eeg data for each task can be found for each subject in

    XP2/sub-xp2*/eeg

    in Brain Vision Recorder format (File Version 1.0). Each raw EEG recording includes three files: the data file (.eeg), the header file (.vhdr) and the marker file (*.vmrk). The header file contains information about acquisition parameters and amplifier setup. For each electrode, the impedance at the beginning of the recording is also specified. For all subjects, channel 32 is the ECG channel. The 63 other channels are EEG channels.

    The marker file contains the list of markers assigned to the EEG recordings and their properties (marker type, marker ID and position in data points). Three type of markers are relevant for the EEG processing: R128 (Response): is the fMRI volume marker to correct for the gradient artifact S 99 (Stimulus): is the protocol marker indicating the start of the Rest block S 2 (Stimulus): is the protocol marker indicating the start of the Task (Motor Execution Motor Imagery or Neurofeedback)
    Warning : in few EEG data, the first S99 marker might be missing, but can be easily “added” 20 s before the first S 2.

    PREPROCESSED EEG DATA

    Following the BIDs arborescence, processed eeg data for each task can be found for each subject in

    XP2/derivatives/sub-xp2*/eeg_pp/*eeg_pp.*

    and following the Brain Analyzer format. Each processed EEG recording includes three files: the data file (.dat), the header file (.vhdr) and the marker file (*.vmrk), containing information similar to those described for raw data. In the header file of preprocessed data channels location are also specified. In the marker file the location in data points of the identified heart pulse (R marker) are specified as well.

    EEG data were pre-processed using BrainVision Analyzer II Software, with the following steps: Automatic gradient artifact correction using the artifact template subtraction method (Sliding average calculation with 21 intervals for sliding average and all channels enabled for correction. Downsampling with factor: 25 (200 Hz) Low Pass FIR Filter:Cut-off Frequency: 50 Hz. Ballistocardiogram (pulse) artifact correction using a semiautomatic procedure (Pulse Template searched between 40 s and 240 s in the ECG channel with the following parameters:Coherence Trigger = 0.5, Minimal Amplitude = 0.5, Maximal Amplitude = 1.3). A Pulse Artifact marker R was associated to each identified pulse. Segmentation relative to the first block marker (S 99) for all the length of the training protocol (las S 2 + 20 s).

    EEG-NF SCORES

    Neurofeedback scores can be found in the .mat structures in

    XP2/derivatives/sub-xp2*/NF_eeg/d_sub*NFeeg_scores.mat

    Structures names NF_eeg are composed by the following subfields: ID : Subject ID, for example sub-xp201 lapC3_ERD : a 1x1280 vector of neurofeedback scores. 4 scores per secondes, for the whole session. eeg : a 64x80200 matrix, with the pre-processed EEG signals with the step described above, filtered between 8 and 30 Hz. lapC3_bandpower_8Hz_30Hz : 1x1280 vector. Bandpower of the filtered signal with a laplacian centred on C3, used to estimate the lapC3_ERD. lapC3_filter : 1x64 vector. Laplacian filter centred above C3 channel. ———————————————————————————————— BOLD fMRI DATA ———————————————————————————————— All DICOM files were converted to Nifti-1 and then in BIDs format (version 2.1.4) using the software dcm2niix (version v1.0.20190720 GVV7.4.0)

    fMRI acquisitions were performed using echo- planar imaging (EPI) and covered the superior half of the brain with the following parameters 3T Siemens Verio EPI sequence TR=1 s TE=23 ms Resolution 2x2x4 mm N of slices: 16 No slice gap

    As specified in the relative task event files in XP2\ *events.tsv files onset, the scanner began the EPI pulse sequence two seconds prior to the start of the protocol (first rest block), so the the first two TRs should be discarded.

    The useful TRs for the runs are therefore

    -task-MIpre and task-MIpost: 320 s (2 to 302) -task-1dNF and task-2dNF: 320 s (2 to 302)

    In task events files for the different tasks, each column represents:

    • 'onset': onset time (sec) of an event
    • 'duration': duration (sec) of the event
    • 'trial_type': trial (block) type: rest or task (Rest, Task-MI, Task-NF)
    • 'stim_file': image presented in a stimulus block. During Rest or Motor Imagery (Task-MI) instructions were presented to the subject. On the other hand, during Neurofeedback blocks (Task-NF) the image presented was a ball moving in a square for the bidimensional NF (task-2dNF) or a ball moving along a gauge for the unidimensional NF (task-1dNF) that the subject could control self-regulating his EEG and fMRI brain activity.

    Following the BIDs arborescence, the functional data and relative metadata are found for each subject in the following directory

    XP2/sub-xp2*/func

    BOLD-NF SCORES

    For each subject and NF session, a matlab structure with BOLD-NF features can be found in

    XP2/derivatives/sub-xp2*/NF_bold/

    In view of BOLD-NF scores computation, fMRI data were preprocessed using AutoMRI, a software based on spm8 and with the following steps: slice-time correction, spatial realignment and coregistration with the anatomical scan, spatial smoothing with a 8 mm Gaussian kernel and normalization to the Montreal Neurological Institute template For each session, a first level general linear model analysis modeling was then performed. The resulting activation maps (voxel-wise Family-Wise error corrected at p < 0.05) were used to define two ROIs (9x9x3 voxels) around the maximum of activation in the ipsilesional primary motor area (M1) and supplementary motor area (SMA) respectively.

    The BOLD-NF scores were calculated as the difference between percentage signal change in the two ROIs (SMA and M1) and a large deep background region (slice 3 out of 16) whose activity is not correlated with the NF task. A smoothed version of the NF scores over the precedent three volumes was also computed.

    The NF_boldi structure has the following structure

    NF_bold → .m1→ .nf → .smoothnf
    → .roimean (averaged BOLD signal in the ROI) → .bgmean (averaged BOLD signal in the background slice) → .method
    NFscores.fmri → .sma→ .nf

  20. Low-Income Housing Tax Credit Properties

    • hudgis-hud.opendata.arcgis.com
    • data.lojic.org
    • +1more
    Updated Nov 12, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Department of Housing and Urban Development (2024). Low-Income Housing Tax Credit Properties [Dataset]. https://hudgis-hud.opendata.arcgis.com/maps/HUD::low-income-housing-tax-credit-properties-1/about
    Explore at:
    Dataset updated
    Nov 12, 2024
    Dataset provided by
    United States Department of Housing and Urban Developmenthttp://www.hud.gov/
    Authors
    Department of Housing and Urban Development
    Area covered
    Description

    Created by the Tax Reform Act of 1986, the Low-Income Housing Tax Credit program (LIHTC) gives State and local LIHTC-allocating agencies the equivalent of nearly $8 billion in annual budget authority to issue tax credits for the acquisition, rehabilitation, or new construction of rental housing targeted to lower-income households. Although some data about the program have been made available by various sources, HUD's database is the only complete national source of information on the size, unit mix, and location of individual projects. With the continued support of the national LIHTC database, HUD hopes to enable researchers to learn more about the effects of the tax credit program.HUD has no administrative authority over the LIHTC program. IRS has authority at the federal level and it is structured so that the states truly administer the program. The LIHTC property locations depicted in this map service represent the general location of the property. The locations of individual buildings associated with each property are not depicted here. The location of the property is derived from the address of the building with the most units. Location data for HUD-related properties and facilities are derived from HUD's enterprise geocoding service. While not all addresses are able to be geocoded and mapped to 100% accuracy, we are continuously working to improve address data quality and enhance coverage. Please consider this issue when using any datasets provided by HUD. When using this data, take note of the field titled “LVL2KX” which indicates the overall accuracy of the geocoded address using the following return codes:‘R’ - Interpolated rooftop (high degree of accuracy, symbolized as green)‘4’ - ZIP+4 centroid (high degree of accuracy, symbolized as green)‘B’ - Block group centroid (medium degree of accuracy, symbolized as yellow)‘T’ - Census tract centroid (low degree of accuracy, symbolized as red)‘2’ - ZIP+2 centroid (low degree of accuracy, symbolized as red)‘Z’ - ZIP5 centroid (low degree of accuracy, symbolized as red)‘5’ - ZIP5 centroid (same as above, low degree of accuracy, symbolized as red)Null - Could not be geocoded (does not appear on the map)For the purposes of displaying the location of an address on a map only use addresses and their associated lat/long coordinates where the LVL2KX field is coded ‘R’ or ‘4’. These codes ensure that the address is displayed on the correct street segment and in the correct census block.The remaining LVL2KX codes provide a cascading indication of the most granular level geography for which an address can be confirmed. For example, if an address cannot be accurately interpolated to a rooftop (‘R’), or ZIP+4 centroid (‘4’), then the address will be mapped to the centroid of the next nearest confirmed geography: block group, tract, and so on. When performing any point-in polygon analysis it is important to note that points mapped to the centroids of larger geographies will be less likely to map accurately to the smaller geographies of the same area. For instance, a point coded as ‘5’ in the correct ZIP Code will be less likely to map to the correct block group or census tract for that address. To learn more about the Low-Income Housing Tax Credit Program visit: https://www.hud.gov/program_offices/public_indian_housing/programs/ph/, for questions about the spatial attribution of this dataset, please reach out to us at GISHelpdesk@hud.gov. Data Dictionary: DD_Low Income Tax Credit Program

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
Neilsberg Research (2024). Five Points, AL Median Household Income Trends (2010-2021, in 2022 inflation-adjusted dollars) [Dataset]. https://www.neilsberg.com/research/datasets/90cd2e14-73f0-11ee-949f-3860777c1fe6/

Five Points, AL Median Household Income Trends (2010-2021, in 2022 inflation-adjusted dollars)

Explore at:
csv, jsonAvailable download formats
Dataset updated
Jan 11, 2024
Dataset authored and provided by
Neilsberg Research
License

Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically

Area covered
Alabama, Five Points
Variables measured
Median Household Income, Median Household Income Year on Year Change, Median Household Income Year on Year Percent Change
Measurement technique
The data presented in this dataset is derived from the U.S. Census Bureau American Community Survey (ACS) 2017-2021 5-Year Estimates. It presents the median household income from the years 2010 to 2021 following an initial analysis and categorization of the census data. Subsequently, we adjusted these figures for inflation using the Consumer Price Index retroactive series via current methods (R-CPI-U-RS). For additional information about these estimations, please contact us via email at research@neilsberg.com
Dataset funded by
Neilsberg Research
Description
About this dataset

Context

The dataset illustrates the median household income in Five Points, spanning the years from 2010 to 2021, with all figures adjusted to 2022 inflation-adjusted dollars. Based on the latest 2017-2021 5-Year Estimates from the American Community Survey, it displays how income varied over the last decade. The dataset can be utilized to gain insights into median household income trends and explore income variations.

Key observations:

From 2010 to 2021, the median household income for Five Points increased by $3,119 (5.39%), as per the American Community Survey estimates. In comparison, median household income for the United States increased by $4,559 (6.51%) between 2010 and 2021.

Analyzing the trend in median household income between the years 2010 and 2021, spanning 11 annual cycles, we observed that median household income, when adjusted for 2022 inflation using the Consumer Price Index retroactive series (R-CPI-U-RS), experienced growth year by year for 5 years and declined for 6 years.

https://i.neilsberg.com/ch/five-points-al-median-household-income-trend.jpeg" alt="Five Points, AL median household income trend (2010-2021, in 2022 inflation-adjusted dollars)">

Content

When available, the data consists of estimates from the U.S. Census Bureau American Community Survey (ACS) 2017-2021 5-Year Estimates. All incomes have been adjusting for inflation and are presented in 2022-inflation-adjusted dollars.

Years for which data is available:

  • 2010, 2011, 2012, 2013, 2014, 2015, 2016, 2017, 2018, 2019, 2020, 2021

Variables / Data Columns

  • Year: This column presents the data year from 2010 to 2021
  • Median Household Income: Median household income, in 2022 inflation-adjusted dollars for the specific year
  • YOY Change($): Change in median household income between the current and the previous year, in 2022 inflation-adjusted dollars
  • YOY Change(%): Percent change in median household income between current and the previous year

Good to know

Margin of Error

Data in the dataset are based on the estimates and are subject to sampling variability and thus a margin of error. Neilsberg Research recommends using caution when presening these estimates in your research.

Custom data

If you do need custom data for any of your research project, report or presentation, you can contact our research staff at research@neilsberg.com for a feasibility of a custom tabulation on a fee-for-service basis.

Inspiration

Neilsberg Research Team curates, analyze and publishes demographics and economic data from a variety of public and proprietary sources, each of which often includes multiple surveys and programs. The large majority of Neilsberg Research aggregated datasets and insights is made available for free download at https://www.neilsberg.com/research/.

Recommended for further research

This dataset is a part of the main dataset for Five Points median household income. You can refer the same here

Search
Clear search
Close search
Google apps
Main menu