100+ datasets found
  1. Envestnet | Yodlee's De-Identified Consumer Purchase Data | Row/Aggregate...

    • datarade.ai
    .sql, .txt
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Envestnet | Yodlee, Envestnet | Yodlee's De-Identified Consumer Purchase Data | Row/Aggregate Level | USA Consumer Data covering 3600+ corporations | 90M+ Accounts [Dataset]. https://datarade.ai/data-products/envestnet-yodlee-s-consumer-purchase-data-row-aggregate-envestnet-yodlee
    Explore at:
    .sql, .txtAvailable download formats
    Dataset provided by
    Envestnethttp://envestnet.com/
    Yodlee
    Authors
    Envestnet | Yodlee
    Area covered
    United States of America
    Description

    Envestnet®| Yodlee®'s Consumer Purchase Data (Aggregate/Row) Panels consist of de-identified, near-real time (T+1) USA credit/debit/ACH transaction level data – offering a wide view of the consumer activity ecosystem. The underlying data is sourced from end users leveraging the aggregation portion of the Envestnet®| Yodlee®'s financial technology platform.

    Envestnet | Yodlee Consumer Panels (Aggregate/Row) include data relating to millions of transactions, including ticket size and merchant location. The dataset includes de-identified credit/debit card and bank transactions (such as a payroll deposit, account transfer, or mortgage payment). Our coverage offers insights into areas such as consumer, TMT, energy, REITs, internet, utilities, ecommerce, MBS, CMBS, equities, credit, commodities, FX, and corporate activity. We apply rigorous data science practices to deliver key KPIs daily that are focused, relevant, and ready to put into production.

    We offer free trials. Our team is available to provide support for loading, validation, sample scripts, or other services you may need to generate insights from our data.

    Investors, corporate researchers, and corporates can use our data to answer some key business questions such as: - How much are consumers spending with specific merchants/brands and how is that changing over time? - Is the share of consumer spend at a specific merchant increasing or decreasing? - How are consumers reacting to new products or services launched by merchants? - For loyal customers, how is the share of spend changing over time? - What is the company’s market share in a region for similar customers? - Is the company’s loyal user base increasing or decreasing? - Is the lifetime customer value increasing or decreasing?

    Additional Use Cases: - Use spending data to analyze sales/revenue broadly (sector-wide) or granular (company-specific). Historically, our tracked consumer spend has correlated above 85% with company-reported data from thousands of firms. Users can sort and filter by many metrics and KPIs, such as sales and transaction growth rates and online or offline transactions, as well as view customer behavior within a geographic market at a state or city level. - Reveal cohort consumer behavior to decipher long-term behavioral consumer spending shifts. Measure market share, wallet share, loyalty, consumer lifetime value, retention, demographics, and more.) - Study the effects of inflation rates via such metrics as increased total spend, ticket size, and number of transactions. - Seek out alpha-generating signals or manage your business strategically with essential, aggregated transaction and spending data analytics.

    Use Cases Categories (Our data provides an innumerable amount of use cases, and we look forward to working with new ones): 1. Market Research: Company Analysis, Company Valuation, Competitive Intelligence, Competitor Analysis, Competitor Analytics, Competitor Insights, Customer Data Enrichment, Customer Data Insights, Customer Data Intelligence, Demand Forecasting, Ecommerce Intelligence, Employee Pay Strategy, Employment Analytics, Job Income Analysis, Job Market Pricing, Marketing, Marketing Data Enrichment, Marketing Intelligence, Marketing Strategy, Payment History Analytics, Price Analysis, Pricing Analytics, Retail, Retail Analytics, Retail Intelligence, Retail POS Data Analysis, and Salary Benchmarking

    1. Investment Research: Financial Services, Hedge Funds, Investing, Mergers & Acquisitions (M&A), Stock Picking, Venture Capital (VC)

    2. Consumer Analysis: Consumer Data Enrichment, Consumer Intelligence

    3. Market Data: AnalyticsB2C Data Enrichment, Bank Data Enrichment, Behavioral Analytics, Benchmarking, Customer Insights, Customer Intelligence, Data Enhancement, Data Enrichment, Data Intelligence, Data Modeling, Ecommerce Analysis, Ecommerce Data Enrichment, Economic Analysis, Financial Data Enrichment, Financial Intelligence, Local Economic Forecasting, Location-based Analytics, Market Analysis, Market Analytics, Market Intelligence, Market Potential Analysis, Market Research, Market Share Analysis, Sales, Sales Data Enrichment, Sales Enablement, Sales Insights, Sales Intelligence, Spending Analytics, Stock Market Predictions, and Trend Analysis

  2. World Values Survey, Aggregate Data

    • thearda.com
    • osf.io
    Updated May 31, 2005
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    World Values Survey Association (WVSA) (2005). World Values Survey, Aggregate Data [Dataset]. http://doi.org/10.17605/OSF.IO/9QN4C
    Explore at:
    Dataset updated
    May 31, 2005
    Dataset provided by
    Association of Religion Data Archives
    Authors
    World Values Survey Association (WVSA)
    Dataset funded by
    Bank of Sweden Tercentennary Foundation
    The World Values Survey Association
    Description

    This file provides summary or aggregated measures for the 82 societies participating in the first four waves of the World Value Surveys. Thus, the society, rather than the individuals surveyed, are the unit of analysis.

    "The World Values Survey is a worldwide investigation of sociocultural and political change. It is conducted by a network of social scientists at leading universities all around world.

    Interviews have been carried out with nationally representative samples of the publics of more than 80 societies on all six inhabited continents. A total of four waves have been carried out since 1981 making it possible to carry out reliable global cross-cultural analyses and analysis of changes over time. The World Values Survey has produced evidence of gradual but pervasive changes in what people want out of life. Moreover, the survey shows that the basic direction of these changes is, to some extent, predictable.

    This project is being carried out by an international network of social scientists, with local funding for each survey (though in some cases, it has been possible to raise supplementary funds from outside sources). In exchange for providing the data from interviews with a representative national sample of at least 1,000 people in their own society, each participating group gets immediate access to the data from all of the other participating societies. Thus, they are able to compare the basic values and beliefs of the people of their own society with those of more than 60 other societies. In addition, they are invited to international meetings at which they can compare findings and interpretations with other members of the WVS network."

  3. Supporting publication for 'Prevalence sample-based guidance for reporting...

    • zenodo.org
    bin
    Updated Feb 3, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Zenodo (2025). Supporting publication for 'Prevalence sample-based guidance for reporting 2024 data' [Dataset]. http://doi.org/10.5281/zenodo.14735617
    Explore at:
    binAvailable download formats
    Dataset updated
    Feb 3, 2025
    Dataset provided by
    Zenodohttp://zenodo.org/
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    The record is aimed at helping the reporting countries to submit the 2024 sample-based level data to the EFSA Data Collection Framework. We include here two excel files and one XML file, word documentwe and we provide below specific information on their use.

    The two Excel documents help in mapping terms from the matrix catalogue ZOO_CAT_MATRIX used in the aggregated prevalence data model to FoodEx2 codes, and offer examples on how prevalence data can be reported using SSD2 and how data are aggregated afterwards. The XML file is the same example as in the Excel file with similar title but in the XML format that allows for it be uploaded in the Data Collection Framework.

    The word document contains the explanation of the examples privided in the Excel and XML and how the aggregation of data reported at sample-based level performed.

  4. d

    Area Analysis | Aggregated Foot Traffic Data | 11 Countries | GDPR-Compliant...

    • datarade.ai
    .csv, .xls, .xml
    Updated Jul 6, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Echo Analytics (2024). Area Analysis | Aggregated Foot Traffic Data | 11 Countries | GDPR-Compliant [Dataset]. https://datarade.ai/data-products/v2-echo-analytics-area-activity-global-coverage-11-count-echo-analytics
    Explore at:
    .csv, .xls, .xmlAvailable download formats
    Dataset updated
    Jul 6, 2024
    Dataset authored and provided by
    Echo Analytics
    Area covered
    United States
    Description

    At Echo, our dedication to data curation is unmatched; we focus on providing our clients with an in-depth picture of a physical location based on activity in and around a point of interest over time. Our dataset empowers you to explore the “what” by allowing you to dig deeper into customer movement behaviors, eliminate gaps in your trade area and discover untapped potential. Leverage Echo's Activity datasets to identify new growth opportunities and gain a competitive advantage.

    This sample of our Area Activity data provides you insights into the estimated total unique visitors and visits in an area. This helps you understand frequentation dynamics over time, identify emerging trends in people movements and measure the impact of external factors on how people move across a city.

    Additional Information: - Understand the actual movement patterns of consumers without using PII data, gaining a 360-degree consumer view. Complement your online behavior knowledge with actual offline actions, and better attribute intent based on real-world behaviors. - Echo collects, cleans and updates its footfall on a daily basis. Normalization of the data occurs on a monthly basis. - We provide data aggregation on a weekly, monthly and quarterly basis. - Information about our country offering and data schema can be found here:

    1) Data Schema: https://docs.echo-analytics.com/activity/data-schema
    2) Country Availability: https://docs.echo-analytics.com/activity/country-coverage
    3) Methodology: https://docs.echo-analytics.com/activity/methodology
    

    Echo's commitment to customer service is evident in our exceptional data quality and dedicated team, providing 360° support throughout your location intelligence journey. We handle the complex tasks to deliver analysis-ready datasets to you.

    Business Needs: 1. Site Selection: Leverage footfall data to identify the best location to open a new store. By analyzing areas with high footfall you can select sites that are likely to attract more customers. 2. Urban Planning Development: City planners can use footfall data to optimize the layout and infrastructure of urban areas, guide the development of commercial areas by indicating where pedestrian traffic is heaviest, and aid in traffic management and safety measures. 3. Real Estate Investment: Leverage footfall data to identify lucrative investment opportunities and optimize property management by analyzing pedestrian traffic patterns.

  5. Z

    Data from: Open-data release of aggregated Australian school-level...

    • data.niaid.nih.gov
    • zenodo.org
    Updated Jan 24, 2020
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Monteiro Lobato, (2020). Open-data release of aggregated Australian school-level information. Edition 2016.1 [Dataset]. https://data.niaid.nih.gov/resources?id=zenodo_46086
    Explore at:
    Dataset updated
    Jan 24, 2020
    Dataset authored and provided by
    Monteiro Lobato,
    License

    CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
    License information was derived automatically

    Area covered
    Australia
    Description

    The file set is a freely downloadable aggregation of information about Australian schools. The individual files represent a series of tables which, when considered together, form a relational database. The records cover the years 2008-2014 and include information on approximately 9500 primary and secondary school main-campuses and around 500 subcampuses. The records all relate to school-level data; no data about individuals is included. All the information has previously been published and is publicly available but it has not previously been released as a documented, useful aggregation. The information includes: (a) the names of schools (b) staffing levels, including full-time and part-time teaching and non-teaching staff (c) student enrolments, including the number of boys and girls (d) school financial information, including Commonwealth government, state government, and private funding (e) test data, potentially for school years 3, 5, 7 and 9, relating to an Australian national testing programme know by the trademark 'NAPLAN'

    Documentation of this Edition 2016.1 is incomplete but the organization of the data should be readily understandable to most people. If you are a researcher, the simplest way to study the data is to make use of the SQLite3 database called 'school-data-2016-1.db'. If you are unsure how to use an SQLite database, ask a guru.

    The database was constructed directly from the other included files by running the following command at a command-line prompt: sqlite3 school-data-2016-1.db < school-data-2016-1.sql Note that a few, non-consequential, errors will be reported if you run this command yourself. The reason for the errors is that the SQLite database is created by importing a series of '.csv' files. Each of the .csv files contains a header line with the names of the variable relevant to each column. The information is useful for many statistical packages but it is not what SQLite expects, so it complains about the header. Despite the complaint, the database will be created correctly.

    Briefly, the data are organized as follows. (a) The .csv files ('comma separated values') do not actually use a comma as the field delimiter. Instead, the vertical bar character '|' (ASCII Octal 174 Decimal 124 Hex 7C) is used. If you read the .csv files using Microsoft Excel, Open Office, or Libre Office, you will need to set the field-separator to be '|'. Check your software documentation to understand how to do this. (b) Each school-related record is indexed by an identifer called 'ageid'. The ageid uniquely identifies each school and consequently serves as the appropriate variable for JOIN-ing records in different data files. For example, the first school-related record after the header line in file 'students-headed-bar.csv' shows the ageid of the school as 40000. The relevant school name can be found by looking in the file 'ageidtoname-headed-bar.csv' to discover that the the ageid of 40000 corresponds to a school called 'Corpus Christi Catholic School'. (3) In addition to the variable 'ageid' each record is also identified by one or two 'year' variables. The most important purpose of a year identifier will be to indicate the year that is relevant to the record. For example, if one turn again to file 'students-headed-bar.csv', one sees that the first seven school-related records after the header line all relate to the school Corpus Christi Catholic School with ageid of 40000. The variable that identifies the important differences between these seven records is the variable 'studentyear'. 'studentyear' shows the year to which the student data refer. One can see, for example, that in 2008, there were a total of 410 students enrolled, of whom 185 were girls and 225 were boys (look at the variable names in the header line). (4) The variables relating to years are given different names in each of the different files ('studentsyear' in the file 'students-headed-bar.csv', 'financesummaryyear' in the file 'financesummary-headed-bar.csv'). Despite the different names, the year variables provide the second-level means for joining information acrosss files. For example, if you wanted to relate the enrolments at a school in each year to its financial state, you might wish to JOIN records using 'ageid' in the two files and, secondarily, matching 'studentsyear' with 'financialsummaryyear'. (5) The manipulation of the data is most readily done using the SQL language with the SQLite database but it can also be done in a variety of statistical packages. (6) It is our intention for Edition 2016-2 to create large 'flat' files suitable for use by non-researchers who want to view the data with spreadsheet software. The disadvantage of such 'flat' files is that they contain vast amounts of redundant information and might not display the data in the form that the user most wants it. (7) Geocoding of the schools is not available in this edition. (8) Some files, such as 'sector-headed-bar.csv' are not used in the creation of the database but are provided as a convenience for researchers who might wish to recode some of the data to remove redundancy. (9) A detailed example of a suitable SQLite query can be found in the file 'school-data-sqlite-example.sql'. The same query, used in the context of analyses done with the excellent, freely available R statistical package (http://www.r-project.org) can be seen in the file 'school-data-with-sqlite.R'.

  6. Aggregated Data: Environmental Monitoring and Observations Effort...

    • data.csiro.au
    Updated Dec 3, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Shandiya Balasubramaniam (2024). Aggregated Data: Environmental Monitoring and Observations Effort 2010-present [Dataset]. http://doi.org/10.25919/yyv0-8k80
    Explore at:
    Dataset updated
    Dec 3, 2024
    Dataset provided by
    CSIROhttp://www.csiro.au/
    Authors
    Shandiya Balasubramaniam
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Time period covered
    Jan 1, 1900 - Dec 31, 2023
    Area covered
    Dataset funded by
    IMOS
    Atlas of Living Australia
    CSIROhttp://www.csiro.au/
    TERN
    Description

    This collection contains aggregated metadata on environmental monitoring and observing activities from three Australian national research infrastructures (NRIs): biodiversity survey events from the Atlas of Living Australia (ALA), marine observations collected by the Integrated Marine Observing System (IMOS), and site-based monitoring and survey efforts by the Terrestrial Ecosystem Research Network (TERN). This dataset provides a summary breakdown of these efforts by survey topic, region, and time period from 2010 to the present.

    Survey topics are mapped to an EcoAssets Earth Science Features vocabulary based on the Earth Science keywords from the Global Change Master Directory (GCMD) vocabulary, modified to use taxonomic concept URIs from the Australian National Species List (ANSL) in place of the GCMD Earth Science > Biological Classification vocabulary. ANSL categories map more readily to biodiversity survey categories, since GCMD depends on a top-level division between vertebrates and invertebrates rather than offering an animal category. The EcoAssets Earth Science Features vocabulary, including alternative keywords used in ALA, IMOS, or TERN datasets, is included in this collection.

    The primary asset is aggregated_env_monitoring.csv. This contains all faceted data records for the period and supported facets related to time, space, and features observed.

    Two derived assets (summary_monitoring_effort_terrestrial.csv, summary_monitoring_effort_marine.csv) further summarise the faceted data. Each is a pivot of the aggregated dataset.

    vocabulary_earth_science_features.csv contains the hierarchical terms used within this asset to categorise earth science features. treeview_earth_science_features.txt provides a simpler, more readable view. keyword_mapping.csv shows the mappings between these terms and the keywords used in source datasets. The data_sources_env_monitoring.csv file includes information on the source datasets within the Atlas of Living Australia that contributed to this asset. Lineage: This dataset was created by the following pipeline:

    1. Metadata records were collected from the TERN linked data portal (https://linkeddata.tern.org.au/) for all TERN monitoring sites and survey activities. Feature terms follow the TERN Feature Type vocabulary, mapped to the EcoAssets Earth Science Features vocabulary. For features that have been measured continuously at the site, metadata records were created for each relevant year since commission of the site. For other sites and features, metadata records were generated only for years in which the site was visited. TERN metadata records are associated with site coordinates.

    2. Metadata records were harvested for datasets in the Australian Ocean Data Network (AODN, https://portal.aodn.org.au/) portal maintained by IMOS (iso19115-3.2018 format over OAI-PMH). Feature terms follow the GCMD keywords used in these metadata records. Metadata records were created for each year overlapping the data collection period for each dataset. Where the datasets were associated with a bounding box, records were created for each IMCRA region intersecting the bounding box.

    3. Metadata records were created for each biodiversity sample event published to the ALA and associated with a Darwin Core event ID and a named sampling protocol (see https://dwc.tdwg.org/terms/#event). Events were excluded if the set of sampled taxa included multiple kingdoms OR the sampling protocol was associated with <50 samples OR no sample included >1 species. The remaining samples were mapped to feature terms based on the taxonomic scope of all species recorded for the associated protocol. Year and coordinates were taken from the associate sample event.

    4. Metadata records from all sources were combined and include the following values. The feature facet values are offered as a convenience for grouping records without using the hierarchical structure of the EcoAssets Earth Science Features vocabulary:

    • Source National Research Institute (NRI – one of ALA, IMOS, TERN) • Dataset name • Dataset URI • Original keyword from NRI (TERN feature type, IMOS GCMD keyword, ALA taxon) • Decimal latitude (where appropriate) • Decimal longitude (where appropriate) • Year • State or Territory • IBRA7 terrestrial region • IMCRA 4.0 mesoscale marine bioregion • Feature ID from EcoAssets Earth Science Features vocabulary • Feature name associated with feature ID • Feature facet 1 – high-level facet based on feature ID – a top-level GCMD Earth Science category (6 terms) • Feature facet 2 – intermediate-level facet based on feature ID – second-level GCMD/ANSL category (29 terms) • Feature facet 3 – lower-level facet with more fine-grained taxonomic structure based on feature ID – typically a third-level GCMD/ANSL category (36 terms)

  7. h

    Aggregated Brazilian Covid-19 data surveillance - PAMEpi data

    • healthdatagateway.org
    • dtechtive.com
    • +1more
    unknown
    Updated May 4, 2022
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2022). Aggregated Brazilian Covid-19 data surveillance - PAMEpi data [Dataset]. http://doi.org/10.5281/zenodo.6384641
    Explore at:
    unknownAvailable download formats
    Dataset updated
    May 4, 2022
    License

    https://pamepi.rondonia.fiocruz.br/en/covid_en.htmlhttps://pamepi.rondonia.fiocruz.br/en/covid_en.html

    Description

    The current file contains community-level aggregate information extracted from health, human mobility, population inequality, and non-pharmaceutical interventions. The integration of variables from different sources facilitates the data analysis and epidemiological studies once the data set is aligned and represents a single entry for each city and day since the beginning of the pandemic in Brazil.

    The data includes, for example, the daily time series of mild to moderate cases resulting from the Flu Syndrome database, hospital occupancy and deaths from the Severe Acute Respiratory Syndrome database, vaccine doses administered daily, etc.

    To familiarize yourself with the data, a data explorer and dictionary are also available at https://pamepi.rondonia.fiocruz.br/en/aggregated_ en.html, and codes used to create the data set can be found on our GitHub directory https://github.com/PAMepi/PAMepi_scripts_datalake.git.

    This work can be cited as: 1. Platform For Analytical Modelis in Epidemiology. (2022). GitHub directory: https://github.com/PAMepi/PAMepi_scripts_datalake.git. PAMepi/PAMepi_scripts_datalake: v1.0.0 (v1.0.0). Zenodo. https://doi.org/10.5281/zenodo.6384641

  8. d

    Bayesian estimation of random-coefficients choice models using aggregate...

    • b2find.dkrz.de
    Updated Oct 24, 2023
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2023). Bayesian estimation of random-coefficients choice models using aggregate data (replication data) - Dataset - B2FIND [Dataset]. https://b2find.dkrz.de/dataset/e71e5754-b868-5be3-9ca7-8e4ad78e173c
    Explore at:
    Dataset updated
    Oct 24, 2023
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    This article discusses the use of Bayesian methods for estimating logit demand models using aggregate data. We analyze two different demand systems: independent samples and consumer panel. Under the first system, there is a different and independent random sample of N consumers in each period and each consumer makes only a single purchase decision. Under the second system, the same N consumers make a purchase decision in each of T periods. Interestingly, there exists an asymptotic link between these two systems, which has important implications for the estimation of these demand models. The proposed methods are illustrated using simulated and real data.

  9. Replication Data for: Aggregated nanoparticles: Sample preparation and...

    • osti.gov
    Updated Jul 21, 2021
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Harvard Univ., Cambridge, MA (United States). Integrated Mesoscale Architectures for Sustainable Catalysis (IMASC) (EFRC) (2021). Replication Data for: Aggregated nanoparticles: Sample preparation and analysis by atom probe tomography [Dataset]. http://doi.org/10.7910/DVN/2UTNXQ
    Explore at:
    Dataset updated
    Jul 21, 2021
    Dataset provided by
    United States Department of Energyhttp://energy.gov/
    Office of Sciencehttp://www.er.doe.gov/
    Department of Energy Basic Energy Sciences Programhttp://science.energy.gov/user-facilities/basic-energy-sciences/
    Harvard Univ., Cambridge, MA (United States). Integrated Mesoscale Architectures for Sustainable Catalysis (IMASC) (EFRC)
    Description

    The data underlying this published work have been made publicly available in this repository as part of the IMASC Data Management Plan. This work was supported as part of the Integrated Mesoscale Architectures for Sustainable Catalysis (IMASC), an Energy Frontier Research Center funded by the U.S. Department of Energy, Office of Science, Basic Energy Sciences under Award # DE-SC0012573.

  10. Data from: Using partial aggregation in Spatial Capture Recapture

    • zenodo.org
    • data.niaid.nih.gov
    • +1more
    bin
    Updated May 28, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Cyril Milleret; Pierre Dupont; Henrik Brøseth; Jonas Kindberg; J. Andrew Royle; Richard Bischof; Cyril Milleret; Pierre Dupont; Henrik Brøseth; Jonas Kindberg; J. Andrew Royle; Richard Bischof (2022). Data from: Using partial aggregation in Spatial Capture Recapture [Dataset]. http://doi.org/10.5061/dryad.pd612qp
    Explore at:
    binAvailable download formats
    Dataset updated
    May 28, 2022
    Dataset provided by
    Zenodohttp://zenodo.org/
    Authors
    Cyril Milleret; Pierre Dupont; Henrik Brøseth; Jonas Kindberg; J. Andrew Royle; Richard Bischof; Cyril Milleret; Pierre Dupont; Henrik Brøseth; Jonas Kindberg; J. Andrew Royle; Richard Bischof
    License

    CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
    License information was derived automatically

    Description
    1. Spatial capture-recapture (SCR) models are commonly used for analyzing data collected using non-invasive genetic sampling (NGS). Opportunistic NGS often leads to detections that do not occur at discrete detector locations. Therefore, spatial aggregation of individual detections into fixed detectors (e.g. center of grid cells) is an option to increase computing speed of SCR analyses. However, it may reduce precision and accuracy of parameter estimations.
    2. Using simulations, we explored the impact that spatial aggregation of detections has on a trade-off between computing time and parameter precision and bias, under a range of biological conditions. We used three different observation models: the commonly used Poisson and Bernoulli models, as well as a novel way to partially aggregate detections (Partially Aggregated Binary model (PAB)) to reduce the loss of information after aggregating binary detections. The PAB model divides detectors into K subdetectors and models the frequency of subdetectors with more than one detection as a binomial response with a sample size of K. Finally, we demonstrate the consequences of aggregation and the use of the PAB model using NGS data from the monitoring of wolverine (Gulo gulo) in Norway.
    3. Spatial aggregation of detections, while reducing computation time, does indeed incur costs in terms of reduced precision and accuracy, especially for the parameters of the detection function. SCR models estimated abundance with a low bias (< 10%) even at high degree of aggregation, but only for the Poisson and PAB models. Overall, the cost of aggregation is mitigated when using the Poisson and PAB models. At the same level of aggregation, the PAB observation models out-performs the Bernoulli model in terms of accuracy of estimates, while offering the benefits of a binary observation model (less assumptions about the underlying ecological process) over the count-based model.
    4. We recommend that detector spacing after aggregation does not exceed 1.5 times the scale-parameter of the detection function in order to limit bias. We recommend the use of the PAB observation model when performing spatial aggregation of binary data as it can mitigate the cost of aggregation, compared to the Bernoulli model.
  11. BIDS Phenotype Aggregation Example Dataset

    • openneuro.org
    Updated Jun 4, 2022
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Samuel Guay; Eric Earl; Hao-Ting Wang; Remi Gau; Dorota Jarecka; David Keator; Melissa Kline Struhl; Satra Ghosh; Louis De Beaumont; Adam G. Thomas (2022). BIDS Phenotype Aggregation Example Dataset [Dataset]. http://doi.org/10.18112/openneuro.ds004130.v1.0.0
    Explore at:
    Dataset updated
    Jun 4, 2022
    Dataset provided by
    OpenNeurohttps://openneuro.org/
    Authors
    Samuel Guay; Eric Earl; Hao-Ting Wang; Remi Gau; Dorota Jarecka; David Keator; Melissa Kline Struhl; Satra Ghosh; Louis De Beaumont; Adam G. Thomas
    License

    CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
    License information was derived automatically

    Description

    BIDS Phenotype Aggregation Example COPY OF "The NIMH Healthy Research Volunteer Dataset" (ds003982)

    Modality-agnostic files were copied over and the CHANGES file was updated. Data was aggregated using:

    python phenotype.py aggregate subject -i segregated_subject -o aggregated_subject

    phenotype.py came from the GitHub repository: https://github.com/ericearl/bids-phenotype

    THE ORIGINAL DATASET ds003982 README FOLLOWS

    A comprehensive clinical, MRI, and MEG collection characterizing healthy research volunteers collected at the National Institute of Mental Health (NIMH) Intramural Research Program (IRP) in Bethesda, Maryland using medical and mental health assessments, diagnostic and dimensional measures of mental health, cognitive and neuropsychological functioning, structural and functional magnetic resonance imaging (MRI), along with diffusion tensor imaging (DTI), and a comprehensive magnetoencephalography battery (MEG).

    In addition, blood samples are currently banked for future genetic analysis. All data collected in this protocol are broadly shared in the OpenNeuro repository, in the Brain Imaging Data Structure (BIDS) format. In addition, blood samples of healthy volunteers are banked for future analyses. All data collected in this protocol are broadly shared here, in the Brain Imaging Data Structure (BIDS) format. In addition, task paradigms and basic pre-processing scripts are shared on GitHub. This dataset is unique in its depth of characterization of a healthy population in terms of brain health and will contribute to a wide array of secondary investigations of non-clinical and clinical research questions.

    This dataset is licensed under the Creative Commons Zero (CC0) v1.0 License.

    Recruitment

    Inclusion criteria for the study require that participants are adults at or over 18 years of age in good health with the ability to read, speak, understand, and provide consent in English. All participants provided electronic informed consent for online screening and written informed consent for all other procedures. Exclusion criteria include:

    • A history of significant or unstable medical or mental health condition requiring treatment
    • Current self-injury, suicidal thoughts or behavior
    • Current illicit drug use by history or urine drug screen
    • Abnormal physical exam or laboratory result at the time of in-person assessment
    • Less than an 8th grade education or IQ below 70
    • Current employees, or first-degree relatives of NIMH employees

    Study participants are recruited through direct mailings, bulletin boards and listservs, outreach exhibits, print advertisements, and electronic media.

    Clinical Measures

    All potential volunteers first visit the study website (https://nimhresearchvolunteer.ctss.nih.gov), check a box indicating consent, and complete preliminary self-report screening questionnaires. The study website is HIPAA compliant and therefore does not collect PII ; instead, participants are instructed to contact the study team to provide their identity and contact information. The questionnaires include demographics, clinical history including medications, disability status (WHODAS 2.0), mental health symptoms (modified DSM-5 Self-Rated Level 1 Cross-Cutting Symptom Measure), substance use survey (DSM-5 Level 2), alcohol use (AUDIT), handedness (Edinburgh Handedness Inventory), and perceived health ratings. At the conclusion of the questionnaires, participants are again prompted to send an email to the study team. Survey results, supplemented by NIH medical records review (if present), are reviewed by the study team, who determine if the participant is likely eligible for the protocol. These participants are then scheduled for an in-person assessment. Follow-up phone screenings were also used to determine if participants were eligible for in-person screening.

    In-person Assessments

    At this visit, participants undergo a comprehensive clinical evaluation to determine final eligibility to be included as a healthy research volunteer. The mental health evaluation consists of a psychiatric diagnostic interview (Structured Clinical Interview for DSM-5 Disorders (SCID-5), along with self-report surveys of mood (Beck Depression Inventory-II (BD-II) and anxiety (Beck Anxiety Inventory, BAI) symptoms. An intelligence quotient (IQ) estimation is determined with the Kaufman Brief Intelligence Test, Second Edition (KBIT-2). The KBIT-2 is a brief (20-30 minute) assessment of intellectual functioning administered by a trained examiner. There are three subtests, including verbal knowledge, riddles, and matrices.

    Medical Evaluation

    Medical evaluation includes medical history elicitation and systematic review of systems. Biological and physiological measures include vital signs (blood pressure, pulse), as well as weight, height, and BMI. Blood and urine samples are taken and a complete blood count, acute care panel, hepatic panel, thyroid stimulating hormone, viral markers (HCV, HBV, HIV), C-reactive protein, creatine kinase, urine drug screen and urine pregnancy tests are performed. In addition, blood samples that can be used for future genomic analysis, development of lymphoblastic cell lines or other biomarker measures are collected and banked with the NIMH Repository and Genomics Resource (Infinity BiologiX). The Family Interview for Genetic Studies (FIGS) was later added to the assessment in order to provide better pedigree information; the Adverse Childhood Events (ACEs) survey was also added to better characterize potential risk factors for psychopathology. The entirety of the in-person assessment not only collects information relevant for eligibility determination, but it also provides a comprehensive set of standardized clinical measures of volunteer health that can be used for secondary research.

    MRI Scan

    Participants are given the option to consent for a magnetic resonance imaging (MRI) scan, which can serve as a baseline clinical scan to determine normative brain structure, and also as a research scan with the addition of functional sequences (resting state and diffusion tensor imaging). The MR protocol used was initially based on the ADNI-3 basic protocol, but was later modified to include portions of the ABCD protocol in the following manner:

    1. The T1 scan from ADNI3 was replaced by the T1 scan from the ABCD protocol.
    2. The Axial T2 2D FLAIR acquisition from ADNI2 was added, and fat saturation turned on.
    3. Fat saturation was turned on for the pCASL acquisition.
    4. The high-resolution in-plane hippocampal 2D T2 scan was removed and replaced with the whole brain 3D T2 scan from the ABCD protocol (which is resolution and bandwidth matched to the T1 scan).
    5. The slice-select gradient reversal method was turned on for DTI acquisition, and reconstruction interpolation turned off.
    6. Scans for distortion correction were added (reversed-blip scans for DTI and resting state scans).
    7. The 3D FLAIR sequence was made optional and replaced by one where the prescription and other acquisition parameters provide resolution and geometric correspondence between the T1 and T2 scans.

    At the time of the MRI scan, volunteers are administered a subset of tasks from the NIH Toolbox Cognition Battery. The four tasks include:

    1. Flanker inhibitory control and attention task assesses the constructs of attention and executive functioning.
    2. Executive functioning is also assessed using a dimensional change card sort test.
    3. Episodic memory is evaluated using a picture sequence memory test.
    4. Working memory is evaluated using a list sorting test.

    MEG

    An optional MEG study was added to the protocol approximately one year after the study was initiated, thus there are relatively fewer MEG recordings in comparison to the MRI dataset. MEG studies are performed on a 275 channel CTF MEG system (CTF MEG, Coquiltam BC, Canada). The position of the head was localized at the beginning and end of each recording using three fiducial coils. These coils were placed 1.5 cm above the nasion, and at each ear, 1.5 cm from the tragus on a line between the tragus and the outer canthus of the eye. For 48 participants (as of 2/1/2022), photographs were taken of the three coils and used to mark the points on the T1 weighted structural MRI scan for co-registration. For the remainder of the participants (n=16 as of 2/1/2022), a Brainsight neuronavigation system (Rogue Research, Montréal, Québec, Canada) was used to coregister the MRI and fiducial localizer coils in realtime prior to MEG data acquisition.

    Specific Measures within Dataset

    Online and In-person behavioral and clinical measures, along with the corresponding phenotype file name, sorted first by measurement location and then by file name.

    LocationMeasureFile Name
    OnlineAlcohol Use Disorders Identification Test (AUDIT)audit
    Demographicsdemographics
    DSM-5 Level 2 Substance Use - Adultdrug_use
    Edinburgh Handedness Inventory (EHI)ehi
    Health History Formhealth_history_questions
    Perceived Health Rating - selfhealth_rating
  12. g

    STAR network aggregated attendance data | gimi9.com

    • gimi9.com
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    STAR network aggregated attendance data | gimi9.com [Dataset]. https://gimi9.com/dataset/eu_https-data-explore-star-fr-explore-dataset-tco-billettique-frequentation-agregee-td-
    Explore at:
    License

    Open Database License (ODbL) v1.0https://www.opendatacommons.org/licenses/odbl/1.0/
    License information was derived automatically

    Description

    ** STAR traffic data per line per day** this dataset is replaced by the datasethttps://data.explore.star.fr/explore/dataset/tco-billettique-star-frequentation-agregee-td/information/ It will be definitively released at the end of May 2023 This dataset provides STAR traffic data per line per month. The data can be downloaded through the URLs transmitted in this dataset. At the beginning of each month, the attendance data for month N-2 are made available. For example, if we are in May, March data is available. The May data will therefore be available in early July. This dataset offers to download attendance data over a rolling year (no history prior to one year). The structure of the attendance file is as follows: - Column 1: Date of the day of operation - Column 2: Line ID (= lineo) - Column 3: Short name of the line in the commercial sense - Column 4: Attendance The information available in this file can be crossed with the information available on STAR opendata. Be careful, however, to the temporal desynchronisation of the data: opendata offers, for example, the current data of topology and hourly offer of the network while the attendance data is those recorded two months before (months in progress minus two months). So you have to think about keeping opendata information, managing your own versioning and/or using GTFS data to cross-reference the information. Particularities: . Line ID = 9000: Traffic data for bridge parks . Stop point identifier = 65501 or 9999: off-site attendance data. For technical reasons, attendance data, via validations, cannot be located (quantified but not usable data). The attendance can be relocated to the stop point level but not to the line level but it can also be relocated to the stop point and line level.

  13. d

    Data from: Framing the Curriculum of DLI Training and Data Services: Part 2

    • search.dataone.org
    Updated Dec 28, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Chuck Humphrey; Wendy Watkins (2023). Framing the Curriculum of DLI Training and Data Services: Part 2 [Dataset]. http://doi.org/10.5683/SP3/S46LZT
    Explore at:
    Dataset updated
    Dec 28, 2023
    Dataset provided by
    Borealis
    Authors
    Chuck Humphrey; Wendy Watkins
    Description

    This session will focus on the baseline of skills that Data Liberation Initiative (DLI) Contacts should have and the corresponding training to achieve these skills. Introducing newcomers to the language of statistics and data is one of the important tasks of the orientation. Acquiring a technical language often poses a barrier to newcomers. To overcome this hurdle, newcomers must grasp both the meaning of new concepts and its abbreviated language of acronyms. Should we expect the orientation to offer all of the baseline skills or is other instruction needed? Do different local environments result in varying uses of DLI resources? Are the same skills needed among differing environments? How much attention should be paid during the orientation to different models of data service? For example, should the implications of buying services from elsewhere (e.g., Sherlock, IDLS, CHASS, Queen’s, etc.) be covered? What kind of distinctions need to be made for the levels of support for instructional and research uses of data? What about the reference uses of data, that is, using data to answer reference questions? Are there additional skills required of those supporting DLI data for research and reference uses? If there are, what are they and how should they be introduced?

  14. d

    FHV Base Aggregate Report

    • catalog.data.gov
    • data.cityofnewyork.us
    • +1more
    Updated Mar 22, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    data.cityofnewyork.us (2025). FHV Base Aggregate Report [Dataset]. https://catalog.data.gov/dataset/fhv-base-aggregate-report
    Explore at:
    Dataset updated
    Mar 22, 2025
    Dataset provided by
    data.cityofnewyork.us
    Description

    Monthly report including total dispatched trips, total dispatched shared trips, and unique dispatched vehicles aggregated by FHV (For-Hire Vehicle) base. These have been tabulated from raw trip record submissions made by bases to the NYC Taxi and Limousine Commission (TLC). This dataset is typically updated monthly on a two-month lag, as bases have until the conclusion of the following month to submit a month of trip records to the TLC. In example, a base has until Feb 28 to submit complete trip records for January. Therefore, the January base aggregates will appear in March at the earliest. The TLC may elect to defer updates to the FHV Base Aggregate Report if a large number of bases have failed to submit trip records by the due date. Note: The TLC publishes base trip record data as submitted by the bases, and we cannot guarantee or confirm their accuracy or completeness. Therefore, this may not represent the total amount of trips dispatched by all TLC-licensed bases. The TLC performs routine reviews of the records and takes enforcement actions when necessary to ensure, to the extent possible, complete and accurate information.

  15. c

    1991 Census: Aggregate Data; Great Britain

    • datacatalogue.cessda.eu
    Updated Mar 1, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Office of Population Censuses and Surveys; General Register Office (Scotland), Census Branch (2025). 1991 Census: Aggregate Data; Great Britain [Dataset]. http://doi.org/10.5255/UKDA-SN-22001-2
    Explore at:
    Dataset updated
    Mar 1, 2025
    Dataset provided by
    Census Division
    Authors
    Office of Population Censuses and Surveys; General Register Office (Scotland), Census Branch
    Area covered
    Northern Ireland, England, Wales, Scotland, United Kingdom
    Variables measured
    Individuals, Families/households, Administrative units (geographical/political), National, Census data, Households, Groups, Subnational
    Measurement technique
    Self-administered questionnaire
    Description

    Abstract copyright UK Data Service and data collection copyright owner.

    The UK censuses took place on 21st April 1991. They were run by the Census Office for Northern Ireland, General Register Office for Scotland, and the Office of Population and Surveys for both England and Wales. The UK comprises the countries of England, Wales, Scotland and Northern Ireland.

    Statistics from the UK censuses help paint a picture of the nation and how we live. They provide a detailed snapshot of the population and its characteristics, and underpin funding allocation to provide public services.


    The aggregate data produced as outputs from censuses in Great Britain provide information on a wide range of demographic and socio-economic characteristics. They are predominantly a collection of aggregated or summary counts of the numbers of people or households resident in specific geographical areas possessing particular characteristics.

    The topics covered by the 1991 Census were virtually the same as those in the 1981 Census. However, new questions were introduced on limiting long-term illness, ethnic group, central heating and term-time address of students. Also a question on weekly hours worked was re-introduced.

    The 100% Sample files include information about total population; population in private households and communal establishments; sex; age; marital status; country of birth; ethnicity; migration; employment status; economic activity; household composition; dependent children; dependant adults; long-term illness; household car availability; housing; housing tenure; housing amenities; central heating; linguistic ability (Welsh/Gaelic in Wales and Scotland respectively).

    The 10% Sample files contain information about socio-economic composition; employment status; occupations; industry of occupation; hours of work; commuting; qualifications, family type; household composition; age; sex; marital status; ethnicity; housing tenure; social class.

    Local Base Statistics (LBS)
    The 1991 Census Local Base Statistics (LBS) have around 20,000 statistical counts (cells) contained in 99 tables and cover the complete range of topics in the 1991 Census. They form the basis of the tables to be reproduced for each county (in England and Wales) and region (in Scotland) and for each local authority district. The LBS are available down to ward level in England and Wales and postcode sector level in Scotland.

    Small Area Statistics (SAS)
    The 1991 Census Small Area Statistics (SAS) tables are an abbreviated version of the Local Base Statistics. They comprise around 10,000 counts for each area and are available as an abstract of some 86 tables for geographic areas down to Enumeration District level in England and Wales and Output Area level in Scotland.

    Data can be accessed through CKAN (to bulk download data).

    Citation: Office of Population Censuses and Surveys; General Register Office for Scotland; Registrar General for Northern Ireland (1997): 1991 Census aggregate data (Edition: 1997). UK Data Service. DOI: https://doi.org/10.5257/census/aggregate-1991-1



    Main Topics:

    Population bases

    Age and marital status

    Communal establishments

    Medical and care establishments

    Hotels and other establishments

    Ethnic group

    Country of birth

    Economic position

    Economic position and ethnic group

    Term-time address

    Persons present

    Long-term illness in households

    Long-term illness in communal establishments

    Long-term illness and economic position

    Migrants

    Wholly moving households

    Ethnic group of migrants

    Imputed residents

    Imputed households

    Tenure and amenities

    Car availability

    Rooms and household size

    Persons per room

    Residents 18 and over

    Visitor households

    Students in households

    Households: 1971/'81/'91 bases

    Dependants in households

    Dependants and long-term illness

    Carers

    Dependent children in households

    Households with children aged 0 - 15

    Women in couples: economic position

    Economic position of household residents

    Age & marital status of household residents

    Earners and dependent children

    Young adults

    Single years of age

    Headship

    Lone 'parents'

    Shared accommodation

    Household composition and housing

    Household composition and ethnic group

    Household composition and long-term illness

    Migrant household heads

    Households with dependent children; housing

    Households with pensioners; housing

    Households with dependants; housing

    Ethnic group; housing

    Country of birth; hold heads and residents

    Country of birth and ethnic group

    Language indicators

    Lifestages

    Occupancy (Occupied; vacant; other accommodation)

    Household spaces and...

  16. E

    Soil aggregate stability data from arable and grassland in Countryside...

    • catalogue.ceh.ac.uk
    zip
    Updated Mar 4, 2020
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    A.M. Keith; M.R. Cave; B.A. Dodd; S.M. Smart; G. Turner; A.M. Tye; C.M. Wood (2020). Soil aggregate stability data from arable and grassland in Countryside Survey, Great Britain 2007 [Dataset]. http://doi.org/10.5285/be3793b6-90fb-4e4c-9515-220cc33223b9
    Explore at:
    zipAvailable download formats
    Dataset updated
    Mar 4, 2020
    Dataset provided by
    NERC EDS Environmental Information Data Centre
    Authors
    A.M. Keith; M.R. Cave; B.A. Dodd; S.M. Smart; G. Turner; A.M. Tye; C.M. Wood
    Time period covered
    May 1, 2007 - Oct 31, 2007
    Area covered
    Dataset funded by
    Natural Environment Research Councilhttps://www.ukri.org/councils/nerc
    Description

    This dataset consists of Particle Size Distribution (PSD) measurements made on 419 archived topsoil samples and derived aggregate stability metrics from arable and grassland habitats across Great Britain in 2007. Laser granulometry was used to measure PSD of 1–2 mm aggregates before and after sonication and the difference in their Mean Weight Diameter (MWD) used to indicate aggregate stability. The samples were collected as part of the Countryside Survey monitoring programme, a unique study or ‘audit’ of the natural resources of the UK’s countryside. The analyses were conducted as part of study aiming to quantify how soil quality indicators change across a gradient of agricultural land management and to identify conditions that determine the ability of different soils to resist and recover from perturbations.

  17. d

    FoodPanda Food & Grocery Transaction Data | Email Receipt Data | Asia |...

    • datarade.ai
    .json, .xml, .csv
    Updated Oct 13, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Measurable AI (2023). FoodPanda Food & Grocery Transaction Data | Email Receipt Data | Asia | Granular & Aggregate Data available [Dataset]. https://datarade.ai/data-products/foodpanda-food-grocery-transaction-data-email-receipt-dat-measurable-ai
    Explore at:
    .json, .xml, .csvAvailable download formats
    Dataset updated
    Oct 13, 2023
    Dataset authored and provided by
    Measurable AI
    Area covered
    Philippines, Pakistan, Malaysia, Taiwan, Hong Kong, Thailand, Singapore
    Description

    The Measurable AI FoodPanda Food & Grocery Transaction dataset is a leading source of email receipts and transaction data, offering data collected directly from users via Proprietary Consumer Apps, with millions of opt-in users.

    We source our email receipt consumer data panel via two consumer apps which garner the express consent of our end-users (GDPR compliant). We then aggregate and anonymize all the transactional data to produce raw and aggregate datasets for our clients.

    Use Cases Our clients leverage our datasets to produce actionable consumer insights such as: - Market share analysis - User behavioral traits (e.g. retention rates) - Average order values - Promotional strategies used by the key players. Several of our clients also use our datasets for forecasting and understanding industry trends better.

    Coverage - Asia (Hong Kong, Taiwan, Singapore, Thailand, Malaysia, Philippines, Pakistan)

    Granular Data Itemized, high-definition data per transaction level with metrics such as - Order value - Items ordered - No. of orders per user - Delivery fee - Service fee - Promotions used - Geolocation data and more

    Aggregate Data - Weekly/ monthly order volume - Revenue delivered in aggregate form, with historical data dating back to 2018. All the transactional e-receipts are sent from the FoodPanda food delivery app to users’ registered accounts.

    Most of our clients are fast-growing Tech Companies, Financial Institutions, Buyside Firms, Market Research Agencies, Consultancies and Academia.

    Our dataset is GDPR compliant, contains no PII information and is aggregated & anonymized with user consent. Contact business@measurable.ai for a data dictionary and to find out our volume in each country.

  18. Aggregated Data: Australian Species Occurrences 1900-2022

    • data.csiro.au
    Updated Sep 26, 2023
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Donald Hobern; Shandiya Balasubramaniam (2023). Aggregated Data: Australian Species Occurrences 1900-2022 [Dataset]. http://doi.org/10.25919/xpy6-t550
    Explore at:
    Dataset updated
    Sep 26, 2023
    Dataset provided by
    CSIROhttp://www.csiro.au/
    Authors
    Donald Hobern; Shandiya Balasubramaniam
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Time period covered
    Jan 1, 1900 - Dec 31, 2022
    Area covered
    Dataset funded by
    CSIROhttp://www.csiro.au/
    TERN
    Australian Research Data Commons
    IMOS
    Atlas of Living Australia
    Description

    Aggregated Australian species occurrence data from 1900 to the present using a suite of facets of most importance for environmental assessments. Occurrence records were aggregated and organised by the Atlas of Living Australia (ALA, https://ala.org.au/) and include survey and monitoring data collected and managed by the Integrated Marine Observing System (IMOS, https://imos.org.au/) and the Terrestrial Ecosystem Research Network (TERN, https://tern.org.au/).

    Data from these infrastructures and other sources have been organised here as a national public-access dataset.

    This dataset serves as a standardised snapshot of Australian biodiversity occurrence data from which many indicator datasets can more readily be derived (see Has Derivation entries below).

    The primary asset is AggregatedData_AustralianSpeciesOccurrences_1.1.2023-06-13.csv. This contains all faceted data records for the period and supported facets related to time, space, taxonomy and conservation significance.

    Six derived assets (SummaryData-ProtectionStatusAustralianMarineSpeciesOccurrences-1.1.2023-06-13.csv, SummaryData-ProtectionStatusAustralianTerrestrialSpeciesOccurrences-1.1.2023-06-13.csv, SummaryData-IntroducedSpeciesOccurrencesByMarineEcoregion-1.1.2023-06-13.csv, SummaryData-IntroducedSpeciesOccurrencesByTerrestrialEcoregion-1.1.2023-06-13.csv, SummaryData-ThreatenedSpeciesOccurrencesByMarineEcoregion-1.1.2023-06-13.csv, SummaryData-ThreatenedSpeciesOccurrencesByTerrestrialEcoregion-1.1.2023-06-13.csv) demonstrate uses supported by the faceted data. Each is a pivot of the aggregated dataset.

    The data-sources.csv file includes information on the source datasets within the Atlas of Living Australia that contributed to this asset. README.txt documents the columns in each data file.

    Grouping records from this dataset supports comparisons between the number of occurrence records for different regions and/or time periods and/or categories of species and occurrence data. Grouped counts of this kind may serve as useful indications of variation and change across the dimensions compared. Note however that such counts may not accurately reflect real differences in biodiversity. It is important to consider confounding factors (particularly variations in recording effort over time). Grouping all records by a single facet (e.g. IBRA region) may help to expose such factors.

    These data are versioned at 12-month intervals. Previous versions will be linked below under Previous Version. The latest version can always be accessed at https://ecoassets.org.au/data/aggregated-data-australian-species-occurrences/.

    Notes

    GRIIS 1.6 includes a number of vertebrate species listed because some individuals have been translocated or (re-)introduced beyond their remaining ranges for conservation purposes. It is unhelpful for the current analysis to treat these as introduced species. These species were removed from the version of the GRIIS list used in this analysis. In future versions of GRIIS, these species will be documented as native species that have been translocated/reintroduced. Lineage: All species occurrence data aggregated by the ALA as of 2022-12-31 were filtered to include only:

    • Records from 1900 onwards
    • Presence records only (exclude absence records
    • Spatial coordinates present
    • Taxon identified to at least species level
    • Location falls within an IBRA or IMCRA region

    Filtered data were processed to include the following elements:

    1. Accepted taxon ID
    2. Accepted species name
    3. Classification (higher ranks)
    4. Year of occurrence
    5. Coordinates of occurrence
    6. Basis of record (specimen, human observation, etc.)
    7. State or Territory
    8. IBRA7 terrestrial region
    9. IMCRA 4.0 mesoscale marine bioregion
    10. Status of location in CAPAD 2020 (not protected area, protected area, indigenous protected area)
    11. Status of location in Forests of Australia (2013)
    12. Status of location in Forests of Australia (2018)
    13. Status of species on EPBC Act List of Threatened Species (mapped to accepted ALA species using GALAH R library)
    14. Status of species on Global Register of Introduced and Invasive Species – Australia (GRIIS) version 1.6 (mapped to accepted ALA species)

    Processed occurrence data were grouped to count records detected for each distinct combination of eleven primary facets. The resulting dataset is published as follows

    • AggregatedData_AustralianSpeciesOccurrences_1.1.2023-06-13.csv

    This dataset includes the following elements:

    1. Year of occurrence
    2. Basis of record (specimen, human observation, etc.)
    3. State/Territory
    4. IBRA7 terrestrial region
    5. IMCRA 4.0 mesoscale marine bioregion
    6. Status of location in Forests of Australia (2018)
    7. Status of location in Forests of Australia (2013)
    8. Status of location in CAPAD 2020 (not protected, PA – protected area, IPA – indigenous protected area)
    9. Status of species on EPBC Act List of Threatened Species
    10. Status of species on Global Register of Introduced and Invasive Species – Australia (GRIIS) version 1.6
    11. ALA species identifier
    12. Scientific name for species
    13. Count of occurrence records matching the values for elements 1 to 11

    Six derived summary datasets are also included. Each of this is a pivot of data in the main dataset and demonstrates a use case for the information:

    • SummaryData-ProtectionStatusAustralianTerrestrialSpeciesOccurrences-1.1.2023-06-13.csv
    • SummaryData-ProtectionStatusAustralianMarineSpeciesOccurrences-1.1.2023-06-13.csv

    These two datasets include the following columns:

    1. IBRA7 / IMCRA 4.0 bioregion
    2. ALA Species ID
    3. Species scientific name
    4. EPBC status for species
    5. Count of all records for species from region
    6. Count of all records for species from protected areas inside region
    7. Count of all records for species from protected areas under indigenous management inside region
    • SummaryData-ThreatenedSpeciesOccurrencesByTerrestrialEcoregion-1.1.2023-06-13.csv
    • SummaryData-ThreatenedSpeciesOccurrencesByMarineEcoregion-1.1.2023-06-13.csv

    These two datasets include the following columns:

    1. IBRA7 / IMCRA 4.0 bioregion
    2. Starting year of the time period
    3. Ending year of the time period
    4. EPBC status for species
    5. Count of all occurrence records in the region and status for the given period
    6. Count of all distinct species in the region and status for the given period
    • SummaryData-IntroducedSpeciesOccurrencesByTerrestrialEcoregion-1.1.2023-06-13.csv
    • SummaryData-IntroducedSpeciesOccurrencesByMarineEcoregion-1.1.2023-06-13.csv

    These two datasets include the following columns:

    1. IBRA7 / IMCRA 4.0 bioregion
    2. Starting year of the time period
    3. Ending year of the time period
    4. GRIIS status for species (Native, Introduced, Invasive)
    5. Count of all occurrence records in the region and status for the given period
    6. Count of all distinct species in the region and status for the given period
  19. e

    Data from: Land Cover Map 2015 (1km dominant aggregate class, GB)

    • data.europa.eu
    • catalogue.ceh.ac.uk
    • +1more
    unknown, zip
    Updated Oct 15, 2020
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Environmental Information Data Centre (2020). Land Cover Map 2015 (1km dominant aggregate class, GB) [Dataset]. https://data.europa.eu/data/datasets/land-cover-map-2015-1km-dominant-aggregate-class-gb?locale=es
    Explore at:
    unknown, zipAvailable download formats
    Dataset updated
    Oct 15, 2020
    Dataset authored and provided by
    Environmental Information Data Centre
    Description

    This dataset consists of the 1km raster, dominant aggregate class version of the Land Cover Map 2015 (LCM2015) for Great Britain. The 1km dominant coverage product is based on the 1km percentage product and reports the aggregated habitat class with the highest percentage cover for each 1km pixel. The 10 aggregate classes are groupings of 21 target classes, which are based on the Joint Nature Conservation Committee (JNCC) Broad Habitats, which encompass the entire range of UK habitats. The aggregate classes group some of the more specialised classes into more general categories. For example, the five coastal classes in the target class are grouped into a single aggregate coastal class. This dataset is derived from the vector version of the Land Cover Map, which contains individual parcels of land cover and is the highest available spatial resolution. LCM2015 is a land cover map of the UK which was produced at the Centre for Ecology & Hydrology by classifying satellite images from 2014 and 2015 into 21 Broad Habitat-based classes. LCM2015 consists of a range of raster and vector products and users should familiarise themselves with the full range (see related records, the CEH web site and the LCM2015 Dataset documentation) to select the product most suited to their needs. LCM2015 was produced at the Centre for Ecology & Hydrology by classifying satellite images from 2014 and 2015 into 21 Broad Habitat-based classes. It is one of a series of land cover maps, produced by UKCEH since 1990. They include versions in 1990, 2000, 2007, 2015, 2017, 2018 and 2019. Full details about this dataset can be found at https://doi.org/10.5285/711c8dc1-0f4e-42ad-a703-8b5d19c92247

  20. d

    Violence Reduction - Victim Demographics - Aggregated

    • catalog.data.gov
    • data.cityofchicago.org
    Updated Mar 14, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    data.cityofchicago.org (2025). Violence Reduction - Victim Demographics - Aggregated [Dataset]. https://catalog.data.gov/dataset/violence-reduction-victim-demographics-aggregated
    Explore at:
    Dataset updated
    Mar 14, 2025
    Dataset provided by
    data.cityofchicago.org
    Description

    This dataset contains aggregate data on violent index victimizations at the quarter level of each year (i.e., January – March, April – June, July – September, October – December), from 2001 to the present (1991 to present for Homicides), with a focus on those related to gun violence. Index crimes are 10 crime types selected by the FBI (codes 1-4) for special focus due to their seriousness and frequency. This dataset includes only those index crimes that involve bodily harm or the threat of bodily harm and are reported to the Chicago Police Department (CPD). Each row is aggregated up to victimization type, age group, sex, race, and whether the victimization was domestic-related. Aggregating at the quarter level provides large enough blocks of incidents to protect anonymity while allowing the end user to observe inter-year and intra-year variation. Any row where there were fewer than three incidents during a given quarter has been deleted to help prevent re-identification of victims. For example, if there were three domestic criminal sexual assaults during January to March 2020, all victims associated with those incidents have been removed from this dataset. Human trafficking victimizations have been aggregated separately due to the extremely small number of victimizations. This dataset includes a " GUNSHOT_INJURY_I " column to indicate whether the victimization involved a shooting, showing either Yes ("Y"), No ("N"), or Unknown ("UKNOWN.") For homicides, injury descriptions are available dating back to 1991, so the "shooting" column will read either "Y" or "N" to indicate whether the homicide was a fatal shooting or not. For non-fatal shootings, data is only available as of 2010. As a result, for any non-fatal shootings that occurred from 2010 to the present, the shooting column will read as “Y.” Non-fatal shooting victims will not be included in this dataset prior to 2010; they will be included in the authorized dataset, but with "UNKNOWN" in the shooting column. The dataset is refreshed daily, but excludes the most recent complete day to allow CPD time to gather the best available information. Each time the dataset is refreshed, records can change as CPD learns more about each victimization, especially those victimizations that are most recent. The data on the Mayor's Office Violence Reduction Dashboard is updated daily with an approximately 48-hour lag. As cases are passed from the initial reporting officer to the investigating detectives, some recorded data about incidents and victimizations may change once additional information arises. Regularly updated datasets on the City's public portal may change to reflect new or corrected information. How does this dataset classify victims? The methodology by which this dataset classifies victims of violent crime differs by victimization type: Homicide and non-fatal shooting victims: A victimization is considered a homicide victimization or non-fatal shooting victimization depending on its presence in CPD's homicide victims data table or its shooting victims data table. A victimization is considered a homicide only if it is present in CPD's homicide data table, while a victimization is considered a non-fatal shooting only if it is present in CPD's shooting data tables and absent from CPD's homicide data table. To determine the IUCR code of homicide and non-fatal shooting victimizations, we defer to the incident IUCR code available in CPD's Crimes, 2001-present dataset (available on the City's open data portal). If the IUCR code in CPD's Crimes dataset is inconsistent with the homicide/non-fatal shooting categorization, we defer to CPD's Victims dataset. For a criminal homicide, the only sensible IUCR codes are 0110 (first-degree murder) or 0130 (second-degree murder). For a non-fatal shooting, a sensible IUCR code must signify a criminal sexual assault, a robbery, or, most commonly, an aggravated battery. In rare instances, the IUCR code in CPD's Crimes and Vi

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
Envestnet | Yodlee, Envestnet | Yodlee's De-Identified Consumer Purchase Data | Row/Aggregate Level | USA Consumer Data covering 3600+ corporations | 90M+ Accounts [Dataset]. https://datarade.ai/data-products/envestnet-yodlee-s-consumer-purchase-data-row-aggregate-envestnet-yodlee
Organization logoOrganization logo

Envestnet | Yodlee's De-Identified Consumer Purchase Data | Row/Aggregate Level | USA Consumer Data covering 3600+ corporations | 90M+ Accounts

Explore at:
.sql, .txtAvailable download formats
Dataset provided by
Envestnethttp://envestnet.com/
Yodlee
Authors
Envestnet | Yodlee
Area covered
United States of America
Description

Envestnet®| Yodlee®'s Consumer Purchase Data (Aggregate/Row) Panels consist of de-identified, near-real time (T+1) USA credit/debit/ACH transaction level data – offering a wide view of the consumer activity ecosystem. The underlying data is sourced from end users leveraging the aggregation portion of the Envestnet®| Yodlee®'s financial technology platform.

Envestnet | Yodlee Consumer Panels (Aggregate/Row) include data relating to millions of transactions, including ticket size and merchant location. The dataset includes de-identified credit/debit card and bank transactions (such as a payroll deposit, account transfer, or mortgage payment). Our coverage offers insights into areas such as consumer, TMT, energy, REITs, internet, utilities, ecommerce, MBS, CMBS, equities, credit, commodities, FX, and corporate activity. We apply rigorous data science practices to deliver key KPIs daily that are focused, relevant, and ready to put into production.

We offer free trials. Our team is available to provide support for loading, validation, sample scripts, or other services you may need to generate insights from our data.

Investors, corporate researchers, and corporates can use our data to answer some key business questions such as: - How much are consumers spending with specific merchants/brands and how is that changing over time? - Is the share of consumer spend at a specific merchant increasing or decreasing? - How are consumers reacting to new products or services launched by merchants? - For loyal customers, how is the share of spend changing over time? - What is the company’s market share in a region for similar customers? - Is the company’s loyal user base increasing or decreasing? - Is the lifetime customer value increasing or decreasing?

Additional Use Cases: - Use spending data to analyze sales/revenue broadly (sector-wide) or granular (company-specific). Historically, our tracked consumer spend has correlated above 85% with company-reported data from thousands of firms. Users can sort and filter by many metrics and KPIs, such as sales and transaction growth rates and online or offline transactions, as well as view customer behavior within a geographic market at a state or city level. - Reveal cohort consumer behavior to decipher long-term behavioral consumer spending shifts. Measure market share, wallet share, loyalty, consumer lifetime value, retention, demographics, and more.) - Study the effects of inflation rates via such metrics as increased total spend, ticket size, and number of transactions. - Seek out alpha-generating signals or manage your business strategically with essential, aggregated transaction and spending data analytics.

Use Cases Categories (Our data provides an innumerable amount of use cases, and we look forward to working with new ones): 1. Market Research: Company Analysis, Company Valuation, Competitive Intelligence, Competitor Analysis, Competitor Analytics, Competitor Insights, Customer Data Enrichment, Customer Data Insights, Customer Data Intelligence, Demand Forecasting, Ecommerce Intelligence, Employee Pay Strategy, Employment Analytics, Job Income Analysis, Job Market Pricing, Marketing, Marketing Data Enrichment, Marketing Intelligence, Marketing Strategy, Payment History Analytics, Price Analysis, Pricing Analytics, Retail, Retail Analytics, Retail Intelligence, Retail POS Data Analysis, and Salary Benchmarking

  1. Investment Research: Financial Services, Hedge Funds, Investing, Mergers & Acquisitions (M&A), Stock Picking, Venture Capital (VC)

  2. Consumer Analysis: Consumer Data Enrichment, Consumer Intelligence

  3. Market Data: AnalyticsB2C Data Enrichment, Bank Data Enrichment, Behavioral Analytics, Benchmarking, Customer Insights, Customer Intelligence, Data Enhancement, Data Enrichment, Data Intelligence, Data Modeling, Ecommerce Analysis, Ecommerce Data Enrichment, Economic Analysis, Financial Data Enrichment, Financial Intelligence, Local Economic Forecasting, Location-based Analytics, Market Analysis, Market Analytics, Market Intelligence, Market Potential Analysis, Market Research, Market Share Analysis, Sales, Sales Data Enrichment, Sales Enablement, Sales Insights, Sales Intelligence, Spending Analytics, Stock Market Predictions, and Trend Analysis

Search
Clear search
Close search
Google apps
Main menu