Facebook
TwitterThe previous review in this series introduced the notion of data description and outlined some of the more common summary measures used to describe a dataset. However, a dataset is typically only of interest for the information it provides regarding the population from which it was drawn. The present review focuses on estimation of population values from a sample.
Facebook
TwitterThe StudentData table comes from a survey of students at the University of Southampton. Were collected student data to increase student interest and to obtain real data for use in the introductory statistics course.
TYPE: Survey SIZE: 500 observations, 9 variables
VARIABLE DESCRIPTIONS: Gander - student's gander (binary: 'Female', 'Male') Age - student's age (numeric: from 15 to 22) Address - student's home address type (binary: 'U' - urban or 'R' - rural) Height - student's height (numeric: from 140 to 199) Weight - student's weight (numeric: from 50 to 160) Eye - student's eye color (binary: 'Blue', 'Brown', 'Green') Medu - mother's education (numeric: 0 - none, 1 - primary education (4th grade), 2 – 5th to 9th grade, 3 – secondary education or 4 – higher education) Fedu - father's education (numeric: 0 - none, 1 - primary education (4th grade), 2 – 5th to 9th grade, 3 – secondary education or 4 – higher education) Fsize - family size (binary: 'LE3' - less or equal to 3 or 'GT3' - greater than 3)
If treated as a representative sample from a larger population, this data set can be used to illustrate concepts such as conditional distributions, populations, samples and sampling variability, and tests of independence. Alternatively, considering the data as the population of interest, this example can be used to illustrate probability rules based on selecting a student at random from the population.
Facebook
TwitterIn 2025, around ** percent of the adult population in France intended to follow the Tour de France of that year, representing a slight increase on 2024. Since 2016, public interest in the race has generally hovered around ** percent.
Facebook
TwitterAttribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
GENERAL INFORMATION
Title of Dataset: A dataset from a survey investigating disciplinary differences in data citation
Date of data collection: January to March 2022
Collection instrument: SurveyMonkey
Funding: Alfred P. Sloan Foundation
SHARING/ACCESS INFORMATION
Licenses/restrictions placed on the data: These data are available under a CC BY 4.0 license
Links to publications that cite or use the data:
Gregory, K., Ninkov, A., Ripp, C., Peters, I., & Haustein, S. (2022). Surveying practices of data citation and reuse across disciplines. Proceedings of the 26th International Conference on Science and Technology Indicators. International Conference on Science and Technology Indicators, Granada, Spain. https://doi.org/10.5281/ZENODO.6951437
Gregory, K., Ninkov, A., Ripp, C., Roblin, E., Peters, I., & Haustein, S. (2023). Tracing data:
A survey investigating disciplinary differences in data citation. Zenodo. https://doi.org/10.5281/zenodo.7555266
DATA & FILE OVERVIEW
File List
Additional related data collected that was not included in the current data package: Open ended questions asked to respondents
METHODOLOGICAL INFORMATION
Description of methods used for collection/generation of data:
The development of the questionnaire (Gregory et al., 2022) was centered around the creation of two main branches of questions for the primary groups of interest in our study: researchers that reuse data (33 questions in total) and researchers that do not reuse data (16 questions in total). The population of interest for this survey consists of researchers from all disciplines and countries, sampled from the corresponding authors of papers indexed in the Web of Science (WoS) between 2016 and 2020.
Received 3,632 responses, 2,509 of which were completed, representing a completion rate of 68.6%. Incomplete responses were excluded from the dataset. The final total contains 2,492 complete responses and an uncorrected response rate of 1.57%. Controlling for invalid emails, bounced emails and opt-outs (n=5,201) produced a response rate of 1.62%, similar to surveys using comparable recruitment methods (Gregory et al., 2020).
Methods for processing the data:
Results were downloaded from SurveyMonkey in CSV format and were prepared for analysis using Excel and SPSS by recoding ordinal and multiple choice questions and by removing missing values.
Instrument- or software-specific information needed to interpret the data:
The dataset is provided in SPSS format, which requires IBM SPSS Statistics. The dataset is also available in a coded format in CSV. The Codebook is required to interpret to values.
DATA-SPECIFIC INFORMATION FOR: MDCDataCitationReuse2021surveydata
Number of variables: 95
Number of cases/rows: 2,492
Missing data codes: 999 Not asked
Refer to MDCDatacitationReuse2021Codebook.pdf for detailed variable information.
Facebook
TwitterHow does your organization use this dataset? What other NYSERDA or energy-related datasets would you like to see on Open NY? Let us know by emailing OpenNY@nyserda.ny.gov. The Low- to Moderate-Income (LMI) New York State (NYS) Census Population Analysis dataset is resultant from the LMI market database designed by APPRISE as part of the NYSERDA LMI Market Characterization Study (https://www.nyserda.ny.gov/lmi-tool). All data are derived from the U.S. Census Bureau’s American Community Survey (ACS) 1-year Public Use Microdata Sample (PUMS) files for 2013, 2014, and 2015. Each row in the LMI dataset is an individual record for a household that responded to the survey and each column is a variable of interest for analyzing the low- to moderate-income population. The LMI dataset includes: county/county group, households with elderly, households with children, economic development region, income groups, percent of poverty level, low- to moderate-income groups, household type, non-elderly disabled indicator, race/ethnicity, linguistic isolation, housing unit type, owner-renter status, main heating fuel type, home energy payment method, housing vintage, LMI study region, LMI population segment, mortgage indicator, time in home, head of household education level, head of household age, and household weight. The LMI NYS Census Population Analysis dataset is intended for users who want to explore the underlying data that supports the LMI Analysis Tool. The majority of those interested in LMI statistics and generating custom charts should use the interactive LMI Analysis Tool at https://www.nyserda.ny.gov/lmi-tool. This underlying LMI dataset is intended for users with experience working with survey data files and producing weighted survey estimates using statistical software packages (such as SAS, SPSS, or Stata).
Facebook
TwitterIn 2023, there were around **** million people in the German-speaking population, aged 14 years and older, who were particularly interested in politics. Around **** million people saw themselves as somewhat of an expert and often gave advice on political topics. In contrast, around ** percent stated that they were barely interested in politics at all. However, in total, almost half of the respondents were somewhat interested in politics. The Allensbach Market and Advertising Media Analysis (Allensbacher Markt- und Werbeträgeranalyse or AWA in German) determines attitudes, consumer habits and media usage of the population in Germany on a broad statistical basis.
Facebook
TwitterA fundamental aspect of ecology is identifying and characterizing population processes. In population studies, we almost always use sampling to make inference about the biological population, which is the population of interest, and the part of the population at risk of sampling is referred to as the statistical population. Ideally, the statistical population is the same as, or accurately represents its corresponding biological population. However, in practice, they rarely align in space and time, which can lead to biased inference. We often view a population misalignment as a temporary emigration process and resolve it with replicate and/or repeat sampling, though this approach is not feasible for all species and habitats. We developed a hierarchical framework to estimate abundance of a biological population of the Kittlitz’s murrelet (Brachyramphus brevirostris), a highly mobile, non-territorial, ice-associated seabird of conservation concern in Alaska and eastern Russia. Our model co..., These data used to estimate abundance and trend of Kittlitz's murrelets were collected in Icy Bay, Alaska, USA between 2005 and 2012. Boat survey data were gathered along line transects using distance sampling, and telemetry data were collected by flying aerial telemetry flights to locate radio-tagged murrelets during the breeding season. Methods are summarized in numerous publications, including the current manuscript in review at Peer Community In Ecology and several others described in the README file., , # Survey data and code to estimate abundance of Brachyramphus murrelets, Icy Bay, Alaska, USA
These data were analyzed and summarized in a manuscript submitted to PCI Ecology as part of a special issue for the EURING Analytical Conference Proceedings, April 2023. The title of the manuscript is “Using multiple datasets to account for misalignment between statistical and biological populations for abundance estimation.â€
File structure is as follows:
DataCode_BRMU_KISSLING_et_al
|- Data | |- AbundanceResults | |- BoatSurveys | |- TelemetrySurveys |- Code | |- JAGS | | |- AbundanceEstimation | | |- ProbPresenceEstimation | | |- TrendEstimation | |- R | | |- AbundanceEstimation | | |- ProbPresenceEstimation | | |- TrendEstimation |- SupplementalInformation
All raw data files are included in the folder "Data." Boat and telemetry survey data were collected between 2005 and 2012, using methods descr...
Facebook
TwitterThis dataset, named "state_trends.csv," contains information about different U.S. states. Let's break down the attributes and understand what each column represents:
In summary, this dataset provides a variety of information about U.S. states, including demographic data, geographical region, psychological region, personality traits, and scores related to interests or proficiencies in various fields such as data science, art, and sports.
Facebook
TwitterSourcing accurate and up-to-date map data across Asia and MENA has historically been difficult for retail brands looking to expand their store networks in these regions. Either the data does not exist or it isn't readily accessible or updated regularly.
GapMaps Map Data uses known population data combined with billions of mobile device location points to provide highly accurate and globally consistent demographics data across Asia and MENA at 150m x 150m grid levels in major cities and 1km grids outside of major cities.
GapMaps Map Data also includes the latest Point-of-Interest (POI) Data for leading retail brands across a range of categories including Fast Food/ QSR, Health & Fitness, Supermarket/Grocery and Cafe sectors which is updated monthly.
With this information, brands can get a detailed understanding of who lives in a catchment, where they work and their spending potential which allows you to:
GapMaps Map Data for Asia and MENA can be utilized in any GIS platform and includes the latest estimates (updated annually) on:
Primary Use Cases for GapMaps Map Data:
Facebook
TwitterAttribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Explore the richness of Montenegro’s demographic landscape with our Montenegro Census Data for 2023. This meticulously compiled dataset, sourced from official statistical sources, including MONSTAT, offers a detailed insight into the population landscape of Montenegro at the municipality (општине) level.
If you require data at a less granular level, such as an Uber grid or square grid, please don't hesitate to get in touch with us at contact@geolocet.com. We understand that different projects may have varying data needs, and we are here to tailor our data to match your specific requirements.
Population Data Montenegro: Gain access to the latest and most accurate population statistics for Montenegro, meticulously organized by municipality (општине).
Demographic Insights: Dive deep into Montenegro’s demographic landscape with data on 22 attributes, including total population, age, and ***gender ***distributions.
All demographic attributes are available at the municipal (општине) level. - Seamless Data Access: Our data is available for instant download in CSV format. No lengthy sign-up processes or NDAs required on our side – just download the data you need. - Data accuracy: Our datasets are created using the latest demographics data available in national statistics bureaus, guaranteeing accuracy and reliability.
Additionally, for those looking to enrich their analysis or project, we offer: - Administrative Boundaries: If you require geographical boundaries (polygons) for the municipalities of Montenegro, consider exploring our Administrative Boundaries product, which seamlessly integrates with this dataset.
Point of Interest (PoI) Data: If you're interested in Points of Interest data, please don't hesitate to contact us for customized datasets to enhance your geographical analysis.
Data Customization: We understand that your project may have specific requirements. Feel free to get in touch with us if you need any modifications or specific data formats such as Excel, JSON, GeoJSON, or Shapefile. We are here to tailor the data to your needs.
Make informed decisions, conduct precise research, and unlock the potential of the Montenegro demographic landscape with our comprehensive dataset. Trust in the accuracy and reliability of our official statistical data to drive your projects forward. Contact us at contact@geolocet.com today to access this invaluable resource and explore Montenegro demographics data like never before.
Facebook
TwitterIn 2024, around **** million people in Germany were somewhat interested in photography. This statistic shows how many people in Germany were interested in photography from 2019 to 2024.The Allensbach Market and Advertising Media Analysis (Allensbacher Markt- und Werbeträgeranalyse or AWA in German) determines attitudes, consumer habits and media usage of the population in Germany on a broad statistical basis.
Facebook
TwitterAttribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
A quandary exists when comparing trend lines of Google Trends query data among different countries. This approach provides directionality and speed of change, but it does not account for the quantity of movement occurring when comparing large regions to small ones. This study applies the physical concept of momentum to the analysis of Google Trends results to provide a method for comparing trends among countries. By accounting for the volume of interest along with the direction and rate of interest gain/loss, one is able to make accurate quantitative statements about how the public in differently sized regions may shift interests and opinion on different issues. Momentum allows us to identify how countries have responded and how they may respond in the future without the erroneous assumption that the behaviors of large and small populations are equally flexible and responsive to new ideas.
Facebook
TwitterAttribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
This data collection contains de-identified clinical health service utilisation data from Bendigo Health and the General Practitioners Practices associated with the Loddon Mallee Murray Medicare Local. The collection also includes associated population health data from the ABS, AIHW and the Municipal Health Plans. Health researchers have a major interest in how clinical data can be used to monitor population health and health care in rural and regional Australia through analysing a broad range of factors shown to impact the health of different populations. The Population Health data collection provides students, managers, clinicians and researchers the opportunity to use clinical data in the study of population health, including the analysis of health risk factors, disease trends and health care utilisation and outcomes.Temporal range (data time period):2004 to 2014Spatial coverage:Bendigo Latitude -36.758711200000010000, Bendigo Longitude 144.283745899999990000
Facebook
TwitterCC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
License information was derived automatically
Users can download data or view data tables on topics related to the labor force of the United States. Background Current Population Survey is a joint effort between the Bureau of Labor Statistics and the Census Bureau. It provides information and data on the labor force of the United States, such as: employment, unemployment, earnings, hours of work, school enrollment, health, employee benefits and income. The CPS is conducted monthly and has a sample of approximately 50,000 households. It is representative of the non-institutionalized US population. The sample provides estimates for the nation as a whole and serves as part of model-based estimates for individual states and other geographic areas. User Functionality Users can download data sets or view data tables on their topic of interest. Data can be organized by a variety of demographic variables, including: sex, age, race, marital status and educational attainment. Data is available on a national or state level. Data Notes The CPS is conducted monthly and has a sample of approximately 50,000 households. It is representative of the non-institutionalized US population. The sample provides estimates for th e nation as a whole and serves as part of model-based estimates for individual states and other geographic areas.
Facebook
TwitterApex carnivores are wide-ranging, low-density, hard to detect, and declining throughout most of their range, making population monitoring both critical and challenging. Rapid and inexpensive index calibration survey (ICS) methods have been developed to monitor large African carnivores. ICS methods assume constant detection probability and a predictable relationship between the index and the actual population of interest. The precision and utility of the resulting estimates from ICS methods have been questioned. We assessed the performance of one ICS method for large carnivores - track counts - with data from two long-term studies of African lion populations. We conducted Monte Carlo simulation of intersections between transects (road segments) and lion movement paths (from GPS collar data) at varying survey intensities. Then, using the track count method we estimated population size and its confidence limits.
We found that estimates either overstate precision or are too imprecise to ...
Facebook
TwitterIn 2025, there were around ***** million people in the German-speaking population aged 14 and over who were particularly interested in skin and body care. This was slightly less than in the year before, while the number of people who are hardly or not at all interested has increased compared to 2024. The Allensbach Market and Advertising Media Analysis, or AWA for short, determines the attitudes, consumer habits and media use of the German population on a broad statistical basis.
Facebook
TwitterOur People data is gathered and aggregated via surveys, digital services, and public data sources. We use powerful profiling algorithms to collect and ingest only fresh and reliable data points.
Our comprehensive data enrichment solution includes a variety of data sets that can help you address gaps in your customer data, gain a deeper understanding of your customers, and power superior client experiences.
People Data Schema & Reach: Our data reach represents the total number of counts available within various categories and comprises attributes such as country location, MAU, DAU & Monthly Location Pings:
Data Export Methodology: Since we collect data dynamically, we provide the most updated data and insights via a best-suited method on a suitable interval (daily/weekly/monthly).
People Data Use Cases:
360-Degree Customer View: Get a comprehensive image of customers by the means of internal and external data aggregation.
Data Enrichment: Leverage Online to offline consumer profiles to build holistic audience segments to improve campaign targeting using user data enrichment
Fraud Detection: Use multiple digital (web and mobile) identities to verify real users and detect anomalies or fraudulent activity.
Advertising & Marketing: Understand audience demographics, interests, lifestyle, hobbies, and behaviors to build targeted marketing campaigns.
Using Factori People Data you can solve use cases like:
Acquisition Marketing Expand your reach to new users and customers using lookalike modeling with your first party audiences to extend to other potential consumers with similar traits and attributes.
Lookalike Modeling
Build lookalike audience segments using your first party audiences as a seed to extend your reach for running marketing campaigns to acquire new users or customers
And also, CRM Data Enrichment, Consumer Data Enrichment B2B Data Enrichment B2C Data Enrichment Customer Acquisition Audience Segmentation 360-Degree Customer View Consumer Profiling Consumer Behaviour Data
Here's the schema of People Data:
person_id
first_name
last_name
age
gender
linkedin_url
twitter_url
facebook_url
city
state
address
zip
zip4
country
delivery_point_bar_code
carrier_route
walk_seuqence_code
fips_state_code
fips_country_code
country_name
latitude
longtiude
address_type
metropolitan_statistical_area
core_based+statistical_area
census_tract
census_block_group
census_block
primary_address
pre_address
streer
post_address
address_suffix
address_secondline
address_abrev
census_median_home_value
home_market_value
property_build+year
property_with_ac
property_with_pool
property_with_water
property_with_sewer
general_home_value
property_fuel_type
year
month
household_id
Census_median_household_income
household_size
marital_status
length+of_residence
number_of_kids
pre_school_kids
single_parents
working_women_in_house_hold
homeowner
children
adults
generations
net_worth
education_level
occupation
education_history
credit_lines
credit_card_user
newly_issued_credit_card_user
credit_range_new
credit_cards
loan_to_value
mortgage_loan2_amount
mortgage_loan_type
mortgage_loan2_type
mortgage_lender_code
mortgage_loan2_render_code
mortgage_lender
mortgage_loan2_lender
mortgage_loan2_ratetype
mortgage_rate
mortgage_loan2_rate
donor
investor
interest
buyer
hobby
personal_email
work_email
devices
phone
employee_title
employee_department
employee_job_function
skills
recent_job_change
company_id
company_name
company_description
technologies_used
office_address
office_city
office_country
office_state
office_zip5
office_zip4
office_carrier_route
office_latitude
office_longitude
office_cbsa_code
office_census_block_group
office_census_tract
office_county_code
company_phone
company_credit_score
company_csa_code
company_dpbc
company_franchiseflag
company_facebookurl
company_linkedinurl
company_twitterurl
company_website
company_fortune_rank
company_government_type
company_headquarters_branch
company_home_business
company_industry
company_num_pcs_used
company_num_employees
company_firm_individual
company_msa
company_msa_name
company_naics_code
company_naics_description
company_naics_code2
company_naics_description2
company_sic_code2
company_sic_code2_description
company_sic...
Facebook
TwitterCC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
License information was derived automatically
The Low- to Moderate-Income (LMI) New York State (NYS) Census Population Analysis dataset is based upon research conducted for a forthcoming Low-Income Energy Bill and Usage Study, which will be published in 2026. The data primarily came from the U.S. Census Bureau’s American Community Survey (ACS) 2018-2022 data. Each row in the LMI dataset is an individual record for a household that responded to the survey and each column is a variable of interest for analyzing the low- to moderate-income population. The LMI NYS Census Population Analysis dataset is intended for users who want to explore the underlying data that supports the New York State Low- to Moderate-Income Census Population Analysis Tool (2018-2022). The majority of those interested in LMI statistics and generating custom charts should use the interactive LMI Census Population Tool at https://www.nyserda.ny.gov/About/Publications/Evaluation-Reports/Low--to-Moderate-Income/LMI-Census-Population-Tool. This underlying LMI dataset is intended for users with experience working with survey data files and producing weighted survey estimates using statistical software packages (such as SAS, SPSS, or Stata).
Facebook
Twitterhttps://borealisdata.ca/api/datasets/:persistentId/versions/3.1/customlicense?persistentId=doi:10.5683/SP3/9TET2Thttps://borealisdata.ca/api/datasets/:persistentId/versions/3.1/customlicense?persistentId=doi:10.5683/SP3/9TET2T
This new product will present data for specific census topics and population groups according to selected demographic, cultural, and socio-economic characteristics. These detailed 'profile-type' tables expand the analytical depth of basic census information. Special interest profiles include: ethnic groups, Aboriginal peoples, occupation, industry, and place of work.
Facebook
TwitterAttribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
The National Population Database (NPD) is a point-based Geographical Information System (GIS) dataset that combines locational information from providers like the Ordnance Survey with population information about those locations, mainly sourced from Government statistics. The points (and sometimes polygons) represent individual buildings, so the NPD allows detailed local analysis for anywhere in Great Britain. The Health & Safety Laboratory (HSL) working with Staffordshire University originally created the NPD in 2004 to help its parent organisation, the Health and Safety Executive (HSE), assess the risks to society of major hazard sites e.g. oil refineries, chemical works and gas holders. Of particular interest to HSE were 'sensitive' populations e.g. schools and hospitals where the people at those locations may be more vulnerable to harm and potentially harder to evacuate in an emergency. The data is split into 5 themes: residential, sensitive populations, transport, workplaces and leisure. More information about the NPD can be found here: https://www.hsl.gov.uk/what-we-do/better-decisions/geoanalytics/national-population-database The NPD was created using various datasets available within Government as part of the Public Sector Mapping Agreement (PSMA) and contains other intellectual property so is only available under license and for a fee. Please contact the HSL GIS Team if you would like to discuss gaining access to the sample or full dataset.
Facebook
TwitterThe previous review in this series introduced the notion of data description and outlined some of the more common summary measures used to describe a dataset. However, a dataset is typically only of interest for the information it provides regarding the population from which it was drawn. The present review focuses on estimation of population values from a sample.