Facebook
TwitterCC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
License information was derived automatically
analyze the current population survey (cps) annual social and economic supplement (asec) with r the annual march cps-asec has been supplying the statistics for the census bureau's report on income, poverty, and health insurance coverage since 1948. wow. the us census bureau and the bureau of labor statistics ( bls) tag-team on this one. until the american community survey (acs) hit the scene in the early aughts (2000s), the current population survey had the largest sample size of all the annual general demographic data sets outside of the decennial census - about two hundred thousand respondents. this provides enough sample to conduct state- and a few large metro area-level analyses. your sample size will vanish if you start investigating subgroups b y state - consider pooling multiple years. county-level is a no-no. despite the american community survey's larger size, the cps-asec contains many more variables related to employment, sources of income, and insurance - and can be trended back to harry truman's presidency. aside from questions specifically asked about an annual experience (like income), many of the questions in this march data set should be t reated as point-in-time statistics. cps-asec generalizes to the united states non-institutional, non-active duty military population. the national bureau of economic research (nber) provides sas, spss, and stata importation scripts to create a rectangular file (rectangular data means only person-level records; household- and family-level information gets attached to each person). to import these files into r, the parse.SAScii function uses nber's sas code to determine how to import the fixed-width file, then RSQLite to put everything into a schnazzy database. you can try reading through the nber march 2012 sas importation code yourself, but it's a bit of a proc freak show. this new github repository contains three scripts: 2005-2012 asec - download all microdata.R down load the fixed-width file containing household, family, and person records import by separating this file into three tables, then merge 'em together at the person-level download the fixed-width file containing the person-level replicate weights merge the rectangular person-level file with the replicate weights, then store it in a sql database create a new variable - one - in the data table 2012 asec - analysis examples.R connect to the sql database created by the 'download all microdata' progr am create the complex sample survey object, using the replicate weights perform a boatload of analysis examples replicate census estimates - 2011.R connect to the sql database created by the 'download all microdata' program create the complex sample survey object, using the replicate weights match the sas output shown in the png file below 2011 asec replicate weight sas output.png statistic and standard error generated from the replicate-weighted example sas script contained in this census-provided person replicate weights usage instructions document. click here to view these three scripts for more detail about the current population survey - annual social and economic supplement (cps-asec), visit: the census bureau's current population survey page the bureau of labor statistics' current population survey page the current population survey's wikipedia article notes: interviews are conducted in march about experiences during the previous year. the file labeled 2012 includes information (income, work experience, health insurance) pertaining to 2011. when you use the current populat ion survey to talk about america, subract a year from the data file name. as of the 2010 file (the interview focusing on america during 2009), the cps-asec contains exciting new medical out-of-pocket spending variables most useful for supplemental (medical spending-adjusted) poverty research. confidential to sas, spss, stata, sudaan users: why are you still rubbing two sticks together after we've invented the butane lighter? time to transition to r. :D
Facebook
TwitterAttribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Association of demographic variables with opinions and attitudes.
Facebook
TwitterThe harmonized data set on health, created and published by the ERF, is a subset of Iraq Household Socio Economic Survey (IHSES) 2012. It was derived from the household, individual and health modules, collected in the context of the above mentioned survey. The sample was then used to create a harmonized health survey, comparable with the Iraq Household Socio Economic Survey (IHSES) 2007 micro data set.
----> Overview of the Iraq Household Socio Economic Survey (IHSES) 2012:
Iraq is considered a leader in household expenditure and income surveys where the first was conducted in 1946 followed by surveys in 1954 and 1961. After the establishment of Central Statistical Organization, household expenditure and income surveys were carried out every 3-5 years in (1971/ 1972, 1976, 1979, 1984/ 1985, 1988, 1993, 2002 / 2007). Implementing the cooperation between CSO and WB, Central Statistical Organization (CSO) and Kurdistan Region Statistics Office (KRSO) launched fieldwork on IHSES on 1/1/2012. The survey was carried out over a full year covering all governorates including those in Kurdistan Region.
The survey has six main objectives. These objectives are:
The raw survey data provided by the Statistical Office were then harmonized by the Economic Research Forum, to create a comparable version with the 2006/2007 Household Socio Economic Survey in Iraq. Harmonization at this stage only included unifying variables' names, labels and some definitions. See: Iraq 2007 & 2012- Variables Mapping & Availability Matrix.pdf provided in the external resources for further information on the mapping of the original variables on the harmonized ones, in addition to more indications on the variables' availability in both survey years and relevant comments.
National coverage: Covering a sample of urban, rural and metropolitan areas in all the governorates including those in Kurdistan Region.
1- Household/family. 2- Individual/person.
The survey was carried out over a full year covering all governorates including those in Kurdistan Region.
Sample survey data [ssd]
----> Design:
Sample size was (25488) household for the whole Iraq, 216 households for each district of 118 districts, 2832 clusters each of which includes 9 households distributed on districts and governorates for rural and urban.
----> Sample frame:
Listing and numbering results of 2009-2010 Population and Housing Survey were adopted in all the governorates including Kurdistan Region as a frame to select households, the sample was selected in two stages: Stage 1: Primary sampling unit (blocks) within each stratum (district) for urban and rural were systematically selected with probability proportional to size to reach 2832 units (cluster). Stage two: 9 households from each primary sampling unit were selected to create a cluster, thus the sample size of total survey clusters was 25488 households distributed on the governorates, 216 households in each district.
----> Sampling Stages:
In each district, the sample was selected in two stages: Stage 1: based on 2010 listing and numbering frame 24 sample points were selected within each stratum through systematic sampling with probability proportional to size, in addition to the implicit breakdown urban and rural and geographic breakdown (sub-district, quarter, street, county, village and block). Stage 2: Using households as secondary sampling units, 9 households were selected from each sample point using systematic equal probability sampling. Sampling frames of each stages can be developed based on 2010 building listing and numbering without updating household lists. In some small districts, random selection processes of primary sampling may lead to select less than 24 units therefore a sampling unit is selected more than once , the selection may reach two cluster or more from the same enumeration unit when it is necessary.
Face-to-face [f2f]
----> Preparation:
The questionnaire of 2006 survey was adopted in designing the questionnaire of 2012 survey on which many revisions were made. Two rounds of pre-test were carried out. Revision were made based on the feedback of field work team, World Bank consultants and others, other revisions were made before final version was implemented in a pilot survey in September 2011. After the pilot survey implemented, other revisions were made in based on the challenges and feedbacks emerged during the implementation to implement the final version in the actual survey.
----> Questionnaire Parts:
The questionnaire consists of four parts each with several sections: Part 1: Socio – Economic Data: - Section 1: Household Roster - Section 2: Emigration - Section 3: Food Rations - Section 4: housing - Section 5: education - Section 6: health - Section 7: Physical measurements - Section 8: job seeking and previous job
Part 2: Monthly, Quarterly and Annual Expenditures: - Section 9: Expenditures on Non – Food Commodities and Services (past 30 days). - Section 10 : Expenditures on Non – Food Commodities and Services (past 90 days). - Section 11: Expenditures on Non – Food Commodities and Services (past 12 months). - Section 12: Expenditures on Non-food Frequent Food Stuff and Commodities (7 days). - Section 12, Table 1: Meals Had Within the Residential Unit. - Section 12, table 2: Number of Persons Participate in the Meals within Household Expenditure Other Than its Members.
Part 3: Income and Other Data: - Section 13: Job - Section 14: paid jobs - Section 15: Agriculture, forestry and fishing - Section 16: Household non – agricultural projects - Section 17: Income from ownership and transfers - Section 18: Durable goods - Section 19: Loans, advances and subsidies - Section 20: Shocks and strategy of dealing in the households - Section 21: Time use - Section 22: Justice - Section 23: Satisfaction in life - Section 24: Food consumption during past 7 days
Part 4: Diary of Daily Expenditures: Diary of expenditure is an essential component of this survey. It is left at the household to record all the daily purchases such as expenditures on food and frequent non-food items such as gasoline, newspapers…etc. during 7 days. Two pages were allocated for recording the expenditures of each day, thus the roster will be consists of 14 pages.
----> Raw Data:
Data Editing and Processing: To ensure accuracy and consistency, the data were edited at the following stages: 1. Interviewer: Checks all answers on the household questionnaire, confirming that they are clear and correct. 2. Local Supervisor: Checks to make sure that questions has been correctly completed. 3. Statistical analysis: After exporting data files from excel to SPSS, the Statistical Analysis Unit uses program commands to identify irregular or non-logical values in addition to auditing some variables. 4. World Bank consultants in coordination with the CSO data management team: the World Bank technical consultants use additional programs in SPSS and STAT to examine and correct remaining inconsistencies within the data files. The software detects errors by analyzing questionnaire items according to the expected parameter for each variable.
----> Harmonized Data:
Iraq Household Socio Economic Survey (IHSES) reached a total of 25488 households. Number of households refused to response was 305, response rate was 98.6%. The highest interview rates were in Ninevah and Muthanna (100%) while the lowest rates were in Sulaimaniya (92%).
Facebook
TwitterAttribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Open Science in (Higher) Education – data of the February 2017 survey
This data set contains:
Survey structure
The survey includes 24 questions and its structure can be separated in five major themes: material used in courses (5), OER awareness, usage and development (6), collaborative tools used in courses (2), assessment and participation options (5), demographics (4). The last two questions include an open text questions about general issues on the topics and singular open education experiences, and a request on forwarding the respondent’s e-mail address for further questionings. The online survey was created with Limesurvey[1]. Several questions include filters, i.e. these questions were only shown if a participants did choose a specific answer beforehand ([n/a] in Excel file, [.] In SPSS).
Demographic questions
Demographic questions asked about the current position, the discipline, birth year and gender. The classification of research disciplines was adapted to general disciplines at German higher education institutions. As we wanted to have a broad classification, we summarised several disciplines and came up with the following list, including the option “other” for respondents who do not feel confident with the proposed classification:
The current job position classification was also chosen according to common positions in Germany, including positions with a teaching responsibility at higher education institutions. Here, we also included the option “other” for respondents who do not feel confident with the proposed classification:
We chose to have a free text (numerical) for asking about a respondent’s year of birth because we did not want to pre-classify respondents’ age intervals. It leaves us options to have different analysis on answers and possible correlations to the respondents’ age. Asking about the country was left out as the survey was designed for academics in Germany.
Remark on OER question
Data from earlier surveys revealed that academics suffer confusion about the proper definition of OER[2]. Some seem to understand OER as free resources, or only refer to open source software (Allen & Seaman, 2016, p. 11). Allen and Seaman (2016) decided to give a broad explanation of OER, avoiding details to not tempt the participant to claim “aware”. Thus, there is a danger of having a bias when giving an explanation. We decided not to give an explanation, but keep this question simple. We assume that either someone knows about OER or not. If they had not heard of the term before, they do not probably use OER (at least not consciously) or create them.
Data collection
The target group of the survey was academics at German institutions of higher education, mainly universities and universities of applied sciences. To reach them we sent the survey to diverse institutional-intern and extern mailing lists and via personal contacts. Included lists were discipline-based lists, lists deriving from higher education and higher education didactic communities as well as lists from open science and OER communities. Additionally, personal e-mails were sent to presidents and contact persons from those communities, and Twitter was used to spread the survey.
The survey was online from Feb 6th to March 3rd 2017, e-mails were mainly sent at the beginning and around mid-term.
Data clearance
We got 360 responses, whereof Limesurvey counted 208 completes and 152 incompletes. Two responses were marked as incomplete, but after checking them turned out to be complete, and we added them to the complete responses dataset. Thus, this data set includes 210 complete responses. From those 150 incomplete responses, 58 respondents did not answer 1st question, 40 respondents discontinued after 1st question. Data shows a constant decline in response answers, we did not detect any striking survey question with a high dropout rate. We deleted incomplete responses and they are not in this data set.
Due to data privacy reasons, we deleted seven variables automatically assigned by Limesurvey: submitdate, lastpage, startlanguage, startdate, datestamp, ipaddr, refurl. We also deleted answers to question No 24 (email address).
References
Allen, E., & Seaman, J. (2016). Opening the Textbook: Educational Resources in U.S. Higher Education, 2015-16.
First results of the survey are presented in the poster:
Heck, Tamara, Blümel, Ina, Heller, Lambert, Mazarakis, Athanasios, Peters, Isabella, Scherp, Ansgar, & Weisel, Luzian. (2017). Survey: Open Science in Higher Education. Zenodo. http://doi.org/10.5281/zenodo.400561
Contact:
Open Science in (Higher) Education working group, see http://www.leibniz-science20.de/forschung/projekte/laufende-projekte/open-science-in-higher-education/.
[1] https://www.limesurvey.org
[2] The survey question about the awareness of OER gave a broad explanation, avoiding details to not tempt the participant to claim “aware”.
Facebook
Twitterhttps://spdx.org/licenses/CC0-1.0.htmlhttps://spdx.org/licenses/CC0-1.0.html
The data relates to the paper that analyses the determinants or factors that best explain student research skills and success in the honours research report module during the COVID-19 pandemic in 2021. The data used have been gathered through an online survey created on the Qualtrics software package. The research questions were developed from demographic factors and subject knowledge including assignments to supervisor influence and other factors in terms of experience or belonging that played a role (see anonymous link at https://unisa.qualtrics.com/jfe/form/SV_86OZZOdyA5sBurY. An SMS was sent to all students of the 2021 module group to make them aware of the survey. They were under no obligation to complete it and all information was regarded as anonymous. We received 39 responses. The raw data from the survey was processed through the SPSS statistical, software package. The data file contains the demographics, frequencies, descriptives, and open questions processed.
The study reported in this paper employed the mixed methods approach comprising a quantitative and qualitative analysis. The quantitative and econometric analysis of the dependent variable, namely, the final marks for the research report and the independent variables that explain it. The results show significance in terms of the assignments and existing knowledge marks in terms of their bachelor’s average mark. We extended the analysis to a qualitative and quantitative survey, which indicated that the mean statistical feedback was above average and therefore strongly agreed/agreed except for library use by the student. Students, therefore, need more guidance in terms of library use and the open questions showed a need for a research methods course in the future. Furthermore, supervision tends to be a significant determinant in all cases. It is also here where supervisors can use social media instruments such as WhatsApp and Facebook to inform students further. This study contributes as the first to investigate the preparation and research skills of students for master's and doctoral studies during the COVID-19 pandemic in an online environment.
Facebook
TwitterAttribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Although South Africa is the global epicenter of the HIV epidemic, the uptake of HIV testing and treatment among young people remains low. Concerns about confidentiality impede the utilization of HIV prevention services, which signals the need for discrete HIV prevention measures that leverage youth-friendly platforms. This paper describes the process of developing a youth-friendly internet-enabled HIV risk calculator in collaboration with young people, including young key populations aged between 18 and 24 years old. Using qualitative research, we conducted an exploratory study with 40 young people including young key population (lesbian, gay, bisexual, transgender (LGBT) individuals, men who have sex with men (MSM), and female sex workers). Eligible participants were young people aged between 18–24 years old and living in Soweto. Data was collected through two peer group discussions with young people aged 18–24 years, a once-off group discussion with the [Name of clinic removed for confidentiality] adolescent community advisory board members and once off face-to-face in-depth interviews with young key population groups: LGBT individuals, MSM, and female sex workers. LGBT individuals are identified as key populations because they face increased vulnerability to HIV/AIDS and other health risks due to societal stigma, discrimination, and obstacles in accessing healthcare and support services. The measures used to collect data included a socio-demographic questionnaire, a questionnaire on mobile phone usage, an HIV and STI risk assessment questionnaire, and a semi-structured interview guide. Framework analysis was used to analyse qualitative data through a qualitative data analysis software called NVivo. Descriptive statistics were summarized using SPSS for participant socio-demographics and mobile phone usage. Of the 40 enrolled participants, 58% were male, the median age was 20 (interquartile range 19–22.75), and 86% had access to the internet. Participants’ recommendations were considered in developing the HIV risk calculator. They indicated a preference for an easy-to-use, interactive, real-time assessment offering discrete and private means to self-assess HIV risk. In addition to providing feedback on the language and wording of the risk assessment tool, participants recommended creating a colorful, interactive and informational app. A collaborative and user-driven process is crucial for designing and developing HIV prevention tools for targeted groups. Participants emphasized that privacy, confidentiality, and ease of use contribute to the acceptability and willingness to use internet-enabled HIV prevention methods.
Facebook
TwitterThe main objective of the HEIS survey is to obtain detailed data on household expenditure and income, linked to various demographic and socio-economic variables, to enable computation of poverty indices and determine the characteristics of the poor and prepare poverty maps. Therefore, to achieve these goals, the sample had to be representative on the sub-district level. The raw survey data provided by the Statistical Office was cleaned and harmonized by the Economic Research Forum, in the context of a major research project to develop and expand knowledge on equity and inequality in the Arab region. The main focus of the project is to measure the magnitude and direction of change in inequality and to understand the complex contributing social, political and economic forces influencing its levels. However, the measurement and analysis of the magnitude and direction of change in this inequality cannot be consistently carried out without harmonized and comparable micro-level data on income and expenditures. Therefore, one important component of this research project is securing and harmonizing household surveys from as many countries in the region as possible, adhering to international statistics on household living standards distribution. Once the dataset has been compiled, the Economic Research Forum makes it available, subject to confidentiality agreements, to all researchers and institutions concerned with data collection and issues of inequality.
Data collected through the survey helped in achieving the following objectives: 1. Provide data weights that reflect the relative importance of consumer expenditure items used in the preparation of the consumer price index 2. Study the consumer expenditure pattern prevailing in the society and the impact of demographic and socio-economic variables on those patterns 3. Calculate the average annual income of the household and the individual, and assess the relationship between income and different economic and social factors, such as profession and educational level of the head of the household and other indicators 4. Study the distribution of individuals and households by income and expenditure categories and analyze the factors associated with it 5. Provide the necessary data for the national accounts related to overall consumption and income of the household sector 6. Provide the necessary income data to serve in calculating poverty indices and identifying the poor characteristics as well as drawing poverty maps 7. Provide the data necessary for the formulation, follow-up and evaluation of economic and social development programs, including those addressed to eradicate poverty
National
Sample survey data [ssd]
The Household Expenditure and Income survey sample for 2010, was designed to serve the basic objectives of the survey through providing a relatively large sample in each sub-district to enable drawing a poverty map in Jordan. The General Census of Population and Housing in 2004 provided a detailed framework for housing and households for different administrative levels in the country. Jordan is administratively divided into 12 governorates, each governorate is composed of a number of districts, each district (Liwa) includes one or more sub-district (Qada). In each sub-district, there are a number of communities (cities and villages). Each community was divided into a number of blocks. Where in each block, the number of houses ranged between 60 and 100 houses. Nomads, persons living in collective dwellings such as hotels, hospitals and prison were excluded from the survey framework.
A two stage stratified cluster sampling technique was used. In the first stage, a cluster sample proportional to the size was uniformly selected, where the number of households in each cluster was considered the weight of the cluster. At the second stage, a sample of 8 households was selected from each cluster, in addition to another 4 households selected as a backup for the basic sample, using a systematic sampling technique. Those 4 households were sampled to be used during the first visit to the block in case the visit to the original household selected is not possible for any reason. For the purposes of this survey, each sub-district was considered a separate stratum to ensure the possibility of producing results on the sub-district level. In this respect, the survey framework adopted that provided by the General Census of Population and Housing Census in dividing the sample strata. To estimate the sample size, the coefficient of variation and the design effect of the expenditure variable provided in the Household Expenditure and Income Survey for the year 2008 was calculated for each sub-district. These results were used to estimate the sample size on the sub-district level so that the coefficient of variation for the expenditure variable in each sub-district is less than 10%, at a minimum, of the number of clusters in the same sub-district (6 clusters). This is to ensure adequate presentation of clusters in different administrative areas to enable drawing an indicative poverty map.
It should be noted that in addition to the standard non response rate assumed, higher rates were expected in areas where poor households are concentrated in major cities. Therefore, those were taken into consideration during the sampling design phase, and a higher number of households were selected from those areas, aiming at well covering all regions where poverty spreads.
Face-to-face [f2f]
Raw Data: - Organizing forms/questionnaires: A compatible archive system was used to classify the forms according to different rounds throughout the year. A registry was prepared to indicate different stages of the process of data checking, coding and entry till forms were back to the archive system. - Data office checking: This phase was achieved concurrently with the data collection phase in the field where questionnaires completed in the field were immediately sent to data office checking phase. - Data coding: A team was trained to work on the data coding phase, which in this survey is only limited to education specialization, profession and economic activity. In this respect, international classifications were used, while for the rest of the questions, coding was predefined during the design phase. - Data entry/validation: A team consisting of system analysts, programmers and data entry personnel were working on the data at this stage. System analysts and programmers started by identifying the survey framework and questionnaire fields to help build computerized data entry forms. A set of validation rules were added to the entry form to ensure accuracy of data entered. A team was then trained to complete the data entry process. Forms prepared for data entry were provided by the archive department to ensure forms are correctly extracted and put back in the archive system. A data validation process was run on the data to ensure the data entered is free of errors. - Results tabulation and dissemination: After the completion of all data processing operations, ORACLE was used to tabulate the survey final results. Those results were further checked using similar outputs from SPSS to ensure that tabulations produced were correct. A check was also run on each table to guarantee consistency of figures presented, together with required editing for tables' titles and report formatting.
Harmonized Data: - The Statistical Package for Social Science (SPSS) was used to clean and harmonize the datasets. - The harmonization process started with cleaning all raw data files received from the Statistical Office. - Cleaned data files were then merged to produce one data file on the individual level containing all variables subject to harmonization. - A country-specific program was generated for each dataset to generate/compute/recode/rename/format/label harmonized variables. - A post-harmonization cleaning process was run on the data. - Harmonized data was saved on the household as well as the individual level, in SPSS and converted to STATA format.
Facebook
TwitterTHE CLEANED AND HARMONIZED VERSION OF THE SURVEY DATA PRODUCED AND PUBLISHED BY THE ECONOMIC RESEARCH FORUM REPRESENTS 100% OF THE ORIGINAL SURVEY DATA COLLECTED BY THE NATIONAL INSTITUTE OF STATISTICS (INS) - TUNISIA
The survey aims at estimating the demographic and educational characteristics of the population. It also calculates the economic indicators of the population such as the number of active individuals, the additional demand for jobs, the number of employed and their characteristics, the number of jobs created, the characteristics of the unemployed and the unemployment rate. Furthermore, this survey estimates these indicators on the household level and their living conditions.
The results of this survey were compared with the results of the second quarter of the national survey on population and employment 2011. It should also be noted that the National Institute of Statistics -Tunisia uses the unemployment definition and concepts adopted by the International Labour Organization. This definition implies that, the individual did not work during the week preceding the day of the interview, was looking for a job in the month preceding the date of the interview, is available to work within two weeks after the day of the interview.
In 2010, the National Institute of Statistics has adopted a strict ILO definition for unemployment, by conditioning that the person must perform effective approaches to search for a job in the month preceding the day of the interview.
Covering a representative sample at the national and regional level (governorates).
1- Household/family. 2- Individual/person.
The survey covered a national sample of households and all individuals permanently residing in surveyed households.
Sample survey data [ssd]
THE CLEANED AND HARMONIZED VERSION OF THE SURVEY DATA PRODUCED AND PUBLISHED BY THE ECONOMIC RESEARCH FORUM REPRESENTS 100% OF THE ORIGINAL SURVEY DATA COLLECTED BY THE NATIONAL INSTITUTE OF STATISTICS - TUNISIA (INS)
The sample is drawn from the frame of the 2004 General Census of Population and Housing.
Face-to-face [f2f]
Three modules were designed for data collection:
Household Questionnaire (Module 1): Includes questions regarding household characteristics, living conditions, individuals and their demographic, educational and economic characteristics. This module also provides information on internal and external migration.
Active Employed Questionnaire (Module 2): Includes questions regarding the characteristics of the employed individuals as occupation, industry and wages for employees.
Active Unemployed Questionnaire (Module 3): Includes questions regarding the characteristics of the unemployed as unemployment duration, the last occupation, activity, and the number of days worked during the last year...etc.
Facebook
TwitterThe Associated Press is sharing data from the COVID Impact Survey, which provides statistics about physical health, mental health, economic security and social dynamics related to the coronavirus pandemic in the United States.
Conducted by NORC at the University of Chicago for the Data Foundation, the probability-based survey provides estimates for the United States as a whole, as well as in 10 states (California, Colorado, Florida, Louisiana, Minnesota, Missouri, Montana, New York, Oregon and Texas) and eight metropolitan areas (Atlanta, Baltimore, Birmingham, Chicago, Cleveland, Columbus, Phoenix and Pittsburgh).
The survey is designed to allow for an ongoing gauge of public perception, health and economic status to see what is shifting during the pandemic. When multiple sets of data are available, it will allow for the tracking of how issues ranging from COVID-19 symptoms to economic status change over time.
The survey is focused on three core areas of research:
Instead, use our queries linked below or statistical software such as R or SPSS to weight the data.
If you'd like to create a table to see how people nationally or in your state or city feel about a topic in the survey, use the survey questionnaire and codebook to match a question (the variable label) to a variable name. For instance, "How often have you felt lonely in the past 7 days?" is variable "soc5c".
Nationally: Go to this query and enter soc5c as the variable. Hit the blue Run Query button in the upper right hand corner.
Local or State: To find figures for that response in a specific state, go to this query and type in a state name and soc5c as the variable, and then hit the blue Run Query button in the upper right hand corner.
The resulting sentence you could write out of these queries is: "People in some states are less likely to report loneliness than others. For example, 66% of Louisianans report feeling lonely on none of the last seven days, compared with 52% of Californians. Nationally, 60% of people said they hadn't felt lonely."
The margin of error for the national and regional surveys is found in the attached methods statement. You will need the margin of error to determine if the comparisons are statistically significant. If the difference is:
The survey data will be provided under embargo in both comma-delimited and statistical formats.
Each set of survey data will be numbered and have the date the embargo lifts in front of it in the format of: 01_April_30_covid_impact_survey. The survey has been organized by the Data Foundation, a non-profit non-partisan think tank, and is sponsored by the Federal Reserve Bank of Minneapolis and the Packard Foundation. It is conducted by NORC at the University of Chicago, a non-partisan research organization. (NORC is not an abbreviation, it part of the organization's formal name.)
Data for the national estimates are collected using the AmeriSpeak Panel, NORC’s probability-based panel designed to be representative of the U.S. household population. Interviews are conducted with adults age 18 and over representing the 50 states and the District of Columbia. Panel members are randomly drawn from AmeriSpeak with a target of achieving 2,000 interviews in each survey. Invited panel members may complete the survey online or by telephone with an NORC telephone interviewer.
Once all the study data have been made final, an iterative raking process is used to adjust for any survey nonresponse as well as any noncoverage or under and oversampling resulting from the study specific sample design. Raking variables include age, gender, census division, race/ethnicity, education, and county groupings based on county level counts of the number of COVID-19 deaths. Demographic weighting variables were obtained from the 2020 Current Population Survey. The count of COVID-19 deaths by county was obtained from USA Facts. The weighted data reflect the U.S. population of adults age 18 and over.
Data for the regional estimates are collected using a multi-mode address-based (ABS) approach that allows residents of each area to complete the interview via web or with an NORC telephone interviewer. All sampled households are mailed a postcard inviting them to complete the survey either online using a unique PIN or via telephone by calling a toll-free number. Interviews are conducted with adults age 18 and over with a target of achieving 400 interviews in each region in each survey.Additional details on the survey methodology and the survey questionnaire are attached below or can be found at https://www.covid-impact.org.
Results should be credited to the COVID Impact Survey, conducted by NORC at the University of Chicago for the Data Foundation.
To learn more about AP's data journalism capabilities for publishers, corporations and financial institutions, go here or email kromano@ap.org.
Facebook
Twitterhttps://search.gesis.org/research_data/datasearch-httpwww-da-ra-deoaip--oaioai-da-ra-de441964https://search.gesis.org/research_data/datasearch-httpwww-da-ra-deoaip--oaioai-da-ra-de441964
Abstract (en): Comparative Cities is a teaching package designed to introduce students to analysis of manuscript schedules of the nineteenth century census for social, urban, family, and demographic history. The files are designed for use with SPSS. It was initially developed at Brown University with assistance of a project grant from the National Endowment for the Humanities. The file is organized to illustrate contrasts among cities at different stages of industrialization and the demographic transition in Europe and America: Pisa, Italy (1841), Amiens, France (1851), Stockport, England (1841 and 1851), and Providence, R.I. (1850, 1865, and 1880). The rural district around Pisa and part of Providence County are also included. There are approximately 1400 cases with information for individuals in each of eleven subfiles. These are random samples from the original 1:10 house samples for the four places made to permit flexible and economical student use. Summaries imbedded in the file permit analysis at the individual, household, or nuclear unit level. There are 142 variables for each individual. The package also contains a coursebook with explanation of each variable, a dictionary with occupational titles that appear in the censuses, course syllabus, and other instructions for use. The files are being used in the separate ongoing research of the two principal investigators and should be used for instructional purposes only. This teaching package can be supplied as two card-image data files, two files of SPSS instruction cards, and associated printed documentation. The package has also been updated with several files designed to be used with microcomputers. Included in the updated materials are four text files (Contents of Tape, Coursebook, Explanatory Materials, and Dictionary of Occupational Titles and Codes), a file of SPSSx data definition statements for use with PC-SPSSx, and a file of data definition statements for use with the Consortium's ABC statistical analysis package. Nine separate sub-files, each derived from the original census data and designed for analysis on micro-computers which are equipped with PC-SPSSx or ABC, are also provided. Finally, the package includes two mainframe SPSSx "Export" files which contain all of the data collected for each city. While these latter files duplicate the SPSS files contained in the earlier Comparative Cities package, they have been modified for use with SPSSx. The original Comparative Cities Teaching Package files can still be supplied as well. These files are oriented towards use of SPSS Version 9 on mainframe computers. 2006-01-12 All files were removed from dataset 20 and flagged as study-level files, so that they will accompany all downloads.2006-01-12 All files were removed from dataset 20 and flagged as study-level files, so that they will accompany all downloads.
Facebook
TwitterAttribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Note: n: The sample sizes for cognition, demand, and use. N/A: not applicable. (1). Descriptive statistics, (2). t-test, (3). ANOVA, (4). Chi-squared test, (5). Nonparametric test, (6). Correlation and regression, (7). Statistical graphs and tables, (8). Statistical design, (9). Multiple ANOVA, (10). Analysis of covariance, (11). Multiple linear regression, (12). Logistic regression, (13). Survival analysis, (14). Discriminant analysis, (15). Clustering analysis, (16). Principal components analysis and Factor analysis (PCA & FA),(17). SPSS, (18). SAS, (19). Overall cognition of and demand for medical statistics, (20). Overall cognition of and demand for software.Basic Demographic Characteristics of the Included Studies.
Facebook
TwitterAttribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Background: Internally displaced persons (IDPs) are frequently subjected to traumatic events, making them vulnerable to using substances. This study explores predictors and types of substances used by IDPs, prevalence of substance dependence, and reasons for substance use. Methods: Cross-sectional survey data were obtained from 520 IDPs living in camps located in Borno State, Nigeria. The Drug Use Disorders Identification Test (DUDIT) was adapted and administered to the participants. IBM SPSS was used to conduct univariate and multivariate linear regression analyses. Results: More than half (66.2%, n = 344) of the survey participants used at least one substance while a third of them (31.2%, n = 162) used more than one substance. About one in ten respondents met the instrument cut-off for dependence. The most popular substance used was Kolanut (46.5%, n = 242). Popular reasons for substance use were availability of substance, influence from others, and having a disease condition. Education, marital status, employment, and number of substances used were significantly associated with substance dependence. Conclusions: A high prevalence of substance use was found among the IDPs. The study highlights the need for intervention in the substance use problem affecting this vulnerable population
Facebook
TwitterAttribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Facebook
TwitterAttribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
*Yem, Kaficho, Guraghe & Tigrie,†Single, divorced & widowed,‡Merchant, student, daily laborer,Socio-Demographic characteristics of Respondents, Jimma Zone, Southwest Ethiopia, September 2012-December 2013 (n = 3463).
Facebook
TwitterAttribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Socio-demographic and anthropometric characteristics according to sex in SeRUN® study 2015–2016.
Facebook
TwitterThe aim of this study was to examine the factors related to different patterns of male violence against women. Employing both intra-individual and sociocultural perspectives, the project focused on the relationship between violence against women and previously established risk factors for intimate partner violence including stressors related to work, economic status, and role transitions (e.g., pregnancy), as well as family power dynamics, status discrepancies, and alcohol use. The following research questions were addressed: (1) To what extent do Caucasian, Black, and Hispanic individuals engage in physical violence with their partners? (2) How are socioeconomic stressors associated with violent relationships among Caucasian, Black, and Hispanic couples? (3) To what extent are changes in patterns of physical violence against women associated with different stages of a relationship (e.g., cohabitation, early marriage, pregnancy, marriage)? (4) To what extent do culturally linked attitudes about family structure (family power dynamics) predict violence among Caucasian, Black, and Hispanic couples? (5) To what extent do family strengths and support systems contribute to the cessation of violence among Caucasian, Black, and Hispanic couples? (6) What is the role of alcohol use in violent relationships among Caucasian, Black, and Hispanic couples? The data used for this project came from the first and second waves of the National Survey of Families and Households (NSFH) conducted by the Center for Demography and Ecology at the University of Wisconsin-Madison [NATIONAL SURVEY OF FAMILIES AND HOUSEHOLDS: WAVE I, 1987-1988, AND WAVE II, 1992-1994 (ICPSR 6906)]. The NSFH was designed to cover a broad range of family structures, processes, and relationships with a large enough sample to permit subgroup analysis. For the purposes of this study, the analytical sample focused on only those couples who were cohabiting or married at the time of the first wave of the study and still with the same person at the time of the second wave (N=3,584). Since the study design included oversamples of previously understudied groups (i.e., Blacks, Mexicans, Puerto Ricans), racial and ethnic comparisons were possible. In both waves of the NSFH several identical questions were asked regarding marital conflicts. Both married and cohabiting respondents were asked how often they used various tactics including heated arguments and hitting or throwing things at each other to resolve their conflicts. In addition, respondents were asked if any of their arguments became physical, how many of their fights resulted in either the respondent or their partner hitting, shoving, or throwing things, and if any injuries resulted as a consequence of these fights. This data collection consists of the SPSS syntax used to recode variables from the original NSFH dataset. In addition, new variables, including both composite variables (e.g., self-esteem, hostility, depression) and husband and wife versions of the variables (using information from both respondent and partner), were constructed. New variables were grouped into the following categories: demographic, personality, alcohol and drug use, relationship stages, gender role attitudes, division of labor, fairness in household chores, social support, and isolation. Psychological well-being scales were created to measure autonomy, positive relations with others, purpose in life, self-acceptance, environmental mastery, and personal growth. Additional scales were created to measure relationship conflict, sex role gender attitudes, personal mastery, alcohol use, and hostility. The Rosenberg Self Esteem Scale and the Center for Epidemiological Studies Depression Scale (CES-D) were also utilized.
Facebook
TwitterStudy setting We conducted this study in the paediatrics departments of the Ola During Children's Hospital (ODCH), Rokupa Government Hospital (RGH), and the King Harman Maternity and Children Hospital (KHMCH) located in Freetown, the Capital city of Sierra Leone. Ola During Children's hospital is a tertiary teaching hospital and the leading paediatric referral hospital in Sierra Leone. Rokupa Government and KHMCH are secondary hospitals that provide comprehensive emergency obstetric and newborn care inpatient and outpatient paediatric and maternity services.Study design and duration This study was conducted between April 2021 to July 2021 and had two phases. Phase 1 was a descriptive cross-sectional retrospective study of paediatrics prescriptions from the respective pharmacy departments from May 1 to May 31, 2021. In phase 2, we conducted a point prevalence descriptive inpatient chart review that lasted for one week to assess MEs and pDDIs among the paediatric patient population.Study populationThis data set is the SPSS file with both variable and data view. It contains the variables that were analysed for the two phases of the study namely: For phase 1 of the study, the population included paediatric prescriptions that came to the respective pharmacy departments in May 2021. Phase 2 included inpatients <16 years irrespective of their working diagnosis and gender and whose parents or guardians consented to participate in the study.Data collection procedure and tool For phase 1 of the study, the data collection tool was adapted from the Sierra Leone Pharmacy and Drugs Act 2001, the World Health Organization (WHO) guidelines for prescription writing, and a previous study [12, 32, 33]. Seventeen essential elements were selected for this study and compiled into a single data collection tool. We manually extracted all data through a review of prescriptions accessed from the pharmacies. The data collection tool for phase 2 was adapted from the WHO guide on reporting and learning systems for medication errors, American Society of Health System Pharmacists (ASHP) guidelines for preventing medication errors in hospitals, and previous studies [3, 4, 18, 34]. Data collection tools were piloted, and feedback was used to develop the final versions used in the study. The treatment charts were reviewed, and the following were extracted and entered into the data collection tool: wrong patient, wrong dose, wrong route, wrong medicine, wrong dosage form, wrong time of administration, contraindication including allergy, wrong duration, dose omitted or delay, wrong frequency, wrong indication, unnecessary medicine, and therapeutic duplication. In addition, nurses were accompanied during the medicine administration rounds and patients and caretakers were interviewed to gather information when necessary. Ethical consideration Clearance to conduct the study was obtained from the Research, Innovation and Publication Review Committee of the Faculty of Pharmaceutical Sciences, College of Medicine and Allied Health Sciences, University of Sierra Leone. The management of the hospitals permitted the study to be done in their facilities. Written informed consent was obtained from parents/caregivers after explaining the purpose and procedures of the study. Parents gave consent before data was collected, and they were not coerced to participate in the study. Patient information was coded and kept confidential. Data analyses The researchers evaluated the completion of the essential elements for each prescription, such as the use of the generic names, recommended abbreviations, and prescription legibility. We determined the accuracy score out of 34 total points. Each element was assessed, scoring 0, 1 or 2 for 'not completed', 'partially completed', or 'fully completed', respectively. Legibility was scored subjectively according to the prescription quality index (PQI) as 0, 1, or 2 for 'illegible', 'barely legible' or 'legible', respectively, by two or more persons [35]. The global accuracy score (GAS) for each prescription was determined by calculating the total percentage achieved out of 34 possible points for the 17 prescription elements considered. The GAS was then classified into one of four scores: 100%, 80% – 99%, 40% – 79%, and less than 40%. The desired prescription-writing accuracy score, or gold standard, is 100%. The definition and severity categorisation of the National Coordinating Council of Medication Error Reporting and Prevention (NCCMERP) was used [5]. Potential drug-drug interactions (pDDIs) were assessed by the Drug.com interaction checker and classified into no interaction, minor, moderate, and major [36]. The data obtained was cleaned and coded and then entered into Statistical Package for Social Sciences (SPSS) version 20 (IBM Statistics, Armonk, NY, USA) for analysis. Descriptive statistics were applied, and results were presented as frequency, percentages, mean, and standard deviation. Inferential statistics, including the Kruskal Wallis, Mann-Whitney U and Pearson correlation, were employed, and a p-value of < 0.05 was considered statistically significant.
Facebook
TwitterAttribution-NonCommercial-NoDerivs 4.0 (CC BY-NC-ND 4.0)https://creativecommons.org/licenses/by-nc-nd/4.0/
License information was derived automatically
This dataset contains cross-sectional survey data collected from 925 Chinese college students examining the relationships between mobile phone addiction, aggression, self-control, and psychological symptoms. Data were collected in April 2024 through an online survey system accessible via computers, iPads, and smartphones. Participants were recruited from humanities disciplines (Chinese Language and Literature, Radio and Television Studies, and related fields) at a university in western China. The sample comprised students across different educational levels: 668 bachelor's degree students (72.22%), 160 associate-to-bachelor transfer students (17.30%), and 97 master's students (10.49%), all in their first or second years of study. The dataset includes responses to four validated psychological instruments: the 17-item Mobile Phone Addiction Index Scale (MPAIS), the 22-item Buss-Perry Aggression Questionnaire (BPAQ), the 19-item Self-Control Scale (SCS), and the 90-item Symptom Checklist-90 (SCL-90). Additionally, comprehensive demographic information was collected, including age, gender, educational level, family income, parental education levels, and family structure variables. All responses were recorded on 5-point Likert scales with appropriate reverse coding applied where necessary. The tabular dataset contains 925 rows (individual participants) and approximately 170 columns representing survey items, computed subscale scores, total scores, and demographic variables. Missing data analysis revealed less than 5% missing values distributed completely at random, with 25 incomplete responses excluded from the final dataset. Data are stored in SPSS (.sav) format and CSV format for broader accessibility. The dataset enables replication of the reported moderated mediation analyses using PROCESS macro Model 14, as well as network analyses examining symptom-level interactions. All data have been de-identified to protect participant confidentiality while maintaining the integrity necessary for secondary analyses exploring digital technology use and mental health relationships among emerging adults.
Facebook
Twitterhttps://www.icpsr.umich.edu/web/ICPSR/studies/29883/termshttps://www.icpsr.umich.edu/web/ICPSR/studies/29883/terms
The Consumer Expenditure Survey (CE) program provides a continuous and comprehensive flow of data on the buying habits of American consumers including data on their expenditures, income, and consumer unit (families and single consumers) characteristics. These data are used widely in economic research and analysis, and in support of revisions of the Consumer Price Index.The CE program consists of two surveys, the Diary Survey and the quarterly Interview Survey (ICPSR 29884). The Diary Survey is designed to obtain data on frequently purchased smaller items, including food and beverages, both at home and in food establishments, housekeeping supplies, tobacco, nonprescription drugs, and personal care products and services. Each consumer unit (CU) records its expenditures in a diary for two consecutive 1-week periods. Although the diary was designed to collect information on expenditures that could not be easily recalled over time, respondents are asked to report all expenses (except overnight travel) that the CU incurs during the survey week.The microdata in this collection are available as SAS, STATA, SPSS data sets or ASCII text and comma-delimited files. The 2009 Diary release contains five sets of data files (FMLY, MEMB, EXPN, DTAB, DTID) and three processing files. The FMLY, MEMB, EXPN, DTAB, and DTID files are organized by the quarter of the calendar year in which the data were collected. There are four quarterly data sets for each of these files.The FMLY files contain CU characteristics, income, and summary level expenditures; the MEMB files contain member characteristics and income data; the EXPN files contain detailed weekly expenditures at the Universal Classification Code (UCC) level; the DTAB files contains the CU's reported annual income values or the mean of the five imputed income values in the multiple imputation method; and the DTID files contain the five imputed income values. The summary level expenditure and income information on the FMLY files permits the data user to link consumer spending, by general expenditure category, and household characteristics and demographics on one set of files.The three processing files enhance computer processing and tabulation of data, and provide descriptive information on item codes. The three processing files are: (1) an aggregation scheme file used in the published consumer expenditure tables (DSTUB), (2) a UCC file that contains UCCs and their abbreviated titles, identifying the expenditure, income, or demographic item represented by each UCC, and (3) a sample program file that contains the computer program used in Section VII.A. SAMPLE PROGRAM of the Diary User Guide. The processing files are further explained in Section III.E.5. PROCESSING FILES of the same User Guide documentation. There is also a second user guide, "User's Guide to Income Imputation in the CE", which includes information on how to appropriately use the imputed income data.Demographic and family characteristics data include age, sex, race, marital status, and CU relationships each CU member. Income information, such as wage, salary, unemployment compensation, child support, and alimony, as well as information on the employment of each CU member age 14 and over was also collected.
Facebook
TwitterAttribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Analytic sample selection process.
Facebook
TwitterCC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
License information was derived automatically
analyze the current population survey (cps) annual social and economic supplement (asec) with r the annual march cps-asec has been supplying the statistics for the census bureau's report on income, poverty, and health insurance coverage since 1948. wow. the us census bureau and the bureau of labor statistics ( bls) tag-team on this one. until the american community survey (acs) hit the scene in the early aughts (2000s), the current population survey had the largest sample size of all the annual general demographic data sets outside of the decennial census - about two hundred thousand respondents. this provides enough sample to conduct state- and a few large metro area-level analyses. your sample size will vanish if you start investigating subgroups b y state - consider pooling multiple years. county-level is a no-no. despite the american community survey's larger size, the cps-asec contains many more variables related to employment, sources of income, and insurance - and can be trended back to harry truman's presidency. aside from questions specifically asked about an annual experience (like income), many of the questions in this march data set should be t reated as point-in-time statistics. cps-asec generalizes to the united states non-institutional, non-active duty military population. the national bureau of economic research (nber) provides sas, spss, and stata importation scripts to create a rectangular file (rectangular data means only person-level records; household- and family-level information gets attached to each person). to import these files into r, the parse.SAScii function uses nber's sas code to determine how to import the fixed-width file, then RSQLite to put everything into a schnazzy database. you can try reading through the nber march 2012 sas importation code yourself, but it's a bit of a proc freak show. this new github repository contains three scripts: 2005-2012 asec - download all microdata.R down load the fixed-width file containing household, family, and person records import by separating this file into three tables, then merge 'em together at the person-level download the fixed-width file containing the person-level replicate weights merge the rectangular person-level file with the replicate weights, then store it in a sql database create a new variable - one - in the data table 2012 asec - analysis examples.R connect to the sql database created by the 'download all microdata' progr am create the complex sample survey object, using the replicate weights perform a boatload of analysis examples replicate census estimates - 2011.R connect to the sql database created by the 'download all microdata' program create the complex sample survey object, using the replicate weights match the sas output shown in the png file below 2011 asec replicate weight sas output.png statistic and standard error generated from the replicate-weighted example sas script contained in this census-provided person replicate weights usage instructions document. click here to view these three scripts for more detail about the current population survey - annual social and economic supplement (cps-asec), visit: the census bureau's current population survey page the bureau of labor statistics' current population survey page the current population survey's wikipedia article notes: interviews are conducted in march about experiences during the previous year. the file labeled 2012 includes information (income, work experience, health insurance) pertaining to 2011. when you use the current populat ion survey to talk about america, subract a year from the data file name. as of the 2010 file (the interview focusing on america during 2009), the cps-asec contains exciting new medical out-of-pocket spending variables most useful for supplemental (medical spending-adjusted) poverty research. confidential to sas, spss, stata, sudaan users: why are you still rubbing two sticks together after we've invented the butane lighter? time to transition to r. :D