28 datasets found

B
Data Cleaning Sample
borealisdata.ca
dataone.org
Updated Jul 13, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Rong Luo (2023). Data Cleaning Sample [Dataset]. http://doi.org/10.5683/SP3/ZCN177
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Unique identifier
https://doi.org/10.5683/SP3/ZCN177
Dataset updated
Jul 13, 2023
Dataset provided by
Borealis
Authors
Rong Luo
License
CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
License information was derived automatically
Description
Sample data for exercises in Further Adventures in Data Cleaning.
Data from: Current and projected research data storage needs of Agricultural...
catalog.data.gov
agdatacommons.nal.usda.gov
+2more
Updated Apr 21, 2025
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Agricultural Research Service (2025). Current and projected research data storage needs of Agricultural Research Service researchers in 2016 [Dataset]. https://catalog.data.gov/dataset/current-and-projected-research-data-storage-needs-of-agricultural-research-service-researc-f33da
Explore at:
Dataset updated
Apr 21, 2025
Dataset provided by
Agricultural Research Servicehttps://www.ars.usda.gov/
Description
The USDA Agricultural Research Service (ARS) recently established SCINet , which consists of a shared high performance computing resource, Ceres, and the dedicated high-speed Internet2 network used to access Ceres. Current and potential SCINet users are using and generating very large datasets so SCINet needs to be provisioned with adequate data storage for their active computing. It is not designed to hold data beyond active research phases. At the same time, the National Agricultural Library has been developing the Ag Data Commons, a research data catalog and repository designed for public data release and professional data curation. Ag Data Commons needs to anticipate the size and nature of data it will be tasked with handling. The ARS Web-enabled Databases Working Group, organized under the SCINet initiative, conducted a study to establish baseline data storage needs and practices, and to make projections that could inform future infrastructure design, purchases, and policies. The SCINet Web-enabled Databases Working Group helped develop the survey which is the basis for an internal report. While the report was for internal use, the survey and resulting data may be generally useful and are being released publicly. From October 24 to November 8, 2016 we administered a 17-question survey (Appendix A) by emailing a Survey Monkey link to all ARS Research Leaders, intending to cover data storage needs of all 1,675 SY (Category 1 and Category 4) scientists. We designed the survey to accommodate either individual researcher responses or group responses. Research Leaders could decide, based on their unit's practices or their management preferences, whether to delegate response to a data management expert in their unit, to all members of their unit, or to themselves collate responses from their unit before reporting in the survey. Larger storage ranges cover vastly different amounts of data so the implications here could be significant depending on whether the true amount is at the lower or higher end of the range. Therefore, we requested more detail from "Big Data users," those 47 respondents who indicated they had more than 10 to 100 TB or over 100 TB total current data (Q5). All other respondents are called "Small Data users." Because not all of these follow-up requests were successful, we used actual follow-up responses to estimate likely responses for those who did not respond. We defined active data as data that would be used within the next six months. All other data would be considered inactive, or archival. To calculate per person storage needs we used the high end of the reported range divided by 1 for an individual response, or by G, the number of individuals in a group response. For Big Data users we used the actual reported values or estimated likely values. Resources in this dataset:Resource Title: Appendix A: ARS data storage survey questions. File Name: Appendix A.pdfResource Description: The full list of questions asked with the possible responses. The survey was not administered using this PDF but the PDF was generated directly from the administered survey using the Print option under Design Survey. Asterisked questions were required. A list of Research Units and their associated codes was provided in a drop down not shown here. Resource Software Recommended: Adobe Acrobat,url: https://get.adobe.com/reader/ Resource Title: CSV of Responses from ARS Researcher Data Storage Survey. File Name: Machine-readable survey response data.csvResource Description: CSV file includes raw responses from the administered survey, as downloaded unfiltered from Survey Monkey, including incomplete responses. Also includes additional classification and calculations to support analysis. Individual email addresses and IP addresses have been removed. This information is that same data as in the Excel spreadsheet (also provided).Resource Title: Responses from ARS Researcher Data Storage Survey. File Name: Data Storage Survey Data for public release.xlsxResource Description: MS Excel worksheet that Includes raw responses from the administered survey, as downloaded unfiltered from Survey Monkey, including incomplete responses. Also includes additional classification and calculations to support analysis. Individual email addresses and IP addresses have been removed.Resource Software Recommended: Microsoft Excel,url: https://products.office.com/en-us/excel
Enterprise Survey 2009-2019, Panel Data - Slovenia
microdata.worldbank.org
catalog.ihsn.org
Updated Aug 6, 2020
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
World Bank Group (WBG) (2020). Enterprise Survey 2009-2019, Panel Data - Slovenia [Dataset]. https://microdata.worldbank.org/index.php/catalog/3762
Explore at:
Dataset updated
Aug 6, 2020
Dataset provided by
World Bank Grouphttp://www.worldbank.org/
European Investment Bankhttp://eib.org/
European Bank for Reconstruction and Developmenthttp://ebrd.com/
Time period covered
2008 - 2019
Area covered
Slovenia
Description
Abstract

The documentation covers Enterprise Survey panel datasets that were collected in Slovenia in 2009, 2013 and 2019.

The Slovenia ES 2009 was conducted between 2008 and 2009. The Slovenia ES 2013 was conducted between March 2013 and September 2013. Finally, the Slovenia ES 2019 was conducted between December 2018 and November 2019. The objective of the Enterprise Survey is to gain an understanding of what firms experience in the private sector.

As part of its strategic goal of building a climate for investment, job creation, and sustainable growth, the World Bank has promoted improving the business environment as a key strategy for development, which has led to a systematic effort in collecting enterprise data across countries. The Enterprise Surveys (ES) are an ongoing World Bank project in collecting both objective data based on firms' experiences and enterprises' perception of the environment in which they operate.

Geographic coverage

National

Analysis unit

The primary sampling unit of the study is the establishment. An establishment is a physical location where business is carried out and where industrial operations take place or services are provided. A firm may be composed of one or more establishments. For example, a brewery may have several bottling plants and several establishments for distribution. For the purposes of this survey an establishment must take its own financial decisions and have its own financial statements separate from those of the firm. An establishment must also have its own management and control over its payroll.

Universe

As it is standard for the ES, the Slovenia ES was based on the following size stratification: small (5 to 19 employees), medium (20 to 99 employees), and large (100 or more employees).

Kind of data

Sample survey data [ssd]

Sampling procedure

The sample for Slovenia ES 2009, 2013, 2019 were selected using stratified random sampling, following the methodology explained in the Sampling Manual for Slovenia 2009 ES and for Slovenia 2013 ES, and in the Sampling Note for 2019 Slovenia ES.

Three levels of stratification were used in this country: industry, establishment size, and oblast (region). The original sample designs with specific information of the industries and regions chosen are included in the attached Excel file (Sampling Report.xls.) for Slovenia 2009 ES. For Slovenia 2013 and 2019 ES, specific information of the industries and regions chosen is described in the "The Slovenia 2013 Enterprise Surveys Data Set" and "The Slovenia 2019 Enterprise Surveys Data Set" reports respectively, Appendix E.

For the Slovenia 2009 ES, industry stratification was designed in the way that follows: the universe was stratified into manufacturing industries, services industries, and one residual (core) sector as defined in the sampling manual. Each industry had a target of 90 interviews. For the manufacturing industries sample sizes were inflated by about 17% to account for potential non-response cases when requesting sensitive financial data and also because of likely attrition in future surveys that would affect the construction of a panel. For the other industries (residuals) sample sizes were inflated by about 12% to account for under sampling in firms in service industries.

For Slovenia 2013 ES, industry stratification was designed in the way that follows: the universe was stratified into one manufacturing industry, and two service industries (retail, and other services).

Finally, for Slovenia 2019 ES, three levels of stratification were used in this country: industry, establishment size, and region. The original sample design with specific information of the industries and regions chosen is described in "The Slovenia 2019 Enterprise Surveys Data Set" report, Appendix C. Industry stratification was done as follows: Manufacturing – combining all the relevant activities (ISIC Rev. 4.0 codes 10-33), Retail (ISIC 47), and Other Services (ISIC 41-43, 45, 46, 49-53, 55, 56, 58, 61, 62, 79, 95).

For Slovenia 2009 and 2013 ES, size stratification was defined following the standardized definition for the rollout: small (5 to 19 employees), medium (20 to 99 employees), and large (more than 99 employees). For stratification purposes, the number of employees was defined on the basis of reported permanent full-time workers. This seems to be an appropriate definition of the labor force since seasonal/casual/part-time employment is not a common practice, except in the sectors of construction and agriculture.

For Slovenia 2009 ES, regional stratification was defined in 2 regions. These regions are Vzhodna Slovenija and Zahodna Slovenija. The Slovenia sample contains panel data. The wave 1 panel “Investment Climate Private Enterprise Survey implemented in Slovenia” consisted of 223 establishments interviewed in 2005. A total of 57 establishments have been re-interviewed in the 2008 Business Environment and Enterprise Performance Survey.

For Slovenia 2013 ES, regional stratification was defined in 2 regions (city and the surrounding business area) throughout Slovenia.

Finally, for Slovenia 2019 ES, regional stratification was done across two regions: Eastern Slovenia (NUTS code SI03) and Western Slovenia (SI04).

Mode of data collection

Computer Assisted Personal Interview [capi]

Research instrument

Questionnaires have common questions (core module) and respectfully additional manufacturing- and services-specific questions. The eligible manufacturing industries have been surveyed using the Manufacturing questionnaire (includes the core module, plus manufacturing specific questions). Retail firms have been interviewed using the Services questionnaire (includes the core module plus retail specific questions) and the residual eligible services have been covered using the Services questionnaire (includes the core module). Each variation of the questionnaire is identified by the index variable, a0.

Response rate

Survey non-response must be differentiated from item non-response. The former refers to refusals to participate in the survey altogether whereas the latter refers to the refusals to answer some specific questions. Enterprise Surveys suffer from both problems and different strategies were used to address these issues.

Item non-response was addressed by two strategies: a- For sensitive questions that may generate negative reactions from the respondent, such as corruption or tax evasion, enumerators were instructed to collect the refusal to respond as (-8). b- Establishments with incomplete information were re-contacted in order to complete this information, whenever necessary. However, there were clear cases of low response.

For 2009 and 2013 Slovenia ES, the survey non-response was addressed by maximizing efforts to contact establishments that were initially selected for interview. Up to 4 attempts were made to contact the establishment for interview at different times/days of the week before a replacement establishment (with similar strata characteristics) was suggested for interview. Survey non-response did occur but substitutions were made in order to potentially achieve strata-specific goals. Further research is needed on survey non-response in the Enterprise Surveys regarding potential introduction of bias.

For 2009, the number of contacted establishments per realized interview was 6.18. This number is the result of two factors: explicit refusals to participate in the survey, as reflected by the rate of rejection (which includes rejections of the screener and the main survey) and the quality of the sample frame, as represented by the presence of ineligible units. The relatively low ratio of contacted establishments per realized interview (6.18) suggests that the main source of error in estimates in the Slovenia may be selection bias and not frame inaccuracy.

For 2013, the number of realized interviews per contacted establishment was 25%. This number is the result of two factors: explicit refusals to participate in the survey, as reflected by the rate of rejection (which includes rejections of the screener and the main survey) and the quality of the sample frame, as represented by the presence of ineligible units. The number of rejections per contact was 44%.

Finally, for 2019, the number of interviews per contacted establishments was 9.7%. This number is the result of two factors: explicit refusals to participate in the survey, as reflected by the rate of rejection (which includes rejections of the screener and the main survey) and the quality of the sample frame, as represented by the presence of ineligible units. The share of rejections per contact was 75.2%.
g
Employee Vehicle Personal Use 2020 (Excel)
opendata.greatersudbury.ca
hub.arcgis.com
Updated Aug 14, 2020
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
City of Greater Sudbury (2020). Employee Vehicle Personal Use 2020 (Excel) [Dataset]. https://opendata.greatersudbury.ca/documents/8ad1b3ec2c254d06af9db35db0f6b6a7
Explore at:
Dataset updated
Aug 14, 2020
Dataset authored and provided by
City of Greater Sudbury
Description
Download Employee Vehicle Personal Use Excel SheetThis dataset lists the employee name and taxable benefit for personal use of City of Greater Sudbury Vehicle as travel expenses for the year 2020. Expenses are broken down in separate tabs by Quarter (Q1, Q2, Q3 and Q4). Data for other years is available in separate datasets. Updated quarterly when expenses are prepared.
d
GP Workforce Statistics
data.gov.au
cloud.csiss.gmu.edu
+1more
html
Updated Nov 18, 2015
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Department of Health (2015). GP Workforce Statistics [Dataset]. https://data.gov.au/dataset/gp-workforce-statistics
Explore at:
htmlAvailable download formats
Dataset updated
Nov 18, 2015
Dataset provided by
Department of Health
Description
Information on general practice statistics such as GP type, age group and place of basic qualification. Excel spreadsheet & PDF or GP workforce statistics. Information on general practice statistics such as GP type, age group and place of basic qualification. Excel spreadsheet & PDF or GP workforce statistics.
Survey on Interest Rate Controls 2019 - Albania, Algeria, Anguilla...and 103...
microdata.worldbank.org
catalog.ihsn.org
+1more
Updated Oct 26, 2023
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
World Bank Group - Finance, Competitiveness and Innovation Global Practice (2023). Survey on Interest Rate Controls 2019 - Albania, Algeria, Anguilla...and 103 more [Dataset]. https://microdata.worldbank.org/index.php/catalog/3812
Explore at:
Dataset updated
Oct 26, 2023
Dataset provided by
World Bank Grouphttp://www.worldbank.org/
Authors
World Bank Group - Finance, Competitiveness and Innovation Global Practice
Time period covered
2019
Area covered
Albania, Anguilla...and 103 more, Algeria
Description
Abstract

The Survey on Interest Rate Controls 2020 was conducted as a World Bank Group study on interest rate controls (IRCs) in lending and deposit markets around the world. The study aims to identify the different types of formal (or de jure) controls, the countries that apply then, how they implement them, and the reasons for doing so. The objective of the study is to advance knowledge on this topic by providing an evidence base for investigating the impact of IRCs on economic outcomes.

The survey investigates present IRCs in each surveyed country, the reasons why they have been applied, the framework and resources associated with their application and the details as to their level and functioning. The focus is on legal forms of control (i.e. codified into law) as opposed to de facto controls. The new database on interest rate controls, a popular form of financial repression is based on a survey of 108 countries, representing 88 percent of global gross domestic product. The interest rate controls presented in this dataset were in effect in 2019.

Geographic coverage

Global Survey, covering 108 countries, representing 88 percent of global GDP.

Analysis unit

Regulation at the national level.

Universe

Banking supervisors and Local Banking Associations.

Kind of data

Sample survey data [ssd]

Mode of data collection

Mail Questionnaire [mail]

Research instrument

Bank supervisors and banking associations were provided with a standard excel file with five parts. The survey was structured in five parts, each placed in a different excel sheet. Part A: Introduction. Countries with no IRCs in place were asked to only answer this sheet and leave the rest blank. Part B: Presented the definitions of controls, institutions, products and additional aspects that will be covered in the survey. Part C: Introduced a set of qualitative questions to describe the IRCs in place. Part D: Displayed a set of tables to quantitatively describe the IRCs in place. Part E: Laid out the final set of questions, covering sanctions and control mechanisms that support the IRCs' enforcement. The questionnaire is provided in the Documentation section in pdf and excel.
Taking Part 2010/11 quarter 4: Statistical release
gov.uk
Updated Aug 9, 2011
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Department for Digital, Culture, Media & Sport (2011). Taking Part 2010/11 quarter 4: Statistical release [Dataset]. https://www.gov.uk/government/statistics/taking-part-the-national-survey-of-culture-leisure-and-sport-2010-11
Explore at:
Dataset updated
Aug 9, 2011
Dataset provided by
GOV.UKhttp://gov.uk/
Authors
Department for Digital, Culture, Media & Sport
Description
The latest estimates from the 2010/11 Taking Part adult survey produced by DCMS were released on 30 June 2011 according to the arrangements approved by the UK Statistics Authority.

Released:

30 June 2011
**

Period covered:

April 2010 to April 2011
**

Geographic coverage:

National and Regional level data for England.
**

Next release date:

Further analysis of the 2010/11 adult dataset and data for child participation will be published on 18 August 2011.

Summary

The latest data from the 2010/11 Taking Part survey provides reliable national estimates of adult engagement with sport, libraries, the arts, heritage and museums & galleries. This release also presents analysis on volunteering and digital participation in our sectors and a look at cycling and swimming proficiency in England. The Taking Part survey is a continuous annual survey of adults and children living in private households in England, and carries the National Statistics badge, meaning that it meets the highest standards of statistical quality.

Statistical Report

http://www.culture.gov.uk/images/research/Taking_Part_Y6_Release.pdf">Taking Part: The National Survey of Culture, Leisure and Sport 2010/11 (PDF 713kb)

http://www.culture.gov.uk/images/research/Taking_Part_Y6_Release.doc">Taking Part: The National Survey of Culture, Leisure and Sport 2010/11 (Word 674kb)

Statistical Worksheets

These spreadsheets contain the data and sample sizes for each sector included in the survey:

http://www.culture.gov.uk/images/research/Y6_Q4_Figures_Arts.xls">The Arts (Excel 79kb)

http://www.culture.gov.uk/images/research/Y6_Q4_Figures_Cycling_and_swimming_proficiency.xls">Cycling and swimming proficiency (Excel 38kb)

http://www.culture.gov.uk/images/research/Y6_Q4_Figures_Digital_participation.xls">Digital Participation (Excel 51kb)

http://www.culture.gov.uk/images/research/Y6_Q4_Figures_Heritage.xls">Heritage (Excel 45kb)

http://www.culture.gov.uk/images/research/Y6_Q4_Figures_Libraries.xls">Libraries (Excel 42kb)

http://www.culture.gov.uk/images/research/Y6_Q4_Museums_and_Galleries2.xls">Museums and Galleries (Excel 43kb)

http://www.culture.gov.uk/images/research/Y6_Q4_Sports2.xls">Sport (Excel 44kb)

http://www.culture.gov.uk/images/research/Y6_Q4_Figures_Volunteering.xls">Volunteering (Excel 87kb)

Previous release

The previous Taking Part release was published on 31 March 2011 and can be found online.

http://www.culture.gov.uk/publications/7995.aspx">Taking Part: The National Survey of Culture, Leisure and Sport January - December 2010

The UK Statistics Authority

This release is published in accordance with the Code of Practice for Official Statistics (2009), as produced by the http://www.statisticsauthority.gov.uk/">UK Statistics Authority (UKSA). The UKSA has the overall objective of promoting and safeguarding the production and publication of official statistics that serve the public good. It monitors and reports on all official statistics, and promotes good practice in this area.

Pre-release access

The document below contains a list of Ministers and Officials who have received privileged early access to this release of Taking Part data. In line with best practice, the list has been kept to a minimum and those given access for briefing purposes had a maximum of 24 hours.

http://www.culture.gov.uk/images/publications/TP_Pre-release_access_300611.pdf">Pre-release access list (PDF 7kb)

The responsible statistician for this release is Neil Wilson. For any queries please contact the Taking Part team on 020 7211 6968 or takingpart@culture.gsi.gov.uk.

Releated information

http://www.culture.gov.uk/what_we_do/research_and_statistics/7387.aspx">Taking Part Survey Questionnaires

http://www.culture.gov.uk/what_we_do/research_and_statistics/7388.aspx">Taking Part Technical Reports
p
Business Activity Survey 2009 - Samoa
microdata.pacificdata.org
Updated Jul 2, 2019
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Samoa Bureau of Statistics (2019). Business Activity Survey 2009 - Samoa [Dataset]. https://microdata.pacificdata.org/index.php/catalog/253
Explore at:
Dataset updated
Jul 2, 2019
Dataset authored and provided by
Samoa Bureau of Statistics
Time period covered
2009
Area covered
Samoa
Description
Abstract

The intention is to collect data for the calendar year 2009 (or the nearest year for which each business keeps its accounts. The survey is considered a one-off survey, although for accurate NAs, such a survey should be conducted at least every five years to enable regular updating of the ratios, etc., needed to adjust the ongoing indicator data (mainly VAGST) to NA concepts. The questionnaire will be drafted by FSD, largely following the previous BAS, updated to current accounting terminology where necessary. The questionnaire will be pilot tested, using some accountants who are likely to complete a number of the forms on behalf of their business clients, and a small sample of businesses. Consultations will also include Ministry of Finance, Ministry of Commerce, Industry and Labour, Central Bank of Samoa (CBS), Samoa Tourism Authority, Chamber of Commerce, and other business associations (hotels, retail, etc.).

The questionnaire will collect a number of items of information about the business ownership, locations at which it operates and each establishment for which detailed data can be provided (in the case of complex businesses), contact information, and other general information needed to clearly identify each unique business. The main body of the questionnaire will collect data on income and expenses, to enable value added to be derived accurately. The questionnaire will also collect data on capital formation, and will contain supplementary pages for relevant industries to collect volume of production data for selected commodities and to collect information to enable an estimate of value added generated by key tourism activities.

The principal user of the data will be FSD which will incorporate the survey data into benchmarks for the NA, mainly on the current published production measure of GDP. The information on capital formation and other relevant data will also be incorporated into the experimental estimates of expenditure on GDP. The supplementary data on volumes of production will be used by FSD to redevelop the industrial production index which has recently been transferred under the SBS from the CBS. The general information about the business ownership, etc., will be used to update the Business Register.

Outputs will be produced in a number of formats, including a printed report containing descriptive information of the survey design, data tables, and analysis of the results. The report will also be made available on the SBS website in “.pdf” format, and the tables will be available on the SBS website in excel tables. Data by region may also be produced, although at a higher level of aggregation than the national data. All data will be fully confidentialised, to protect the anonymity of all respondents. Consideration may also be made to provide, for selected analytical users, confidentialised unit record files (CURFs).

A high level of accuracy is needed because the principal purpose of the survey is to develop revised benchmarks for the NA. The initial plan was that the survey will be conducted as a stratified sample survey, with full enumeration of large establishments and a sample of the remainder.

Geographic coverage

National Coverage

Analysis unit

The main statistical unit to be used for the survey is the establishment. For simple businesses that undertake a single activity at a single location there is a one-to-one relationship between the establishment and the enterprise. For large and complex enterprises, however, it is desirable to separate each activity of an enterprise into establishments to provide the most detailed information possible for industrial analysis. The business register will need to be developed in such a way that records the links between establishments and their parent enterprises. The business register will be created from administrative records and may not have enough information to recognize all establishments of complex enterprises. Large businesses will be contacted prior to the survey post-out to determine if they have separate establishments. If so, the extended structure of the enterprise will be recorded on the business register and a questionnaire will be sent to the enterprise to be completed for each establishment.

SBS has decided to follow the New Zealand simplified version of its statistical units model for the 2009 BAS. Future surveys may consider location units and enterprise groups if they are found to be useful for statistical collections.

It should be noted that while establishment data may enable the derivation of detailed benchmark accounts, it may be necessary to aggregate up to enterprise level data for the benchmarks if the ongoing data used to extrapolate the benchmark forward (mainly VAGST) are only available at the enterprise level.

Universe

The BAS's covered all employing units, and excluded small non-employing units such as the market sellers. The surveys also excluded central government agencies engaged in public administration (ministries, public education and health, etc.). It only covers businesses that pay the VAGST. (Threshold SAT$75,000 and upwards).

Kind of data

Sample survey data [ssd]

Sampling procedure

-Total Sample Size was 1240 -Out of the 1240, 902 successfully completed the questionnaire. -The other remaining 338 either never responded or were omitted (some businesses were ommitted from the sample as they do not meet the requirement to be surveyed) -Selection was all employing units paying VAGST (Threshold SAT $75,000 upwards)

WILL CONFIRM LATER!!

OSO LE MEA E LE FAASA...AEA :-)

Mode of data collection

Mail Questionnaire [mail]

Research instrument

General instructions, authority for the survey, etc;

Business demography information on ownership, contact details, structure, etc.;

Employment;

Income;

Expenses;

Inventories;

Profit or loss and reconciliation to business accounts' profit and loss;

Fixed assets - purchases, disposals, book values

Thank you and signature of respondent.

Supplementary Pages Additional pages have been prepared to collect data for a limited range of industries. 1.Production data. To rebase and redevelop the Industrial Production Index (IPI), it is intended to collect volume of production information from a selection of large manufacturing businesses. The selection of businesses and products is critical to the usefulness of the IPI. The products must be homogeneous, and be of enough importance to the economy to justify collecting the data. Significance criteria should be established for the selection of products to include in the IPI, and the 2009 BAS provides an opportunity to collect benchmark data for a range of products known to be significant (based on information in the existing IPI, CPI weights, export data, etc.) as well as open questions for respondents to provide information on other significant products. 2.Tourism. There is a strong demand for estimates of tourism value added. To estimate tourism value added using the international standard Tourism Satellite Account methodology requires the use of an input-output table, which is beyond the capacity of SBS at present. However, some indicative estimates of the main parts of the economy influenced by tourism can be derived if the necessary data are collected. Tourism is a demand concept, based on defining tourists (the international standard includes both international and domestic tourists), what products are characteristically purchased by tourists, and which industries supply those products. Some questions targeted at those industries that have significant involvement with tourists (hotels, restaurants, transport and tour operators, vehicle hire, etc.), on how much of their income is sourced from tourism would provide valuable indicators of the size of the direct impact of tourism.

Cleaning operations

Partial imputation was done at the time of receipt of questionnaires, after follow-up procedures to obtain fully completed questionnaires have been followed. Imputation followed a process, i.e., apply ratios from responding units in the imputation cell to the partial data that was supplied. Procedures were established during the editing stage (a) to preserve the integrity of the questionnaires as supplied by respondents, and (b) to record all changes made to the questionnaires during editing. If SBS staff writes on the form, for example, this should only be done in red pen, to distinguish the alterations from the original information.

Additional edit checks were developed, including checking against external data at enterprise/establishment level. External data to be checked against include VAGST and SNPF for turnover and purchases, and salaries and wages and employment data respectively. Editing and imputation processes were undertaken by FSD using Excel.

Sampling error estimates

NOT APPLICABLE!!
Supplement 1. Excel-sheet calculator and calculator instructions.
wiley.figshare.com
html
Updated May 30, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Anne Chao; Robert K. Colwell; Chih-Wei Lin; Nicholas J. Gotelli (2023). Supplement 1. Excel-sheet calculator and calculator instructions. [Dataset]. http://doi.org/10.6084/m9.figshare.3530930.v1
Explore at:
htmlAvailable download formats
Unique identifier
https://doi.org/10.6084/m9.figshare.3530930.v1
Dataset updated
May 30, 2023
Dataset provided by
Wileyhttps://www.wiley.com/
Authors
Anne Chao; Robert K. Colwell; Chih-Wei Lin; Nicholas J. Gotelli
License
CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
License information was derived automatically
Description
File List Supp1ExcelGuide.pdf Supp2ExcelCalculator.xls ExcelCalculatorAbundanceData.pdf ExcelCalculatorIncidenceData.pdf Description Supp1ExcelGuide.pdf contains a complete description of the variables and how to use the Excel Spreadsheet calculator. Supp2ExcelCalculator.xls is an Excel spreadsheet with formulas to calculate the statistics described in the paper.
i
Title: Anemia Project Data Set Open Access Deposited
datacore.iu.edu
Updated Sep 22, 2020
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Vitzthum, Virginia J. (2020). Title: Anemia Project Data Set Open Access Deposited [Dataset]. https://datacore.iu.edu/concern/data_sets/w3763676r?locale=en
Explore at:
Dataset updated
Sep 22, 2020
Dataset provided by
PLOS One
Authors
Vitzthum, Virginia J.
Description
The data are stored in two formats: a single EXCEL 2010 file with two worksheets (one for each phase of data collection) and two csv files (one for each phase of data collection; data are identical to those in the corresponding Excel file worksheets). A Codebook (pdf format) describes the variables i...n detail. [more]
Petre_Slide_CategoricalScatterplotFigShare.pptx
figshare.com
pptx
Updated Sep 19, 2016
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Benj Petre; Aurore Coince; Sophien Kamoun (2016). Petre_Slide_CategoricalScatterplotFigShare.pptx [Dataset]. http://doi.org/10.6084/m9.figshare.3840102.v1
Explore at:
pptxAvailable download formats
Unique identifier
https://doi.org/10.6084/m9.figshare.3840102.v1
Dataset updated
Sep 19, 2016
Dataset provided by
Figsharehttp://figshare.com/
Authors
Benj Petre; Aurore Coince; Sophien Kamoun
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
Categorical scatterplots with R for biologists: a step-by-step guide

Benjamin Petre1, Aurore Coince2, Sophien Kamoun1

1 The Sainsbury Laboratory, Norwich, UK; 2 Earlham Institute, Norwich, UK

Weissgerber and colleagues (2015) recently stated that ‘as scientists, we urgently need to change our practices for presenting continuous data in small sample size studies’. They called for more scatterplot and boxplot representations in scientific papers, which ‘allow readers to critically evaluate continuous data’ (Weissgerber et al., 2015). In the Kamoun Lab at The Sainsbury Laboratory, we recently implemented a protocol to generate categorical scatterplots (Petre et al., 2016; Dagdas et al., 2016). Here we describe the three steps of this protocol: 1) formatting of the data set in a .csv file, 2) execution of the R script to generate the graph, and 3) export of the graph as a .pdf file.

Protocol

• Step 1: format the data set as a .csv file. Store the data in a three-column excel file as shown in Powerpoint slide. The first column ‘Replicate’ indicates the biological replicates. In the example, the month and year during which the replicate was performed is indicated. The second column ‘Condition’ indicates the conditions of the experiment (in the example, a wild type and two mutants called A and B). The third column ‘Value’ contains continuous values. Save the Excel file as a .csv file (File -> Save as -> in ‘File Format’, select .csv). This .csv file is the input file to import in R.

• Step 2: execute the R script (see Notes 1 and 2). Copy the script shown in Powerpoint slide and paste it in the R console. Execute the script. In the dialog box, select the input .csv file from step 1. The categorical scatterplot will appear in a separate window. Dots represent the values for each sample; colors indicate replicates. Boxplots are superimposed; black dots indicate outliers.

• Step 3: save the graph as a .pdf file. Shape the window at your convenience and save the graph as a .pdf file (File -> Save as). See Powerpoint slide for an example.

Notes

• Note 1: install the ggplot2 package. The R script requires the package ‘ggplot2’ to be installed. To install it, Packages & Data -> Package Installer -> enter ‘ggplot2’ in the Package Search space and click on ‘Get List’. Select ‘ggplot2’ in the Package column and click on ‘Install Selected’. Install all dependencies as well.

• Note 2: use a log scale for the y-axis. To use a log scale for the y-axis of the graph, use the command line below in place of command line #7 in the script.

7 Display the graph in a separate window. Dot colors indicate

replicates

graph + geom_boxplot(outlier.colour='black', colour='black') + geom_jitter(aes(col=Replicate)) + scale_y_log10() + theme_bw()

References

Dagdas YF, Belhaj K, Maqbool A, Chaparro-Garcia A, Pandey P, Petre B, et al. (2016) An effector of the Irish potato famine pathogen antagonizes a host autophagy cargo receptor. eLife 5:e10856.

Petre B, Saunders DGO, Sklenar J, Lorrain C, Krasileva KV, Win J, et al. (2016) Heterologous Expression Screens in Nicotiana benthamiana Identify a Candidate Effector of the Wheat Yellow Rust Pathogen that Associates with Processing Bodies. PLoS ONE 11(2):e0149035

Weissgerber TL, Milic NM, Winham SJ, Garovic VD (2015) Beyond Bar and Line Graphs: Time for a New Data Presentation Paradigm. PLoS Biol 13(4):e1002128

https://cran.r-project.org/

http://ggplot2.org/
a
South Fork Cherry River Water Quality
conservation-abra.hub.arcgis.com
Updated Feb 22, 2023
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Allegheny-Blue Ridge Alliance (2023). South Fork Cherry River Water Quality [Dataset]. https://conservation-abra.hub.arcgis.com/maps/3b366a6bc44e4392847b71ec82038173
Explore at:
Dataset updated
Feb 22, 2023
Dataset authored and provided by
Allegheny-Blue Ridge Alliance
Area covered

Description
Purpose:This feature layer describes water quality sampling data performed at several operating coal mines in the South Fork of Cherry watershed, West Virginia.Source & Data:Data was downloaded from WV Department of Environmental Protection's ApplicationXtender online database and EPA's ECHO online database between January and April, 2023.There are five data sets here: Surface Water Monitoring Sites, which contains basic information about monitoring sites (name, lat/long, etc.) and NPDES Outlet Monitoring Sites, which contains similar information about outfall discharges surrounding the active mines. Biological Assessment Stations (BAS) contain similar information for pre-project biological sampling. NOV Summary contains locations of Notices of Violation received by South Fork Coal Company from WV Department of Environmental Protection. The Quarterly Monitoring Reports table contains the sampling data for the Surface Water Monitoring Sites, which actually goes as far back as 2018 for some mines. Parameters of concern include iron, aluminum and selenium, among others.A relationship class between Surface Water Monitoring Sites and the Quarterly Monitoring Reports allows access to individual sample results.Processing:Notices of Violation were obtained from the WV DEP AppXtender database for Mining and Reclamation Article 3 (SMCRA) Permitting, and Mining and Reclamation NPDES Permitting. Violation data were entered into Excel and loaded into ArcGIS Pro as a CSV text file with Lat/Long coordinates for each Violation. The CSV file was converted to a point feature class.Water quality data were downloaded in PDF format from the WVDEP AppXtender website. Non-searchable PDFs were converted via Optical Character Recognition, so that data could be copied. Sample results were copied and pasted manually to Notepad++, and several columns were re-ordered. Data was grouped by sample station and sorted chronologically. Sample data, contained in the associated table (SW_QM_Reports) were linked back to the monitoring station locations using the Station_ID text field in a geodatabase relationship class.Water monitoring station locations were taken from published Drainage Maps and from water quality reports. A CSV table was created with station Lat/Long locations and loaded into ArcGIS Pro. It was then converted to a point feature class.Stream Crossings and Road Construction Areas were digitized as polygon feature classes from project Drainage and Progress maps that were converted to TIFF image format from PDF and georeferenced.The ArcGIS Pro map - South Fork Cherry River Water Quality, was published as a service definition to ArcGIS Online.Symbology:NOV Summary - dark blue, solid pointLost Flats Surface Water Monitoring Sites: Data Available - medium blue point, black outlineLost Flats Surface Water Monitoring Sites: No Data Available - no-fill point, thick medium blue outlineLost Flats NPDES Outlet Monitoring Sites - orange point, black outlineBlue Knob Surface Water Monitoring Sites: Data Available - medium blue point, black outlineBlue Knob Surface Water Monitoring Sites: No Data Available - no-fill point, thick medium blue outlineBlue Knob NPDES Outlet Monitoring Sites - orange point, black outlineBlue Knob Biological Assessment Stations: Data Available - medium green point, black outlineBlue Knob Biological Assessment Stations: No Data Available - no-fill point, thick medium green outlineRocky Run Surface Water Monitoring Sites: Data Available - medium blue point, black outlineRocky Run Surface Water Monitoring Sites: No Data Available - no-fill point, thick medium blue outlineRocky Run NPDES Outlet Monitoring Sites - orange point, black outlineRocky Run Biological Assessment Stations: Data Available - medium green point, black outlineRocky Run Biological Assessment Stations: No Data Available - no-fill point, thick medium green outlineRocky Run Stream Crossings: turquoise blue polygon with red outlineRocky Run Haul Road Construction Areas: dark red (40% transparent) polygon with black outlineHaul Road No 2 Surface Water Monitoring Sites: Data Available - medium blue point, black outlineHaul Road No 2 Surface Water Monitoring Sites: No Data Available - no-fill point, thick medium blue outlineHaul Road No 2 NPDES Outlet Monitoring Sites - orange point, black outline
Taking Part 2011/12 Quarter 2: Statistical Release
gov.uk
Updated Dec 22, 2011
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Department for Digital, Culture, Media & Sport (2011). Taking Part 2011/12 Quarter 2: Statistical Release [Dataset]. https://www.gov.uk/government/statistics/taking-part-2011-12-quarter-2-statistical-release
Explore at:
Dataset updated
Dec 22, 2011
Dataset provided by
GOV.UKhttp://gov.uk/
Authors
Department for Digital, Culture, Media & Sport
Description
It is a continuous face to face household survey of adults aged 16 and over in England and chidren aged 5-15 years old. This latest releases presents rolling estimates incorporating data from the second quarter of year seven of the survey.

Released:

21 December 2011

Period covered:

October 2010 to September 2011

Geographic coverage:

National and Regional level data for England.

Next release date:

Rolling annual estimates for adults, including the third quarter of the 2011/12 survey year, is scheduled for the end of March 2012.

Summary

The latest data from the 2010/11 Taking Part survey provides reliable national estimates of adult and child engagement with sport, libraries, the arts, heritage and museums & galleries. This release builds on the first release of data from 2010/11 to look at a number of areas in depth and present measures that begin to consider broader definitions of participation in our sectors. The report also looks at some of the other measures in the survey that provide estimates of volunteering and charitable giving and civic engagement.

The Taking Part survey is a continuous annual survey of adults and children living in private households in England, and carries the National Statistics badge, meaning that it meets the highest standards of statistical quality.

Statistical Report

http://www.culture.gov.uk/images/research/TakingPart_2011-12_Q2_StatisticalRelease_updatev2.pdf">Taking Part 2011/12 Quarter 2: Statistical Release (PDF 496kb) *

http://www.culture.gov.uk/images/research/TakingPart_2011-12_Q2_StatisticalRelease_updatev2.doc">Taking Part 2011/12 Quarter 2: Statistical Release (Word 803kb)*

Dashboard

http://www.culture.gov.uk/images/research/taking-part-dashboard_Q2.pdf">Taking Part 2011/12 Quarter 2: Dashboard (PDF 215kb)

Statistical Worksheets

These spreadsheets contain the data and sample sizes to support the material in this release:

http://www.culture.gov.uk/images/research/TP_Y7_Q2_Figures_Archives.xls">Archives (Excel 95kb)

http://www.culture.gov.uk/images/research/TP_Y7_Q2_Figures_Arts.xls">Arts (Excel 81kb)

http://www.culture.gov.uk/images/research/TP_Y7_Q2_Figures_Cycling_and_swimming_proficiency.xls">Cycling and swimming proficiency (Excel 71kb)

http://www.culture.gov.uk/images/research/TP_Y7_Q2_Figures_Digital_participation.xls">Digital participation (Excel 73kb)

http://www.culture.gov.uk/images/research/TP_Y7_Q2_Figures_Heritage.xls">Heritage (Excel 87 kb)

http://www.culture.gov.uk/images/research/TP_Y7_Q2_Figures_Libraries.xls">Libraries (Excel 72kb)

http://www.culture.gov.uk/images/research/TP_Y7_Q2_Figures_Museums_and_Galleries.xls">Museums and Galleries (Excel 81kb)

http://www.culture.gov.uk/images/research/TP_Y7_Q2_Figures_Olympics.xls">Olympics (Excel 90kb)

http://www.culture.gov.uk/images/research/TP_Y7_Q2_Figures_Sports.xls">Sports (Excel 73kb)

http://www.culture.gov.uk/images/research/TP_Y7_Q2_Big_Society.xls">Big Society (Excel 196kb)

Previous release

The previous Taking Part release was published on 29 September 2011 and can be found online. It also provides spreadsheets containing the data and sample sizes for each sector included in the survey.

http://www.culture.gov.uk/publications/8462.aspx">Taking Part 2011/12 Quarter 1

Pre-release access

The document below contains a list of Ministers and Officials who have received privileged early access to this release of Taking Part data. In line with best practice, the list has been kept to a minimum and those given access for briefing purposes had a maximum of 24 hours.

http://www.culture.gov.uk/images/research/pre-release-access-list-tp-qrt2.pdf">Pre-release access (PDF 60kb)

The UK Statistics Authority

This release is published in accordance with the Code of Practice for Official Statistics (2009), as produced by the UK Statistics Authority (UKSA). The UKSA has the overall objective of promoting and s
D
Replication Data for: Benchmarking density functional methods for harmonic...
dataverse.azure.uit.no
dataverse.no
+1more
pdf, txt, xlsx
Updated Sep 28, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Md Mehboob Alam; Md Mehboob Alam (2023). Replication Data for: Benchmarking density functional methods for harmonic vibrational frequencies. IN REVIEW [Dataset]. http://doi.org/10.18710/2DQK6Z
Explore at:
txt(76901), xlsx(2733050), txt(2038), txt(89673), txt(99700), txt(75646), txt(164788), txt(368175), txt(71700), txt(104821), txt(101625), pdf(1666984), txt(271747), txt(70615)Available download formats
Unique identifier
https://doi.org/10.18710/2DQK6Z
Dataset updated
Sep 28, 2023
Dataset provided by
DataverseNO
Authors
Md Mehboob Alam; Md Mehboob Alam
License
CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
License information was derived automatically
Description
The files contains data for reproducing all the results in the article "Benchmarking density functional methods for harmonic vibrational frequencies" (IN REVIEW). The file frequency_data_for_statistical_analysis.xlsx is an excel file containing 11 differently named worksheets. Each worksheet contains the name of the XC functionals used. All the quantities are calculated using the standard mathematical formula of EXCEL. The distribution_of_signed_error_plot.pdf is a pdf file containing the distribution of signed error obtained for each molecule using 17 different XC functionals. The distribution plots are obtained using the distribution formula given in the upcoming article. All the plots have been created using GNUPLOT software. The text files are tab delimited text files obtained from the excel worksheets.
Raw data excel and report
figshare.com
xlsx
Updated Oct 28, 2022
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Cameron Burns; ben peacock (2022). Raw data excel and report [Dataset]. http://doi.org/10.6084/m9.figshare.21400044.v1
Explore at:
xlsxAvailable download formats
Unique identifier
https://doi.org/10.6084/m9.figshare.21400044.v1
Dataset updated
Oct 28, 2022
Dataset provided by
Figsharehttp://figshare.com/
figshare
Authors
Cameron Burns; ben peacock
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
Lyophilised GFP exosome reference standard provided by ISEVxTech from Merck (Exosome standards, fluorescent recombinant, expressed in human cells | Sigma-Aldrich (sigmaaldrich.com)).

Unless otherwise mentioned all samples and standards had data acquired over one minute and where water is used as a diluent it is HPLC grade water.

Lyophilized Merck rEV’s resuspended in 100µl of water then placed on ice (all subsequent sample handling performed on ice as per manufacturer instruction). 2.17E+10 particles per ml (P/ml) QC beads diluted 1 in 100 with water to a volume of 100µl. Used to align Nanoanalyzer and create the concentration standard used to assess the samples. Size bead cocktail (68nm, 91nm, 113nm and 155nm) of silica nanospheres diluted 1 in 100 to a volume of 100µl with water then analysed to create the size standard (figure 1). 100µl TE buffer was analysed in order to provide a blank for the samples. Samples were serially diluted 1 in 25, 1 in 50 and 1 in 100 dilutions with TE buffer then immediately analysed in triplicate. Data analysis performed using the nFCM professional software, files saved in FCS. format with associated PDF reports found on Figshare.
Taking Part 2011/12 Quarter 3: Statistical Release
gov.uk
Updated Mar 29, 2012
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Department for Digital, Culture, Media & Sport (2012). Taking Part 2011/12 Quarter 3: Statistical Release [Dataset]. https://www.gov.uk/government/statistics/taking-part-2011-12-quarter-3-statistical-release
Explore at:
Dataset updated
Mar 29, 2012
Dataset provided by
GOV.UKhttp://gov.uk/
Authors
Department for Digital, Culture, Media & Sport
Description
The Taking Part survey has run since 2005 and is the key evidence source for DCMS. It is a continuous face to face household survey of adults aged 16 and over in England and children aged 5-15 years old. This latest releases presents rolling estimates incorporating data from the third quarter of year seven of the survey.

Released:

29 March 2012

Period covered:

January 2011 - December 2011

Geographic coverage:

National and Regional level data for England.

Next release date:

A release of rolling annual estimates for adults, including the fourth quarter of the 2011/12 survey year, is scheduled for the end of June 2012.

Summary

The latest data from the 2011/12 Taking Part survey provides reliable national estimates of adult and child engagement with sport, libraries, the arts, heritage and museums and galleries. This release builds on the data from 2010/2011 and data from quarter 1 and quarter 2 releases of data from earlier in 2011/12 to look at a number of areas in depth and present measures that begin to consider broader definitions of participation in our sectors. The report also looks at some of the other measures in the survey that provide estimates of volunteering and charitable giving and civic engagement.

The Taking Part survey is a continuous annual survey of adults and children living in private households in England, and carries the National Statistics badge, meaning that it meets the highest standards of statistical quality.

Statistical Report

http://www.culture.gov.uk/images/research/TakingPart_2011-12_Q3_StatisticalRelease.pdf">Taking Part 2011/12 Quarter 3: Statistical Release (PDF 468kb)

http://www.culture.gov.uk/images/research/TakingPart_2011-12_Q3_StatisticalRelease.doc">Taking Part 2011/12 Quarter 3: Statistical Release (Word 1mb)

Dashboard

http://www.culture.gov.uk/images/research/taking-part-dashboard_Yr7-Q3.pdf">Taking Part 2011/12 Quarter 3: Dashboard (PDF 166kb)

Statistical Worksheets

These spreadsheets contain the data and sample sizes to support the material in this release:

http://www.culture.gov.uk/images/research/TP_Y7_Q3_Figures_Archives.xls">Archives (Excel 89kb)

http://www.culture.gov.uk/images/research/TP_Y7_Q3_Figures_Arts.xls">Arts (Excel 80kb)

http://www.culture.gov.uk/images/research/TP_Y7_Q3_Figures_Cycling_and_swimming_proficiency.xls">Cycling and swimming proficiency (Excel 71kb)

http://www.culture.gov.uk/images/research/TP_Y7_Q3_Figures_Digital_participation.xls">Digital participation (Excel 75kb)

http://www.culture.gov.uk/images/research/TP_Y7_Q3_Figures_Heritage.xls">Heritage (Excel 78kb)

http://www.culture.gov.uk/images/research/TP_Y7_Q3_Figures_Libraries.xls">Libraries (Excel 71kb)

http://www.culture.gov.uk/images/research/TP_Y7_Q3_Figures_Museums_and_Galleries.xls">Museums and Galleries (Excel 74kb)

http://www.culture.gov.uk/images/research/TP_Y7_Q3_Figures_Olympics.xls">Olympics (Excel 87kb)

http://www.culture.gov.uk/images/research/TP_Y7_Q3_Figures_Sports.xls">Sports (Excel 73kb)

http://www.culture.gov.uk/images/research/TP_Y7_Q3_Big_Society.xls">Big Society (Excel 196kb)

Previous release

The previous Taking Part release was published on 21 December 2011 and can be found online. It also provides spreadsheets containing the data and sample sizes for each sector included in the survey.

http://www.culture.gov.uk/publications/8734.aspx">Taking Part 2011/12 Quarter 2

Pre-release access

The document below contains a list of Ministers and Officials who have received privileged early access to this release of Taking Part data. In line with best practice, the list has been kept to a minimum and those given access for briefing purposes had a maximum of 24 hours.

http://www.culture.gov.uk/images/research/TP_Q3_prerelease_access-list.pdf">Pre-release access (PDF 63kb)

The UK Statistics Authority

This release is published in accordance with the Code of Practice for Off
Z
Current Practices in Clinical Gait Analysis in Europe - Survey
data.niaid.nih.gov
data-staging.niaid.nih.gov
+1more
Updated Jul 10, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Stéphane ARMAND; Morgan SANGEUX; Zimi SAWACHA; Brian HORSAK (2024). Current Practices in Clinical Gait Analysis in Europe - Survey [Dataset]. https://data.niaid.nih.gov/resources?id=zenodo_10124976
Explore at:
Dataset updated
Jul 10, 2024
Authors
Stéphane ARMAND; Morgan SANGEUX; Zimi SAWACHA; Brian HORSAK
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Area covered
Europe
Description
This dataset contains anonymous raw data from a questionnaire on the practice of clinical gait analysis in Europe. This work was initiated by ESMAC (European Society for Movement Analysis in Adults and Children). It includes the analysis of 75 questions answered by 97 laboratories.The dataset contains 5 files:- Survey_ESMAC_Questions is a pdf file containing the questions asked. - Survey_ESMAC_Data.xlsx is an Excel file containing the raw data and the data modified for the analysis. The modifications made were notified in two sheets of the file.- Survey_ESMAC_Results.pdf is a file containing the export of the results in PDF format.- Survey_ESMAC_Results.html is a file containing the export of results in HTML format.
d
Data from: General Practice Workforce
digital.nhs.uk
Updated Nov 28, 2024
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
(2024). General Practice Workforce [Dataset]. https://digital.nhs.uk/data-and-information/publications/statistical/general-and-personal-medical-services
Explore at:
Dataset updated
Nov 28, 2024
License
https://digital.nhs.uk/about-nhs-digital/terms-and-conditionshttps://digital.nhs.uk/about-nhs-digital/terms-and-conditions
Time period covered
Sep 30, 2015 - Oct 31, 2024
Description
The General Practice Workforce series of Official Statistics presents a snapshot of the primary care general practice workforce. A snapshot statistic relates to the situation at a specific date, which for these workforce statistics is now the last calendar day each month. This monthly snapshot reflects the general practice workforce at 31 October 2024. These statistics present full-time equivalent (FTE) and headcount figures by four staff groups, (GPs, Nurses, Direct Patient Care (DPC) and administrative staff), with breakdowns of individual job roles within these high-level groups. For the purposes of NHS workforce statistics, we define full-time working to be 37.5 hours per week. Full-time equivalent is a standardised measure of the workload of an employed person. Using FTE, we can convert part-time and additional working hours into an equivalent number of full-time staff. For example, an individual working 37.5 hours would be classed as 1.0 FTE while a colleague working 30 hours would be 0.8 FTE. The term “headcount” relates to distinct individuals, and as the same person may hold more than one role, care should be taken when interpreting headcount figures. Please refer to the Using this Publication section for information and guidance about the contents of this publication and how it can and cannot be used. England-level time series figures for all job roles are available in the Excel bulletin tables back to September 2015 when this series of Official Statistics began. The Excel file also includes Sub-ICB Location-level FTE and headcount breakdowns for the current reporting period. CSVs containing practice-level summaries and Sub-ICB Location-level counts of individuals are also available. Please refer to the Publication content, analysis, and release schedule in the Using this publication section for more details of what’s available. We are continually working to improve our publications to ensure their contents are as useful and relevant as possible for our users. We welcome feedback from all users to PrimaryCareWorkforce@nhs.net.
i
Household Health Survey 2012-2013, Economic Research Forum (ERF)...
catalog.ihsn.org
datacatalog.ihsn.org
Updated Jun 26, 2017
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Central Statistical Organization (CSO) (2017). Household Health Survey 2012-2013, Economic Research Forum (ERF) Harmonization Data - Iraq [Dataset]. https://catalog.ihsn.org/index.php/catalog/6937
Explore at:
Dataset updated
Jun 26, 2017
Dataset provided by
Kurdistan Regional Statistics Office (KRSO)
Central Statistical Organization (CSO)
Economic Research Forum
Time period covered
2012 - 2013
Area covered
Iraq
Description
Abstract

The harmonized data set on health, created and published by the ERF, is a subset of Iraq Household Socio Economic Survey (IHSES) 2012. It was derived from the household, individual and health modules, collected in the context of the above mentioned survey. The sample was then used to create a harmonized health survey, comparable with the Iraq Household Socio Economic Survey (IHSES) 2007 micro data set.

----> Overview of the Iraq Household Socio Economic Survey (IHSES) 2012:

Iraq is considered a leader in household expenditure and income surveys where the first was conducted in 1946 followed by surveys in 1954 and 1961. After the establishment of Central Statistical Organization, household expenditure and income surveys were carried out every 3-5 years in (1971/ 1972, 1976, 1979, 1984/ 1985, 1988, 1993, 2002 / 2007). Implementing the cooperation between CSO and WB, Central Statistical Organization (CSO) and Kurdistan Region Statistics Office (KRSO) launched fieldwork on IHSES on 1/1/2012. The survey was carried out over a full year covering all governorates including those in Kurdistan Region.

The survey has six main objectives. These objectives are:

Provide data for poverty analysis and measurement and monitor, evaluate and update the implementation Poverty Reduction National Strategy issued in 2009.

Provide comprehensive data system to assess household social and economic conditions and prepare the indicators related to the human development.

Provide data that meet the needs and requirements of national accounts.

Provide detailed indicators on consumption expenditure that serve making decision related to production, consumption, export and import.

Provide detailed indicators on the sources of households and individuals income.

Provide data necessary for formulation of a new consumer price index number.

The raw survey data provided by the Statistical Office were then harmonized by the Economic Research Forum, to create a comparable version with the 2006/2007 Household Socio Economic Survey in Iraq. Harmonization at this stage only included unifying variables' names, labels and some definitions. See: Iraq 2007 & 2012- Variables Mapping & Availability Matrix.pdf provided in the external resources for further information on the mapping of the original variables on the harmonized ones, in addition to more indications on the variables' availability in both survey years and relevant comments.

Geographic coverage

National coverage: Covering a sample of urban, rural and metropolitan areas in all the governorates including those in Kurdistan Region.

Analysis unit

1- Household/family. 2- Individual/person.

Universe

The survey was carried out over a full year covering all governorates including those in Kurdistan Region.

Kind of data

Sample survey data [ssd]

Sampling procedure

----> Design:

Sample size was (25488) household for the whole Iraq, 216 households for each district of 118 districts, 2832 clusters each of which includes 9 households distributed on districts and governorates for rural and urban.

----> Sample frame:

Listing and numbering results of 2009-2010 Population and Housing Survey were adopted in all the governorates including Kurdistan Region as a frame to select households, the sample was selected in two stages: Stage 1: Primary sampling unit (blocks) within each stratum (district) for urban and rural were systematically selected with probability proportional to size to reach 2832 units (cluster). Stage two: 9 households from each primary sampling unit were selected to create a cluster, thus the sample size of total survey clusters was 25488 households distributed on the governorates, 216 households in each district.

----> Sampling Stages:

In each district, the sample was selected in two stages: Stage 1: based on 2010 listing and numbering frame 24 sample points were selected within each stratum through systematic sampling with probability proportional to size, in addition to the implicit breakdown urban and rural and geographic breakdown (sub-district, quarter, street, county, village and block). Stage 2: Using households as secondary sampling units, 9 households were selected from each sample point using systematic equal probability sampling. Sampling frames of each stages can be developed based on 2010 building listing and numbering without updating household lists. In some small districts, random selection processes of primary sampling may lead to select less than 24 units therefore a sampling unit is selected more than once , the selection may reach two cluster or more from the same enumeration unit when it is necessary.

Mode of data collection

Face-to-face [f2f]

Research instrument

----> Preparation:

The questionnaire of 2006 survey was adopted in designing the questionnaire of 2012 survey on which many revisions were made. Two rounds of pre-test were carried out. Revision were made based on the feedback of field work team, World Bank consultants and others, other revisions were made before final version was implemented in a pilot survey in September 2011. After the pilot survey implemented, other revisions were made in based on the challenges and feedbacks emerged during the implementation to implement the final version in the actual survey.

----> Questionnaire Parts:

The questionnaire consists of four parts each with several sections: Part 1: Socio – Economic Data: - Section 1: Household Roster - Section 2: Emigration - Section 3: Food Rations - Section 4: housing - Section 5: education - Section 6: health - Section 7: Physical measurements - Section 8: job seeking and previous job

Part 2: Monthly, Quarterly and Annual Expenditures: - Section 9: Expenditures on Non – Food Commodities and Services (past 30 days). - Section 10 : Expenditures on Non – Food Commodities and Services (past 90 days). - Section 11: Expenditures on Non – Food Commodities and Services (past 12 months). - Section 12: Expenditures on Non-food Frequent Food Stuff and Commodities (7 days). - Section 12, Table 1: Meals Had Within the Residential Unit. - Section 12, table 2: Number of Persons Participate in the Meals within Household Expenditure Other Than its Members.

Part 3: Income and Other Data: - Section 13: Job - Section 14: paid jobs - Section 15: Agriculture, forestry and fishing - Section 16: Household non – agricultural projects - Section 17: Income from ownership and transfers - Section 18: Durable goods - Section 19: Loans, advances and subsidies - Section 20: Shocks and strategy of dealing in the households - Section 21: Time use - Section 22: Justice - Section 23: Satisfaction in life - Section 24: Food consumption during past 7 days

Part 4: Diary of Daily Expenditures: Diary of expenditure is an essential component of this survey. It is left at the household to record all the daily purchases such as expenditures on food and frequent non-food items such as gasoline, newspapers…etc. during 7 days. Two pages were allocated for recording the expenditures of each day, thus the roster will be consists of 14 pages.

Cleaning operations

----> Raw Data:

Data Editing and Processing: To ensure accuracy and consistency, the data were edited at the following stages: 1. Interviewer: Checks all answers on the household questionnaire, confirming that they are clear and correct. 2. Local Supervisor: Checks to make sure that questions has been correctly completed. 3. Statistical analysis: After exporting data files from excel to SPSS, the Statistical Analysis Unit uses program commands to identify irregular or non-logical values in addition to auditing some variables. 4. World Bank consultants in coordination with the CSO data management team: the World Bank technical consultants use additional programs in SPSS and STAT to examine and correct remaining inconsistencies within the data files. The software detects errors by analyzing questionnaire items according to the expected parameter for each variable.

----> Harmonized Data:

The SPSS package is used to harmonize the Iraq Household Socio Economic Survey (IHSES) 2007 with Iraq Household Socio Economic Survey (IHSES) 2012.

The harmonization process starts with raw data files received from the Statistical Office.

A program is generated for each dataset to create harmonized variables.

Data is saved on the household and individual level, in SPSS and then converted to STATA, to be disseminated.

Response rate

Iraq Household Socio Economic Survey (IHSES) reached a total of 25488 households. Number of households refused to response was 305, response rate was 98.6%. The highest interview rates were in Ninevah and Muthanna (100%) while the lowest rates were in Sulaimaniya (92%).
U
Statistical Abstract of the United States, 2011
dataverse-staging.rdmc.unc.edu
Updated Oct 28, 2011
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
UNC Dataverse (2011). Statistical Abstract of the United States, 2011 [Dataset]. https://dataverse-staging.rdmc.unc.edu/dataset.xhtml?persistentId=hdl:1902.29/CD-10849
Explore at:
Dataset updated
Oct 28, 2011
Dataset provided by
UNC Dataverse
License
https://dataverse-staging.rdmc.unc.edu/api/datasets/:persistentId/versions/1.0/customlicense?persistentId=hdl:1902.29/CD-10849https://dataverse-staging.rdmc.unc.edu/api/datasets/:persistentId/versions/1.0/customlicense?persistentId=hdl:1902.29/CD-10849
Description
"The Statistical Abstract of the United States, published since 1878, is the standard summary of statistics on the social, political, and economic organization of the United States. It is designed to serve as a convenient volume for statistical reference and as a guide to other statistical publications and sources. The latter function is served by the introductory text to each section, the source note appearing below each table, and Appendix I, which comprises the Guide to Sources of Statisti cs, the Guide to State Statistical Abstracts, and the Guide to Foreign Statistical Abstracts. The Statistical Abstract sections and tables are compiled into one Adobe PDF named StatAbstract2009.pdf. This PDF is bookmarked by section and by table and can be searched using the Acrobat Search feature. The Statistical Abstract on CD-ROM is best viewed using Adobe Acrobat 5, or any subsequent version of Acrobat or Acrobat Reader. The Statistical Abstract tables and the metropolitan areas tables from Appendix II are available as Excel(.xls or .xlw) spreadsheets. In most cases, these spreadsheet files offer the user direct access to more data than are shown either in the publication or Adobe Acrobat. These files usually contain more years of data, more geographic areas, and/or more categories of subjects than those shown in the Acrobat version. The extensive selection of statistics is provided for the United States, with selected data for regions, divisions, states, metropolitan areas, cities, and foreign countries from reports and records of government and private agencies. Software on the disc can be used to perform full-text searches, view official statistics, open tables as Lotus worksheets or Excel workbooks, and link directly to source agencies and organizations for supporting information. Except as indicated, figures are for the United States as presently constituted. Although emphasis in the Statistical Abstract is primarily given to national data, many tables present data for regions and individual states and a smaller number for metropolitan areas and cities.Statistics for the Commonwealth of Puerto Rico and for island areas of the United States are included in many state tables and are supplemented by information in Section 29. Additional information for states, cities, counties, metropolitan areas, and other small units, as well as more historical data are available in various supplements to the Abstract. Statistics in this edition are generally for the most recent year or period available by summer 2006. Each year over 1,400 tables and charts are reviewed and evaluated; new tables and charts of current interest are added, continuing series are updated, and less timely data are condensed or eliminated. Text notes and appendices are revised as appropriate. This year we have introduced 72 new tables covering a wide range of subject areas. These cover a variety of topics including: learning disability for children, people impacted by the hurricanes in the Gulf Coast area, employees with alternative work arrangements, adult computer and Internet users by selected characteristics, North America cruise industry, women- and minority-owned businesses, and the percentage of the adult population considered to be obese. Some of the annually surveyed topics are population; vital statistics; health and nutrition; education; law enforcement, courts and prison; geography and environment; elections; state and local government; federal government finances and employment; national defense and veterans affairs; social insurance and human services; labor force, employment, and earnings; income, expenditures, and wealth; prices; business enterprise; science and technology; agriculture; natural resources; energy; construction and housing; manufactures; domestic trade and services; transportation; information and communication; banking, finance, and insurance; arts, entertainment, and recreation; accommodation, food services, and other services; foreign commerce and aid; outlying areas; and comparative international statistics." Note to Users: This CD is part of a collection located in the Data Archive of the Odum Institute for Research in Social Science, at the University of North Carolina at Chapel Hill. The collection is located in Room 10, Manning Hall. Users may check the CDs out subscribing to the honor system. Items can be checked out for a period of two weeks. Loan forms are located adjacent to the collection.

Facebook

Twitter

Click to copy link

Link copied

Cite

Rong Luo (2023). Data Cleaning Sample [Dataset]. http://doi.org/10.5683/SP3/ZCN177

Data Cleaning Sample

Explore at:

167 scholarly articles cite this dataset (View in Google Scholar)

CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.

Unique identifier

https://doi.org/10.5683/SP3/ZCN177

Dataset updated

Jul 13, 2023

Dataset provided by

Borealis

Authors

Rong Luo

License

CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
License information was derived automatically

Description

Sample data for exercises in Further Adventures in Data Cleaning.

Clear search

Close search

Google apps

Main menu

Data Cleaning Sample

Data from: Current and projected research data storage needs of Agricultural...

Enterprise Survey 2009-2019, Panel Data - Slovenia

Abstract

Geographic coverage

Analysis unit

Universe

Kind of data

Sampling procedure

Mode of data collection

Research instrument

Response rate

Employee Vehicle Personal Use 2020 (Excel)

GP Workforce Statistics

Survey on Interest Rate Controls 2019 - Albania, Algeria, Anguilla...and 103...

Abstract

Geographic coverage

Analysis unit

Universe

Kind of data

Mode of data collection

Research instrument

Taking Part 2010/11 quarter 4: Statistical release

Released:

Period covered:

Geographic coverage:

Next release date:

Summary

Statistical Report

Statistical Worksheets

Previous release

The UK Statistics Authority

Pre-release access

Releated information

Business Activity Survey 2009 - Samoa

Abstract

Geographic coverage

Analysis unit

Universe

Kind of data

Sampling procedure

Mode of data collection

Research instrument

Cleaning operations

Sampling error estimates

Supplement 1. Excel-sheet calculator and calculator instructions.

Title: Anemia Project Data Set Open Access Deposited

Petre_Slide_CategoricalScatterplotFigShare.pptx

7 Display the graph in a separate window. Dot colors indicate

South Fork Cherry River Water Quality

Taking Part 2011/12 Quarter 2: Statistical Release

Released:

Period covered:

Geographic coverage:

Next release date:

Summary

Statistical Report

Dashboard

Statistical Worksheets

Previous release

Pre-release access

The UK Statistics Authority

Replication Data for: Benchmarking density functional methods for harmonic...

Raw data excel and report

Taking Part 2011/12 Quarter 3: Statistical Release

Released:

Period covered:

Geographic coverage:

Next release date:

Summary

Statistical Report

Dashboard

Statistical Worksheets

Previous release

Pre-release access

The UK Statistics Authority

Current Practices in Clinical Gait Analysis in Europe - Survey

Data from: General Practice Workforce

Household Health Survey 2012-2013, Economic Research Forum (ERF)...

Abstract