Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
The dataset is created for the year 2018 in Germany for 267 popular initiatives at the local level where the information was collected regarding the following variables: topic of initiative, State legislation, turnout, validity of the initiative, success of the popular initiative, proportion of yes voters in case of a referendum, size of municipality, profile of initiators (local council or citizens), proportion of no voters, index of mobilization, repartition index (difference between yes and no voters), approval rate, number of inhabitants, correction of initiative, status of the case (open/closed).The dataset was created with the help of the existing popular initiatives registered by the University of Wuppertal and the association Mehr Demokratie in Germany.The idea of the dataset is to evaluate in details on which factors the success of popular initiatives depend in the different German States (Länder). A repartition index (difference between yes and no voters) and a mobilization index (repartition index multiplied by the turnout) were calculated and added in the dataset. All the other variables were also created in order to balance the result of these initiatives. The final aim is to be able to measure how direct democratic tools influence local politics in Germany. This is why it is important to examine the prevailing factors for the satisfaction of citizens who use these procedures.In this dataset, the destiny of an initiative (failure/success) can be taken as the dependent variable and all the others could be classified as independent variables.Direct democracy offers possibilities for citizens to influence political decisions especially at the local level. In Germany, the local political systems have been affected by the introduction of direct democratic tools such as citizen initiatives and local referenda since the Reunification. The State legislations defined new conditions for citizen initiatives and municipal referenda with a minimum number of valid signatures for initiatives and a minimum approval rate for the referenda. In the attached file, you will find the dataset in Excel file as well as a .dta file that you can open with the software Stata (https://www.stata.com/).
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
The Canada Trademarks Dataset
18 Journal of Empirical Legal Studies 908 (2021), prepublication draft available at https://papers.ssrn.com/abstract=3782655, published version available at https://onlinelibrary.wiley.com/share/author/CHG3HC6GTFMMRU8UJFRR?target=10.1111/jels.12303
Dataset Selection and Arrangement (c) 2021 Jeremy Sheff
Python and Stata Scripts (c) 2021 Jeremy Sheff
Contains data licensed by Her Majesty the Queen in right of Canada, as represented by the Minister of Industry, the minister responsible for the administration of the Canadian Intellectual Property Office.
This individual-application-level dataset includes records of all applications for registered trademarks in Canada since approximately 1980, and of many preserved applications and registrations dating back to the beginning of Canada’s trademark registry in 1865, totaling over 1.6 million application records. It includes comprehensive bibliographic and lifecycle data; trademark characteristics; goods and services claims; identification of applicants, attorneys, and other interested parties (including address data); detailed prosecution history event data; and data on application, registration, and use claims in countries other than Canada. The dataset has been constructed from public records made available by the Canadian Intellectual Property Office. Both the dataset and the code used to build and analyze it are presented for public use on open-access terms.
Scripts are licensed for reuse subject to the Creative Commons Attribution License 4.0 (CC-BY-4.0), https://creativecommons.org/licenses/by/4.0/. Data files are licensed for reuse subject to the Creative Commons Attribution License 4.0 (CC-BY-4.0), https://creativecommons.org/licenses/by/4.0/, and also subject to additional conditions imposed by the Canadian Intellectual Property Office (CIPO) as described below.
Terms of Use:
As per the terms of use of CIPO's government data, all users are required to include the above-quoted attribution to CIPO in any reproductions of this dataset. They are further required to cease using any record within the datasets that has been modified by CIPO and for which CIPO has issued a notice on its website in accordance with its Terms and Conditions, and to use the datasets in compliance with applicable laws. These requirements are in addition to the terms of the CC-BY-4.0 license, which require attribution to the author (among other terms). For further information on CIPO’s terms and conditions, see https://www.ic.gc.ca/eic/site/cipointernet-internetopic.nsf/eng/wr01935.html. For further information on the CC-BY-4.0 license, see https://creativecommons.org/licenses/by/4.0/.
The following attribution statement, if included by users of this dataset, is satisfactory to the author, but the author makes no representations as to whether it may be satisfactory to CIPO:
The Canada Trademarks Dataset is (c) 2021 by Jeremy Sheff and licensed under a CC-BY-4.0 license, subject to additional terms imposed by the Canadian Intellectual Property Office. It contains data licensed by Her Majesty the Queen in right of Canada, as represented by the Minister of Industry, the minister responsible for the administration of the Canadian Intellectual Property Office. For further information, see https://creativecommons.org/licenses/by/4.0/ and https://www.ic.gc.ca/eic/site/cipointernet-internetopic.nsf/eng/wr01935.html.
Details of Repository Contents:
This repository includes a number of .zip archives which expand into folders containing either scripts for construction and analysis of the dataset or data files comprising the dataset itself. These folders are as follows:
If users wish to construct rather than download the datafiles, the first script that they should run is /py/sftp_secure.py. This script will prompt the user to enter their IP Horizons SFTP credentials; these can be obtained by registering with CIPO at https://ised-isde.survey-sondage.ca/f/s.aspx?s=59f3b3a4-2fb5-49a4-b064-645a5e3a752d&lang=EN&ds=SFTP. The script will also prompt the user to identify a target directory for the data downloads. Because the data archives are quite large, users are advised to create a target directory in advance and ensure they have at least 70GB of available storage on the media in which the directory is located.
The sftp_secure.py script will generate a new subfolder in the user’s target directory called /XML_raw. Users should note the full path of this directory, which they will be prompted to provide when running the remaining python scripts. Each of the remaining scripts, the filenames of which begin with “iterparse”, corresponds to one of the data files in the dataset, as indicated in the script’s filename. After running one of these scripts, the user’s target directory should include a /csv subdirectory containing the data file corresponding to the script; after running all the iterparse scripts the user’s /csv directory should be identical to the /csv directory in this repository. Users are invited to modify these scripts as they see fit, subject to the terms of the licenses set forth above.
With respect to the Stata do-files, only one of them is relevant to construction of the dataset itself. This is /do/CA_TM_csv_cleanup.do, which converts the .csv versions of the data files to .dta format, and uses Stata’s labeling functionality to reduce the size of the resulting files while preserving information. The other do-files generate the analyses and graphics presented in the paper describing the dataset (Jeremy N. Sheff, The Canada Trademarks Dataset, 18 J. Empirical Leg. Studies (forthcoming 2021)), available at https://papers.ssrn.com/abstract=3782655). These do-files are also licensed for reuse subject to the terms of the CC-BY-4.0 license, and users are invited to adapt the scripts to their needs.
The python and Stata scripts included in this repository are separately maintained and updated on Github at https://github.com/jnsheff/CanadaTM.
This repository also includes a copy of the current version of CIPO's data dictionary for its historical XML trademarks archive as of the date of construction of this dataset.
CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
License information was derived automatically
The zip files contain several files with wills from Mexico between 1810 and 1910 collected in order to measure Mexican wealth distribution in its first century of independence. The main file is wills_clean.xlsx, which contains the full collection of wills; in that file, you will find variables for year, state, and wealth, not excluding debts, debts and wealth (net wealth). You can combine this file with the do file cleaningroutine_for_social_tables to produce the detailed social tables. The rest of the files consist of data files with the social tables (for comparison) and xlsx files with the wills from the main file divided by decade to facilitate calculations using the do file inequality_analysis_ routine_clean.do from which you will be able to reproduce the rest of the analysis (unbalanced sample and generalized beta, lognormal, etc.) Note: The calculation programs are .do files; thus, they require stata to be executed. Some of the detailed social tables are dta files, and thus also stata files. You can open them in R and work with them or convert them to any other data format. The wills come from 5 different Mexican archives: Archivo Histórico de Notarias de la Ciudad de México, Archivo General del Estado de Yucatán, Archivo Municipal de Saltillo, Archivo Histórico de la Ciudad de Morelia and, Testamentos del Colegio de Sonora.
These data are part of NACJD's Fast Track Release and are distributed as they were received from the data depositor. The files have been zipped by NACJD for release, but not checked or processed except for the removal of direct identifiers. Users should refer to the accompanying readme files for a brief dscription of the files available with this collection and consult the investigator(s) if further information is needed. The qualitative data are not available as part of the data collection at this time. Numerous high-profile events involving student victimization on school buses have raised critical questions regarding the safety of school-based transportation for children, the efforts taken by school districts to protect students on buses, and the most effective transportation-based behavioral management strategies for reducing misconduct. To address these questions, a national web-based survey was administered to public school district-level transportation officials throughout the United States to assess the prevalence of misconduct on buses, identify strategies to address misconduct, and describe effective ways to reduce student misbehavior on buses. Telephone interviews were also conducted with a small group of transportation officials to understand the challenges of transportation-based behavioral management, to determine successful strategies to create safe and positive school bus environments, and to identify data-driven approaches for tracking and assessing disciplinary referrals. The collection includes 10 Stata data files: BVSBS_analysis file.dta (n=2,595; 1058 variables) Title Crosswalk File.dta (n=2,594; 3 variables) Lessons Learned and Open Dummies.dta (n=1,543; 200 variables) CCD dataset.dta (n=12,494; 89 variables) BVSB_REGION.dta (n=4; 3 variables) BVSB_SCHOOLS.dta (n=3; 3 variables) BVSB_STUDENTS.dta (n=3; 3 variables) BVSB_URBAN.dta (n=8; 3 variables) BVSB_WHITE.dta (n=3; 3 variables) FINALRAKER.dta (n=2,595; 2 variables)
Open Data: dta and do files with codebook. Visit https://dataone.org/datasets/sha256%3A2f12c5ceaeef6d27970926f3a983679fd21c34b8277efd01cfbc9274ca921a04 for complete metadata about this dataset.
https://doi.org/10.17026/fp39-0x58https://doi.org/10.17026/fp39-0x58
Learning is crucial for everyone. The association between biological lifestyle factors (i.e., physical activity, sleep, and nutrition) and learning performance has been well established for children, adolescents and college students in traditional education. Evidence for these associations for adult distance students is lacking however. The Adult Learning Open University Determinants (ALOUD) study is the first to identify the determinants of learning performance within adult distance education. Over the course of 1 year, all new students (n = 4945) of the Open University of the Netherlands were approached. At the start of their study, 2040 students fully participated in this observational study by filling out an online questionnaire and performing cognitive tests. Learning performance was measured over the course of 14 months, the subscription period of the first course bought by the student. The ALOUD study might result in tailor-made educational innovations for adults participating in distance education and, finally, a more successful distance education student population. Of the 4945 students who were approached, 2842 responded and 2040 fully participated at baseline.The variables 'beroep' and 'opmerkingen' are removed due to privacy concerns.Because this is the biological part of the dataset, the variables regarding social support, life events, coping, self-efficacy, affect, perseverance, self esteem, test anxiety, goal orientation and learning strategies are not included.The variables 'dob_year_survey' and 'dob_year_OUdata' at the end of the dataset indicate year of birth.The Open University (NL) provided the .sav file. DANS converted this file to .por and dta. format and added these formats. As a result, the data is available in .sav, .por and .dta format.
This exercise dataset was created for researchers interested in learning how to use the models described in the "Handbook on Impact Evaluation: Quantitative Methods and Practices" by S. Khandker, G. Koolwal and H. Samad, World Bank, October 2009 (permanent URL http://go.worldbank.org/FE8098BI60).
Public programs are designed to reach certain goals and beneficiaries. Methods to understand whether such programs actually work, as well as the level and nature of impacts on intended beneficiaries, are main themes of this book. Has the Grameen Bank, for example, succeeded in lowering consumption poverty among the rural poor in Bangladesh? Can conditional cash transfer programs in Mexico and Latin America improve health and schooling outcomes for poor women and children? Does a new road actually raise welfare in a remote area in Tanzania, or is it a "highway to nowhere?"
This handbook reviews quantitative methods and models of impact evaluation. It begings by reviewing the basic issues pertaining to an evaluation of an intervention to reach certain targets and goals. It then focuses on the experimental design of an impact evaluation, highlighting its strengths and shortcomings, followed by discussions on various non-experimental methods. The authors also cover methods to shed light on the nature and mechanisms by which different participants are benefiting from the program.
The handbook provides STATA exercises in the context of evaluating major microcredit programs in Bangladesh, such as the Grameen Bank. This dataset provides both the related Stata data files and the Stata programs.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Data for paper published in PLOS ONE 14.07.2023 These files were used for the statistical analysis of the hemparc feasibility trial using Stata software verson 17, and are as follows, both Stata format and .csv format as appropriate. The .do file is a simple text file.
hepmarc_data minimum dataset: .csv, .dta: See doi:10.1136/bmjopen-2019-035596 for study protocol describing all data collected hepmarc Data dictionary .xls, .dta; description of each data fields in minimum dataset hepmarc AE listing: Adverse events listing .csv, .dta hepmarc SAP v1.0 240322_.xls .dta; description of each data fields in minimum dataset hepmarc data.do Stata .do file used to perform the analysis
Notes: Each particpant's age has been altered by a random amount to preserve anonymity. There are two rows for two of the participants who each reported two adverse reactions.
Abstract Objectives Maraviroc may reduce hepatic inflammation in people with HIV and non-alcoholic fatty liver disease (HIV-NAFLD) through CCR5-receptor antagonism, which warrants further exploration.
Methods We performed an open-label 96-week randomised-controlled feasibility trial of maraviroc plus optimised background therapy (OBT) versus OBT alone, in a 1:1 ratio, for people with virologically-suppressed HIV-1 and NAFLD without cirrhosis. Dosing followed recommendations for HIV therapy in the Summary of Product Characteristics for maraviroc. The primary outcomes were safety, recruitment and retention rates, adherence and data completeness. Secondary outcomes included the change in Fibroscan-assessed liver stiffness measurements (LSM), controlled attenuation parameter (CAP) and Enhanced Liver Fibrosis (ELF) scores.
Results Fifty-three participants (53/60, 88% of target) were recruited; 23 received maraviroc plus OBT; 89% were male; 19% had type 2 diabetes mellitus. The median baseline LSM, CAP & ELF scores were 6.2 (IQR 4.6-7.8) kPa, 325 (IQR 279-351) dB/m and 9.1 (IQR 8.6-9.6) respectively.
Primary outcomes: all individuals eligible after screening were randomised; there was 92% (SD 6.6%) adherence to maraviroc [target >90%]; 83% (95%CI 70%-92%) participant retention [target >65%]; 5.5% of data were missing [target
Attribution-NonCommercial 4.0 (CC BY-NC 4.0)https://creativecommons.org/licenses/by-nc/4.0/
License information was derived automatically
The Transgender Archives at the University of Victoria Libraries is home to the world's largest collection of materials related to the history of Trans+ activism. Since 2017, the Mapping Transgender Activism Project has been a partnership between UVic's Chair in Transgender Studies and UVic Libraries. The project has involved undergraduate and graduate student interns using pre-defined methodologies to read, review, and analyze rare periodicals catalogued by UVic Libraries dating from 1960 onwards. The project records historically important datapoints that appear in each periodical document, including names, dates, events, keywords, and locations, in addition to other textual data. The data is grouped into three main categories: "Issues," "Articles," and "Events." The top category is "Issues." All "Articles" and "Events" data correspond with the "Issues" file. Due to the COVID-19 global pandemic, the internship moved online during 2020-2021, resulting in some Transgender Archives material being accessed through the Digital Transgender Archive. The "DTA links" file lists these issues and links. The "Issue Cover Scans" file includes scanned cover images of all indexed issues. Each data file includes a "Data Dictionary" file providing full details. These files are open to review. Data files are restricted and access is provided upon request After submitting your request for access, your request will be reviewed by the office of the Chair in Transgender Studies. Afterwards, we will respond within a few business days. If you have any questions, please email transarc@uvic.ca
CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
License information was derived automatically
The Stata data file "cap_dietary_combined.dta” and equivalent excel file of the same name comprises nutrition-related data collected by adolescent secondary school students during a "Citizen Science" project in the districts of Kavre and Jumla, Nepal during April to June 2022. The project was part of a CIFF-funded Children in All Policies 2030 (CAP2030) project . The data were generated by the students using a mobile device data collection form developed using "Open Data Kit (ODK) Collect" electronic data collection platform by Kathmandu Living Labs (KLL) and University College London (UCL) for the purposes of this study. The data collected were part of a learning exercise for students to raise awareness of under- and over- nutrition and poor diets in their locale. The form could be used to record 24-hour dietary recall of different healthy food groups and unhealthy sentinel foods amongst children under 5 years, older children and adolescents and adults. Height and weight could also be recorded where stadiometers and weighing scales were available. The form provided feedback to the data collector / respondent in terms of saying what their dietary diversity and unhealthy eating scores were and suggested what foods should be consumed for a healthy diet. In the case of under- or over- nutrition a message appeared with dietary / health advice. Since the students were using 10 android tablets to record information in and around their schools, the dataset may contain several copies of the same respondent recorded by different individuals, so should not be relied upon for precise prevalence of nutritional status. Rather, the data serve to demonstrate the potential of citizen science methods with Nepali school students to record such information. The nutrition app and the process of gathering the data are described in a paper entitled "Citizen science for climate change resilience: engaging adolescents to study climate hazards, biodiversity and nutrition in rural Nepal" submitted to Wellcome Open Research in Feb 2023. The data contributed to Table 6, Figure 7 and Annex 6 of this paper.
CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
License information was derived automatically
The Stata data file "CAP_Demographics_Jumla_Kavre_recoded.dta” and equivalent excel file of the same name comprises data collected by adolescent secondary school students during a "Citizen Science" project in the district of Kavre in the central hills of Nepal during April 2022 and in the district of Jumla in the remote mountains of West Nepal during June 2022. The project was part of a CIFF-funded Children in All Policies 2030 (CAP2030) project.
The data were generated by the students using a mobile device data collection form developed using "Open Data Kit (ODK) Collect" electronic data collection platform by Kathmandu Living Labs (KLL) and University College London (UCL) for the purposes of this study. Researchers from KLL and UCL trained the adolescents to record basic socio-demographic information about themselves and their households including caste/ethnicity, religion, education, water sources, assets, household characteristics, and income sources. The form also asked about their access to mobile phones or other devices and internet and their concerns with respect to climate change. The resulting data describe the participants in the citizen science project, but their names and addresses have been removed. The app and the process of gathering the data are described in a paper entitled "Citizen science for climate change resilience: engaging adolescents to study climate hazards, biodiversity and nutrition in rural Nepal" submitted to Wellcome Open Research in Feb 2023. The data contributed to Tables 2 and 3 of this paper.
CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
License information was derived automatically
Uganda_data.csv: The dataset contains mosquito count data from monthly entomological surveillance conducted at 330 households between 2011 and 2016 for three study sites in Uganda.Uganda_housing_covariates.dta: The data file contains household-level housing covariates for the three study sites in Uganda, including roof type, wall type, floor type, open/closed eaves, types of airbricks, and house type. Site IDs and household IDs are also available.TororoHH_info.xlsx: A full list of enumerated households in Tororo, Uganda.KanunguHH_info.xlsx: A full list of enumerated households in Kanungu, Uganda.JinjaHH_info.xlsx: A full list of enumerated households in Jinja, Uganda.Uganda_environmental_covariates.csv: Environmental covariate data for three study sites in Uganda.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
This set contains data for the following publication:
Herrala, R., Yliniemi, K., Vapaavuori, J., Lundström, M. (2025) Selective gold recovery with electrochemically-assisted aqueous reduction, Journal of Sustainable Metallurgy.
The Readme file within the dataset contains detailed information about the dta organization, contents, and file types. This is summarized here:
The data has been collected between 2022 and 2024.
CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
License information was derived automatically
The Stata data file "CAP2030_Plant_Atlas_Jumla_2022-06-20-19-18-13_labelled.dta” and equivalent excel file of the same name comprises data collected by adolescent secondary school students during a "Citizen Science" project in the district of Jumla in the remote mountains of West Nepal during June 2022. The project was part of a CIFF-funded Children in All Policies 2030 (CAP2030) project.
The data were generated by the students using a mobile device data collection form developed using "Open Data Kit (ODK) Collect" electronic data collection platform by Kathmandu Living Labs (KLL) and University College London (UCL) for the purposes of this study. KLL and UCL adapted a CommCare form, and a printed 'Plant Atlas' developed by Bristol University and HERD International as part of the ‘Micropoll’ project, implemented in Jumla district between 2021 and 2022. Citizen science users of the app used the printed photographic 'Plant Atlas', which depicts different species of plant and their flowers and encodes information about its Nepali, English and Scientific names in a QR code. Data collectors had to scan the QR code of the plant once they had matched it to a species in their locale. They then went on to record any insects visiting the flowers and any pests of diseases affecting the plant. They took photographs of the plants and of pests or diseases. Researchers from KLL and UCL trained the adolescents to record the plants identified and associated pollinators or pests and take photos. The resulting datafile includes the latitude/longitude, name of the plant and category (crop, wild), date it was recorded, and the district. Links to photographs of the plant are included but require login to the KLL server. Users of the data may contact KLL (contact@kathmandulivinglabs.org) or UCL (n.saville@ucl.ac.uk) if access to photographs is required. The data were generated as part of a learning exercise for students to raise awareness of biodiversity in their locale and to develop a sense of environmental stewardship. Since the students were using 10 android tablets to record information in a reasonably limited geographical area, the dataset may contain several copies of the same plant recorded by different individuals, so cannot be used for calculation of prevalence of species. Rather, the data serve to demonstrate the potential of citizen science methods with Nepali school students to record such information. The app and the process of gathering the data are described in a paper entitled "Citizen science for climate change resilience: engaging adolescents to study climate hazards, biodiversity and nutrition in rural Nepal" submitted to Wellcome Open Research in Feb 2023. The data contributed to Annex 3 of this paper.
Not seeing a result you expected?
Learn how you can add new datasets to our index.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
The dataset is created for the year 2018 in Germany for 267 popular initiatives at the local level where the information was collected regarding the following variables: topic of initiative, State legislation, turnout, validity of the initiative, success of the popular initiative, proportion of yes voters in case of a referendum, size of municipality, profile of initiators (local council or citizens), proportion of no voters, index of mobilization, repartition index (difference between yes and no voters), approval rate, number of inhabitants, correction of initiative, status of the case (open/closed).The dataset was created with the help of the existing popular initiatives registered by the University of Wuppertal and the association Mehr Demokratie in Germany.The idea of the dataset is to evaluate in details on which factors the success of popular initiatives depend in the different German States (Länder). A repartition index (difference between yes and no voters) and a mobilization index (repartition index multiplied by the turnout) were calculated and added in the dataset. All the other variables were also created in order to balance the result of these initiatives. The final aim is to be able to measure how direct democratic tools influence local politics in Germany. This is why it is important to examine the prevailing factors for the satisfaction of citizens who use these procedures.In this dataset, the destiny of an initiative (failure/success) can be taken as the dependent variable and all the others could be classified as independent variables.Direct democracy offers possibilities for citizens to influence political decisions especially at the local level. In Germany, the local political systems have been affected by the introduction of direct democratic tools such as citizen initiatives and local referenda since the Reunification. The State legislations defined new conditions for citizen initiatives and municipal referenda with a minimum number of valid signatures for initiatives and a minimum approval rate for the referenda. In the attached file, you will find the dataset in Excel file as well as a .dta file that you can open with the software Stata (https://www.stata.com/).