2 datasets found
  1. Data from: Diversity in citations to a single study: Supplementary data set...

    • zenodo.org
    • data.niaid.nih.gov
    bin
    Updated Aug 25, 2021
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Rhodri Ivor Leng; Rhodri Ivor Leng (2021). Diversity in citations to a single study: Supplementary data set for citation context network analysis [Dataset]. http://doi.org/10.5281/zenodo.5244800
    Explore at:
    binAvailable download formats
    Dataset updated
    Aug 25, 2021
    Dataset provided by
    Zenodohttp://zenodo.org/
    Authors
    Rhodri Ivor Leng; Rhodri Ivor Leng
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Introduction

    This document describes the data set used for all analyses in 'Diversity in citations to a single study: A citation context network analysis of how evidence from a prospective cohort study was cited' accepted for publication in Quantitative Science Studies [1].

    Data Collection

    The data collection procedure has been fully described [1]. Concisely, the data set contains bibliometric data collected from Web of Science Core Collection via the University of Edinburgh’s Library subscription concerning all papers that cited a cohort study, Paul et al. [2], in the period <1985. This includes a full list of citing papers, and the citations between these papers. Additionally, it includes textual passages (citation contexts) from 343 citing papers, which were manually recovered from the full-text documents accessible via the University of Edinburgh’s Library subscription. These data have been cleaned, converted into network readable datasets, and are coded into particular classifications reflecting content, which are described fully in the supplied code book and within the manuscript [1].

    Data description

    All relevant data can be found in the attached file 'Supplementary_material_Leng_QSS_2021.xlsx', which contains the following five workbooks:

    • “Overview” includes a list of the content of the workbooks.
    • “Code Book” contains the coding rules and definitions used for the classification of findings and paper titles.
    • “Node attribute list” includes a workbook containing all node attributes for the citation network, which includes Paul et al. [2] and its citing papers as of 1984. Highlighted in yellow at the bottom of this workbook is two papers that were discarded due to duplication - remove these if analysing this dataset in a network analysis. The columns refer to:
    1. Id, the node identifier
    2. Label, the formal citation of the paper to which data within this row corresponds. Citation is in the following format: last name of first author, year of publication, journal of publication, volume number, start page, and DOI (if available).
    3. Title, the paper title for the paper in question.
    4. Publication_year, the year of publication.
    5. Document_type, the document type (e.g. review, article)
    6. WoS_ID, the paper’s unique Web of Science accession number.
    7. Citation_context, a column specifying whether citation context data is available from that paper
    8. Explanans, the title explanans terms for that paper;
    9. Explanandum, the explanandum terms for that paper.
    10. Combined_Title_Classification, the combined terms used for fig 2 of the published manuscript.
    11. Serum_cholesterol_(SC), a column identifying papers that cited the serum cholesterol findings.
    12. Blood_Pressure_(BP), a column identifying papers that cited the blood pressure findings.
    13. Coffee_(C), a column identifying papers that cited the coffee findings.
    14. Diet_(D), a column identifying papers that cited the dietary findings.
    15. Smoking_(S), a column identifying papers that cited the smoking findings.
    16. Alcohol_(A), a column identifying papers that cited the alcohol findings.
    17. Physical_Activity_(PA), a column identifying papers that cited the physical activity findings.
    18. Body_Fatness (BF), a column identifying papers that cited the body fatness findings.
    19. Indegree, the number of within network citations to that paper, calculated for the network shown in Fig 4 of the manuscript.
    20. Outdegree, the number of within network references of that paper as calculated for the network in Fig 4.
    21. Main_component, a column specifying whether a node is contained in the largest weakly connect component as shown in Fig 4 of the manuscript.
    22. Cluster, provides the cluster membership number as discussed within the manuscript (Fig 5).
    • “Edge list” includes a workbook including the edges for the network. The columns refer to:
    1. Source, contains the node identifier of the citing paper.
    2. Target, contains the node identifier of the cited paper.
    • “Citation context classification” includes a workbook containing the WoS accession number for the paper analysed, and any finding category discussed in that paper established via context analysis (see the code book for definitions). The columns refer to:
    1. Id, the node identifier
    2. Finding_Class, the findings discussed from Paul et al. within the body of the citing paper.
    • “Citation context data” includes a workbook containing the WoS accession number for papers in which citation context data was available, the citation context passages, the reference number or format of Paul et al. within the citing paper, and the finding categories discussed in those contexts (see code book for definitions). The columns refer to:
    1. Id, the node identifier
    2. Citation_context, the passage copied from the full text of the citing paper containing discussion of the findings of Paul et al.
    3. Reference_in_citing_article, the reference number or format of Paul et al. within the citing paper.
    4. Finding_class, the findings discussed from Paul et al. within the body of the citing paper.

    Software recommended for analysis

    For the analyses performed within the manuscript, Gephi version 0.9.2 was used [3], and both the edge and node lists are in a format that is easily read into this software. The Sci2 tool was used to parse data initially [4].

    Notes

    1. Leng, R. I. (Forthcoming). Diversity in citations to a single study: A citation context network analysis of how evidence from a prospective cohort study was cited. Quantitative Science Studies.
    2. Paul, O., Lepper, M. H., Phelan, W. H., Dupertuis, G. W., Macmillan, A., McKean, H., et al. (1963). A longitudinal study of coronary heart disease. Circulation, 28, 20-31. https://doi.org/10.1161/01.cir.28.1.20.
    3. Bastian, M., Heymann, S., & Jacomy, M. (2009). Gephi: an open source software for exploring and manipulating networks. International AAAI Conference on Weblogs and Social Media.
    4. Sci2 Team. (2009). Science of Science (Sci2) Tool. Indiana University and SciTech Strategies. Stable URL: https://sci2.cns.iu.edu
  2. g

    Development and Validation of the Critical Consciousness Scale - Archival...

    • search.gesis.org
    Updated Apr 2, 2021
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Diemer, Matthew A. (2021). Development and Validation of the Critical Consciousness Scale - Archival Version [Dataset]. http://doi.org/10.3886/ICPSR36564
    Explore at:
    Dataset updated
    Apr 2, 2021
    Dataset provided by
    ICPSR - Interuniversity Consortium for Political and Social Research
    GESIS search
    Authors
    Diemer, Matthew A.
    License

    https://search.gesis.org/research_data/datasearch-httpwww-da-ra-deoaip--oaioai-da-ra-de520798https://search.gesis.org/research_data/datasearch-httpwww-da-ra-deoaip--oaioai-da-ra-de520798

    Description

    Abstract (en): This study sought to develop and validate a measure of critical consciousness. The measure, referred to as the Critical Consciousness Scale (CCS), examines the capacity of oppressed or marginalized people to critically analyze their social and political conditions, endorsement of societal equality, and action to change perceived inequities. Critical reflection items were developed to gauge respondents' consciousness of racial, gendered, and socioeconomic inequalities in terms of occupational and educational opportunity. Survey questions assessed respondents' opinions on sociopolitical issues and the degree and frequency of participation in individual and/or collective action. Information was collected on topics such as the frequency that respondents participated in discussions regarding social or political issues and their opinions on statements such as whether certain racial or ethnic groups have fewer chances to get good jobs. Demographic variables include race, gender, age, class, and educational attainment of a respondent's mother and father. ICPSR data undergo a confidentiality review and are altered when necessary to limit the risk of disclosure. ICPSR also routinely creates ready-to-go data files along with setups in the major statistical software formats as well as standard codebooks to accompany the data. In addition to these procedures, ICPSR performed the following processing steps for this data collection: Checked for undocumented or out-of-range codes.. Presence of Common Scales: Critical Consciousness Scale (CCS) Students age 13-19 who attend high school in large Midwestern cities. Users are strongly encouraged to reference the documentation found in the codebook for detailed explanations concerning sampling strategies. self-enumerated questionnaire, on-site questionnaireVariables CCS9_R, CCS10_R, CCS11_R, CCS12_R, CCS13_R, CCS14_R, CCS15_R, CCS20_R, CCS24_R, CCS25_FLAG_R, and CCS32_FLAG_R were recoded in order to reverse code the value labels and data. Due to an administrative error, data for variables CCS25_FLAG, CCS25_FLAG_R, CCS26_FLAG, CCS27_FLAG, CCS28_FLAG, CCS29_FLAG, CCS30_FLAG, CCS31_FLAG, CCS32_FLAG, and CCS32_FLAG_R was not collected for all participants. These items were ignored in the process of validating the Critical Consciousness Scale. Users should take caution when analyzing these variables given the incomplete data collection. Users interested in the validation of these questions are encouraged to reference Fostering marginalized youths' academic achievement and critical consciousness through a values-affirmation intervention by Luke Rapa.The original 46-item questionnaire developed to validate the Critical Consciousness Scale included all Critical Consciousness Scale items found in the dataset. A series of analyses resulted in a pared-down 22-item questionnaire. Both questionnaires are included in the codebook for reference.

  3. Not seeing a result you expected?
    Learn how you can add new datasets to our index.

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
Rhodri Ivor Leng; Rhodri Ivor Leng (2021). Diversity in citations to a single study: Supplementary data set for citation context network analysis [Dataset]. http://doi.org/10.5281/zenodo.5244800
Organization logo

Data from: Diversity in citations to a single study: Supplementary data set for citation context network analysis

Related Article
Explore at:
binAvailable download formats
Dataset updated
Aug 25, 2021
Dataset provided by
Zenodohttp://zenodo.org/
Authors
Rhodri Ivor Leng; Rhodri Ivor Leng
License

Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically

Description

Introduction

This document describes the data set used for all analyses in 'Diversity in citations to a single study: A citation context network analysis of how evidence from a prospective cohort study was cited' accepted for publication in Quantitative Science Studies [1].

Data Collection

The data collection procedure has been fully described [1]. Concisely, the data set contains bibliometric data collected from Web of Science Core Collection via the University of Edinburgh’s Library subscription concerning all papers that cited a cohort study, Paul et al. [2], in the period <1985. This includes a full list of citing papers, and the citations between these papers. Additionally, it includes textual passages (citation contexts) from 343 citing papers, which were manually recovered from the full-text documents accessible via the University of Edinburgh’s Library subscription. These data have been cleaned, converted into network readable datasets, and are coded into particular classifications reflecting content, which are described fully in the supplied code book and within the manuscript [1].

Data description

All relevant data can be found in the attached file 'Supplementary_material_Leng_QSS_2021.xlsx', which contains the following five workbooks:

  • “Overview” includes a list of the content of the workbooks.
  • “Code Book” contains the coding rules and definitions used for the classification of findings and paper titles.
  • “Node attribute list” includes a workbook containing all node attributes for the citation network, which includes Paul et al. [2] and its citing papers as of 1984. Highlighted in yellow at the bottom of this workbook is two papers that were discarded due to duplication - remove these if analysing this dataset in a network analysis. The columns refer to:
  1. Id, the node identifier
  2. Label, the formal citation of the paper to which data within this row corresponds. Citation is in the following format: last name of first author, year of publication, journal of publication, volume number, start page, and DOI (if available).
  3. Title, the paper title for the paper in question.
  4. Publication_year, the year of publication.
  5. Document_type, the document type (e.g. review, article)
  6. WoS_ID, the paper’s unique Web of Science accession number.
  7. Citation_context, a column specifying whether citation context data is available from that paper
  8. Explanans, the title explanans terms for that paper;
  9. Explanandum, the explanandum terms for that paper.
  10. Combined_Title_Classification, the combined terms used for fig 2 of the published manuscript.
  11. Serum_cholesterol_(SC), a column identifying papers that cited the serum cholesterol findings.
  12. Blood_Pressure_(BP), a column identifying papers that cited the blood pressure findings.
  13. Coffee_(C), a column identifying papers that cited the coffee findings.
  14. Diet_(D), a column identifying papers that cited the dietary findings.
  15. Smoking_(S), a column identifying papers that cited the smoking findings.
  16. Alcohol_(A), a column identifying papers that cited the alcohol findings.
  17. Physical_Activity_(PA), a column identifying papers that cited the physical activity findings.
  18. Body_Fatness (BF), a column identifying papers that cited the body fatness findings.
  19. Indegree, the number of within network citations to that paper, calculated for the network shown in Fig 4 of the manuscript.
  20. Outdegree, the number of within network references of that paper as calculated for the network in Fig 4.
  21. Main_component, a column specifying whether a node is contained in the largest weakly connect component as shown in Fig 4 of the manuscript.
  22. Cluster, provides the cluster membership number as discussed within the manuscript (Fig 5).
  • “Edge list” includes a workbook including the edges for the network. The columns refer to:
  1. Source, contains the node identifier of the citing paper.
  2. Target, contains the node identifier of the cited paper.
  • “Citation context classification” includes a workbook containing the WoS accession number for the paper analysed, and any finding category discussed in that paper established via context analysis (see the code book for definitions). The columns refer to:
  1. Id, the node identifier
  2. Finding_Class, the findings discussed from Paul et al. within the body of the citing paper.
  • “Citation context data” includes a workbook containing the WoS accession number for papers in which citation context data was available, the citation context passages, the reference number or format of Paul et al. within the citing paper, and the finding categories discussed in those contexts (see code book for definitions). The columns refer to:
  1. Id, the node identifier
  2. Citation_context, the passage copied from the full text of the citing paper containing discussion of the findings of Paul et al.
  3. Reference_in_citing_article, the reference number or format of Paul et al. within the citing paper.
  4. Finding_class, the findings discussed from Paul et al. within the body of the citing paper.

Software recommended for analysis

For the analyses performed within the manuscript, Gephi version 0.9.2 was used [3], and both the edge and node lists are in a format that is easily read into this software. The Sci2 tool was used to parse data initially [4].

Notes

  1. Leng, R. I. (Forthcoming). Diversity in citations to a single study: A citation context network analysis of how evidence from a prospective cohort study was cited. Quantitative Science Studies.
  2. Paul, O., Lepper, M. H., Phelan, W. H., Dupertuis, G. W., Macmillan, A., McKean, H., et al. (1963). A longitudinal study of coronary heart disease. Circulation, 28, 20-31. https://doi.org/10.1161/01.cir.28.1.20.
  3. Bastian, M., Heymann, S., & Jacomy, M. (2009). Gephi: an open source software for exploring and manipulating networks. International AAAI Conference on Weblogs and Social Media.
  4. Sci2 Team. (2009). Science of Science (Sci2) Tool. Indiana University and SciTech Strategies. Stable URL: https://sci2.cns.iu.edu
Search
Clear search
Close search
Google apps
Main menu