2 datasets found

Data from: Diversity in citations to a single study: Supplementary data set...
zenodo.org
data.niaid.nih.gov
bin
Updated Aug 25, 2021
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Rhodri Ivor Leng; Rhodri Ivor Leng (2021). Diversity in citations to a single study: Supplementary data set for citation context network analysis [Dataset]. http://doi.org/10.5281/zenodo.5244800
Explore at:
binAvailable download formats
Unique identifier
https://doi.org/10.5281/zenodo.5244800
Dataset updated
Aug 25, 2021
Dataset provided by
Zenodohttp://zenodo.org/
Authors
Rhodri Ivor Leng; Rhodri Ivor Leng
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
Introduction

This document describes the data set used for all analyses in 'Diversity in citations to a single study: A citation context network analysis of how evidence from a prospective cohort study was cited' accepted for publication in Quantitative Science Studies [1].

Data Collection

The data collection procedure has been fully described [1]. Concisely, the data set contains bibliometric data collected from Web of Science Core Collection via the University of Edinburgh’s Library subscription concerning all papers that cited a cohort study, Paul et al. [2], in the period <1985. This includes a full list of citing papers, and the citations between these papers. Additionally, it includes textual passages (citation contexts) from 343 citing papers, which were manually recovered from the full-text documents accessible via the University of Edinburgh’s Library subscription. These data have been cleaned, converted into network readable datasets, and are coded into particular classifications reflecting content, which are described fully in the supplied code book and within the manuscript [1].

Data description

All relevant data can be found in the attached file 'Supplementary_material_Leng_QSS_2021.xlsx', which contains the following five workbooks:

“Overview” includes a list of the content of the workbooks.

“Code Book” contains the coding rules and definitions used for the classification of findings and paper titles.

“Node attribute list” includes a workbook containing all node attributes for the citation network, which includes Paul et al. [2] and its citing papers as of 1984. Highlighted in yellow at the bottom of this workbook is two papers that were discarded due to duplication - remove these if analysing this dataset in a network analysis. The columns refer to:

Id, the node identifier

Label, the formal citation of the paper to which data within this row corresponds. Citation is in the following format: last name of first author, year of publication, journal of publication, volume number, start page, and DOI (if available).

Title, the paper title for the paper in question.

Publication_year, the year of publication.

Document_type, the document type (e.g. review, article)

WoS_ID, the paper’s unique Web of Science accession number.

Citation_context, a column specifying whether citation context data is available from that paper

Explanans, the title explanans terms for that paper;

Explanandum, the explanandum terms for that paper.

Combined_Title_Classification, the combined terms used for fig 2 of the published manuscript.

Serum_cholesterol_(SC), a column identifying papers that cited the serum cholesterol findings.

Blood_Pressure_(BP), a column identifying papers that cited the blood pressure findings.

Coffee_(C), a column identifying papers that cited the coffee findings.

Diet_(D), a column identifying papers that cited the dietary findings.

Smoking_(S), a column identifying papers that cited the smoking findings.

Alcohol_(A), a column identifying papers that cited the alcohol findings.

Physical_Activity_(PA), a column identifying papers that cited the physical activity findings.

Body_Fatness (BF), a column identifying papers that cited the body fatness findings.

Indegree, the number of within network citations to that paper, calculated for the network shown in Fig 4 of the manuscript.

Outdegree, the number of within network references of that paper as calculated for the network in Fig 4.

Main_component, a column specifying whether a node is contained in the largest weakly connect component as shown in Fig 4 of the manuscript.

Cluster, provides the cluster membership number as discussed within the manuscript (Fig 5).

“Edge list” includes a workbook including the edges for the network. The columns refer to:

Source, contains the node identifier of the citing paper.

Target, contains the node identifier of the cited paper.

“Citation context classification” includes a workbook containing the WoS accession number for the paper analysed, and any finding category discussed in that paper established via context analysis (see the code book for definitions). The columns refer to:

Id, the node identifier

Finding_Class, the findings discussed from Paul et al. within the body of the citing paper.

“Citation context data” includes a workbook containing the WoS accession number for papers in which citation context data was available, the citation context passages, the reference number or format of Paul et al. within the citing paper, and the finding categories discussed in those contexts (see code book for definitions). The columns refer to:

Id, the node identifier

Citation_context, the passage copied from the full text of the citing paper containing discussion of the findings of Paul et al.

Reference_in_citing_article, the reference number or format of Paul et al. within the citing paper.

Finding_class, the findings discussed from Paul et al. within the body of the citing paper.

Software recommended for analysis

For the analyses performed within the manuscript, Gephi version 0.9.2 was used [3], and both the edge and node lists are in a format that is easily read into this software. The Sci2 tool was used to parse data initially [4].

Notes

Leng, R. I. (Forthcoming). Diversity in citations to a single study: A citation context network analysis of how evidence from a prospective cohort study was cited. Quantitative Science Studies.

Paul, O., Lepper, M. H., Phelan, W. H., Dupertuis, G. W., Macmillan, A., McKean, H., et al. (1963). A longitudinal study of coronary heart disease. Circulation, 28, 20-31. https://doi.org/10.1161/01.cir.28.1.20.

Bastian, M., Heymann, S., & Jacomy, M. (2009). Gephi: an open source software for exploring and manipulating networks. International AAAI Conference on Weblogs and Social Media.

Sci2 Team. (2009). Science of Science (Sci2) Tool. Indiana University and SciTech Strategies. Stable URL: https://sci2.cns.iu.edu
g
Development and Validation of the Critical Consciousness Scale - Archival...
search.gesis.org
Updated Apr 2, 2021
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Diemer, Matthew A. (2021). Development and Validation of the Critical Consciousness Scale - Archival Version [Dataset]. http://doi.org/10.3886/ICPSR36564
Explore at:
Unique identifier
https://doi.org/10.3886/ICPSR36564
Dataset updated
Apr 2, 2021
Dataset provided by
ICPSR - Interuniversity Consortium for Political and Social Research
GESIS search
Authors
Diemer, Matthew A.
License
https://search.gesis.org/research_data/datasearch-httpwww-da-ra-deoaip--oaioai-da-ra-de520798https://search.gesis.org/research_data/datasearch-httpwww-da-ra-deoaip--oaioai-da-ra-de520798
Description
Abstract (en): This study sought to develop and validate a measure of critical consciousness. The measure, referred to as the Critical Consciousness Scale (CCS), examines the capacity of oppressed or marginalized people to critically analyze their social and political conditions, endorsement of societal equality, and action to change perceived inequities. Critical reflection items were developed to gauge respondents' consciousness of racial, gendered, and socioeconomic inequalities in terms of occupational and educational opportunity. Survey questions assessed respondents' opinions on sociopolitical issues and the degree and frequency of participation in individual and/or collective action. Information was collected on topics such as the frequency that respondents participated in discussions regarding social or political issues and their opinions on statements such as whether certain racial or ethnic groups have fewer chances to get good jobs. Demographic variables include race, gender, age, class, and educational attainment of a respondent's mother and father. ICPSR data undergo a confidentiality review and are altered when necessary to limit the risk of disclosure. ICPSR also routinely creates ready-to-go data files along with setups in the major statistical software formats as well as standard codebooks to accompany the data. In addition to these procedures, ICPSR performed the following processing steps for this data collection: Checked for undocumented or out-of-range codes.. Presence of Common Scales: Critical Consciousness Scale (CCS) Students age 13-19 who attend high school in large Midwestern cities. Users are strongly encouraged to reference the documentation found in the codebook for detailed explanations concerning sampling strategies. self-enumerated questionnaire, on-site questionnaireVariables CCS9_R, CCS10_R, CCS11_R, CCS12_R, CCS13_R, CCS14_R, CCS15_R, CCS20_R, CCS24_R, CCS25_FLAG_R, and CCS32_FLAG_R were recoded in order to reverse code the value labels and data. Due to an administrative error, data for variables CCS25_FLAG, CCS25_FLAG_R, CCS26_FLAG, CCS27_FLAG, CCS28_FLAG, CCS29_FLAG, CCS30_FLAG, CCS31_FLAG, CCS32_FLAG, and CCS32_FLAG_R was not collected for all participants. These items were ignored in the process of validating the Critical Consciousness Scale. Users should take caution when analyzing these variables given the incomplete data collection. Users interested in the validation of these questions are encouraged to reference Fostering marginalized youths' academic achievement and critical consciousness through a values-affirmation intervention by Luke Rapa.The original 46-item questionnaire developed to validate the Critical Consciousness Scale included all Critical Consciousness Scale items found in the dataset. A series of analyses resulted in a pared-down 22-item questionnaire. Both questionnaires are included in the codebook for reference.
Not seeing a result you expected?
Learn how you can add new datasets to our index.

Facebook

Twitter

Click to copy link

Link copied

Cite

Rhodri Ivor Leng; Rhodri Ivor Leng (2021). Diversity in citations to a single study: Supplementary data set for citation context network analysis [Dataset]. http://doi.org/10.5281/zenodo.5244800

Data from: Diversity in citations to a single study: Supplementary data set for citation context network analysis

Explore at:

binAvailable download formats

Unique identifier

https://doi.org/10.5281/zenodo.5244800

Dataset updated

Aug 25, 2021

Dataset provided by

Zenodohttp://zenodo.org/

Authors

Rhodri Ivor Leng; Rhodri Ivor Leng

License

Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically

Description

Introduction

This document describes the data set used for all analyses in 'Diversity in citations to a single study: A citation context network analysis of how evidence from a prospective cohort study was cited' accepted for publication in Quantitative Science Studies [1].

Data Collection

The data collection procedure has been fully described [1]. Concisely, the data set contains bibliometric data collected from Web of Science Core Collection via the University of Edinburgh’s Library subscription concerning all papers that cited a cohort study, Paul et al. [2], in the period <1985. This includes a full list of citing papers, and the citations between these papers. Additionally, it includes textual passages (citation contexts) from 343 citing papers, which were manually recovered from the full-text documents accessible via the University of Edinburgh’s Library subscription. These data have been cleaned, converted into network readable datasets, and are coded into particular classifications reflecting content, which are described fully in the supplied code book and within the manuscript [1].

Data description

All relevant data can be found in the attached file 'Supplementary_material_Leng_QSS_2021.xlsx', which contains the following five workbooks:

“Overview” includes a list of the content of the workbooks.
“Code Book” contains the coding rules and definitions used for the classification of findings and paper titles.
“Node attribute list” includes a workbook containing all node attributes for the citation network, which includes Paul et al. [2] and its citing papers as of 1984. Highlighted in yellow at the bottom of this workbook is two papers that were discarded due to duplication - remove these if analysing this dataset in a network analysis. The columns refer to:

Id, the node identifier
Label, the formal citation of the paper to which data within this row corresponds. Citation is in the following format: last name of first author, year of publication, journal of publication, volume number, start page, and DOI (if available).
Title, the paper title for the paper in question.
Publication_year, the year of publication.
Document_type, the document type (e.g. review, article)
WoS_ID, the paper’s unique Web of Science accession number.
Citation_context, a column specifying whether citation context data is available from that paper
Explanans, the title explanans terms for that paper;
Explanandum, the explanandum terms for that paper.
Combined_Title_Classification, the combined terms used for fig 2 of the published manuscript.
Serum_cholesterol_(SC), a column identifying papers that cited the serum cholesterol findings.
Blood_Pressure_(BP), a column identifying papers that cited the blood pressure findings.
Coffee_(C), a column identifying papers that cited the coffee findings.
Diet_(D), a column identifying papers that cited the dietary findings.
Smoking_(S), a column identifying papers that cited the smoking findings.
Alcohol_(A), a column identifying papers that cited the alcohol findings.
Physical_Activity_(PA), a column identifying papers that cited the physical activity findings.
Body_Fatness (BF), a column identifying papers that cited the body fatness findings.
Indegree, the number of within network citations to that paper, calculated for the network shown in Fig 4 of the manuscript.
Outdegree, the number of within network references of that paper as calculated for the network in Fig 4.
Main_component, a column specifying whether a node is contained in the largest weakly connect component as shown in Fig 4 of the manuscript.
Cluster, provides the cluster membership number as discussed within the manuscript (Fig 5).

“Edge list” includes a workbook including the edges for the network. The columns refer to:

Source, contains the node identifier of the citing paper.
Target, contains the node identifier of the cited paper.

“Citation context classification” includes a workbook containing the WoS accession number for the paper analysed, and any finding category discussed in that paper established via context analysis (see the code book for definitions). The columns refer to:

Id, the node identifier
Finding_Class, the findings discussed from Paul et al. within the body of the citing paper.

“Citation context data” includes a workbook containing the WoS accession number for papers in which citation context data was available, the citation context passages, the reference number or format of Paul et al. within the citing paper, and the finding categories discussed in those contexts (see code book for definitions). The columns refer to:

Id, the node identifier
Citation_context, the passage copied from the full text of the citing paper containing discussion of the findings of Paul et al.
Reference_in_citing_article, the reference number or format of Paul et al. within the citing paper.
Finding_class, the findings discussed from Paul et al. within the body of the citing paper.

Software recommended for analysis

For the analyses performed within the manuscript, Gephi version 0.9.2 was used [3], and both the edge and node lists are in a format that is easily read into this software. The Sci2 tool was used to parse data initially [4].

Notes

Leng, R. I. (Forthcoming). Diversity in citations to a single study: A citation context network analysis of how evidence from a prospective cohort study was cited. Quantitative Science Studies.
Paul, O., Lepper, M. H., Phelan, W. H., Dupertuis, G. W., Macmillan, A., McKean, H., et al. (1963). A longitudinal study of coronary heart disease. Circulation, 28, 20-31. https://doi.org/10.1161/01.cir.28.1.20.
Bastian, M., Heymann, S., & Jacomy, M. (2009). Gephi: an open source software for exploring and manipulating networks. International AAAI Conference on Weblogs and Social Media.
Sci2 Team. (2009). Science of Science (Sci2) Tool. Indiana University and SciTech Strategies. Stable URL: https://sci2.cns.iu.edu

Clear search

Close search

Google apps

Main menu

Data from: Diversity in citations to a single study: Supplementary data set...

Development and Validation of the Critical Consciousness Scale - Archival...

Data from: Diversity in citations to a single study: Supplementary data set for citation context network analysis