Saved datasets
Last updated
Download format
Usage rights
License from data provider
Please review the applicable license to make sure your contemplated use is permitted.
Cost to access
Described as free to access or have a license that allows redistribution.
100+ datasets found
  1. z

    Citation network data sets for 'Oxytocin – a social peptide? Deconstructing...

    Updated Oct 19, 2021
  2. Effects of oxytocin administration and conditioned oxytocin on brain...

    Updated Mar 19, 2020
  3. v

    France import data of Oxytocin from Indonesia

    Updated Mar 4, 2023
  4. m

    Data for: The Effect of Oxytocin on Group Formation and Strategic Thinking...

    Updated Apr 9, 2018
  5. z

    Data from: Breastfeeding dynamically changes endogenous oxytocin levels and...

    • +1more
    Updated Aug 8, 2020
  6. N

    l-DOPA and oxytocin influence the neurocomputational mechanisms of...

    Updated Mar 11, 2023
  7. v

    Global import data of Oxytocin

    Updated Mar 17, 2023
  8. v

    Yemen import data of Oxytocin

    Updated Feb 22, 2023
  9. h

    Affectionate touch and diurnal oxytocin levels: An ecological momentary...

    Updated May 16, 2023
  10. v

    Brazil import data of Oxytocin from India

    Updated Sep 4, 2023
  11. N

    Data from: Oxytocin and brain activity in humans: A systematic review and...

    Updated Jun 30, 2018
  12. e

    Data from: Oxytocin receptor

    Updated Aug 31, 2022
  13. Table of NDC products with oxytocin

  14. v

    Paraguay import data of Oxytocin

    Updated Apr 14, 2023
  15. v

    Ghana import data of Oxytocin injection

    Updated Jan 4, 2023
  16. u

    Data from: Labor induction with oxytocin in pregnant rats is not associated...

    Updated Apr 4, 2023
  17. N

    Data from: Oxytocin modulates the intrinsic dynamics between...

    Updated Sep 23, 2018
  18. d

    A dataset for oxytocin loaded nanoparticles and their effects on...

    • +1more
    Updated May 8, 2022
  19. N

    Human endogenous oxytocin and its neural correlates show adaptive responses...

    Updated Mar 23, 2023
  20. m

    Data from: Experimental data exploring the effects of intranasal oxytocin on...

    Updated Oct 26, 2021
Click to copy link
Link copied
Leng, Rhodri Ivor (2021). Citation network data sets for 'Oxytocin – a social peptide? Deconstructing the evidence' [Dataset].

Citation network data sets for 'Oxytocin – a social peptide? Deconstructing the evidence'

Explore at:
csvAvailable download formats
Dataset updated
Oct 19, 2021
Dataset provided by
University of Edinburgh
Leng, Rhodri Ivor

Attribution 4.0 (CC BY 4.0)
License information was derived automatically

Area covered
United States


This note describes the data sets used for all analyses contained in the manuscript 'Oxytocin - a social peptide?’[1]

Data Collection

The datasets described here were originally retrieved from Web of Science (WoS) Core Collection via the University of Edinburgh’s library subscription [2]. The aim of the original study for which these data were gathered was to survey peer-reviewed primary studies on oxytocin and social behaviour. To capture relevant papers, we used the following query:

TI = (“oxytocin” OR “pitocin” OR “syntocinon”) AND TS = (“social*” OR “pro$social” OR “anti$social”)

The final search was performed on the 13 September 2021. This returned a total of 2,747 records, of which 2,049 were classified by WoS as ‘articles’. Given our interest in primary studies only – articles reporting original data – we excluded all other document types. We further excluded all articles sub-classified as ‘book chapters’ or as ‘proceeding papers’ in order to limit our analysis to primary studies published in peer-reviewed academic journals. This reduced the set to 1,977 articles. All of these were published in the English language, and no further language refinements were unnecessary.

All available metadata on these 1,977 articles was exported as plain text ‘flat’ format files in four batches, which we later merged together via Notepad++. Upon manually examination, we discovered examples of papers classified as ‘articles’ by WoS that were, in fact, reviews. To further filter our results, we searched all available PMIDs in PubMed (1,903 had associated PMIDs - ~96% of set). We then filtered results to identify all records classified as ‘review’, ‘systematic review’, or ‘meta-analysis’, identifying 75 records [3] (thus, ~4% of records classified by WoS were classified as reviews in PubMed). After examining a sample and agreeing with the PubMed classification, these were removed these from our dataset - leaving a total of 1,902 articles.

From these data, we constructed two datasets via parsing out relevant reference data via the Sci2 Tool [4]. First, we constructed a ‘node-attribute-list’ by first linking unique reference strings (‘Cite Me As’ column in WoS data files) to unique identifiers, we then parsed into this dataset information on the identify of a paper, including the title of the article, all authors, journal publication, year of publication, total citations as recorded from WoS, and WoS accession number. Second, we constructed an ‘edge-list’ that records the citations from a citing paper in the ‘Source’ column and identifies the cited paper in the ‘Target’ column, using the unique identifies as described previously to link these data to the node-attribute-list.

We then constructed a network in which papers are nodes, and citation links between nodes are directed edges between nodes. We used Gephi Version 0.9.2 [5] to manually clean these data by merging duplicate references that are caused by different reference formats or by referencing errors. To do this, we needed to retain both all retrieved records (1,902) as well as including all of their references to papers whether these were included in our original search or not. In total, this produced a network of 46,633 nodes (unique reference strings) and 112,520 edges (citation links). Thus, the average reference list size of these articles is ~59 references. The mean indegree (within network citations) is 2.4 (median is 1) for the entire network reflecting a great diversity in referencing choices among our 1,902 articles.

After merging duplicates, we then restricted the network to include only articles fully retrieved (1,902), and retrained only those that were connected together by citations links in a large interconnected network (i.e. the largest component). In total, 1,892 (99.5%) of our initial set were connected together via citation links, meaning a total of ten papers were removed from the following analysis – and these were neither connected to the largest component, nor did they form connections with one another (i.e. these were ‘isolates’).

This left us with a network of 1,892 nodes connected together by 26,019 edges. It is this network that is described by the ‘node-attribute-list’ and ‘edge-list’ provided here. This network has a mean in-degree of 13.76 (median in-degree of 4). By restricting our analysis in this way, we lose 44,741 unique references (96%) and 86,501 citations (77%) from the full network, but retain a set of articles tightly knitted together, all of which have been fully retrieved due to possessing certain terms related to oxytocin AND social behaviour in their title, abstract, or associated keywords.

Before moving on, we calculated indegree for all nodes in this network – this counts the number of citations to a given paper from other papers within this network – and have included this in the node-attribute-list. We further clustered this network via modularity maximisation via the Leiden algorithm [6]. We set the algorithm to resolution 1, and allowed the algorithm to run over 100 iterations and 100 restarts. This gave Q=0.43 and identified seven clusters, which we describe in detail within the body of the paper. We have included cluster membership as an attribute in the node-attribute-list.

For additional analysis, we also analysed the full reference list data to examine the most commonly cited references between 2016 and 2021 - the results of this are described in OTSOC_Cited_2016-2021.csv. This takes the reference lists of all retrieved papers within the network and examines their full reference lists (including references to other papers not contained within the network). These data were cleaned by matching DOIs and manual cleansing.

Data description

We include here two network datasets: (i) ‘OTSOC-node-attribute-list.csv’ consists of the attributes of 1,892 primary articles retrieved from WoS that include terms indicating a focus on oxytocin and social behaviour; (ii) ‘OTSOC-edge-list.csv’ records the citations between these papers. Together, these can be imported into a range of different software for network analysis; however, we have formatted these for ease of upload into Gephi 0.9.2. Finally, we include (iii) 'OTSOC_Cited_2016-2021' that lists all papers cited by >10 papers in the OTSOC network following any analysis of the bibliographies of retrieved papers. Below, we detail their contents:

1. ‘OTSOC-node-attribute-list.csv’ is a comma-separate values file that contains all node attributes for the citation network (n=1,892) analysed in the paper. The columns refer to:

Id, the unique identifier

Label, the reference string of the paper to which the attributes in this row correspond. This is taken from the ‘Cite Me As’ column from the original WoS download. The reference string is in the following format: last name of first author, publication year, journal, volume, start page, and DOI (if available).

Wos_id, unique Web of Science (WoS) accession number. These can be used to query WoS to find further data on all papers via the ‘UT= ’ field tag.

Title, paper title.

Authors, all named authors.

Journal, journal of publication.

Pub_year, year of publication.

Wos_citations, total number of citations recorded by WoS Core Collection to a given paper as of 13 September 2021

Indegree, the number of within network citations to a given paper, calculated for the network shown in Figure 1 of the manuscript.

Cluster, provides the cluster membership number as discussed within the manuscript (Figure 1). This was established via modularity maximisation via the Leiden algorithm (Res 1; Q=0.43|7 clusters)

2. ‘OTSOC-edge -list.csv’ is a comma-separated values file that contains all citation links between the 1,892 articles (n=26,019). The columns refer to:

Source, the unique identifier of the citing paper.

Target, the unique identifier of the cited paper.

Type, edges are ‘Directed’, and this column tells Gephi to regard all edges as such.

Syr_date, this contains the date of publication of the citing paper.

Tyr_date, this contains the date of publication of the cited paper.

3. 'OTSOC_Cited_2016-2021.csv' is a comma-separated values file that contain citations to all cited references that were cited by at least 10 of the retrieved papers within the OTSOC network published from 2016 onwards. The columns refer to:

Reference, the cited reference string extracted from the bibliographies of retrieved papers.

Publication year, the publication year of the cited reference.

DOI, the DOI of the cited reference.

indegree_2016, the total number of citations to a cited reference from papers published in 2016 and contained within the OTSOC network.

indegree_2017, the total number of citations to a cited reference from papers published in 2017 and contained within the OTSOC network.

indegree_2018, the total number of citations to a cited reference from papers published in 2018 and contained within the OTSOC network.

indegree_2019, the total number of citations to a cited

Clear search
Close search
Google apps
Main menu