11 datasets found
  1. Sample Graph Datasets in CSV Format

    • zenodo.org
    csv
    Updated Dec 9, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Edwin Carreño; Edwin Carreño (2024). Sample Graph Datasets in CSV Format [Dataset]. http://doi.org/10.5281/zenodo.14335015
    Explore at:
    csvAvailable download formats
    Dataset updated
    Dec 9, 2024
    Dataset provided by
    Zenodohttp://zenodo.org/
    Authors
    Edwin Carreño; Edwin Carreño
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Sample Graph Datasets in CSV Format

    Note: none of the data sets published here contain actual data, they are for testing purposes only.

    Description

    This data repository contains graph datasets, where each graph is represented by two CSV files: one for node information and another for edge details. To link the files to the same graph, their names include a common identifier based on the number of nodes. For example:

    • dataset_30_nodes_interactions.csv:contains 30 rows (nodes).
    • dataset_30_edges_interactions.csv: contains 47 rows (edges).
    • the common identifier dataset_30 refers to the same graph.

    CSV nodes

    Each dataset contains the following columns:

    Name of the ColumnTypeDescription
    UniProt IDstringprotein identification
    labelstringprotein label (type of node)
    propertiesstringa dictionary containing properties related to the protein.

    CSV edges

    Each dataset contains the following columns:

    Name of the ColumnTypeDescription
    Relationship IDstringrelationship identification
    Source IDstringidentification of the source protein in the relationship
    Target IDstringidentification of the target protein in the relationship
    labelstringrelationship label (type of relationship)
    propertiesstringa dictionary containing properties related to the relationship.

    Metadata

    GraphNumber of NodesNumber of EdgesSparse graph

    dataset_30*

    30

    47

    Y

    dataset_60*

    60

    181

    Y

    dataset_120*

    120

    689

    Y

    dataset_240*

    240

    2819

    Y

    dataset_300*

    300

    4658

    Y

    dataset_600*

    600

    18004

    Y

    dataset_1200*

    1200

    71785

    Y

    dataset_2400*

    2400

    288600

    Y

    dataset_3000*

    3000

    449727

    Y

    dataset_6000*

    6000

    1799413

    Y

    dataset_12000*

    12000

    7199863

    Y

    dataset_24000*

    24000

    28792361

    Y

    dataset_30000*

    30000

    44991744

    Y

    This repository include two (2) additional tiny graph datasets to experiment before dealing with larger datasets.

    CSV nodes (tiny graphs)

    Each dataset contains the following columns:

    Name of the ColumnTypeDescription
    IDstringnode identification
    labelstringnode label (type of node)
    propertiesstringa dictionary containing properties related to the node.

    CSV edges (tiny graphs)

    Each dataset contains the following columns:

    Name of the ColumnTypeDescription
    IDstringrelationship identification
    sourcestringidentification of the source node in the relationship
    targetstringidentification of the target node in the relationship
    labelstringrelationship label (type of relationship)
    propertiesstringa dictionary containing properties related to the relationship.

    Metadata (tiny graphs)

    GraphNumber of NodesNumber of EdgesSparse graph
    dataset_dummy*36N
    dataset_dummy2*36N
  2. m

    Dataset of development of business during the COVID-19 crisis

    • data.mendeley.com
    • narcis.nl
    Updated Nov 9, 2020
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Tatiana N. Litvinova (2020). Dataset of development of business during the COVID-19 crisis [Dataset]. http://doi.org/10.17632/9vvrd34f8t.1
    Explore at:
    Dataset updated
    Nov 9, 2020
    Authors
    Tatiana N. Litvinova
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    To create the dataset, the top 10 countries leading in the incidence of COVID-19 in the world were selected as of October 22, 2020 (on the eve of the second full of pandemics), which are presented in the Global 500 ranking for 2020: USA, India, Brazil, Russia, Spain, France and Mexico. For each of these countries, no more than 10 of the largest transnational corporations included in the Global 500 rating for 2020 and 2019 were selected separately. The arithmetic averages were calculated and the change (increase) in indicators such as profitability and profitability of enterprises, their ranking position (competitiveness), asset value and number of employees. The arithmetic mean values of these indicators for all countries of the sample were found, characterizing the situation in international entrepreneurship as a whole in the context of the COVID-19 crisis in 2020 on the eve of the second wave of the pandemic. The data is collected in a general Microsoft Excel table. Dataset is a unique database that combines COVID-19 statistics and entrepreneurship statistics. The dataset is flexible data that can be supplemented with data from other countries and newer statistics on the COVID-19 pandemic. Due to the fact that the data in the dataset are not ready-made numbers, but formulas, when adding and / or changing the values in the original table at the beginning of the dataset, most of the subsequent tables will be automatically recalculated and the graphs will be updated. This allows the dataset to be used not just as an array of data, but as an analytical tool for automating scientific research on the impact of the COVID-19 pandemic and crisis on international entrepreneurship. The dataset includes not only tabular data, but also charts that provide data visualization. The dataset contains not only actual, but also forecast data on morbidity and mortality from COVID-19 for the period of the second wave of the pandemic in 2020. The forecasts are presented in the form of a normal distribution of predicted values and the probability of their occurrence in practice. This allows for a broad scenario analysis of the impact of the COVID-19 pandemic and crisis on international entrepreneurship, substituting various predicted morbidity and mortality rates in risk assessment tables and obtaining automatically calculated consequences (changes) on the characteristics of international entrepreneurship. It is also possible to substitute the actual values identified in the process and following the results of the second wave of the pandemic to check the reliability of pre-made forecasts and conduct a plan-fact analysis. The dataset contains not only the numerical values of the initial and predicted values of the set of studied indicators, but also their qualitative interpretation, reflecting the presence and level of risks of a pandemic and COVID-19 crisis for international entrepreneurship.

  3. f

    Tables, data analysis and graphs.

    • figshare.com
    • plos.figshare.com
    xlsx
    Updated Jun 4, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Wesley D. Black (2023). Tables, data analysis and graphs. [Dataset]. http://doi.org/10.1371/journal.pone.0238901.s002
    Explore at:
    xlsxAvailable download formats
    Dataset updated
    Jun 4, 2023
    Dataset provided by
    PLOS ONE
    Authors
    Wesley D. Black
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    A MS-Excel notebook of several spreadsheets pertaining to Figs 1–9 and Table 1, including ANOVA analysis, means, standard deviations, and charts/graphs, and also an adaptation of Andersen 1958 Table 1 adjusted to convert CFU/plate to CFU/m3 with the 1.25x adjustment factor for use of plastic Petri dishes. (XLSX)

  4. d

    Replication Data for: A Fistful of Dollars: Financial Incentives, Peer...

    • search.dataone.org
    • dataverse.harvard.edu
    Updated Nov 13, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Bauer, Rob; Eberhardt, Inka; Smeets, Paul (2023). Replication Data for: A Fistful of Dollars: Financial Incentives, Peer Information, and Retirement Savings [Dataset]. https://search.dataone.org/view/sha256%3A926f0b859a9c0d0ba39fd820a0f93fad856f1f8fb22ef8c76a16ed58fffbf4e2
    Explore at:
    Dataset updated
    Nov 13, 2023
    Dataset provided by
    Harvard Dataverse
    Authors
    Bauer, Rob; Eberhardt, Inka; Smeets, Paul
    Description

    Files used to create the tables and graphs in the paper A Fistful of Dollars: Financial Incentives, Peer Information, and Retirement Savings (Bauer, Eberhardt, & Smeets, forthcoming). Sample datasets are included. Programs are written in Stata.

  5. d

    Key generic technology prediction in patent citation using graph neural...

    • dataone.org
    • data.niaid.nih.gov
    • +1more
    Updated Jun 5, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    M. L. Ding (2024). Key generic technology prediction in patent citation using graph neural networks [Dataset]. http://doi.org/10.5061/dryad.nk98sf803
    Explore at:
    Dataset updated
    Jun 5, 2024
    Dataset provided by
    Dryad Digital Repository
    Authors
    M. L. Ding
    Time period covered
    Jan 11, 2024
    Description

    With the rapid advancement of the Fourth Industrial Revolution, international competition in technology and industry is intensifying. However, in the era of big data and large-scale science, making accurate judgments about the key areas of technology and innovative trends has become exceptionally difficult. This paper constructs a patent indicator evaluation system based on the dimensions of key and generic patent citation, integrates graph neural network modeling to predict key common technologies, and confirms the effectiveness of the method using the field of genetic engineering as an example. According to the LDA topic model, the main technical R&D directions in genetic engineering are genetic analysis and detection technologies, the application of microorganisms in industrial production, virology research involving vaccine development and immune responses, high-throughput sequencing and analysis technologies in genomics, targeted drug design and molecular therapeutic strategies..., These datasets were obtained by the Incopat patent database for cited patents (2013-2022) in the field of genetic engineering. Details for the datasets are provided in the README file. This directory contains the selection of the patent datasets. 1) Table of key generic indicators for nodes (partial 1).csv This file consists of 10 indicators of patents: technical coverage, patent families, patent family citation, patent cooperation, enterprise-enterprise cooperation, industry-university-research cooperation, claims, citation frequency, layout countries, and layout countries. 2) Table of key generic indicators for nodes (partial 2).csv This file consists of 10 indicators of patents: technical convergence, cited countries, inventors, citations, homologous countries/areas, degree centrality, closeness centrality, betweenness centrality, eigenvector centrality, and PageRank. 3) patent.content The content file contains descriptions of the patents in the following format:

    This README file was generated on 2023-11-25 by Mingli Ding.

    GENERAL INFORMATION

    1. Author Information Investigators Contact Information Name: Mingli Ding; Wangke Yu; Shuhua Wang Institution: Jingdezhen Ceramic University Address: Jingdezhen, Jiangxi, China Email: mlding1@163.com
    2. Date of data collection:2013-2022

    DATA & FILE OVERVIEW

    1. File List:

    A) Table of key generic indicators for nodes (partial 1).csv

    B) Table of key generic indicators for nodes (partial 2).csv

    C) patent.content

    D) patent.cites

    E) Graph neural network modeling highest accuracy for different dimensions.csv

    F) Prediction effects of key generic technologies.csv

    DATA-SPECIFIC INFORMATION FOR: Table of key generic indicators for nodes (partial 1).csv

    1. Number of variables: 10
    2. Number of cases/rows: 72489
    3. Variable List:
    • technical coverage: number ...
  6. u

    Population status of Canada's migratory birds - Status of bird species...

    • data.urbandatacentre.ca
    • gimi9.com
    • +2more
    Updated Oct 1, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2024). Population status of Canada's migratory birds - Status of bird species listed in the Migratory Birds Convention Act in relation to population goals, Canada [Dataset]. https://data.urbandatacentre.ca/dataset/gov-canada-a7b8267d-298c-42a1-b074-018401d8fedc
    Explore at:
    Dataset updated
    Oct 1, 2024
    License

    Open Government Licence - Canada 2.0https://open.canada.ca/en/open-government-licence-canada
    License information was derived automatically

    Area covered
    Canada
    Description

    The Canadian Environmental Sustainability Indicators (CESI) program provides data and information to track Canada's performance on key environmental sustainability issues. The Population status of Canada's migratory birds indicator reports the proportion of bird species listed in the Migratory Birds Convention Act whose populations fall within, or are above or below national population goals. It provides a snapshot assessment of the state of bird populations in Canada. Some bird species are managed towards specific population levels (for example, some hunted species or species of conservation concern). While the indicator reports whether species' populations are within acceptable bounds, it does not indicate if management goals are being met. This information is provided to Canadians in a number of formats including: static and interactive maps, charts and graphs, HTML and CSV data tables and downloadable reports. See the supplementary documentation for data sources and details on how those data were collected and how the indicator was calculated. Supplemental Information Canadian Environmental Sustainability Indicators - Home page: https://www.canada.ca/environmental-indicators

  7. Z

    tBiomedL: Larger Semantic Table Annotations Benchmark for Biomedical Domain

    • data.niaid.nih.gov
    • zenodo.org
    Updated Dec 7, 2023
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Jimènez-Ruiz, Ernesto (2023). tBiomedL: Larger Semantic Table Annotations Benchmark for Biomedical Domain [Dataset]. https://data.niaid.nih.gov/resources?id=zenodo_10283118
    Explore at:
    Dataset updated
    Dec 7, 2023
    Dataset provided by
    Hassanzadeh, Oktie
    Jimènez-Ruiz, Ernesto
    König-Ries, Birgitta
    Abdelmageed, Nora
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    tBiomedL is a dataset for tabular data to knowledge graph matching. It is derived for the Biodiversity domain and has two types of tables. On the one hand, Horizontal Relational Tables are where each table represents a collection of entities. On the other hand, Entity Tables represent a single entity. We supported ground truth data from Wikidata as a target knowledge graph (KG). tBiomedL is generated by KG2Tables using five levels of a recursive hierarchy of related concepts in Wikidata. It is the successor work of tBiomed tBiomedL contains 860,479 entity and horizontal tables, while this repository contains only a sample of 1% of the total of the entire benchmark with its ground truth data (gt). The Full size of this dataset is 27 GB. We will update this repository with the full dataset, including the test fold with its ground truth data in the Future. Please get in touch if you are interested in the full dataset, The supported tasks for semantic table annotations are:

    Topic Detection (TD) links the entire table to an entity or a class from the target KG. Cell Entity Annotation (CEA) maps individual table cells to entities from the target KG. Column Type Annotation (CTA) links individual table columns to classes from the target KG. Column Property Annotation (CPA) detects the relations between column pairs from the target knowledge graph. Row Annotation (RA) annotates the entire row to a KG entity or property.

  8. Microdata: Australian Census Longitudinal Dataset, 2006-2011

    • data.gov.au
    • data.wu.ac.at
    html
    Updated May 2, 2016
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Australian Bureau of Statistics (2016). Microdata: Australian Census Longitudinal Dataset, 2006-2011 [Dataset]. https://data.gov.au/data/dataset/microdata-australian-census-longitudinal-dataset-2006-2011
    Explore at:
    htmlAvailable download formats
    Dataset updated
    May 2, 2016
    Dataset provided by
    Australian Bureau of Statisticshttp://abs.gov.au/
    License

    Attribution 3.0 (CC BY 3.0)https://creativecommons.org/licenses/by/3.0/
    License information was derived automatically

    Area covered
    Australia
    Description

    The Australian Census Longitudinal Dataset (ACLD) brings together a 5% sample from the 2006 Census with records from the 2011 Census to create a research tool for exploring how Australian society is changing over time. In taking a longitudinal view of Australians, the ACLD may uncover new insights into the dynamics and transitions that drive social and economic change over time, conveying how these vary for diverse population groups and geographies. It is envisaged that the 2016 and successive Censuses will be added in the future, as well as administrative data sets. The ACLD is released in ABS TableBuilder and as a microdata product in the ABS Data Laboratory.

    The Census of Population and Housing is conducted every five years and aims to measure accurately the number of people and dwellings in Australia on Census Night.

    Microdata products are the most detailed information available from a Census or survey and are generally the responses to individual questions on the questionnaire. They also include derived data from answers to two or more questions and are released with the approval of the Australian Statistician. The following microdata products are available for this longitudinal dataset: •ACLD in TableBuilder - an online tool for creating tables and graphs. •ACLD in ABS Data Laboratory (ABSDL) - for in-depth analysis using a range of statistical software packages.

  9. U

    Summary of dissolved pesticide concentrations in discrete surface-water...

    • data.usgs.gov
    • datasets.ai
    • +2more
    Updated Apr 29, 2017
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Adam Johnson; Joseph Kennedy (2017). Summary of dissolved pesticide concentrations in discrete surface-water samples collected on the islands of Kauaʻi and Oʻahu, Hawaiʻi November 2016–April 2017 [Dataset]. http://doi.org/10.5066/F7BG2N79
    Explore at:
    Dataset updated
    Apr 29, 2017
    Dataset provided by
    United States Geological Surveyhttp://www.usgs.gov/
    Authors
    Adam Johnson; Joseph Kennedy
    License

    U.S. Government Workshttps://www.usa.gov/government-works
    License information was derived automatically

    Time period covered
    Nov 21, 2016 - Apr 29, 2017
    Area covered
    Kauai, Hawaii, O‘ahu
    Description

    This dataset includes pesticide-concentration results for 32 discrete water samples that were collected on the islands of Kauai and Oahu between November 21, 2016 and April 29, 2017. The water samples were collected for the Pesticide-Monitoring Program of Surface Water in the State of Hawaii. This dataset consists of five files: a summary file, a sample-list file, two results files, and this metadata file. The summary file (Summary_of_pesticide_results_for_discrete_samples_Hawaii_Nov2016_Apr2017.pdf) includes maps and a table of the sample sites, graphs that summarize the most frequently detected pesticide compounds in the water samples, and tables that summarize comparisons between (1) concentrations of detected pesticide compounds and (2) water-quality standards, criteria, and benchmarks. The sample-list file [List_of_discrete_samples_Hawaii_Nov2016_Apr2017.csv] contains a list of 32 samples and attributes that describe where, when, and how each sample was collected. The first r ...

  10. d

    Data from: A global meta-analysis of the impacts of tree plantations on...

    • datadryad.org
    zip
    Updated Jan 3, 2022
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Chao Wang (2022). A global meta-analysis of the impacts of tree plantations on biodiversity [Dataset]. http://doi.org/10.5061/dryad.zcrjdfnd4
    Explore at:
    zipAvailable download formats
    Dataset updated
    Jan 3, 2022
    Dataset provided by
    Dryad
    Authors
    Chao Wang
    Time period covered
    2021
    Description

    Aim: Planted forests are becoming increasingly common worldwide for a variety of reasons including water conservation and carbon sequestration, whereas the effects of tree plantations on biodiversity are unclear as to whether planted ecosystems are ‘green deserts’ or valuable habitats for biodiversity.

    Location: Global.

    Time period: 1980–2020.

    Taxa studied: Flora, fauna, and microorganisms.

    Methods: By conducting a meta-analysis of 361 observations from 138 sites worldwide, we explored the global patterns and associated drivers of biodiversity responding to tree plantations by comparing biodiversity levels in plantations and adjacent habitats (primary or secondary forests).

    Results: Overall, the biodiversity (species richness) and abundance across multi-trophic levels in tree plantations was lower than that in primary forests, reached similar values to secondary succession, but varied with plantation and management regimes. Specifically, the biodiversity across multi-trophic levels...

  11. tBiodiv: Semantic Table Annotations Benchmark for Biodiversity Domain

    • zenodo.org
    zip
    Updated Dec 7, 2023
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Nora Abdelmageed; Nora Abdelmageed; Ernesto Jimènez-Ruiz; Ernesto Jimènez-Ruiz; Oktie Hassanzadeh; Oktie Hassanzadeh; Birgitta König-Ries; Birgitta König-Ries (2023). tBiodiv: Semantic Table Annotations Benchmark for Biodiversity Domain [Dataset]. http://doi.org/10.5281/zenodo.10283015
    Explore at:
    zipAvailable download formats
    Dataset updated
    Dec 7, 2023
    Dataset provided by
    Zenodohttp://zenodo.org/
    Authors
    Nora Abdelmageed; Nora Abdelmageed; Ernesto Jimènez-Ruiz; Ernesto Jimènez-Ruiz; Oktie Hassanzadeh; Oktie Hassanzadeh; Birgitta König-Ries; Birgitta König-Ries
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    tBiodiv is a dataset for tabular data to knowledge graph matching. It is derived for the Biodiversity domain and has two types of tables. On the one hand, Horizontal Relational Tables are where each table represents a collection of entities. On the other hand, Entity Tables represent a single entity. We supported ground truth data from Wikidata as a target knowledge graph (KG).

    tBiodiv is generated by KG2Tables using two levels of a recursive hierarchy of related concepts in Wikidata.

    tBiodiv contains 57,426 entity and horizontal tables, while this repository contains only a sample of 1% of the total generated tables of the entire benchmark with its ground truth data (gt). The Full size of this dataset is 122 GB. We will update this repository with the full dataset in the Future.

    Please get in touch if you are interested in the full dataset,

    The supported tasks for semantic table annotations are:

    1. Topic Detection (TD) links the entire table to an entity or a class from the target KG.
    2. Cell Entity Annotation (CEA) maps individual table cells to entities from the target KG.
    3. Column Type Annotation (CTA) links individual table columns to classes from the target KG.
    4. Column Property Annotation (CPA) detects the relations between column pairs from the target knowledge graph.
    5. Row Annotation (RA) annotates the entire row to a KG entity or property.

  12. Not seeing a result you expected?
    Learn how you can add new datasets to our index.

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
Edwin Carreño; Edwin Carreño (2024). Sample Graph Datasets in CSV Format [Dataset]. http://doi.org/10.5281/zenodo.14335015
Organization logo

Sample Graph Datasets in CSV Format

Explore at:
csvAvailable download formats
Dataset updated
Dec 9, 2024
Dataset provided by
Zenodohttp://zenodo.org/
Authors
Edwin Carreño; Edwin Carreño
License

Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically

Description

Sample Graph Datasets in CSV Format

Note: none of the data sets published here contain actual data, they are for testing purposes only.

Description

This data repository contains graph datasets, where each graph is represented by two CSV files: one for node information and another for edge details. To link the files to the same graph, their names include a common identifier based on the number of nodes. For example:

  • dataset_30_nodes_interactions.csv:contains 30 rows (nodes).
  • dataset_30_edges_interactions.csv: contains 47 rows (edges).
  • the common identifier dataset_30 refers to the same graph.

CSV nodes

Each dataset contains the following columns:

Name of the ColumnTypeDescription
UniProt IDstringprotein identification
labelstringprotein label (type of node)
propertiesstringa dictionary containing properties related to the protein.

CSV edges

Each dataset contains the following columns:

Name of the ColumnTypeDescription
Relationship IDstringrelationship identification
Source IDstringidentification of the source protein in the relationship
Target IDstringidentification of the target protein in the relationship
labelstringrelationship label (type of relationship)
propertiesstringa dictionary containing properties related to the relationship.

Metadata

GraphNumber of NodesNumber of EdgesSparse graph

dataset_30*

30

47

Y

dataset_60*

60

181

Y

dataset_120*

120

689

Y

dataset_240*

240

2819

Y

dataset_300*

300

4658

Y

dataset_600*

600

18004

Y

dataset_1200*

1200

71785

Y

dataset_2400*

2400

288600

Y

dataset_3000*

3000

449727

Y

dataset_6000*

6000

1799413

Y

dataset_12000*

12000

7199863

Y

dataset_24000*

24000

28792361

Y

dataset_30000*

30000

44991744

Y

This repository include two (2) additional tiny graph datasets to experiment before dealing with larger datasets.

CSV nodes (tiny graphs)

Each dataset contains the following columns:

Name of the ColumnTypeDescription
IDstringnode identification
labelstringnode label (type of node)
propertiesstringa dictionary containing properties related to the node.

CSV edges (tiny graphs)

Each dataset contains the following columns:

Name of the ColumnTypeDescription
IDstringrelationship identification
sourcestringidentification of the source node in the relationship
targetstringidentification of the target node in the relationship
labelstringrelationship label (type of relationship)
propertiesstringa dictionary containing properties related to the relationship.

Metadata (tiny graphs)

GraphNumber of NodesNumber of EdgesSparse graph
dataset_dummy*36N
dataset_dummy2*36N
Search
Clear search
Close search
Google apps
Main menu