100+ datasets found
  1. Dataset_Graph

    • springernature.figshare.com
    bin
    Updated Jan 2, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Hadi Yazdi; Qiguan Shu; Thomas Rötzer; Frank Petzold; Ferdinand Ludwig (2024). Dataset_Graph [Dataset]. http://doi.org/10.6084/m9.figshare.23943060.v1
    Explore at:
    binAvailable download formats
    Dataset updated
    Jan 2, 2024
    Dataset provided by
    Figsharehttp://figshare.com/
    Authors
    Hadi Yazdi; Qiguan Shu; Thomas Rötzer; Frank Petzold; Ferdinand Ludwig
    License

    CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
    License information was derived automatically

    Description

    The "Dataset_Graph.zip" file contains the graph models of the trees in the dataset. These graph models are saved in the "pickle" format, which is a binary format used for serializing Python objects. The graph models capture the structural information and relationships of the cylinders in each tree, representing the hierarchical organization of the branches.

  2. Wikipedia Knowledge Graph dataset

    • zenodo.org
    • produccioncientifica.ugr.es
    • +1more
    pdf, tsv
    Updated Jul 17, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Wenceslao Arroyo-Machado; Wenceslao Arroyo-Machado; Daniel Torres-Salinas; Daniel Torres-Salinas; Rodrigo Costas; Rodrigo Costas (2024). Wikipedia Knowledge Graph dataset [Dataset]. http://doi.org/10.5281/zenodo.6346900
    Explore at:
    tsv, pdfAvailable download formats
    Dataset updated
    Jul 17, 2024
    Dataset provided by
    Zenodohttp://zenodo.org/
    Authors
    Wenceslao Arroyo-Machado; Wenceslao Arroyo-Machado; Daniel Torres-Salinas; Daniel Torres-Salinas; Rodrigo Costas; Rodrigo Costas
    License

    CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
    License information was derived automatically

    Description

    Wikipedia is the largest and most read online free encyclopedia currently existing. As such, Wikipedia offers a large amount of data on all its own contents and interactions around them, as well as different types of open data sources. This makes Wikipedia a unique data source that can be analyzed with quantitative data science techniques. However, the enormous amount of data makes it difficult to have an overview, and sometimes many of the analytical possibilities that Wikipedia offers remain unknown. In order to reduce the complexity of identifying and collecting data on Wikipedia and expanding its analytical potential, after collecting different data from various sources and processing them, we have generated a dedicated Wikipedia Knowledge Graph aimed at facilitating the analysis, contextualization of the activity and relations of Wikipedia pages, in this case limited to its English edition. We share this Knowledge Graph dataset in an open way, aiming to be useful for a wide range of researchers, such as informetricians, sociologists or data scientists.

    There are a total of 9 files, all of them in tsv format, and they have been built under a relational structure. The main one that acts as the core of the dataset is the page file, after it there are 4 files with different entities related to the Wikipedia pages (category, url, pub and page_property files) and 4 other files that act as "intermediate tables" making it possible to connect the pages both with the latter and between pages (page_category, page_url, page_pub and page_link files).

    The document Dataset_summary includes a detailed description of the dataset.

    Thanks to Nees Jan van Eck and the Centre for Science and Technology Studies (CWTS) for the valuable comments and suggestions.

  3. E

    Code and data for 'Improved vapor pressure predictions using group...

    • edmond.mpg.de
    exe, zip
    Updated Mar 19, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Matteo Krueger; Thomas Berkemeier; Matteo Krueger; Thomas Berkemeier (2025). Code and data for 'Improved vapor pressure predictions using group contribution-assisted graph convolutional neural networks (GC2NN)' [Dataset]. http://doi.org/10.17617/3.GIKHJL
    Explore at:
    zip(115013), zip(105301), zip(2234685), zip(34286), zip(85257), exe(191951991)Available download formats
    Dataset updated
    Mar 19, 2025
    Dataset provided by
    Edmond
    Authors
    Matteo Krueger; Thomas Berkemeier; Matteo Krueger; Thomas Berkemeier
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    We propose a novel approach to predict saturation vapor pressures using group contribution-assisted graph convolutional neural networks (GC2NN), which use both, molecular descriptors like molar mass and functional group counts, as well as molecular graphs containing atom and bond features, as representations of molecular structure. Molecular graphs allow the ML model to better infer molecular connectivity and spatial relations compared to methods using other, non-structural embeddings. We achieve best results with an adaptive-depth GC2NN, where the number of evaluated graph layers depends on molecular size. We apply the model to compounds relevant for the formation of SOA, achieving strong agreement between predicted and experimentally-determined vapor pressure. In this study, we present two models: a general model with broader scope, achieving a mean absolute error (MAE) of 0.67 log-units (R2 = 0.86), and a specialized model focused on atmospheric compounds (MAE = 0.36 log-units, R2 = 0.97).

  4. q

    Thinking deeply about quantitative analysis: Building a Biologist's Toolkit

    • qubeshub.org
    Updated Aug 26, 2021
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Sarah Bray; Paul Duffin; James Wagner (2021). Thinking deeply about quantitative analysis: Building a Biologist's Toolkit [Dataset]. http://doi.org/10.24918/cs.2016.4
    Explore at:
    Dataset updated
    Aug 26, 2021
    Dataset provided by
    QUBES
    Authors
    Sarah Bray; Paul Duffin; James Wagner
    Description

    Vision and Change in Undergraduate Biology Education encouraged faculty to focus on core concepts and competencies in undergraduate curriculum. We created a sophomore-level course, Biologists' Toolkit, to focus on the competencies of quantitative reasoning and scientific communication. We introduce students to the statistical analysis of data using the open source statistical language and environment, R and R Studio, in the first two-thirds of the course. During this time the students learn to write basic computer commands to input data and conduct common statistical analysis. The students also learn to graphically represent their data using R. In a final project, we assign students unique data sets that require them to develop a hypothesis that can be explored with the data, analyze and graph the data, search literature related to their data set, and write a report that emulates a scientific paper. The final report includes publication quality graphs and proper reporting of data and statistical results. At the end of the course students reported greater confidence in their ability to read and make graphs, analyze data, and develop hypotheses. Although programming in R has a steep learning curve, we found that students who learned programming in R developed a robust strategy for data analyses and they retained and successfully applied those skills in other courses during their junior and senior years.

  5. Quantitative questions - analysed data

    • figshare.com
    xlsx
    Updated Aug 24, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Ashleigh Prince (2023). Quantitative questions - analysed data [Dataset]. http://doi.org/10.6084/m9.figshare.24029238.v1
    Explore at:
    xlsxAvailable download formats
    Dataset updated
    Aug 24, 2023
    Dataset provided by
    Figsharehttp://figshare.com/
    Authors
    Ashleigh Prince
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    The Excel spreadsheet contains the quantitative questions (Questions 1, 3 and 4). Each question is analysed in the form of a frequency distribution table and a pie chart.

  6. Dataset_QSM

    • springernature.figshare.com
    zip
    Updated Jan 2, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Hadi Yazdi; Qiguan Shu; Thomas Rötzer; Frank Petzold; Ferdinand Ludwig (2024). Dataset_QSM [Dataset]. http://doi.org/10.6084/m9.figshare.23943195.v1
    Explore at:
    zipAvailable download formats
    Dataset updated
    Jan 2, 2024
    Dataset provided by
    figshare
    Figsharehttp://figshare.com/
    Authors
    Hadi Yazdi; Qiguan Shu; Thomas Rötzer; Frank Petzold; Ferdinand Ludwig
    License

    CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
    License information was derived automatically

    Description

    The "Dataset_QSM.zip" file includes three directories: "opt," "optcsv," and "trans," which correspond to each project in the dataset. The "opt" directory contains the main Quantitative Structure Model (QSM) files in ".mat" format. These files store the structural information of the tree cylinders, including their geometry and other relevant attributes. In the "optcsv" directory, you can find the extracted features from the QSM files in a more accessible format, specifically as ".csv" files. These files contain the selected features of the cylinders, making it easier to work with and analyze the QSM data. Lastly, the "trans" directory holds the transformation information files. These files provide the necessary details for converting the location coordinates of the cylinders to the project's coordinate system.

  7. d

    Data from: Use of vectors in financial graphs

    • search.dataone.org
    • dataverse.harvard.edu
    • +1more
    Updated Nov 8, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Wong, Dr Abdul Rahim (2023). Use of vectors in financial graphs [Dataset]. http://doi.org/10.7910/DVN/BEM1LH
    Explore at:
    Dataset updated
    Nov 8, 2023
    Dataset provided by
    Harvard Dataverse
    Authors
    Wong, Dr Abdul Rahim
    Description

    Use of vectors in financial graphs By using mathematical vectors calculations as financial modeling then further into a new form of quantitative analysis instrument for linear financial computation graphs. A new tool in financial data analysis as an indicator.

  8. The Beta Cell in Its Cluster: Stochastic Graphs of Beta Cell Connectivity in...

    • figshare.com
    ai
    Updated Jun 1, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Deborah A. Striegel; Manami Hara; Vipul Periwal (2023). The Beta Cell in Its Cluster: Stochastic Graphs of Beta Cell Connectivity in the Islets of Langerhans [Dataset]. http://doi.org/10.1371/journal.pcbi.1004423
    Explore at:
    aiAvailable download formats
    Dataset updated
    Jun 1, 2023
    Dataset provided by
    PLOShttp://plos.org/
    Authors
    Deborah A. Striegel; Manami Hara; Vipul Periwal
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Pancreatic islets of Langerhans consist of endocrine cells, primarily α, β and δ cells, which secrete glucagon, insulin, and somatostatin, respectively, to regulate plasma glucose. β cells form irregular locally connected clusters within islets that act in concert to secrete insulin upon glucose stimulation. Due to the central functional significance of this local connectivity in the placement of β cells in an islet, it is important to characterize it quantitatively. However, quantification of the seemingly stochastic cytoarchitecture of β cells in an islet requires mathematical methods that can capture topological connectivity in the entire β-cell population in an islet. Graph theory provides such a framework. Using large-scale imaging data for thousands of islets containing hundreds of thousands of cells in human organ donor pancreata, we show that quantitative graph characteristics differ between control and type 2 diabetic islets. Further insight into the processes that shape and maintain this architecture is obtained by formulating a stochastic theory of β-cell rearrangement in whole islets, just as the normal equilibrium distribution of the Ornstein-Uhlenbeck process can be viewed as the result of the interplay between a random walk and a linear restoring force. Requiring that rearrangements maintain the observed quantitative topological graph characteristics strongly constrained possible processes. Our results suggest that β-cell rearrangement is dependent on its connectivity in order to maintain an optimal cluster size in both normal and T2D islets.

  9. D

    Data from: Supplemental Material: "2D, 2.5D, or 3D? An Exploratory Study on...

    • darus.uni-stuttgart.de
    Updated Aug 1, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Stefan Paul Feyer; Bruno Pinaud; Stephen Kobourov; Nicolas Brich; Michael Krone; Andreas Kerren; Falk Schreiber; Karsten Klein (2023). Supplemental Material: "2D, 2.5D, or 3D? An Exploratory Study on Multilayer Network Visualizations in Virtual Reality" [Dataset]. http://doi.org/10.18419/DARUS-3387
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Aug 1, 2023
    Dataset provided by
    DaRUS
    Authors
    Stefan Paul Feyer; Bruno Pinaud; Stephen Kobourov; Nicolas Brich; Michael Krone; Andreas Kerren; Falk Schreiber; Karsten Klein
    License

    https://darus.uni-stuttgart.de/api/datasets/:persistentId/versions/1.0/customlicense?persistentId=doi:10.18419/DARUS-3387https://darus.uni-stuttgart.de/api/datasets/:persistentId/versions/1.0/customlicense?persistentId=doi:10.18419/DARUS-3387

    Dataset funded by
    DFG
    NSF
    ELLIIT
    Description

    Dataset containing supplemental material for the publication "2D, 2.5D, or 3D? An Exploratory Study on Multilayer Network Visualizations in Virtual Reality" This dataset contains: 1) archive containing all raw quantitative results, 2) archive containing all raw qualitative data, 3) archive containing the graphs used for the experiment (.graphml file format), 4) the code to generate the graph library (C++ files using OGDF), 5) a PDF document containing detailed results (with p-values and more charts), 6) a video showing the experimentation from a participant's point of view. 7) complete graph library generated by our graph generator for the experiment

  10. m

    A Quantitative Monitoring Study of Environmental Factors Activating Caihua...

    • data.mendeley.com
    Updated Oct 2, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Xiang He (2023). A Quantitative Monitoring Study of Environmental Factors Activating Caihua and Wooden Heritage Cracks in the Palace Museum,processed graphs and masks [Dataset]. http://doi.org/10.17632/3x3hwrpd9f.1
    Explore at:
    Dataset updated
    Oct 2, 2023
    Authors
    Xiang He
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Some data in the article "A Quantitative Monitoring Study of Environmental Factors Activating Caihua and Wooden Heritage Cracks in the Palace Museum". File name represents the monitoring date and time.

  11. Data Visualization Cheat sheets and Resources

    • kaggle.com
    zip
    Updated Feb 20, 2021
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Kash (2021). Data Visualization Cheat sheets and Resources [Dataset]. https://www.kaggle.com/kaushiksuresh147/data-visualization-cheat-cheats-and-resources
    Explore at:
    zip(133638507 bytes)Available download formats
    Dataset updated
    Feb 20, 2021
    Authors
    Kash
    License

    https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/

    Description

    The Data Visualization Corpus

    https://www.googleapis.com/download/storage/v1/b/kaggle-user-content/o/inbox%2F1430847%2F29f7950c3b7daf11175aab404725542c%2FGettyImages-1187621904-600x360.jpg?generation=1601115151722854&alt=media" alt="">

    Data Visualization

    Data visualization is the graphical representation of information and data. By using visual elements like charts, graphs, and maps, data visualization tools provide an accessible way to see and understand trends, outliers, and patterns in data.

    In the world of Big Data, data visualization tools and technologies are essential to analyze massive amounts of information and make data-driven decisions

    The Data Visualizaion Copus

    The Data Visualization corpus consists:

    • 32 cheat sheets: This includes A-Z about the techniques and tricks that can be used for visualization, Python and R visualization cheat sheets, Types of charts, and their significance, Storytelling with data, etc..

    • 32 Charts: The corpus also consists of a significant amount of data visualization charts information along with their python code, d3.js codes, and presentations relation to the respective charts explaining in a clear manner!

    • Some recommended books for data visualization every data scientist's should read:

      1. Beautiful Visualization by Julie Steele and Noah Iliinsky
      2. Information Dashboard Design by Stephen Few
      3. Knowledge is beautiful by David McCandless (Short abstract)
      4. The Functional Art: An Introduction to Information Graphics and Visualization by Alberto Cairo
      5. The Visual Display of Quantitative Information by Edward R. Tufte
      6. storytelling with data: a data visualization guide for business professionals by cole Nussbaumer knaflic
      7. Research paper - Cheat Sheets for Data Visualization Techniques by Zezhong Wang, Lovisa Sundin, Dave Murray-Rust, Benjamin Bach

    Suggestions:

    In case, if you find any books, cheat sheets, or charts missing and if you would like to suggest some new documents please let me know in the discussion sections!

    Resources:

    Request to kaggle users:

    • A kind request to kaggle users to create notebooks on different visualization charts as per their interest by choosing a dataset of their own as many beginners and other experts could find it useful!

    • To create interactive EDA using animation with a combination of data visualization charts to give an idea about how to tackle data and extract the insights from the data

    Suggestion and queries:

    Feel free to use the discussion platform of this data set to ask questions or any queries related to the data visualization corpus and data visualization techniques

    Kindly upvote the dataset if you find it useful or if you wish to appreciate the effort taken to gather this corpus! Thank you and have a great day!

  12. Dataset: A continuous open source data collection platform for architectural...

    • zenodo.org
    • data.niaid.nih.gov
    zip
    Updated Jan 1, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Darius Sas; Darius Sas; Alessandro Gilardi; Ilaria Pigazzini; Francesca Arcelli Fontana; Alessandro Gilardi; Ilaria Pigazzini; Francesca Arcelli Fontana (2024). Dataset: A continuous open source data collection platform for architectural technical debt assessment [Dataset]. http://doi.org/10.5281/zenodo.10044706
    Explore at:
    zipAvailable download formats
    Dataset updated
    Jan 1, 2024
    Dataset provided by
    Zenodohttp://zenodo.org/
    Authors
    Darius Sas; Darius Sas; Alessandro Gilardi; Ilaria Pigazzini; Francesca Arcelli Fontana; Alessandro Gilardi; Ilaria Pigazzini; Francesca Arcelli Fontana
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    The dataset and replication package of the study "A continuous open source data collection platform for architectural technical debt assessment".

    Abstract

    Architectural decisions are the most important source of technical debt. In recent years, researchers spent an increasing amount of effort investigating this specific category of technical debt, with quantitative methods, and in particular static analysis, being the most common approach to investigate such a topic.

    However, quantitative studies are susceptible, to varying degrees, to external validity threats, which hinder the generalisation of their findings.

    In response to this concern, researchers strive to expand the scope of their study by incorporating a larger number of projects into their analyses. This practice is typically executed on a case-by-case basis, necessitating substantial data collection efforts that have to be repeated for each new study.

    To address this issue, this paper presents our initial attempt at tackling this problem and enabling researchers to study architectural smells at large scale, a well-known indicator of architectural technical debt. Specifically, we introduce a novel approach to data collection pipeline that leverages Apache Airflow to continuously generate up-to-date, large-scale datasets using Arcan, a tool for architectural smells detection (or any other tool).

    Finally, we present the publicly-available dataset resulting from the first three months of execution of the pipeline, that includes over 30,000 analysed commits and releases from over 10,000 open source GitHub projects written in 5 different programming languages and amounting to over a billion of lines of code analysed.

  13. f

    Speech Graphs Provide a Quantitative Measure of Thought Disorder in...

    • plos.figshare.com
    tiff
    Updated May 31, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Natalia B. Mota; Nivaldo A. P. Vasconcelos; Nathalia Lemos; Ana C. Pieretti; Osame Kinouchi; Guillermo A. Cecchi; Mauro Copelli; Sidarta Ribeiro (2023). Speech Graphs Provide a Quantitative Measure of Thought Disorder in Psychosis [Dataset]. http://doi.org/10.1371/journal.pone.0034928
    Explore at:
    tiffAvailable download formats
    Dataset updated
    May 31, 2023
    Dataset provided by
    PLOS ONE
    Authors
    Natalia B. Mota; Nivaldo A. P. Vasconcelos; Nathalia Lemos; Ana C. Pieretti; Osame Kinouchi; Guillermo A. Cecchi; Mauro Copelli; Sidarta Ribeiro
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    BackgroundPsychosis has various causes, including mania and schizophrenia. Since the differential diagnosis of psychosis is exclusively based on subjective assessments of oral interviews with patients, an objective quantification of the speech disturbances that characterize mania and schizophrenia is in order. In principle, such quantification could be achieved by the analysis of speech graphs. A graph represents a network with nodes connected by edges; in speech graphs, nodes correspond to words and edges correspond to semantic and grammatical relationships. Methodology/Principal FindingsTo quantify speech differences related to psychosis, interviews with schizophrenics, manics and normal subjects were recorded and represented as graphs. Manics scored significantly higher than schizophrenics in ten graph measures. Psychopathological symptoms such as logorrhea, poor speech, and flight of thoughts were grasped by the analysis even when verbosity differences were discounted. Binary classifiers based on speech graph measures sorted schizophrenics from manics with up to 93.8% of sensitivity and 93.7% of specificity. In contrast, sorting based on the scores of two standard psychiatric scales (BPRS and PANSS) reached only 62.5% of sensitivity and specificity. Conclusions/SignificanceThe results demonstrate that alterations of the thought process manifested in the speech of psychotic patients can be objectively measured using graph-theoretical tools, developed to capture specific features of the normal and dysfunctional flow of thought, such as divergence and recurrence. The quantitative analysis of speech graphs is not redundant with standard psychometric scales but rather complementary, as it yields a very accurate sorting of schizophrenics and manics. Overall, the results point to automated psychiatric diagnosis based not on what is said, but on how it is said.

  14. Z

    Appendix for "Is JavaScript Call Graph Extraction Solved Yet? A Comparative...

    • data.niaid.nih.gov
    Updated Sep 23, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Gábor Lóki (2022). Appendix for "Is JavaScript Call Graph Extraction Solved Yet? A Comparative Study of Static and Dynamic Tools" [Dataset]. https://data.niaid.nih.gov/resources?id=zenodo_7104953
    Explore at:
    Dataset updated
    Sep 23, 2022
    Dataset provided by
    Gábor Antal
    Zoltán Herczeg
    Gábor Lóki
    Péter Hegedűs
    Rudolf Ferenc
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Abstract

    The popularity and wide adoption of JavaScript both at the client and server-side makes its code analysis more essential than ever before. Most of the algorithms for vulnerability analysis, coding issue detection, or type inference rely on the call graph representation of the underlying program. Luckily, there are quite a few tools to get this job done already. However, their performance in vitro and especially in vivo has not yet been extensively compared and evaluated.

    In this paper, we systematically compare five static and two dynamic approaches for building JavaScript call graphs on 26 WebKit SunSpider benchmark programs and two static and two dynamic methods on 12 real-world Node.js modules. The tools under examination using static techniques were npm call graph, IBM WALA, Google Closure Compiler, Approximate Call Graph, and Type Analyzer for JavaScript. We performed dynamic analyzes relying on the nodejs-cg tool (a customized Node.js runtime) and the NodeProf instrumentation and profiling framework.

    We provide a quantitative evaluation of the results, and a result quality analysis based on 941 manually validated call edges. On the SunSpider programs, which do not take any inputs, so dynamic extraction could be complete, all the static tools also performed well. For example, TAJS found 93% of all edges while having a 97% precision compared to the precise dynamic call graph. When it comes to real-world Node.js modules, our evaluation shows that static tools struggle with parsing the code and fail to detect a significant amount of call edges that dynamic approaches can capture. Nonetheless, a significant number of edges not detected by dynamic approaches are also reported. Among these, however, there are also edges that are real, but for some reason the unit tests did not execute the branches in which these calls were included.

  15. C

    Data from: Dataset on Spodoptera used to conduct the practical comparison of...

    • dataverse.cirad.fr
    application/x-gzip +1
    Updated May 8, 2020
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    CIRAD Dataverse (2020). Dataset on Spodoptera used to conduct the practical comparison of FCA extensions to Model Indeterminate Value of Ternary Data [Dataset]. http://doi.org/10.18167/DVN1/VNCZYA
    Explore at:
    application/x-gzip(17438), pdf(84480)Available download formats
    Dataset updated
    May 8, 2020
    Dataset provided by
    CIRAD Dataverse
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    This dataset contains an except from Knomana knowledge base on Spodoptera realized January 9, 2020, and the files (input and output) resulting from the qualitative and quantitative evaluation of three classification methods (Graph-FCA, RCA, and TCA) on the except.

  16. Dataset Chart Hours Television Digital Social Intervention Chicago & Los...

    • zenodo.org
    • data.niaid.nih.gov
    Updated Aug 15, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Dr. David Render PhD; Dr. David Render PhD (2022). Dataset Chart Hours Television Digital Social Intervention Chicago & Los Angeles Research PhD [Dataset]. http://doi.org/10.5281/zenodo.6991324
    Explore at:
    Dataset updated
    Aug 15, 2022
    Dataset provided by
    Zenodohttp://zenodo.org/
    Authors
    Dr. David Render PhD; Dr. David Render PhD
    License

    Attribution 2.0 (CC BY 2.0)https://creativecommons.org/licenses/by/2.0/
    License information was derived automatically

    Area covered
    Chicago, Los Angeles
    Description

    Dataset chart Quantitative Information Social Issues Racial Mental Emotional PhD Dr.David Render Solving Categorizing Identifying Social Issues Human Impact In Part National Case Studies Chicagoland Business & Los Angeles Economic Territories

  17. o

    A Mixed-Methods Framework Combining Directed Acyclic Graphs and Alkire...

    • openicpsr.org
    Updated Mar 25, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Guberney Muñetón-Santa; Carlos Andrés Pérez-Aguirre; Julián Andrés Angarita-Suárez (2025). A Mixed-Methods Framework Combining Directed Acyclic Graphs and Alkire Foster Methodology for Evaluating Participatory Dynamics [Dataset]. http://doi.org/10.3886/E224181V1
    Explore at:
    Dataset updated
    Mar 25, 2025
    Dataset provided by
    Instituto de Estudios Regionales, Universidad de Antioquia
    Grupo Epidemiología, Facultad Nacional de Salud Pública, Universidad de Antioquia
    Authors
    Guberney Muñetón-Santa; Carlos Andrés Pérez-Aguirre; Julián Andrés Angarita-Suárez
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Time period covered
    Jan 1, 2023 - Dec 31, 2023
    Area covered
    Medellín
    Description

    Data for the paper "A Mixed-Methods Framework Combining Directed Acyclic Graphs and Alkire Foster Methodology for Evaluating Participatory Dynamics"Abstract:Effective measurement of citizen participation is crucial for promoting democratic governance, yet traditional approaches often fail to capture its complex, context-specific and causal dynamics. To address these limitations, this paper presents a novel mixed-methods approach that integrates Directed Acyclic Graphs with the Alkire-Foster method. The proposed approach uses qualitative insights, obtained through expert workshops and the construction of directed acyclic graphs, to model the causal relationships that influence the quality of citizen participation. This qualitative understanding then informs the quantitative construction of a multidimensional indicator using the transparent and decomposable Alkire-Foster methodology. Applied to the case study of Medellín, Colombia, using 2023 survey data, the proposed methodology yielded a Multidimensional Participation Index of 0.727 for participating individuals, highlighting the significant contribution of participatory practices and enabling conditions to overall quality. Gender analysis revealed nuanced dimensions of equity, with women having a slightly higher multidimensional participation index than men. This new methodology offers significant advantages over conventional approaches in terms of theoretical grounding, interpretability, context sensitivity and policy relevance. Future research should focus on the validation and refinement of the methodology in different contexts and the further use of directed acyclic graphs for prospective policy impact assessment. By providing a more robust and nuanced measure of citizen participation, this research contributes to advancing both the understanding and practice of democratic and accountable governance.

  18. d

    Best Management Practices Statistical Estimator (BMPSE) Version 1.2.0

    • catalog.data.gov
    • data.usgs.gov
    • +1more
    Updated Jul 6, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    U.S. Geological Survey (2024). Best Management Practices Statistical Estimator (BMPSE) Version 1.2.0 [Dataset]. https://catalog.data.gov/dataset/best-management-practices-statistical-estimator-bmpse-version-1-2-0
    Explore at:
    Dataset updated
    Jul 6, 2024
    Dataset provided by
    United States Geological Surveyhttp://www.usgs.gov/
    Description

    The Best Management Practices Statistical Estimator (BMPSE) version 1.2.0 was developed by the U.S. Geological Survey (USGS), in cooperation with the Federal Highway Administration (FHWA) Office of Project Delivery and Environmental Review to provide planning-level information about the performance of structural best management practices for decision makers, planners, and highway engineers to assess and mitigate possible adverse effects of highway and urban runoff on the Nation's receiving waters (Granato 2013, 2014; Granato and others, 2021). The BMPSE was assembled by using a Microsoft Access® database application to facilitate calculation of BMP performance statistics. Granato (2014) developed quantitative methods to estimate values of the trapezoidal-distribution statistics, correlation coefficients, and the minimum irreducible concentration (MIC) from available data. Granato (2014) developed the BMPSE to hold and process data from the International Stormwater Best Management Practices Database (BMPDB, www.bmpdatabase.org). Version 1.0 of the BMPSE contained a subset of the data from the 2012 version of the BMPDB; the current version of the BMPSE (1.2.0) contains a subset of the data from the December 2019 version of the BMPDB. Selected data from the BMPDB were screened for import into the BMPSE in consultation with Jane Clary, the data manager for the BMPDB. Modifications included identifying water quality constituents, making measurement units consistent, identifying paired inflow and outflow values, and converting BMPDB water quality values set as half the detection limit back to the detection limit. Total polycyclic aromatic hydrocarbons (PAH) values were added to the BMPSE from BMPDB data; they were calculated from individual PAH measurements at sites with enough data to calculate totals. The BMPSE tool can sort and rank the data, calculate plotting positions, calculate initial estimates, and calculate potential correlations to facilitate the distribution-fitting process (Granato, 2014). For water-quality ratio analysis the BMPSE generates the input files and the list of filenames for each constituent within the Graphical User Interface (GUI). The BMPSE calculates the Spearman’s rho (ρ) and Kendall’s tau (τ) correlation coefficients with their respective 95-percent confidence limits and the probability that each correlation coefficient value is not significantly different from zero by using standard methods (Granato, 2014). If the 95-percent confidence limit values are of the same sign, then the correlation coefficient is statistically different from zero. For hydrograph extension, the BMPSE calculates ρ and τ between the inflow volume and the hydrograph-extension values (Granato, 2014). For volume reduction, the BMPSE calculates ρ and τ between the inflow volume and the ratio of outflow to inflow volumes (Granato, 2014). For water-quality treatment, the BMPSE calculates ρ and τ between the inflow concentrations and the ratio of outflow to inflow concentrations (Granato, 2014; 2020). The BMPSE also calculates ρ between the inflow and the outflow concentrations when a water-quality treatment analysis is done. The current version (1.2.0) of the BMPSE also has the option to calculate urban-runoff quality statistics from inflows to BMPs by using computer code developed for the Highway Runoff Database (Granato and Cazenas, 2009;Granato, 2019). Granato, G.E., 2013, Stochastic empirical loading and dilution model (SELDM) version 1.0.0: U.S. Geological Survey Techniques and Methods, book 4, chap. C3, 112 p., CD-ROM https://pubs.usgs.gov/tm/04/c03 Granato, G.E., 2014, Statistics for stochastic modeling of volume reduction, hydrograph extension, and water-quality treatment by structural stormwater runoff best management practices (BMPs): U.S. Geological Survey Scientific Investigations Report 2014–5037, 37 p., http://dx.doi.org/10.3133/sir20145037. Granato, G.E., 2019, Highway-Runoff Database (HRDB) Version 1.1.0: U.S. Geological Survey data release, https://doi.org/10.5066/P94VL32J. Granato, G.E., and Cazenas, P.A., 2009, Highway-Runoff Database (HRDB Version 1.0)--A data warehouse and preprocessor for the stochastic empirical loading and dilution model: Washington, D.C., U.S. Department of Transportation, Federal Highway Administration, FHWA-HEP-09-004, 57 p. https://pubs.usgs.gov/sir/2009/5269/disc_content_100a_web/FHWA-HEP-09-004.pdf Granato, G.E., Spaetzel, A.B., and Medalie, L., 2021, Statistical methods for simulating structural stormwater runoff best management practices (BMPs) with the stochastic empirical loading and dilution model (SELDM): U.S. Geological Survey Scientific Investigations Report 2020–5136, 41 p., https://doi.org/10.3133/sir20205136

  19. Characterization of peptide-protein relationships in protein ambiguity...

    • data.niaid.nih.gov
    xml
    Updated Jul 30, 2021
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Karin Schork; Martin Eisenacher (2021). Characterization of peptide-protein relationships in protein ambiguity groups via bipartite graphs (data set D1) [Dataset]. https://data.niaid.nih.gov/resources?id=pxd024684
    Explore at:
    xmlAvailable download formats
    Dataset updated
    Jul 30, 2021
    Dataset provided by
    Medizinisches Proteom-Center, Ruhr-University Bochum
    Ruhr-University Bochum, Medical Faculty, Medizinisches Proteom-Center, Medical Bioinformatics Ruhr-University Bochum, Medical Proteome Analysis, Center for Proteindiagnostics (PRODI)
    Authors
    Karin Schork; Martin Eisenacher
    Variables measured
    Proteomics
    Description

    Motivation: In bottom-up mass spectrometry proteins are enzymatically digested before measurement. The relationship between proteins and peptides can be represented by bipartite graphs that can be split into connected components. This representation is useful to aid protein inference and quantification, which is complex due to the occurrence of shared peptides. We conducted a comprehensive analysis of these bipartite graphs using peptides from an in silico digestion of protein databases as well as quantified peptides. Results: The graphs based on quantified peptides are smaller and have less complex structures compared to the database level. However, the proportion of protein nodes without unique peptides and the proportion of graphs that contain these proteins increase. Large differences between the two underlying organisms (mouse and yeast) on database as well as quantitative level could be observed. Insights of this analysis may be useful for the development of protein inference and quantification algorithms. Link to preprint: https://www.biorxiv.org/content/10.1101/2021.07.28.454128v1?ct=

  20. Size of Federal Reserve's balance sheet 2007-2025

    • statista.com
    Updated Jul 2, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Statista (2025). Size of Federal Reserve's balance sheet 2007-2025 [Dataset]. https://www.statista.com/statistics/1121448/fed-balance-sheet-timeline/
    Explore at:
    Dataset updated
    Jul 2, 2025
    Dataset authored and provided by
    Statistahttp://statista.com/
    Time period covered
    Aug 1, 2007 - Jun 25, 2025
    Area covered
    United States
    Description

    The Federal Reserve's balance sheet has undergone significant changes since 2007, reflecting its response to major economic crises. From a modest *** trillion U.S. dollars at the end of 2007, it ballooned to approximately **** trillion U.S. dollars by June 2025. This dramatic expansion, particularly during the 2008 financial crisis and the COVID-19 pandemic - both of which resulted in negative annual GDP growth in the U.S. - showcases the Fed's crucial role in stabilizing the economy through expansionary monetary policies. Impact on inflation and interest rates The Fed's expansionary measures, while aimed at stimulating economic growth, have had notable effects on inflation and interest rates. Following the quantitative easing in 2020, inflation in the United States reached ***** percent in 2022, the highest since 1991. However, by *************, inflation had declined to *** percent. Concurrently, the Federal Reserve implemented a series of interest rate hikes, with the rate peaking at **** percent in ***********, before the first rate cut since ************** occurred in **************. Financial implications for the Federal Reserve The expansion of the Fed's balance sheet and subsequent interest rate hikes have had significant financial implications. In 2023, the Fed reported a negative net income of ***** billion U.S. dollars, a stark contrast to the ***** billion U.S. dollars profit in 2022. This unprecedented shift was primarily due to rapidly rising interest rates, which caused the Fed's interest expenses to soar to over *** billion U.S. dollars in 2023. Despite this, the Fed's net interest income on securities acquired through open market operations reached a record high of ****** billion U.S. dollars in the same year.

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
Hadi Yazdi; Qiguan Shu; Thomas Rötzer; Frank Petzold; Ferdinand Ludwig (2024). Dataset_Graph [Dataset]. http://doi.org/10.6084/m9.figshare.23943060.v1
Organization logo

Dataset_Graph

Explore at:
5 scholarly articles cite this dataset (View in Google Scholar)
binAvailable download formats
Dataset updated
Jan 2, 2024
Dataset provided by
Figsharehttp://figshare.com/
Authors
Hadi Yazdi; Qiguan Shu; Thomas Rötzer; Frank Petzold; Ferdinand Ludwig
License

CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
License information was derived automatically

Description

The "Dataset_Graph.zip" file contains the graph models of the trees in the dataset. These graph models are saved in the "pickle" format, which is a binary format used for serializing Python objects. The graph models capture the structural information and relationships of the cylinders in each tree, representing the hierarchical organization of the branches.

Search
Clear search
Close search
Google apps
Main menu