68 datasets found

Quantitative questions - analysed data
figshare.com
xlsx
Updated Aug 24, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Ashleigh Prince (2023). Quantitative questions - analysed data [Dataset]. http://doi.org/10.6084/m9.figshare.24029238.v1
Explore at:
xlsxAvailable download formats
Unique identifier
https://doi.org/10.6084/m9.figshare.24029238.v1
Dataset updated
Aug 24, 2023
Dataset provided by
Figsharehttp://figshare.com/
Authors
Ashleigh Prince
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
The Excel spreadsheet contains the quantitative questions (Questions 1, 3 and 4). Each question is analysed in the form of a frequency distribution table and a pie chart.
f
Underlying quantitative data in support of the charts in Fig 6 in [1].
plos.figshare.com
xlsx
Updated Jul 3, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
PLOS One (2025). Underlying quantitative data in support of the charts in Fig 6 in [1]. [Dataset]. http://doi.org/10.1371/journal.pone.0327518.s004
Explore at:
xlsxAvailable download formats
Unique identifier
https://doi.org/10.1371/journal.pone.0327518.s004
Dataset updated
Jul 3, 2025
Dataset provided by
PLOS ONE
Authors
PLOS One
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
Underlying quantitative data in support of the charts in Fig 6 in [1].
Data Visualization Cheat sheets and Resources
kaggle.com
zip
Updated Feb 20, 2021
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Kash (2021). Data Visualization Cheat sheets and Resources [Dataset]. https://www.kaggle.com/kaushiksuresh147/data-visualization-cheat-cheats-and-resources
Explore at:
zip(133638507 bytes)Available download formats
Dataset updated
Feb 20, 2021
Authors
Kash
License
https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/
Description
The Data Visualization Corpus

https://www.googleapis.com/download/storage/v1/b/kaggle-user-content/o/inbox%2F1430847%2F29f7950c3b7daf11175aab404725542c%2FGettyImages-1187621904-600x360.jpg?generation=1601115151722854&alt=media" alt="">

Data Visualization

Data visualization is the graphical representation of information and data. By using visual elements like charts, graphs, and maps, data visualization tools provide an accessible way to see and understand trends, outliers, and patterns in data.

In the world of Big Data, data visualization tools and technologies are essential to analyze massive amounts of information and make data-driven decisions

The Data Visualizaion Copus

The Data Visualization corpus consists:

32 cheat sheets: This includes A-Z about the techniques and tricks that can be used for visualization, Python and R visualization cheat sheets, Types of charts, and their significance, Storytelling with data, etc..

32 Charts: The corpus also consists of a significant amount of data visualization charts information along with their python code, d3.js codes, and presentations relation to the respective charts explaining in a clear manner!

Some recommended books for data visualization every data scientist's should read:

Beautiful Visualization by Julie Steele and Noah Iliinsky

Information Dashboard Design by Stephen Few

Knowledge is beautiful by David McCandless (Short abstract)

The Functional Art: An Introduction to Information Graphics and Visualization by Alberto Cairo

The Visual Display of Quantitative Information by Edward R. Tufte

storytelling with data: a data visualization guide for business professionals by cole Nussbaumer knaflic

Research paper - Cheat Sheets for Data Visualization Techniques by Zezhong Wang, Lovisa Sundin, Dave Murray-Rust, Benjamin Bach

Suggestions:

In case, if you find any books, cheat sheets, or charts missing and if you would like to suggest some new documents please let me know in the discussion sections!

Resources:

Charts: I personally recommend data viz catalogue, it's easy to understand with their explanation!

Python codes: Plotly for python and Python graph gallery

R codes for charts:Plotly for R

d3 codes: Visualization codes using d3

Request to kaggle users:

A kind request to kaggle users to create notebooks on different visualization charts as per their interest by choosing a dataset of their own as many beginners and other experts could find it useful!

To create interactive EDA using animation with a combination of data visualization charts to give an idea about how to tackle data and extract the insights from the data

Suggestion and queries:

Feel free to use the discussion platform of this data set to ask questions or any queries related to the data visualization corpus and data visualization techniques

Kindly upvote the dataset if you find it useful or if you wish to appreciate the effort taken to gather this corpus! Thank you and have a great day!
Replication dataset and calculations for PIIE WP 16-7, A Portfolio Model of...
piie.com
Updated Apr 29, 2016
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Jens H. E. Christensen; Signe Krogstrup (2016). Replication dataset and calculations for PIIE WP 16-7, A Portfolio Model of Quantitative Easing, by Jens H. E. Christensen and Signe Krogstrup. (2016). [Dataset]. https://www.piie.com/publications/working-papers/portfolio-model-quantitative-easing
Explore at:
Dataset updated
Apr 29, 2016
Dataset provided by
Peterson Institute for International Economicshttp://www.piie.com/
Authors
Jens H. E. Christensen; Signe Krogstrup
Description
This data package includes the underlying data and files to replicate the calculations, charts, and tables presented in A Portfolio Model of Quantitative Easing, PIIE Working Paper 16-7. If you use the data, please cite as: Christensen, Jens H. E., and Signe Krogstrup. (2016). A Portfolio Model of Quantitative Easing. PIIE Working Paper 16-7. Peterson Institute for International Economics.
Dataset Chart Hours Television Digital Social Intervention Chicago & Los...
zenodo.org
data.niaid.nih.gov
Updated Aug 15, 2022
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Dr. David Render PhD; Dr. David Render PhD (2022). Dataset Chart Hours Television Digital Social Intervention Chicago & Los Angeles Research PhD [Dataset]. http://doi.org/10.5281/zenodo.6991324
Explore at:
Unique identifier
https://doi.org/10.5281/zenodo.6991324
Dataset updated
Aug 15, 2022
Dataset provided by
Zenodohttp://zenodo.org/
Authors
Dr. David Render PhD; Dr. David Render PhD
License
Attribution 2.0 (CC BY 2.0)https://creativecommons.org/licenses/by/2.0/
License information was derived automatically
Area covered
Los Angeles, Chicago
Description
Dataset chart Quantitative Information Social Issues Racial Mental Emotional PhD Dr.David Render Solving Categorizing Identifying Social Issues Human Impact In Part National Case Studies Chicagoland Business & Los Angeles Economic Territories
Primers used for quantitative real-time PCR, CHART-PCR and ChIP.
plos.figshare.com
xls
Updated Jun 3, 2023
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Yong Hu; Lu Zhang; Lin Zhao; Jun Li; Shibin He; Kun Zhou; Fei Yang; Min Huang; Li Jiang; Lijia Li (2023). Primers used for quantitative real-time PCR, CHART-PCR and ChIP. [Dataset]. http://doi.org/10.1371/journal.pone.0022132.t001
Explore at:
xlsAvailable download formats
Unique identifier
https://doi.org/10.1371/journal.pone.0022132.t001
Dataset updated
Jun 3, 2023
Dataset provided by
PLOShttp://plos.org/
Authors
Yong Hu; Lu Zhang; Lin Zhao; Jun Li; Shibin He; Kun Zhou; Fei Yang; Min Huang; Li Jiang; Lijia Li
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
Primers used for quantitative real-time PCR, CHART-PCR and ChIP.
E
Code and data for 'Improved vapor pressure predictions using group...
edmond.mpg.de
exe, zip
Updated Mar 19, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Matteo Krueger; Thomas Berkemeier; Matteo Krueger; Thomas Berkemeier (2025). Code and data for 'Improved vapor pressure predictions using group contribution-assisted graph convolutional neural networks (GC2NN)' [Dataset]. http://doi.org/10.17617/3.GIKHJL
Explore at:
zip(115013), zip(105301), zip(2234685), zip(34286), zip(85257), exe(191951991)Available download formats
Unique identifier
https://doi.org/10.17617/3.GIKHJL
Dataset updated
Mar 19, 2025
Dataset provided by
Edmond
Authors
Matteo Krueger; Thomas Berkemeier; Matteo Krueger; Thomas Berkemeier
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
We propose a novel approach to predict saturation vapor pressures using group contribution-assisted graph convolutional neural networks (GC2NN), which use both, molecular descriptors like molar mass and functional group counts, as well as molecular graphs containing atom and bond features, as representations of molecular structure. Molecular graphs allow the ML model to better infer molecular connectivity and spatial relations compared to methods using other, non-structural embeddings. We achieve best results with an adaptive-depth GC2NN, where the number of evaluated graph layers depends on molecular size. We apply the model to compounds relevant for the formation of SOA, achieving strong agreement between predicted and experimentally-determined vapor pressure. In this study, we present two models: a general model with broader scope, achieving a mean absolute error (MAE) of 0.67 log-units (R2 = 0.86), and a specialized model focused on atmospheric compounds (MAE = 0.36 log-units, R2 = 0.97).
t
Stable oxygen and carbon isotope ratios of Globigerinoides obliquus of...
service.tib.eu
Updated Nov 30, 2024
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
(2024). Stable oxygen and carbon isotope ratios of Globigerinoides obliquus of quantitative range chart of the ostracodes in the Pliocene-Pleistocene interval of ODP Hole 107-654A - Vdataset - LDM [Dataset]. https://service.tib.eu/ldmservice/dataset/png-doi-10-1594-pangaea-744017
Explore at:
Dataset updated
Nov 30, 2024
License
Attribution 3.0 (CC BY 3.0)https://creativecommons.org/licenses/by/3.0/
License information was derived automatically
Description
Deep-water benthic ostracodes from the Pliocene-Pleistocene interval of ODP Leg 107, Hole 654A (Tyrrhenian Sea) were studied. From a total of 106 samples, 40 species considered autochthonous were identified. Detailed investigations have established the biostratigraphic distribution of the most frequent ostracode taxa. The extinction levels of Agrenocythere pliocenica (a psychrospheric ostracode) in Hole 654A and in some Italian land sections lead to the conclusion that the removal of psychrospheric conditions took place in the Mediterranean Sea during or after the time interval corresponding to the Small Gephyrocapsa Zone (upper part of early Pleistocene), and not at the beginning of the Quaternary, as previously stated. Based on a reduced matrix of quantitative data of 63 samples and 20 variables of ostracodes, four varimax assemblages were extracted by a Q-mode factor analysis. Six factors and eight varimax assemblages were recognized from the Q-mode factor analysis of the quantitative data of 162 samples and 47 variables of the benthic foraminifers. The stratigraphic distributions of the varimax assemblages of the two faunistic groups were plotted against the calcareous plankton biostratigraphic scheme and compared in order to trace the relationship between the benthic foraminifers and ostracodes varimax assemblages. General results show that the two populations, belonging to quite different taxa, display almost coeval changes along the Pliocene-Pleistocene sequence of Hole 654A, essentially induced by paleoenvironmental modifications. Mainly on the base of the benthic foraminifer assemblages (which are quantitatively better represented than the ostracode assemblages), it is possible to identify such modifications as variations in sedimentation depth and in bottom oxygen content.
f
Underlying quantitative data in support of the charts in Figs 4A and C in...
plos.figshare.com
xlsx
Updated Jul 3, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
PLOS One (2025). Underlying quantitative data in support of the charts in Figs 4A and C in [1]. [Dataset]. http://doi.org/10.1371/journal.pone.0327518.s001
Explore at:
xlsxAvailable download formats
Unique identifier
https://doi.org/10.1371/journal.pone.0327518.s001
Dataset updated
Jul 3, 2025
Dataset provided by
PLOS ONE
Authors
PLOS One
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
Underlying quantitative data in support of the charts in Figs 4A and C in [1].
Replication dataset and calculations for PIIE PB 15-7, Quantity Theory of...
piie.com
Updated May 1, 2015
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
William R. Cline (2015). Replication dataset and calculations for PIIE PB 15-7, Quantity Theory of Money Redux? Will Inflation Be the Legacy of Quantitative Easing?, by William R. Cline. (2015). [Dataset]. https://www.piie.com/publications/policy-briefs/quantity-theory-money-redux-will-inflation-be-legacy-quantitative-easing
Explore at:
Dataset updated
May 1, 2015
Dataset provided by
Peterson Institute for International Economicshttp://www.piie.com/
Authors
William R. Cline
Description
This data package includes the underlying data and files to replicate the calculations, charts, and tables presented in Quantity Theory of Money Redux? Will Inflation Be the Legacy of Quantitative Easing?, PIIE Policy Brief 15-7. If you use the data, please cite as: Cline, William R. (2015). Quantity Theory of Money Redux? Will Inflation Be the Legacy of Quantitative Easing?. PIIE Policy Brief 15-7. Peterson Institute for International Economics.
Wikipedia Knowledge Graph dataset
zenodo.org
produccioncientifica.ugr.es
+1more
pdf, tsv
Updated Jul 17, 2024
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Wenceslao Arroyo-Machado; Wenceslao Arroyo-Machado; Daniel Torres-Salinas; Daniel Torres-Salinas; Rodrigo Costas; Rodrigo Costas (2024). Wikipedia Knowledge Graph dataset [Dataset]. http://doi.org/10.5281/zenodo.6346900
Explore at:
tsv, pdfAvailable download formats
Unique identifier
https://doi.org/10.5281/zenodo.6346900
Dataset updated
Jul 17, 2024
Dataset provided by
Zenodohttp://zenodo.org/
Authors
Wenceslao Arroyo-Machado; Wenceslao Arroyo-Machado; Daniel Torres-Salinas; Daniel Torres-Salinas; Rodrigo Costas; Rodrigo Costas
License
CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
License information was derived automatically
Description
Wikipedia is the largest and most read online free encyclopedia currently existing. As such, Wikipedia offers a large amount of data on all its own contents and interactions around them, as well as different types of open data sources. This makes Wikipedia a unique data source that can be analyzed with quantitative data science techniques. However, the enormous amount of data makes it difficult to have an overview, and sometimes many of the analytical possibilities that Wikipedia offers remain unknown. In order to reduce the complexity of identifying and collecting data on Wikipedia and expanding its analytical potential, after collecting different data from various sources and processing them, we have generated a dedicated Wikipedia Knowledge Graph aimed at facilitating the analysis, contextualization of the activity and relations of Wikipedia pages, in this case limited to its English edition. We share this Knowledge Graph dataset in an open way, aiming to be useful for a wide range of researchers, such as informetricians, sociologists or data scientists.

There are a total of 9 files, all of them in tsv format, and they have been built under a relational structure. The main one that acts as the core of the dataset is the page file, after it there are 4 files with different entities related to the Wikipedia pages (category, url, pub and page_property files) and 4 other files that act as "intermediate tables" making it possible to connect the pages both with the latter and between pages (page_category, page_url, page_pub and page_link files).

The document Dataset_summary includes a detailed description of the dataset.

Thanks to Nees Jan van Eck and the Centre for Science and Technology Studies (CWTS) for the valuable comments and suggestions.
m
Revealing the Therapeutic Potential of Lactoperoxidase and Deubiquitinase in...
data.mendeley.com
Updated Jun 13, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
jie su (2025). Revealing the Therapeutic Potential of Lactoperoxidase and Deubiquitinase in Dairy Animal Mastitis via Integrative Transcriptomic and Quantitative Trait Loci Analyses [Dataset]. http://doi.org/10.17632/bd2x3crmj2.1
Explore at:
Unique identifier
https://doi.org/10.17632/bd2x3crmj2.1
Dataset updated
Jun 13, 2025
Authors
jie su
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
Supplementary Materials
Dataset_Graph
springernature.figshare.com
bin
Updated Jan 2, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Hadi Yazdi; Qiguan Shu; Thomas Rötzer; Frank Petzold; Ferdinand Ludwig (2024). Dataset_Graph [Dataset]. http://doi.org/10.6084/m9.figshare.23943060.v1
Explore at:
binAvailable download formats
Unique identifier
https://doi.org/10.6084/m9.figshare.23943060.v1
Dataset updated
Jan 2, 2024
Dataset provided by
Figsharehttp://figshare.com/
Authors
Hadi Yazdi; Qiguan Shu; Thomas Rötzer; Frank Petzold; Ferdinand Ludwig
License
CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
License information was derived automatically
Description
The "Dataset_Graph.zip" file contains the graph models of the trees in the dataset. These graph models are saved in the "pickle" format, which is a binary format used for serializing Python objects. The graph models capture the structural information and relationships of the cylinders in each tree, representing the hierarchical organization of the branches.
Z
Data from: 3DCP.fyi - A Comprehensive Citation Network Graph on the State of...
data.niaid.nih.gov
Updated Apr 15, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Bos, Freek (2024). 3DCP.fyi - A Comprehensive Citation Network Graph on the State of the Art in 3D Concrete Printing [Dataset]. https://data.niaid.nih.gov/resources?id=zenodo_10973877
Explore at:
Dataset updated
Apr 15, 2024
Dataset provided by
Bos, Freek
Auer, Daniel
Fischer, Oliver
License
Attribution-NonCommercial 4.0 (CC BY-NC 4.0)https://creativecommons.org/licenses/by-nc/4.0/
License information was derived automatically
Description
Research in digital fabrication, specifically in 3D concrete printing (3DCP), has seen a substantial increase in publication output in the past five years, making it hard to keep up with the latest developments. The 3dcp.fyi database aims to provide the research community with a comprehensive, up-to-date, and manually curated literature dataset documenting the development of the field from its early beginnings in the late 1990s to its resurgence in the 2010s until today. The data set is compiled using a systematic approach. A thorough literature search was conducted in scientific databases, following the Preferred Reporting Items for Systematic Reviews and Meta-Analyses (PRISMA) scheme. This was then enhanced iteratively with non-indexed literature through a snowball citation search. The authors of the articles were assigned unique and persistent identifiers (ORCID® IDs) through a systematic process that combined querying APIs systematically and manually curating data. The works in the data set also include references to other works, as long as those referenced works are also included within the same data set. A citation network graph is created where scientific articles are represented as vertices, and their citations to other scientific articles are the edges. The constructed network graph is subjected to detailed analysis using specific graph-theoretic algorithms, like PageRank. These algorithms evaluate the structure and connections within the graph, yielding quantitative metrics. Currently, the high-quality dataset contains more than 2600 manually curated scientific works, including journal articles, conference articles, books, and theses, with more than 40000 cross-references and 2000 authors, opening up the possibility for more detailed analysis. The data is published on https://3dcp.fyi, ready for import into several reference managers, and is continuously updated. We encourage researchers to enrich the database by submitting their publications, adding missing works, or suggesting new features.
C
Quantitative Sensitivity
ckan.mobidatalab.eu
Updated Jul 12, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
OverheidNl (2023). Quantitative Sensitivity [Dataset]. https://ckan.mobidatalab.eu/dataset/31945-kwantitatieve-gevoeligheid
Explore at:
http://publications.europa.eu/resource/authority/file-type/wms_srvc, http://publications.europa.eu/resource/authority/file-type/tiff, http://publications.europa.eu/resource/authority/file-type/pngAvailable download formats
Dataset updated
Jul 12, 2023
Dataset provided by
OverheidNl
License
http://standaarden.overheid.nl/owms/terms/licentieonbekendhttp://standaarden.overheid.nl/owms/terms/licentieonbekend
Description
The quantitative sensitivity is defined as the product of the groundwater residence time with the infiltration rate. When a water particle remains underground twice as long, it is twice as important for the water quality in an aquifer. The same applies to the infiltration rate. When twice as much infiltration occurs in one area as in another area (and the residence time is the same), the area is twice as important for the overall groundwater quality. The map therefore indicates which areas are important for the 'overall water quality', only taking into account conservative transport, and thus without taking degradation and retardation into account. Insight into the quantitative sensitivity is important, because the most important characteristic of groundwater systems is that the greater part remains shallow and short in the subsurface, and a very small part of an area largely determines the deeper water quality. The chart shows the quantitative sensitivity on a logarithmic scale. No data is yet available for the municipality of Vijfheerenlanden, which has been part of the province of Utrecht since 1 January 2019.
Quantitative easing by the Bank of England 2009-2020
statista.com
Updated Jun 26, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Statista (2025). Quantitative easing by the Bank of England 2009-2020 [Dataset]. https://www.statista.com/statistics/1105570/value-of-quantitative-easing-by-the-bank-of-england-in-the-united-kingdom/
Explore at:
Dataset updated
Jun 26, 2025
Dataset authored and provided by
Statistahttp://statista.com/
Time period covered
Nov 2009 - Nov 2020
Area covered
United Kingdom
Description
One of the major duties the Bank of England (BoE) is tasked with is keeping inflation rates low and stable. The usual tactic for keeping inflation rates down, and therefore the price of goods and services stable by the Bank of England is through lowering the Bank Rate. Such a measure was used in 2008 during the global recession when the BoE lowered the bank base rate from **** percent to *** percent. Due to the economic fears surrounding the COVID-19 virus, as of the 19th of March 2020, the bank base rate was set to its lowest ever standing. The issue with lowering interest rates is that there is an end limit as to how low they can go. Quantitative easing Quantitative easing is a measure that central banks can use to inject money into the economy to hopefully boost spending and investment. Quantitative easing is the creation of digital money in order to purchase government bonds. By purchasing large amounts of government bonds, the interest rates on those bonds lower. This in turn means that the interest rates offered on loans for the purchasing of mortgages or business loans also lowers, encouraging spending and stimulating the economy. Large enterprises jump at the opportunity After the initial stimulus of *** billion British pounds through quantitative easing in March 2020, the Bank of England announced in June that they would increase the amount by a further 100 billion British pounds. In March of 2020, the headline flow of borrowing by non-financial industries including construction, transport, real estate and the manufacturing sectors increased significantly.
Dataset: A continuous open source data collection platform for architectural...
zenodo.org
data.niaid.nih.gov
zip
Updated Jan 1, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Darius Sas; Darius Sas; Alessandro Gilardi; Ilaria Pigazzini; Francesca Arcelli Fontana; Alessandro Gilardi; Ilaria Pigazzini; Francesca Arcelli Fontana (2024). Dataset: A continuous open source data collection platform for architectural technical debt assessment [Dataset]. http://doi.org/10.5281/zenodo.10044706
Explore at:
zipAvailable download formats
Unique identifier
https://doi.org/10.5281/zenodo.10044706
Dataset updated
Jan 1, 2024
Dataset provided by
Zenodohttp://zenodo.org/
Authors
Darius Sas; Darius Sas; Alessandro Gilardi; Ilaria Pigazzini; Francesca Arcelli Fontana; Alessandro Gilardi; Ilaria Pigazzini; Francesca Arcelli Fontana
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
The dataset and replication package of the study "A continuous open source data collection platform for architectural technical debt assessment".

Abstract

Architectural decisions are the most important source of technical debt. In recent years, researchers spent an increasing amount of effort investigating this specific category of technical debt, with quantitative methods, and in particular static analysis, being the most common approach to investigate such a topic.

However, quantitative studies are susceptible, to varying degrees, to external validity threats, which hinder the generalisation of their findings.

In response to this concern, researchers strive to expand the scope of their study by incorporating a larger number of projects into their analyses. This practice is typically executed on a case-by-case basis, necessitating substantial data collection efforts that have to be repeated for each new study.

To address this issue, this paper presents our initial attempt at tackling this problem and enabling researchers to study architectural smells at large scale, a well-known indicator of architectural technical debt. Specifically, we introduce a novel approach to data collection pipeline that leverages Apache Airflow to continuously generate up-to-date, large-scale datasets using Arcan, a tool for architectural smells detection (or any other tool).

Finally, we present the publicly-available dataset resulting from the first three months of execution of the pipeline, that includes over 30,000 analysed commits and releases from over 10,000 open source GitHub projects written in 5 different programming languages and amounting to over a billion of lines of code analysed.
Comparative Visualisation of Biomass Feedstock Quantities Across Sub-Saharan...
zenodo.org
Updated May 11, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Raymond Atughwe; Raymond Atughwe; Siddharth Gadkari; Michael Short; Siddharth Gadkari; Michael Short (2025). Comparative Visualisation of Biomass Feedstock Quantities Across Sub-Saharan Africa Subnational Units [Dataset]. http://doi.org/10.5281/zenodo.15288696
Explore at:
Unique identifier
https://doi.org/10.5281/zenodo.15288696
Dataset updated
May 11, 2025
Dataset provided by
Zenodohttp://zenodo.org/
Authors
Raymond Atughwe; Raymond Atughwe; Siddharth Gadkari; Michael Short; Siddharth Gadkari; Michael Short
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Time period covered
Apr 27, 2025
Area covered
Sub-Saharan Africa
Description
The dataset includes 15 visual diagrams (pie and bar charts) comparing the distribution of agricultural residues, OFMSW, and used cooking oil across each state in Nigeria, province in South Africa, and county in Kenya. These summaries provide a comparative overview of regional feedstock strengths. The charts complement quantitative analyses by providing visual summaries of feedstock availability.
Can Cavendish Financial (CAV) Chart a Course for Growth? (Forecast)
kappasignal.com
Updated Apr 9, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
KappaSignal (2024). Can Cavendish Financial (CAV) Chart a Course for Growth? (Forecast) [Dataset]. https://www.kappasignal.com/2024/04/can-cavendish-financial-cav-chart.html
Explore at:
Dataset updated
Apr 9, 2024
Dataset authored and provided by
KappaSignal
License
https://www.kappasignal.com/p/legal-disclaimer.htmlhttps://www.kappasignal.com/p/legal-disclaimer.html
Description
This analysis presents a rigorous exploration of financial data, incorporating a diverse range of statistical features. By providing a robust foundation, it facilitates advanced research and innovative modeling techniques within the field of finance.

Can Cavendish Financial (CAV) Chart a Course for Growth?

Financial data:

Historical daily stock prices (open, high, low, close, volume)

Fundamental data (e.g., market capitalization, price to earnings P/E ratio, dividend yield, earnings per share EPS, price to earnings growth, debt-to-equity ratio, price-to-book ratio, current ratio, free cash flow, projected earnings growth, return on equity, dividend payout ratio, price to sales ratio, credit rating)

Technical indicators (e.g., moving averages, RSI, MACD, average directional index, aroon oscillator, stochastic oscillator, on-balance volume, accumulation/distribution A/D line, parabolic SAR indicator, bollinger bands indicators, fibonacci, williams percent range, commodity channel index)

Machine learning features:

Feature engineering based on financial data and technical indicators

Sentiment analysis data from social media and news articles

Macroeconomic data (e.g., GDP, unemployment rate, interest rates, consumer spending, building permits, consumer confidence, inflation, producer price index, money supply, home sales, retail sales, bond yields)

Potential Applications:

Stock price prediction

Portfolio optimization

Algorithmic trading

Market sentiment analysis

Risk management

Use Cases:

Researchers investigating the effectiveness of machine learning in stock market prediction

Analysts developing quantitative trading Buy/Sell strategies

Individuals interested in building their own stock market prediction models

Students learning about machine learning and financial applications

Additional Notes:

The dataset may include different levels of granularity (e.g., daily, hourly)

Data cleaning and preprocessing are essential before model training

Regular updates are recommended to maintain the accuracy and relevance of the data
f
Underlying quantitative data in support of the chart in Fig 4B in [1].
plos.figshare.com
xlsx
Updated Jul 3, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
PLOS One (2025). Underlying quantitative data in support of the chart in Fig 4B in [1]. [Dataset]. http://doi.org/10.1371/journal.pone.0327518.s002
Explore at:
xlsxAvailable download formats
Unique identifier
https://doi.org/10.1371/journal.pone.0327518.s002
Dataset updated
Jul 3, 2025
Dataset provided by
PLOS ONE
Authors
PLOS One
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
Underlying quantitative data in support of the chart in Fig 4B in [1].

Facebook

Twitter

Click to copy link

Link copied

Cite

Ashleigh Prince (2023). Quantitative questions - analysed data [Dataset]. http://doi.org/10.6084/m9.figshare.24029238.v1

Quantitative questions - analysed data

Explore at:

xlsxAvailable download formats

Unique identifier

https://doi.org/10.6084/m9.figshare.24029238.v1

Dataset updated

Aug 24, 2023

Dataset provided by

Figsharehttp://figshare.com/

Authors

Ashleigh Prince

License

Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically

Description

The Excel spreadsheet contains the quantitative questions (Questions 1, 3 and 4). Each question is analysed in the form of a frequency distribution table and a pie chart.

Clear search

Close search

Google apps

Main menu

Quantitative questions - analysed data

Underlying quantitative data in support of the charts in Fig 6 in [1].

Data Visualization Cheat sheets and Resources

The Data Visualization Corpus

Data Visualization

The Data Visualizaion Copus

The Data Visualization corpus consists:

Suggestions:

Resources:

Request to kaggle users:

Suggestion and queries:

Kindly upvote the dataset if you find it useful or if you wish to appreciate the effort taken to gather this corpus! Thank you and have a great day!

Replication dataset and calculations for PIIE WP 16-7, A Portfolio Model of...

Dataset Chart Hours Television Digital Social Intervention Chicago & Los...

Primers used for quantitative real-time PCR, CHART-PCR and ChIP.

Code and data for 'Improved vapor pressure predictions using group...

Stable oxygen and carbon isotope ratios of Globigerinoides obliquus of...

Underlying quantitative data in support of the charts in Figs 4A and C in...

Replication dataset and calculations for PIIE PB 15-7, Quantity Theory of...

Wikipedia Knowledge Graph dataset

Revealing the Therapeutic Potential of Lactoperoxidase and Deubiquitinase in...

Dataset_Graph

Data from: 3DCP.fyi - A Comprehensive Citation Network Graph on the State of...

Quantitative Sensitivity

Quantitative easing by the Bank of England 2009-2020

Dataset: A continuous open source data collection platform for architectural...

Comparative Visualisation of Biomass Feedstock Quantities Across Sub-Saharan...

Can Cavendish Financial (CAV) Chart a Course for Growth? (Forecast)

Can Cavendish Financial (CAV) Chart a Course for Growth?

Financial data:

Machine learning features:

Potential Applications:

Use Cases:

Additional Notes:

Underlying quantitative data in support of the chart in Fig 4B in [1].

Quantitative questions - analysed data