80 datasets found

R
Graphs And Charts Dataset
universe.roboflow.com
zip
Updated Nov 24, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Learn (2024). Graphs And Charts Dataset [Dataset]. https://universe.roboflow.com/learn-aouhg/graphs-and-charts
Explore at:
zipAvailable download formats
Dataset updated
Nov 24, 2024
Dataset authored and provided by
Learn
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Variables measured
Bar Chart Line Graph Pie Bounding Boxes
Description
Graphs And Charts

## Overview Graphs And Charts is a dataset for object detection tasks - it contains Bar Chart Line Graph Pie annotations for 384 images. ## Getting Started You can download this dataset for use within your own projects, or fork it into a workspace on Roboflow to create your own model. ## License This dataset is available under the [CC BY 4.0 license](https://creativecommons.org/licenses/CC BY 4.0).
d
State and regional sensitivity spreadsheets for bar and pie charts
datasets.ai
catalog.data.gov
53
Updated Sep 11, 2024
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
U.S. Environmental Protection Agency (2024). State and regional sensitivity spreadsheets for bar and pie charts [Dataset]. https://datasets.ai/datasets/state-and-regional-sensitivity-spreadsheets-for-bar-and-pie-charts
Explore at:
53Available download formats
Dataset updated
Sep 11, 2024
Dataset authored and provided by
U.S. Environmental Protection Agency
Description
These files represent the state and regional summaries of sensitivities to formaldehyde, acetaldehyde and ozone to various sources and compounds.

This dataset is associated with the following publication: Luecken, D., S. Napelenok, M. Strum, R. Scheffe, and S. Phillips. Sensitivity of Ambient Atmospheric Formaldehyde and Ozone to Precursor Species and Source Types Across the United States. ENVIRONMENTAL SCIENCE & TECHNOLOGY. American Chemical Society, Washington, DC, USA, 52(8): 4668–4675, (2018).
i
Data Visualization SVG Illustrations Dataset
illustrations.design
illuhub.com
svg
Updated Sep 7, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Illuhub (2025). Data Visualization SVG Illustrations Dataset [Dataset]. https://illustrations.design/illustrations/technology-electronics/data-visual
Explore at:
svgAvailable download formats
Dataset updated
Sep 7, 2025
Dataset authored and provided by
Illuhub
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Time period covered
2024 - Present
Area covered
Worldwide
Variables measured
Category, File Format, Subcategory
Description
Specialized collection of 0 free data visualization SVG illustrations from the technology & electronics category. Data visualization illustrations including bar charts, network graphs, and information graphics Examples include: bar chart, network graph.
f
Beyond Bar and Line Graphs: Time for a New Data Presentation Paradigm
plos.figshare.com
docx
Updated May 31, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Tracey L. Weissgerber; Natasa M. Milic; Stacey J. Winham; Vesna D. Garovic (2023). Beyond Bar and Line Graphs: Time for a New Data Presentation Paradigm [Dataset]. http://doi.org/10.1371/journal.pbio.1002128
Explore at:
docxAvailable download formats
Unique identifier
https://doi.org/10.1371/journal.pbio.1002128
Dataset updated
May 31, 2023
Dataset provided by
PLOS Biology
Authors
Tracey L. Weissgerber; Natasa M. Milic; Stacey J. Winham; Vesna D. Garovic
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
Figures in scientific publications are critically important because they often show the data supporting key findings. Our systematic review of research articles published in top physiology journals (n = 703) suggests that, as scientists, we urgently need to change our practices for presenting continuous data in small sample size studies. Papers rarely included scatterplots, box plots, and histograms that allow readers to critically evaluate continuous data. Most papers presented continuous data in bar and line graphs. This is problematic, as many different data distributions can lead to the same bar or line graph. The full data may suggest different conclusions from the summary statistics. We recommend training investigators in data presentation, encouraging a more complete presentation of data, and changing journal editorial policies. Investigators can quickly make univariate scatterplots for small sample size studies using our Excel templates.
Tableau Dummy Dataset for Practice
kaggle.com
Updated Aug 21, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Piush Dave (2025). Tableau Dummy Dataset for Practice [Dataset]. https://www.kaggle.com/datasets/piyushdave/tableau-dummy-dataset-for-practice
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Aug 21, 2025
Dataset provided by
Kagglehttp://kaggle.com/
Authors
Piush Dave
License
https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/
Description
Domain-Specific Dataset and Visualization Guide

This package contains 20 realistic datasets in CSV format across different industries, along with 20 text files suggesting visualization ideas. Each dataset includes about 300 rows of synthetic but domain-appropriate data. They are designed for data analysis, visualization practice, machine learning projects, and dashboard building.

What’s inside

20 CSV files, one for each domain:

Education

E-Commerce

Healthcare

Finance

Retail

Social Media

Manufacturing

Sports

Transport

Hospitality

Telecom

Banking

Real Estate

Gaming

Agriculture

Automobile

Energy

Insurance

Government

Entertainment

20 TXT files, each listing 10 relevant graphing options for the dataset.

MASTER_INDEX.csv, which summarizes all domains with their column names.

Use cases

Practice data cleaning, exploration, and visualization in Excel, Tableau, Power BI, or Python.

Build dashboards for specific industries.

Train beginner-level machine learning models such as classification and regression.

Use in classroom teaching or workshops as ready-made datasets.

Example

Education dataset has columns like StudentName, Class, Subject, Marks, AttendancePercent. Suggested graphs: bar chart of average marks by subject, scatter plot of marks vs attendance percent, line chart of attendance over time.

E-Commerce dataset has columns like OrderDate, Product, Category, Price, Quantity, Total. Suggested graphs: line chart of revenue trend, bar chart of revenue by category, pie chart of payment mode share.
Chart Viewer
city-of-lawrenceville-arcgis-hub-lville.hub.arcgis.com
Updated Sep 22, 2021
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
esri_en (2021). Chart Viewer [Dataset]. https://city-of-lawrenceville-arcgis-hub-lville.hub.arcgis.com/items/be4582b38d764de0a970b986c824acde
Explore at:
Dataset updated
Sep 22, 2021
Dataset provided by
Esrihttp://esri.com/
Authors
esri_en
Description
Use the Chart Viewer template to display bar charts, line charts, pie charts, histograms, and scatterplots to complement a map. Include multiple charts to view with a map or side by side with other charts for comparison. Up to three charts can be viewed side by side or stacked, but you can access and view all the charts that are authored in the map. Examples: Present a bar chart representing average property value by county for a given area. Compare charts based on multiple population statistics in your dataset. Display an interactive scatterplot based on two values in your dataset along with an essential set of map exploration tools. Data requirements The Chart Viewer template requires a map with at least one chart configured. Key app capabilities Multiple layout options - Choose Stack to display charts stacked with the map, or choose Side by side to display charts side by side with the map. Manage chart - Reorder, rename, or turn charts on and off in the app. Multiselect chart - Compare two charts in the panel at the same time. Bookmarks - Allow users to zoom and pan to a collection of preset extents that are saved in the map. Home, Zoom controls, Legend, Layer List, Search Supportability This web app is designed responsively to be used in browsers on desktops, mobile phones, and tablets. We are committed to ongoing efforts towards making our apps as accessible as possible. Please feel free to leave a comment on how we can improve the accessibility of our apps for those who use assistive technologies.
D
Data from: Debunking strategies for misleading bar charts
phys-techsciences.datastations.nl
csv, html +2
Updated Aug 30, 2022
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
W Wijnker; W Wijnker (2022). Debunking strategies for misleading bar charts [Dataset]. http://doi.org/10.17026/DANS-ZT5-QG5E
Explore at:
html(5363707), text/x-r-notebook(104408), csv(323294), csv(430892), zip(19082)Available download formats
Unique identifier
https://doi.org/10.17026/DANS-ZT5-QG5E
Dataset updated
Aug 30, 2022
Dataset provided by
DANS Data Station Physical and Technical Sciences
Authors
W Wijnker; W Wijnker
License
https://doi.org/10.17026/fp39-0x58https://doi.org/10.17026/fp39-0x58
Description
This deposit includes the data that was collected in an experimental study on debunking strategies for misleading bar charts, involving 2 surveys (one week delay) with a total of 24 unique bar charts each with two bars, filled in by 441 representative (age, ethnicity, gender) participants from the USA. De experiment compares four methods for correcting misleading bar charts with truncated vertical axes by measuring the participants evaluated difference between the bars at five time points. Measures were taken on a visual analogue scale. The first survey also included a short graph literacy scale and a question on highest completed educational level. Date Submitted: 2022-06-24
A fine-grained dataset of visualisation and interaction practices in...
zenodo.org
csv
Updated Sep 19, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Tommaso Battisti; Tommaso Battisti (2025). A fine-grained dataset of visualisation and interaction practices in web-based Digital Humanities projects [Dataset]. http://doi.org/10.5281/zenodo.17158330
Explore at:
csvAvailable download formats
Unique identifier
https://doi.org/10.5281/zenodo.17158330
Dataset updated
Sep 19, 2025
Dataset provided by
Zenodohttp://zenodo.org/
Authors
Tommaso Battisti; Tommaso Battisti
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
Description

This dataset classifies 587 visualisation–interaction units extracted from 186 web-based Digital Humanities projects, previously classified in a related dataset https://doi.org/10.5281/zenodo.14192758" target="_blank" rel="noopener">[1], allowing cross-references between them. Each row represents a distinct combination of visualisation technique(s) (e.g., map, bar chart, network) and associated interactive features within a project. The dataset provides a finer-grained view of design choices, documenting how visualisation and interaction methods are implemented, including their connection to narrative or non-narrative contexts, temporal encodings, and multi-view or reconfiguration strategies.

The building blocks of our dataset: defining visualisation–interaction units

A visualisation–interaction unit is a distinct configuration combining a visualisation technique (or multiple techniques when linked through coordinated views) with a specific set of interactive features. Following [2], we consider these elements as working interdependently to achieve a shared data-related goal.
These units form the basic level of analysis in our dataset, with each row representing one unit. Units are distinguished not only by their visualisation and interaction techniques, but also by their temporal characteristics and narrative context. Temporal encodings—such as time axes, animated transitions, or other time-based variables—define a new unit even if the visualisation and interaction remain unchanged. Similarly, an identical configuration appearing in both a narrative and a non-narrative context counts as two separate units, reflecting their differing intent and function.
Accordingly, the number of units in a project does not directly correspond to the number of visualisations it contains. Two otherwise identical charts are treated as distinct units if they differ in interactive features, temporal encoding, or narrative context, while exact duplicates without variation are counted as a single unit. For example, if a project contains five bar charts that all support drill-down on click, they are counted as a single unit. Conversely, if the same five bar charts each offer different interaction techniques, they are treated as separate units based on their unique visualisation–interaction combinations.
Every unit includes at least one visualisation technique, although interaction may or may not be present (for instance, in a static chart).

Classification schema: categories and columns

Identifiers. Three columns in the dataset are dedicated to uniquely identifying units and their relationships within projects:

project_id: the identifier of the project to which the unit belongs. This reuses the same incremental IDs from https://doi.org/10.5281/zenodo.14192758" target="_blank" rel="noopener">[1] to enable cross-referencing between datasets.

vis_unit_id: the identifier of the individual visualisation–interaction unit. IDs increment within each project and reset to 1 for a new project.

visualisation_version: an identifier used to track interactive transformations of visualisations. Multiple rows can share the same project_id and vis_unit_id if they represent different states of the same view, triggered by user interactions that modify the visual form.

Narrativity. We record whether a visualisation–interaction unit is employed within a narrative context. Some projects contain units exclusively in narrative or non-narrative settings, while others include units in both. The relevant columns are:

non_narrative: a boolean value indicating whether the unit appears in non-narrative contexts.

narrative: a boolean value indicating whether the unit is used in narrative contexts (including both strongly guided, author-driven data stories and more interactive, user-driven narratives).

Visualisation techniques. We adopt, and where necessary adapt, the terminology and definitions from [3]. Each column corresponds to a specific type of visualisation and indicates (by means of a boolean value) whether that visualisation technique is present in a given visualisation–interaction unit. The following columns and inclusion criteria are used to encode this information:

plot: visual representations that map data points onto a two-dimensional coordinate system.

cluster_or_set: sets or cluster-based visualisations used to unveil possible inter-object similarities.

map: geographical maps used to show spatial insights. While we do not specify the variants of maps (e.g., pin maps, dot density maps, flow maps, etc.), we make an exception for maps where each data point is represented by another visualisation (e.g., a map where each data point is a pie chart) by accounting for the presence of both in their respective columns.

network: visual representations highlighting relational aspects through nodes connected by links or edges.

hierarchical_diagram: tree-like structures such as tree diagrams, radial trees, but also dendrograms. They differ from networks for their strictly hierarchical structure and absence of closed connection loops.

treemap: still hierarchical, but highlighting quantities expressed by means of area size. It also includes circle packing variants.

word_cloud: clouds of words, where each instance’s size is proportional to its frequency in a related context

bars: includes bar charts, histograms, and variants. It coincides with “bar charts” in [7] but with a more generic term to refer to all bar-based visualisations.

line_chart: the display of information as sequential data points connected by straight-line segments.

area_chart: similar to a line chart but with a filled area below the segments. It also includes density plots.

pie_chart: circular graphs divided into slices, which can also use multi-level solutions.

plot_3d: plots that use a third dimension to encode an additional variable.

proportional_area: representations used to compare values through area size. Typically, using circle- or square-like shapes.

timeline: the display of a list of data points or spans in chronological order. They include timelines working either with a scale or simply displaying events in sequence. As in [3], we also include structured solutions resembling Gantt chart layouts.

other: it includes all other types of non-temporal visualisations that do not fall into the aforementioned categories.

Temporal encodings. We identify techniques used to encode temporality (except for timelines, where temporal encoding is tacitly assumed). Columns:

temporal_dimension: to report when time is mapped to any dimension of a visualisation. We use the term “dimension” and not “axis” as in [3] as more appropriate for radial layouts or more complex representational choices.

animation: temporality is perceived through an animation changing the visualisation according to time flow.

visual_variable: another visual encoding strategy is used to represent any temporality-related variable (e.g. colour).

Multi-type coordinated views. Tracking coordinated views across the dataset is limited to cases where multiple visualisation types can be clearly identified within a single view. For these instances, a dedicated column indicates which visualisation—if any—plays a central or dominant role:

primary_visualisation: contains the name of the visualisation technique (as defined in the corresponding column) that holds a dominant role in the coordinated view. If no single type can be considered guiding because multiple types have similar perceived importance, the column contains "NA".

Interaction techniques. A set of categories to assess affordable interaction techniques based on the concept of user intent [2] and user-allowed data actions [4]. The following categories roughly match the “processing”, “mapping”, and “presentation” actions from [4] and the manipulative subset of methods of the “how” an interaction is performed in the conception of [5]. Only interactions that affect the visual representation or the aspect of data points, symbols, and glyphs are taken into consideration. A two-level analysis is enabled by the columns, referring to interaction categories also explored at an aggregated project level in [1], and their values, exposing more specific interaction techniques (multiple values are divided by a semicolumn). They include:

basic_selection: the demarcation of an element either for the duration of the interaction (highlight) or more permanently until the occurrence of another selection (mark).

advanced_selection: the demarcation of an element triggers the demarcation of related instances within the same visualisation
f
Summary of Annotation pipeline and Prediction pipeline.
plos.figshare.com
xlsx
Updated Jun 1, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Han Zhuang; Tzu-Yang Huang; Daniel E. Acuna (2023). Summary of Annotation pipeline and Prediction pipeline. [Dataset]. http://doi.org/10.1371/journal.pcbi.1009650.s001
Explore at:
xlsxAvailable download formats
Unique identifier
https://doi.org/10.1371/journal.pcbi.1009650.s001
Dataset updated
Jun 1, 2023
Dataset provided by
PLOS Computational Biology
Authors
Han Zhuang; Tzu-Yang Huang; Daniel E. Acuna
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
With annotation pipeline, we applied compound figure classification, subfigure separation, and bar chart classification to obtain bar charts from this sample and then ask annotators to annotate graphical integrity issues on these bar chart. With prediction pipeline, we applied our whole graphical integrity issues detector on this sample. Both sets are similar, as demonstrated by analysis in Fig 2. (XLSX)
h
llm-distribution-sample
huggingface.co
Updated Oct 7, 2025
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Rob van Volt (2025). llm-distribution-sample [Dataset]. https://huggingface.co/datasets/robvanvolt/llm-distribution-sample
Explore at:
Dataset updated
Oct 7, 2025
Authors
Rob van Volt
License
MIT Licensehttps://opensource.org/licenses/MIT
License information was derived automatically
Description
LLM Distribution Evaluation Dataset

This dataset contains 1000 synthetic graphs with questions and answers about statistical distributions, designed to evaluate large language models' ability to analyze data visualizations.

Dataset Description Dataset Summary

This dataset contains diverse statistical visualizations (bar charts, line plots, scatter plots, histograms, area charts, and step plots) with associated questions about:

Normality testing Distribution… See the full description on the dataset page: https://huggingface.co/datasets/robvanvolt/llm-distribution-sample.
G
Textiles, Clothing and Rubber Products
open.canada.ca
datasets.ai
+1more
jpg, pdf
Updated Mar 14, 2022
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Natural Resources Canada (2022). Textiles, Clothing and Rubber Products [Dataset]. https://open.canada.ca/data/en/dataset/4bcefc01-7c19-5e7b-98ae-85c307ad88cb
Explore at:
pdf, jpgAvailable download formats
Dataset updated
Mar 14, 2022
Dataset provided by
Natural Resources Canada
License
Open Government Licence - Canada 2.0https://open.canada.ca/en/open-government-licence-canada
License information was derived automatically
Description
Contained within the 3rd Edition (1957) of the Atlas of Canada is a plate that shows six condensed maps of the distribution of plants producing the following: leather footwear, womens and childrens factory made clothing, synthetic textiles and silks, mens factory made clothing, cotton textiles, and rubber products. All data for these maps is for 1954 with the exception of the rubber products map which is for 1955. Each map is accompanied by a bar graph and pie chart. The bar graphs show the value of production by major categories of products. The pie charts show the percentage distribution of persons employed in each manufacturing industry by province.
E
EconBiz Images for Text Extraction from Scholarly Figures
live.european-language-grid.eu
data.niaid.nih.gov
+2more
json
Updated Apr 14, 2016
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
(2016). EconBiz Images for Text Extraction from Scholarly Figures [Dataset]. https://live.european-language-grid.eu/catalogue/corpus/7506
Explore at:
jsonAvailable download formats
Dataset updated
Apr 14, 2016
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
"Scholarly figures are data visualizations like bar charts, pie charts, line graphs, maps, scatter plots or similar figures. Text extraction from scholarly figures is useful in many application scenarios, since text in scholarly figures often contains information that is not present in the surrounding text. This dataset is a corpus of 121 scholarly figures from the economics domain evaluating text extraction tools. We randomly extracted these figures from a corpus of 288,000 open access publications from EconBiz. The dataset resembles a wide variety of scholarly figures from bar charts to maps. We manually labeled the figures to create the gold standard.
We adjusted the provided gold standard to have a uniform format for all datasets. Each figure is accompanied by a TSV file (tab-separated values) where each entry corresponds to a text line which has the following structure:
X-coordinate of the center of the bounding box in pixel
Y-coordinate of the center of the bounding box in pixel
Width of the bounding box in pixel
Height of the bounding box in pixel
Rotation angle around its center in degree
Text inside the bounding box
In addition we provide the ground truth in JSON format. A schema file is included in each dataset as well. The dataset is accompanied with a ReadMe file with further information about the figures and their origin.
If you use this dataset in your own work, please cite one of the papers in the references."
u
Code book of RTL visualization in Arabic News media
rdr.ucl.ac.uk
xlsx
Updated Jul 3, 2024
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Muna Alebri; No ̈elle Rakotondravony; Lane Harrison (2024). Code book of RTL visualization in Arabic News media [Dataset]. http://doi.org/10.5522/04/26150749.v1
Explore at:
xlsxAvailable download formats
Unique identifier
https://doi.org/10.5522/04/26150749.v1
Dataset updated
Jul 3, 2024
Dataset provided by
University College London
Authors
Muna Alebri; No ̈elle Rakotondravony; Lane Harrison
License
CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
License information was derived automatically
Description
In this project, we aimed to map the visualisation design space of visualisation embedded in right-to-left (RTL) scripts. We aimed to expand our knowledge of visualisation design beyond the dominance of research based on left-to-right (LTR) scripts. Through this project, we identify common design practices regarding the chart structure, the text, and the source. We also identify ambiguity, particularly regarding the axis position and direction, suggesting that the community may benefit from unified standards similar to those found on web design for RTL scripts. To achieve this goal, we curated a dataset that covered 128 visualisations found in Arabic news media and coded these visualisations based on the chart composition (e.g., chart type, x-axis direction, y-axis position, legend position, interaction, embellishment type), text (e.g., availability of text, availability of caption, annotation type), and source (source position, attribution to designer, ownership of the visualisation design). Links are also provided to the articles and the visualisations. This dataset is limited for stand-alone visualisations, whether they were single-panelled or included small multiples. We also did not consider infographics in this project, nor any visualisation that did not have an identifiable chart type (e.g., bar chart, line chart). The attached documents also include some graphs from our analysis of the dataset provided, where we illustrate common design patterns and their popularity within our sample.
Regression table.
plos.figshare.com
xlsx
Updated Jun 1, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Han Zhuang; Tzu-Yang Huang; Daniel E. Acuna (2023). Regression table. [Dataset]. http://doi.org/10.1371/journal.pcbi.1009650.s002
Explore at:
xlsxAvailable download formats
Unique identifier
https://doi.org/10.1371/journal.pcbi.1009650.s002
Dataset updated
Jun 1, 2023
Dataset provided by
PLOShttp://plos.org/
Authors
Han Zhuang; Tzu-Yang Huang; Daniel E. Acuna
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
This table shows our analysis of the relationship between proportional ink violations and a group of variables (journal rank, research field, researcher seniority, affiliation country, and year of publication). (XLSX)
A
‘Hr Analytics Job Prediction’ analyzed by Analyst-2
analyst-2.ai
Updated Nov 13, 2021
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Analyst-2 (analyst-2.ai) / Inspirient GmbH (inspirient.com) (2021). ‘Hr Analytics Job Prediction’ analyzed by Analyst-2 [Dataset]. https://analyst-2.ai/analysis/kaggle-hr-analytics-job-prediction-d026/785df16e/?iid=019-996&v=presentation
Explore at:
Dataset updated
Nov 13, 2021
Dataset authored and provided by
Analyst-2 (analyst-2.ai) / Inspirient GmbH (inspirient.com)
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
Analysis of ‘Hr Analytics Job Prediction’ provided by Analyst-2 (analyst-2.ai), based on source dataset retrieved from https://www.kaggle.com/mfaisalqureshi/hr-analytics-and-job-prediction on 30 September 2021.

--- Dataset description provided by original source is as follows ---

Context

Hr Data Analytics This dataset contains information about employees who worked in a company.

Content

This dataset contains columns: Satisfactory Level, Number of Project, Average Monthly Hours, Time Spend Company, Promotion Last 5
Years, Department, Salary

Acknowledgements

You can download, copy and share this dataset for analysis and Predictions employees Behaviour.

Inspiration

Answer the following questions would be worthy 1- Do Exploratory Data analysis to figure out which variables have a direct and clear impact on employee retention (i.e. whether they leave the company or continue to work) 2- Plot bar charts showing the impact of employee salaries on retention 3- Plot bar charts showing a correlation between department and employee retention 4- Now build a logistic regression model using variables that were narrowed down in step 1 5- Measure the accuracy of the model

--- Original source retains full ownership of the source dataset ---
u
Combined HMM post-processing analyses of river plots and bar plots of the...
zivahub.uct.ac.za
txt
Updated Feb 8, 2024
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Glenn Moncrieff; Keletso Moilwe; Jasper Slingsby; Vernon Visser (2024). Combined HMM post-processing analyses of river plots and bar plots of the gross (grey) and net (colour) change in each land cover class (i.e. NVF, Aliens, Indigenous Forest, Grassland, and Mixed Woody Grassland) under their respective management classes (i.e. Barloworld/Commercial, Communal, Forestry/Conservation, and Plantations) between 1990 and 2020 [Dataset]. http://doi.org/10.25375/uct.24885267.v1
Explore at:
txtAvailable download formats
Unique identifier
https://doi.org/10.25375/uct.24885267.v1
Dataset updated
Feb 8, 2024
Dataset provided by
University of Cape Town
Authors
Glenn Moncrieff; Keletso Moilwe; Jasper Slingsby; Vernon Visser
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
This dataset was used to produce land cover change analyses for Figures 3.8 (A-D) and Figures 3.9 (A -D) as part of a master's thesis titled- Repeatable methods for classification of alien and native vegetation in the Montane grasslands (2024).This dataset encompasses forty-five data entries. This includes 11 Hidden Markov Model (HMM) post-processed GeoTIFFS for 1990 until 2020 under each management class (i.e. Barloworld/Commercial, Communal, Forestry/Conservation, and Plantations). An r. script is also included to combine all GeoTIFFS then format and run code to produce river plots and bar graphs for the four management classes. The dataset aims to illustrate the changes among each land cover class (i.e. NVF, Aliens, Indigenous Forest, Grassland, and Mixed Woody Grassland) under the different management classes. This helps identify drivers of land cover change in the various management classes and the entire study area. It is also important to note that the land cover changes will portray realistic changes because they are HMM post-processed.Date of data collection: February 2020Location of data collection: Blyde River Canyon Conservancy and its surrounds, in Mpumalanga/Limpopo Provinces, South Africa.
Comparative Visualisation of Biomass Feedstock Quantities Across Sub-Saharan...
zenodo.org
Updated May 11, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Raymond Atughwe; Raymond Atughwe; Siddharth Gadkari; Michael Short; Siddharth Gadkari; Michael Short (2025). Comparative Visualisation of Biomass Feedstock Quantities Across Sub-Saharan Africa Subnational Units [Dataset]. http://doi.org/10.5281/zenodo.15288696
Explore at:
Unique identifier
https://doi.org/10.5281/zenodo.15288696
Dataset updated
May 11, 2025
Dataset provided by
Zenodohttp://zenodo.org/
Authors
Raymond Atughwe; Raymond Atughwe; Siddharth Gadkari; Michael Short; Siddharth Gadkari; Michael Short
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Time period covered
Apr 27, 2025
Area covered
Africa, Sub-Saharan Africa
Description
The dataset includes 15 visual diagrams (pie and bar charts) comparing the distribution of agricultural residues, OFMSW, and used cooking oil across each state in Nigeria, province in South Africa, and county in Kenya. These summaries provide a comparative overview of regional feedstock strengths. The charts complement quantitative analyses by providing visual summaries of feedstock availability.
h
JGraphQA
huggingface.co
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Akira Kinoshita, JGraphQA [Dataset]. https://huggingface.co/datasets/r-g2-2024/JGraphQA
Explore at:
Authors
Akira Kinoshita
License
Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
License information was derived automatically
Description
JGraphQA

Introduction

We introduce JGraphQA, a multimodal benchmark designed to evaluate the chart understanding capabilities of Large Multimodal Models (LMMs) in Japanese. To create JGraphQA, we first conducted a detailed analysis of the existing ChartQA benchmark. Then, focusing on Japanese investor relations (IR) materials, we collected a total of 100 images consisting of four types: pie charts, line charts, bar charts, and tables. For each image, we created two… See the full description on the dataset page: https://huggingface.co/datasets/r-g2-2024/JGraphQA.
Adult Data Set ( Census Income dataset)
kaggle.com
Updated Mar 7, 2021
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
KritiDoneria (2021). Adult Data Set ( Census Income dataset) [Dataset]. https://www.kaggle.com/kritidoneria/adultdatasetxai/tasks
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Mar 7, 2021
Dataset provided by
Kagglehttp://kaggle.com/
Authors
KritiDoneria
Description
The dataset used is US Census data which is an extraction of the 1994 census data which was donated to the UC Irvine’s Machine Learning Repository. The data contains approximately 32,000 observations with over 15 variables. The dataset was downloaded from: http://archive.ics.uci.edu/ml/datasets/Adult. The dependent variable in our analysis will be income level and who earns above $50,000 a year using SQL queries, Proportion Analysis using bar charts and Simple Decision Tree to understand the important variables and their influence on prediction.
m
Student Flows Across Regions and Settlements in Kazakhstan, 2020–2024: A...
data.mendeley.com
Updated Aug 26, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Anel Tarakbay (2025). Student Flows Across Regions and Settlements in Kazakhstan, 2020–2024: A Comprehensive Dataset [Dataset]. http://doi.org/10.17632/nvthgwbpgr.2
Explore at:
Unique identifier
https://doi.org/10.17632/nvthgwbpgr.2
Dataset updated
Aug 26, 2025
Authors
Anel Tarakbay
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Area covered
Kazakhstan
Description
The dataset contains comprehensive data on internal student mobility between settlements and regions in Kazakhstan from the year 2020 through to year 2024. It contains yearly totals of transferring students, broken down by origin and destination sites, both at the regional and settlement levels. The data was derived from the Kazakhstan National Education Database and represents official administrative counts for transferring schools. It is provided to help facilitate research on educational mobility, educational inequality, urbanization, and school planning and forecasting.

-This dataset consists of six individual tables derived from aggregated student migration data from Kazakhstan for the period from 2020 to 2024. The tables are in .csv form and can be found in the 'Dataset/data' folder. -There is a detailed variable description in the provided 'Dataset/data/Codebook.xlsx' that explains every field utilized in the dataset. -Each table is also accompanied by a related visualisation (such as bar charts, maps, line graphs) that presents main patterns and insights. The visualisations appear in the 'Dataset/data/visualizations' folder and they correspond to every table by figure number for easy reference. -The Dataset/code folder contains the Python code used to process and analyze the raw data from the National Education Database (NEDB), along with a README.txt file that provides a step-by-step explanation of the methodology.

Facebook

Twitter

Click to copy link

Link copied

Cite

Learn (2024). Graphs And Charts Dataset [Dataset]. https://universe.roboflow.com/learn-aouhg/graphs-and-charts

Graphs And Charts Dataset

graphs-and-charts

graphs-and-charts-dataset

Explore at:

zipAvailable download formats

Dataset updated

Nov 24, 2024

Dataset authored and provided by

Learn

License

Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically

Variables measured

Bar Chart Line Graph Pie Bounding Boxes

Description

Graphs And Charts

## Overview

Graphs And Charts is a dataset for object detection tasks - it contains Bar Chart Line Graph Pie annotations for 384 images.

## Getting Started

You can download this dataset for use within your own projects, or fork it into a workspace on Roboflow to create your own model.

  ## License

  This dataset is available under the [CC BY 4.0 license](https://creativecommons.org/licenses/CC BY 4.0).

Clear search

Close search

Google apps

Main menu

Graphs And Charts Dataset

Graphs And Charts

State and regional sensitivity spreadsheets for bar and pie charts

Data Visualization SVG Illustrations Dataset

Beyond Bar and Line Graphs: Time for a New Data Presentation Paradigm

Tableau Dummy Dataset for Practice

Chart Viewer

Data from: Debunking strategies for misleading bar charts

A fine-grained dataset of visualisation and interaction practices in...

Description

The building blocks of our dataset: defining visualisation–interaction units

Classification schema: categories and columns

Summary of Annotation pipeline and Prediction pipeline.

llm-distribution-sample

Textiles, Clothing and Rubber Products

EconBiz Images for Text Extraction from Scholarly Figures

Code book of RTL visualization in Arabic News media

Regression table.

‘Hr Analytics Job Prediction’ analyzed by Analyst-2

Context

Content

Acknowledgements

Inspiration

Combined HMM post-processing analyses of river plots and bar plots of the...

Comparative Visualisation of Biomass Feedstock Quantities Across Sub-Saharan...

JGraphQA

Adult Data Set ( Census Income dataset)

Student Flows Across Regions and Settlements in Kazakhstan, 2020–2024: A...

Graphs And Charts Dataset

graphs-and-charts

graphs-and-charts-dataset

Graphs And Charts