100+ datasets found

DH Praxis Data Visualization Project
figshare.com
zip
Updated Jan 19, 2016
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Kelly Blanchat (2016). DH Praxis Data Visualization Project [Dataset]. http://doi.org/10.6084/m9.figshare.1237770.v1
Explore at:
zipAvailable download formats
Unique identifier
https://doi.org/10.6084/m9.figshare.1237770.v1
Dataset updated
Jan 19, 2016
Dataset provided by
Figsharehttp://figshare.com/
Authors
Kelly Blanchat
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
A collection of files used for a data visualization project for the Digital Humanities Praxis course at the Graduate Center, CUNY. The files represent raw data (csv), data used for the visualization(s) (gephi), and the visualizations themselves (pdf). A write-up on the project can be located at the GC Academic Commons site: http://dhpraxis14.commons.gc.cuny.edu/2014/11/12/its-big-data-to-me-data-viz-part-2
Data Visualization Final Project
kaggle.com
zip
Updated Dec 13, 2021
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Serhii Kotiv (2021). Data Visualization Final Project [Dataset]. https://www.kaggle.com/datasets/serhiikotiv/data-visualization-final-project
Explore at:
zip(184734 bytes)Available download formats
Dataset updated
Dec 13, 2021
Authors
Serhii Kotiv
Description
Dataset

This dataset was created by Serhii Kotiv

Contents
R
Data Visualization 2 (trail) Dataset
universe.roboflow.com
zip
Updated Nov 4, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
practicum (2025). Data Visualization 2 (trail) Dataset [Dataset]. https://universe.roboflow.com/practicum-ziryz/data-visualization-2-trail-yxvoh
Explore at:
zipAvailable download formats
Dataset updated
Nov 4, 2025
Dataset authored and provided by
practicum
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Variables measured
Food 5Sze Bounding Boxes
Description
Data Visualization 2 (trail)

## Overview Data Visualization 2 (trail) is a dataset for object detection tasks - it contains Food 5Sze annotations for 7,580 images. ## Getting Started You can download this dataset for use within your own projects, or fork it into a workspace on Roboflow to create your own model. ## License This dataset is available under the [CC BY 4.0 license](https://creativecommons.org/licenses/CC BY 4.0).
data viz project
kaggle.com
zip
Updated Jul 12, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Mohamed Aldamrdash (2024). data viz project [Dataset]. https://www.kaggle.com/datasets/mohamedaldamrdash/data-viz-project/discussion
Explore at:
zip(457942697 bytes)Available download formats
Dataset updated
Jul 12, 2024
Authors
Mohamed Aldamrdash
License
Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
License information was derived automatically
Description
Dataset

This dataset was created by Mohamed Aldamrdash

Released under Apache 2.0

Contents
u
Code book of RTL visualization in Arabic News media
rdr.ucl.ac.uk
xlsx
Updated Jul 3, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Muna Alebri; No ̈elle Rakotondravony; Lane Harrison (2024). Code book of RTL visualization in Arabic News media [Dataset]. http://doi.org/10.5522/04/26150749.v1
Explore at:
xlsxAvailable download formats
Unique identifier
https://doi.org/10.5522/04/26150749.v1
Dataset updated
Jul 3, 2024
Dataset provided by
University College London
Authors
Muna Alebri; No ̈elle Rakotondravony; Lane Harrison
License
CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
License information was derived automatically
Description
In this project, we aimed to map the visualisation design space of visualisation embedded in right-to-left (RTL) scripts. We aimed to expand our knowledge of visualisation design beyond the dominance of research based on left-to-right (LTR) scripts. Through this project, we identify common design practices regarding the chart structure, the text, and the source. We also identify ambiguity, particularly regarding the axis position and direction, suggesting that the community may benefit from unified standards similar to those found on web design for RTL scripts. To achieve this goal, we curated a dataset that covered 128 visualisations found in Arabic news media and coded these visualisations based on the chart composition (e.g., chart type, x-axis direction, y-axis position, legend position, interaction, embellishment type), text (e.g., availability of text, availability of caption, annotation type), and source (source position, attribution to designer, ownership of the visualisation design). Links are also provided to the articles and the visualisations. This dataset is limited for stand-alone visualisations, whether they were single-panelled or included small multiples. We also did not consider infographics in this project, nor any visualisation that did not have an identifiable chart type (e.g., bar chart, line chart). The attached documents also include some graphs from our analysis of the dataset provided, where we illustrate common design patterns and their popularity within our sample.
Data professional survey practice set
kaggle.com
zip
Updated Apr 6, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Bharat Kumar (2023). Data professional survey practice set [Dataset]. https://www.kaggle.com/datasets/bharatbbhardwaj/data-professional-survey-practice-set
Explore at:
zip(2430487 bytes)Available download formats
Dataset updated
Apr 6, 2023
Authors
Bharat Kumar
Description
Dataset

This dataset was created by Bharat Kumar

Contents
D
Data Visualization of a GL Community: A Cooperative Project
ssh.datastations.nl
datasearch.gesis.org
pdf, tsv, xls, zip
Updated Feb 19, 2018
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
R. Bartolini; S. Goggi; G. Pardelli; R. Bartolini; S. Goggi; G. Pardelli (2018). Data Visualization of a GL Community: A Cooperative Project [Dataset]. http://doi.org/10.17026/DANS-X3B-FVYJ
Explore at:
tsv(28446), zip(23390), xls(60416), pdf(709707)Available download formats
Unique identifier
https://doi.org/10.17026/DANS-X3B-FVYJ
Dataset updated
Feb 19, 2018
Dataset provided by
DANS Data Station Social Sciences and Humanities
Authors
R. Bartolini; S. Goggi; G. Pardelli; R. Bartolini; S. Goggi; G. Pardelli
License
CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
License information was derived automatically
Description
In 2012, GreyNet published a page on its website and made accessible the first edition of IDGL, International Directory of Organizations in Grey Literature . The latest update of this PDF publication was in August 2016, providing a list of some 280 organizations in 40 countries worldwide that have contact with the Grey Literature Network Service. The listing appears by country followed by the names of the organizations in alphabetical order, which are then linked to a URL.This year GreyNet International marks its Twenty Fifth Anniversary and seeks to more fully showcase organizations, whose involvement in grey literature is in one or more ways linked to GreyNet.org. Examples of which include: members, partners, conference hosts, sponsors, authors, service providers, committee members, associate editors, etc.This revised and updated edition of IDGL will benefit from the use of visualization software mapping the cities in which GreyNet’s contacts are located. Behind each point of contact are a number of fields that can be grouped and cross-tabulated for further data analysis. Such fields include the source, name of organization, acronym, affiliate’s job title, sector of information, subject/discipline, city, state, country, ISO code, continent, and URL. Eight of the twelve fields require input, while the other four fields do not.The population of the study was derived by extracting records from GreyNet’s in-house, administrative file. Only recipients on GreyNet’s Distribution List as of February 2017 were included. The records were then further filtered and only those that allowed for completion of the required fields remained. This set of records was then converted to Excel format, duplications were removed, and further normalization of field entries took place. In fine, 510 records form the corpus of this study. In the coming months, an in-depth analysis of the data will be carried out - the results of which will be recorded and made visually accessible.The expected outcome of the project will not only produce a revised, expanded, and updated publication of IDGL, but will also provide a visual overview of GreyNet as an international organization serving diverse communities with shared interests in grey literature. It will be a demonstration of GreyNet’s commitment to research, publication, open access, education, and public awareness in this field of library and information science. Finally, this study will serve to pinpoint geographic and subject based areas currently within as well as outside of GreyNet’s catchment.
R
Data visualization web application interface for the BENEFIT-MED project
entrepot.recherche.data.gouv.fr
Updated Oct 6, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Fifonsi Fafa Amegnihe; RAJJOU Loïc; Fifonsi Fafa Amegnihe; RAJJOU Loïc (2025). Data visualization web application interface for the BENEFIT-MED project [Dataset]. http://doi.org/10.57745/FE0JBK
Explore at:
Unique identifier
https://doi.org/10.57745/FE0JBK
Dataset updated
Oct 6, 2025
Dataset provided by
Recherche Data Gouv
Authors
Fifonsi Fafa Amegnihe; RAJJOU Loïc; Fifonsi Fafa Amegnihe; RAJJOU Loïc
License
https://spdx.org/licenses/etalab-2.0.htmlhttps://spdx.org/licenses/etalab-2.0.html
Description
Here is the link for the web application developped fir the data analysis embedded to the BENEFIT-MED project
G
Construction Data Visualization Software Market Research Report 2033
growthmarketreports.com
csv, pdf, pptx
Updated Aug 4, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Growth Market Reports (2025). Construction Data Visualization Software Market Research Report 2033 [Dataset]. https://growthmarketreports.com/report/construction-data-visualization-software-market
Explore at:
pdf, pptx, csvAvailable download formats
Dataset updated
Aug 4, 2025
Dataset authored and provided by
Growth Market Reports
Time period covered
2024 - 2032
Area covered
Global
Description
Construction Data Visualization Software Market Outlook

According to our latest research, the global construction data visualization software market size reached USD 2.1 billion in 2024, driven by the increasing adoption of digital solutions across the construction industry. The market is poised to exhibit a robust CAGR of 13.2% from 2025 to 2033, with the total market value expected to reach USD 6.1 billion by 2033. This significant growth is fueled by the rising demand for real-time data analytics, improved project transparency, and the growing complexity of construction projects worldwide. The construction data visualization software market is witnessing accelerated adoption as organizations strive to enhance project efficiency, reduce costs, and mitigate risks through actionable insights derived from advanced data visualization tools.

The construction data visualization software market is experiencing remarkable momentum as the industry increasingly recognizes the value of data-driven decision-making. One of the primary growth factors is the escalating complexity of construction projects, which necessitates robust tools for managing, analyzing, and visualizing vast datasets. As projects become larger and more intricate, stakeholders require comprehensive dashboards and visual analytics to track progress, monitor budgets, and anticipate potential risks. This has resulted in a surge in demand for construction data visualization software, which empowers users to convert raw data into actionable insights, streamline workflows, and facilitate better communication across teams. Furthermore, the integration of Building Information Modeling (BIM) and other digital construction technologies is amplifying the need for sophisticated visualization solutions, enabling seamless collaboration and data sharing among architects, engineers, contractors, and project owners.

Another significant driver propelling the construction data visualization software market is the growing emphasis on cost control and resource optimization. Construction firms are under increasing pressure to deliver projects on time and within budget, while maintaining high standards of quality and safety. Data visualization tools play a crucial role in this regard by providing real-time visibility into project metrics, enabling early identification of cost overruns, schedule delays, and resource bottlenecks. By leveraging advanced analytics and interactive dashboards, construction companies can make informed decisions, optimize resource allocation, and improve overall project performance. The adoption of cloud-based solutions further enhances accessibility and scalability, allowing organizations of all sizes to harness the power of data visualization without significant upfront investments in IT infrastructure.

The rapid digital transformation sweeping across the construction sector is also a key catalyst for market expansion. As the industry embraces technologies such as the Internet of Things (IoT), artificial intelligence, and machine learning, the volume and complexity of construction data are increasing exponentially. Construction data visualization software is evolving to accommodate these trends, offering advanced features such as predictive analytics, automated reporting, and customizable dashboards. These capabilities enable stakeholders to gain deeper insights into project dynamics, anticipate challenges, and drive continuous improvement. Additionally, the growing focus on sustainability and regulatory compliance is prompting construction firms to adopt data visualization tools that facilitate transparent reporting and support environmentally responsible decision-making.

From a regional perspective, North America currently dominates the construction data visualization software market, accounting for the largest revenue share in 2024. This leadership position is attributed to the presence of major industry players, high digital adoption rates, and a strong focus on innovation and process optimization. Europe and Asia Pacific are also witnessing rapid growth, fueled by increasing infrastructure investments, government initiatives promoting digitalization, and the rising adoption of cloud-based construction solutions. Emerging markets in Latin America and the Middle East & Africa are expected to experience steady growth, supported by urbanization trends and the modernization of construction practices. Overall, the global construction data visualization software market is set to witness robust expansion across all major reg
Sales Data Analysis Project
kaggle.com
zip
Updated Jun 1, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Stina Tonia (2024). Sales Data Analysis Project [Dataset]. https://www.kaggle.com/datasets/stinatonia/2019-project-on-sales
Explore at:
zip(3818151 bytes)Available download formats
Dataset updated
Jun 1, 2024
Authors
Stina Tonia
Description
This project was done to analyze sales data: to identify trends, top-selling products, and revenue metrics for business decision-making. I did this project offered by MeriSKILL, to learn more and be exposed to real-world projects and challenges that will provide me with valuable industry experience and help me develop my data analytical skills.https://www.googleapis.com/download/storage/v1/b/kaggle-user-content/o/inbox%2F20837845%2Fe3561db319392bf9cc8b7d3fcc7ed94d%2F2019%20Sales%20dashboard.png?generation=1717273572595587&alt=media" alt=""> More on this project is on Medium
H
Data Visualization in Social Work Research
datasetcatalog.nlm.nih.gov
data.niaid.nih.gov
+2more
Updated Aug 8, 2013
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Esposito, Tonino; Wegner-Lohin; Rothwell, David (2013). Data Visualization in Social Work Research [Dataset]. http://doi.org/10.7910/DVN/I6IIXL
Explore at:
Unique identifier
https://doi.org/10.7910/DVN/I6IIXL
Dataset updated
Aug 8, 2013
Authors
Esposito, Tonino; Wegner-Lohin; Rothwell, David
Description
Research dissemination and knowledge translation are imperative in social work. Methodological developments in data visualization techniques have improved the ability to convey meaning and reduce erroneous conclusions. The purpose of this project is to examine: (1) How are empirical results presented visually in social work research?; (2) To what extent do top social work journals vary in the publication of data visualization techniques?; (3) What is the predominant type of analysis presented in tables and graphs?; (4) How can current data visualization methods be improved to increase understanding of social work research? Method: A database was built from a systematic literature review of the four most recent issues of Social Work Research and 6 other highly ranked journals in social work based on the 2009 5-year impact factor (Thomson Reuters ISI Web of Knowledge). Overall, 294 articles were reviewed. Articles without any form of data visualization were not included in the final database. The number of articles reviewed by journal includes : Child Abuse & Neglect (38), Child Maltreatment (30), American Journal of Community Psychology (31), Family Relations (36), Social Work (29), Children and Youth Services Review (112), and Social Work Research (18). Articles with any type of data visualization (table, graph, other) were included in the database and coded sequentially by two reviewers based on the type of visualization method and type of analyses presented (descriptive, bivariate, measurement, estimate, predicted value, other). Additional revi ew was required from the entire research team for 68 articles. Codes were discussed until 100% agreement was reached. The final database includes 824 data visualization entries.
a
Rescue Plan Summary Data by Project and Reporting Period
arpa-data-reporting-pdx.hub.arcgis.com
hub.arcgis.com
+1more
Updated Sep 19, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
City of Portland, Oregon (2023). Rescue Plan Summary Data by Project and Reporting Period [Dataset]. https://arpa-data-reporting-pdx.hub.arcgis.com/datasets/rescue-plan-summary-data-by-project-and-reporting-period
Explore at:
Dataset updated
Sep 19, 2023
Dataset authored and provided by
City of Portland, Oregon
Area covered
Description
A variety of cumulative (over reporting periods) data points for Rescue Plan projects: individual recipient count, business recipient count, non profit recipient count, and expenditure.-- Additional Information: Category: ARPA Update Frequency: As Necessary-- Metadata Link: https://www.portlandmaps.com/metadata/index.cfm?&action=DisplayLayer&LayerID=61045
BDV Final Project Tableau finals
figshare.com
xml
Updated Apr 19, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Liang-Mei Lin (2025). BDV Final Project Tableau finals [Dataset]. http://doi.org/10.6084/m9.figshare.28819487.v2
Explore at:
xmlAvailable download formats
Unique identifier
https://doi.org/10.6084/m9.figshare.28819487.v2
Dataset updated
Apr 19, 2025
Dataset provided by
figshare
Figsharehttp://figshare.com/
Authors
Liang-Mei Lin
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
This package includes a Tableau file and good/bad figures for the visual sequencing disorder group. Dataset: Medical Cost.csv is used for the creation of visualizations.

Data for "Direct and indirect Rod and Frame effect: A virtual reality study"...

data.mendeley.com

Updated Feb 12, 2025

Facebook

Twitter

Click to copy link

Link copied

Cite

Michał Adamski (2025). Data for "Direct and indirect Rod and Frame effect: A virtual reality study" [Dataset]. http://doi.org/10.17632/pcf2n8b4rd.1

Explore at:

Unique identifier

https://doi.org/10.17632/pcf2n8b4rd.1

Dataset updated

Feb 12, 2025

Authors

Michał Adamski

License

Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically

Description

This dataset contains the raw experimental data and supplementary materials for the "Asymmetry Effects in Virtual Reality Rod and Frame Test". The materials included are:

•  Raw Experimental Data: older.csv and young.csv
•  Mathematica Notebooks: a collection of Mathematica notebooks used for data analysis and visualization. These notebooks provide scripts for processing the experimental data, performing statistical analyses, and generating the figures used in the project.
•  Unity Package: a Unity package featuring a sample scene related to the project. The scene was built using Unity’s Universal Rendering Pipeline (URP). To utilize this package, ensure that URP is enabled in your Unity project. Instructions for enabling URP can be found in the Unity URP Documentation.

Requirements:

•  For Data Files: software capable of opening CSV files (e.g., Microsoft Excel, Google Sheets, or any programming language that can read CSV formats).
•  For Mathematica Notebooks: Wolfram Mathematica software to run and modify the notebooks.
•  For Unity Package: Unity Editor version compatible with URP (2019.3 or later recommended). URP must be installed and enabled in your Unity project.

Usage Notes:

•  The dataset facilitates comparative studies between different age groups based on the collected variables.
•  Users can modify the Mathematica notebooks to perform additional analyses.
•  The Unity scene serves as a reference to the project setup and can be expanded or integrated into larger projects.

Citation: Please cite this dataset when using it in your research or publications.

Z
Interview data on experts' recommendations for visualizations in libraries
data.niaid.nih.gov
Updated Mar 31, 2022
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Moßburger, Luis (2022). Interview data on experts' recommendations for visualizations in libraries [Dataset]. https://data.niaid.nih.gov/resources?id=zenodo_6400983
Explore at:
Dataset updated
Mar 31, 2022
Dataset provided by
University of Regensburg
Authors
Moßburger, Luis
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
Results of an interview study with twelve experts on their project processes and their recommendations for visualizations in libraries. Recommendations were retrieved with the method SHIRA (Structured Hierarchical Interviewing for Requirement Analysis)[1]. This allows generating concrete qualities and implementation suggestions out of abstract qualities.

The file contains two pages: On the first, a meta-model was constructed out of all identified steps during library visualization projects. On the second, all SHIRA suggestions were gathered and analyzed.

Feel free to contact me if you have any questions!

[1] M. Hassenzahl, R. Wessler, and K.-C. Hamborg. Exploring and understanding product qualities that users desire. Conference on Human-Computer Interaction IHM-HCI’2001, 2, 2001.
Online Store Dataset
kaggle.com
zip
Updated Sep 7, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Vishwas Chakilam (2024). Online Store Dataset [Dataset]. https://www.kaggle.com/datasets/chakilamvishwas/online-store-dataset
Explore at:
zip(13864 bytes)Available download formats
Dataset updated
Sep 7, 2024
Authors
Vishwas Chakilam
License
MIT Licensehttps://opensource.org/licenses/MIT
License information was derived automatically
Description
Dataset Name: Online Store Dataset

Description: The Online Store dataset is a comprehensive collection of 500 rows of synthetic e-commerce product data. Designed to simulate an online retail environment similar to major e-commerce platforms like Amazon, this dataset includes a diverse range of attributes for each product. The dataset provides valuable insights into product characteristics, pricing, stock levels, and customer feedback, making it ideal for analysis, machine learning, and data visualization projects.

Features:

ID: Unique identifier for each product. Product_Name: Name of the product, generated using random words to simulate real-world product names. Category: Product category (e.g., Electronics, Clothing, Books, Home, Toys, Sports). Price: Product price, ranging from $10 to $500. Stock: Number of items available in stock. Rating: Customer rating of the product (1 to 5 stars). Reviews: Number of customer reviews. Brand: Brand of the product. Date_Added: Date when the product was added to the catalog. Discount: Percentage discount applied to the product. Use Cases:

Data Analysis: Explore trends and patterns in e-commerce product data. Machine Learning: Build and train models for product recommendation, pricing strategies, or customer segmentation. Data Visualization: Create visualizations to analyze product categories, pricing distribution, and customer reviews. Notes:

The data is synthetic and randomly generated, reflecting typical attributes found in e-commerce platforms. This dataset can be used for educational purposes, practice, and experimentation with various data analysis and machine learning techniques.
f
Data from: spectrum_utils: A Python Package for Mass Spectrometry Data...
acs.figshare.com
text/x-python
Updated May 31, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Wout Bittremieux (2023). spectrum_utils: A Python Package for Mass Spectrometry Data Processing and Visualization [Dataset]. http://doi.org/10.1021/acs.analchem.9b04884.s001
Explore at:
text/x-pythonAvailable download formats
Unique identifier
https://doi.org/10.1021/acs.analchem.9b04884.s001
Dataset updated
May 31, 2023
Dataset provided by
ACS Publications
Authors
Wout Bittremieux
License
Attribution-NonCommercial 4.0 (CC BY-NC 4.0)https://creativecommons.org/licenses/by-nc/4.0/
License information was derived automatically
Description
Given the wide diversity in applications of biological mass spectrometry, custom data analyses are often needed to fully interpret the results of an experiment. Such bioinformatics scripts necessarily include similar basic functionality to read mass spectral data from standard file formats, process it, and visualize it. Rather than having to reimplement this functionality, to facilitate this task, spectrum_utils is a Python package for mass spectrometry data processing and visualization. Its high-level functionality enables developers to quickly prototype ideas for computational mass spectrometry projects in only a few lines of code. Notably, the data processing functionality is highly optimized for computational efficiency to be able to deal with the large volumes of data that are generated during mass spectrometry experiments. The visualization functionality makes it possible to easily produce publication-quality figures as well as interactive spectrum plots for inclusion on web pages. spectrum_utils is available for Python 3.6+, includes extensive online documentation and examples, and can be easily installed using conda. It is freely available as open source under the Apache 2.0 license at https://github.com/bittremieux/spectrum_utils.
viz project
data.wu.ac.at
csv, json, xml
Updated Mar 29, 2018
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Police Department (NYPD) (2018). viz project [Dataset]. https://data.wu.ac.at/schema/bronx_lehman_cuny_edu/aGRraC0ya3B2
Explore at:
json, csv, xmlAvailable download formats
Dataset updated
Mar 29, 2018
Dataset provided by
New York City Police Departmenthttps://nyc.gov/nypd
Description
This dataset includes all valid felony, misdemeanor, and violation crimes reported to the New York City Police Department (NYPD) for all complete quarters so far this year (2016). Offenses occurring at intersections are represented at the X Coordinate and Y Coordinate of the intersection. Crimes occurring anywhere other than an intersection are geo-located to the middle of the block. For additional details, please see the attached data dictionary in the ‘About’ section.
z
Classification of web-based Digital Humanities projects leveraging...
zenodo.org
data-staging.niaid.nih.gov
csv, tsv
Updated Nov 10, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Tommaso Battisti; Tommaso Battisti (2025). Classification of web-based Digital Humanities projects leveraging information visualisation techniques [Dataset]. http://doi.org/10.5281/zenodo.14192758
Explore at:
tsv, csvAvailable download formats
Unique identifier
https://doi.org/10.5281/zenodo.14192758
Dataset updated
Nov 10, 2025
Dataset provided by
Zenodo
Authors
Tommaso Battisti; Tommaso Battisti
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
Description

This dataset contains a list of 186 Digital Humanities projects leveraging information visualisation techniques. Each project has been classified according to visualisation and interaction methods, narrativity and narrative solutions, domain, methods for the representation of uncertainty and interpretation, and the employment of critical and custom approaches to visually represent humanities data.

Classification schema: categories and columns

The project_id column contains unique internal identifiers assigned to each project. Meanwhile, the last_access column records the most recent date (in DD/MM/YYYY format) on which each project was reviewed based on the web address specified in the url column.
The remaining columns can be grouped into descriptive categories aimed at characterising projects according to different aspects:

Narrativity. It reports the presence of information visualisation techniques employed within narrative structures. Here, the term narrative encompasses both author-driven linear data stories and more user-directed experiences where the narrative sequence is determined by user exploration [1]. We define 2 columns to identify projects using visualisation techniques in narrative, or non-narrative sections. Both conditions can be true for projects employing visualisations in both contexts. Columns:

non_narrative (boolean)

narrative (boolean)

Domain. The humanities domain to which the project is related. We rely on [2] and the chapters of the first part of [3] to abstract a set of general domains. Column:

domain (categorical):

History and archaeology

Art and art history

Language and literature

Music and musicology

Multimedia and performing arts

Philosophy and religion

Other: both extra-list domains and cases of collections without a unique or specific thematic focus.

Visualisation of uncertainty and interpretation. Buiding upon the frameworks proposed by [4] and [5], a set of categories was identified, highlighting a distinction between precise and impressional communication of uncertainty. Precise methods explicitly represent quantifiable uncertainty such as missing, unknown, or uncertain data, precisely locating and categorising it using visual variables and positioning. Two sub-categories are interactive distinction, when uncertain data is not visually distinguishable from the rest of the data but can be dynamically isolated or included/excluded categorically through interaction techniques (usually filters); and visual distinction, when uncertainty visually “emerges” from the representation by means of dedicated glyphs and spatial or visual cues and variables. On the other hand, impressional methods communicate the constructed and situated nature of data [6], exposing the interpretative layer of the visualisation and indicating more abstract and unquantifiable uncertainty using graphical aids or interpretative metrics. Two sub-categories are: ambiguation, when the use of graphical expedients—like permeable glyph boundaries or broken lines—visually convey the ambiguity of a phenomenon; and interpretative metrics, when expressive, non-scientific, or non-punctual metrics are used to build a visualisation. Column:

uncertainty_interpretation (categorical):

Interactive distinction

Visual distinction

Ambiguation

Interpretative metrics

Critical adaptation. We identify projects in which, with regards to at least a visualisation, the following criteria are fulfilled: 1) avoid repurposing of prepackaged, generic-use, or ready-made solutions; 2) being tailored and unique to reflect the peculiarities of the phenomena at hand; 3) avoid simplifications to embrace and depict complexity, promoting time-consuming visualisation-based inquiry. Column:

critical_adaptation (boolean)

Non-temporal visualisation techniques. We adopt and partially adapt the terminology and definitions from [7]. A column is defined for each type of visualisation and accounts for its presence within a project, also including stacked layouts and more complex variations. Columns and inclusion criteria:

plot (boolean): visual representations that map data points onto a two-dimensional coordinate system.

cluster_or_set (boolean): sets or cluster-based visualisations used to unveil possible inter-object similarities.

map (boolean): geographical maps used to show spatial insights. While we do not specify the variants of maps (e.g., pin maps, dot density maps, flow maps, etc.), we make an exception for maps where each data point is represented by another visualisation (e.g., a map where each data point is a pie chart) by accounting for the presence of both in their respective columns.

network (boolean): visual representations highlighting relational aspects through nodes connected by links or edges.

hierarchical_diagram (boolean): tree-like structures such as tree diagrams, radial trees, but also dendrograms. They differ from networks for their strictly hierarchical structure and absence of closed connection loops.

treemap (boolean): still hierarchical, but highlighting quantities expressed by means of area size. It also includes circle packing variants.

word_cloud (boolean): clouds of words, where each instance’s size is proportional to its frequency in a related context

bars (boolean): includes bar charts, histograms, and variants. It coincides with “bar charts” in [7] but with a more generic term to refer to all bar-based visualisations.

line_chart (boolean): the display of information as sequential data points connected by straight-line segments.

area_chart (boolean): similar to a line chart but with a filled area below the segments. It also includes density plots.

pie_chart (boolean): circular graphs divided into slices which can also use multi-level solutions.

plot_3d (boolean): plots that use a third dimension to encode an additional variable.

proportional_area (boolean): representations used to compare values through area size. Typically, using circle- or square-like shapes.

other (boolean): it includes all other types of non-temporal visualisations that do not fall into the aforementioned categories.

Temporal visualisations and encodings. In addition to non-temporal visualisations, a group of techniques to encode temporality is considered in order to enable comparisons with [7]. Columns:

timeline (boolean): the display of a list of data points or spans in chronological order. They include timelines working either with a scale or simply displaying events in sequence. As in [7], we also include structured solutions resembling Gantt chart layouts.

temporal_dimension (boolean): to report when time is mapped to any dimension of a visualisation, with the exclusion of timelines. We use the term “dimension” and not “axis” as in [7] as more appropriate for radial layouts or more complex representational choices.

animation (boolean): temporality is perceived through an animation changing the visualisation according to time flow.

visual_variable (boolean): another visual encoding strategy is used to represent any temporality-related variable (e.g., colour).

Interactions. A set of categories to assess affordable interactions based on the concept of user intent [8] and user-allowed perceptualisation data actions [9]. The following categories roughly match the manipulative subset of methods of the “how” an interaction is performed in the conception of [10]. Only interactions that affect the aspect of the visualisation or the visual representation of its data points, symbols, and glyphs are taken into consideration. Columns:

basic_selection (boolean): the demarcation of an element either for the duration of the interaction or more permanently until the occurrence of another selection.

advanced_selection (boolean): the demarcation involves both the selected element and connected elements within the visualisation or leads to brush and link effects across views. Basic selection is tacitly implied.

navigation (boolean): interactions that allow moving, zooming, panning, rotating, and scrolling the view but only when applied to the visualisation and not to the web page. It also includes “drill” interactions (to navigate through different levels or portions of data detail, often generating a new view that replaces or accompanies the original) and “expand” interactions generating new perspectives on data by expanding and collapsing nodes.

arrangement (boolean): the organisation of visualisation elements (symbols, glyphs, etc.) or multi-visualisation layouts spatially through drag and drop or
m
R codes and dataset for Visualisation of Diachronic Constructional Change...
bridges.monash.edu
researchdata.edu.au
zip
Updated May 30, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Gede Primahadi Wijaya Rajeg (2023). R codes and dataset for Visualisation of Diachronic Constructional Change using Motion Chart [Dataset]. http://doi.org/10.26180/5c844c7a81768
Explore at:
zipAvailable download formats
Unique identifier
https://doi.org/10.26180/5c844c7a81768
Dataset updated
May 30, 2023
Dataset provided by
Monash University
Authors
Gede Primahadi Wijaya Rajeg
License
Attribution-NonCommercial-ShareAlike 4.0 (CC BY-NC-SA 4.0)https://creativecommons.org/licenses/by-nc-sa/4.0/
License information was derived automatically
Description
PublicationPrimahadi Wijaya R., Gede. 2014. Visualisation of diachronic constructional change using Motion Chart. In Zane Goebel, J. Herudjati Purwoko, Suharno, M. Suryadi & Yusuf Al Aried (eds.). Proceedings: International Seminar on Language Maintenance and Shift IV (LAMAS IV), 267-270. Semarang: Universitas Diponegoro. doi: https://doi.org/10.4225/03/58f5c23dd8387Description of R codes and data files in the repositoryThis repository is imported from its GitHub repo. Versioning of this figshare repository is associated with the GitHub repo's Release. So, check the Releases page for updates (the next version is to include the unified version of the codes in the first release with the tidyverse).The raw input data consists of two files (i.e. will_INF.txt and go_INF.txt). They represent the co-occurrence frequency of top-200 infinitival collocates for will and be going to respectively across the twenty decades of Corpus of Historical American English (from the 1810s to the 2000s).These two input files are used in the R code file 1-script-create-input-data-raw.r. The codes preprocess and combine the two files into a long format data frame consisting of the following columns: (i) decade, (ii) coll (for "collocate"), (iii) BE going to (for frequency of the collocates with be going to) and (iv) will (for frequency of the collocates with will); it is available in the input_data_raw.txt. Then, the script 2-script-create-motion-chart-input-data.R processes the input_data_raw.txt for normalising the co-occurrence frequency of the collocates per million words (the COHA size and normalising base frequency are available in coha_size.txt). The output from the second script is input_data_futurate.txt.Next, input_data_futurate.txt contains the relevant input data for generating (i) the static motion chart as an image plot in the publication (using the script 3-script-create-motion-chart-plot.R), and (ii) the dynamic motion chart (using the script 4-script-motion-chart-dynamic.R).The repository adopts the project-oriented workflow in RStudio; double-click on the Future Constructions.Rproj file to open an RStudio session whose working directory is associated with the contents of this repository.

Facebook

Twitter

Click to copy link

Link copied

Cite

Kelly Blanchat (2016). DH Praxis Data Visualization Project [Dataset]. http://doi.org/10.6084/m9.figshare.1237770.v1

DH Praxis Data Visualization Project

Explore at:

zipAvailable download formats

Unique identifier

https://doi.org/10.6084/m9.figshare.1237770.v1

Dataset updated

Jan 19, 2016

Dataset provided by

Figsharehttp://figshare.com/

Authors

Kelly Blanchat

License

Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically

Description

A collection of files used for a data visualization project for the Digital Humanities Praxis course at the Graduate Center, CUNY. The files represent raw data (csv), data used for the visualization(s) (gephi), and the visualizations themselves (pdf). A write-up on the project can be located at the GC Academic Commons site: http://dhpraxis14.commons.gc.cuny.edu/2014/11/12/its-big-data-to-me-data-viz-part-2

Clear search

Close search

Google apps

Main menu

DH Praxis Data Visualization Project

Data Visualization Final Project

Dataset

Contents

Data Visualization 2 (trail) Dataset

Data Visualization 2 (trail)

data viz project

Dataset

Contents

Code book of RTL visualization in Arabic News media

Data professional survey practice set

Dataset

Contents

Data Visualization of a GL Community: A Cooperative Project

Data visualization web application interface for the BENEFIT-MED project

Construction Data Visualization Software Market Research Report 2033

Construction Data Visualization Software Market Outlook

Sales Data Analysis Project

Data Visualization in Social Work Research

Rescue Plan Summary Data by Project and Reporting Period

BDV Final Project Tableau finals

Data for "Direct and indirect Rod and Frame effect: A virtual reality study"...

Interview data on experts' recommendations for visualizations in libraries

Online Store Dataset

Data from: spectrum_utils: A Python Package for Mass Spectrometry Data...

viz project

Classification of web-based Digital Humanities projects leveraging...

Description

Classification schema: categories and columns

R codes and dataset for Visualisation of Diachronic Constructional Change...

DH Praxis Data Visualization Project