100+ datasets found
  1. High Interactivity Visualization Software for Large Computational Data Sets,...

    • data.nasa.gov
    application/rdfxml +5
    Updated Jun 26, 2018
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2018). High Interactivity Visualization Software for Large Computational Data Sets, Phase II [Dataset]. https://data.nasa.gov/dataset/High-Interactivity-Visualization-Software-for-Larg/ttzp-wtjx
    Explore at:
    application/rdfxml, xml, csv, application/rssxml, tsv, jsonAvailable download formats
    Dataset updated
    Jun 26, 2018
    License

    U.S. Government Workshttps://www.usa.gov/government-works
    License information was derived automatically

    Description

    Existing scientific visualization tools have specific limitations for large scale scientific data sets. Of these four limitations can be seen as paramount: (i) memory management, (ii) remote visualization, (iii) interactivity, and (iv) specificity. In Phase I, we proposed and successfully developed a prototype of a collection of computer tools and libraries called SciViz that overcome these limitations and enable researchers to visualize large scale data sets (greater than 200 gigabytes) on HPC resources remotely from their workstations at interactive rates. A key element of our technology is the stack oriented rather than a framework driven approach which allows it to interoperate with common existing scientific visualization software thereby eliminating the need for the user to switch and learn new software. The result is a versatile 3D visualization capability that will significantly decrease the time to knowledge discovery from large, complex data sets.

    Typical visualization activity can be organized into a simple stack of steps that leads to the visualization result. These steps can broadly be classified into data retrieval, data analysis, visual representation, and rendering. Our approach will be to continue with the technique selected in Phase I of utilizing existing visualization tools at each point in the visualization stack and to develop specific tools that address the core limitations identified and seamlessly integrate them into the visualization stack. Specifically, we intend to complete technical objectives in four areas that will complete the development of visualization tools for interactive visualization of very large data sets in each layer of the visualization stack. These four areas are: Feature Objectives, C++ Conversion and Optimization, Testing Objectives, and Domain Specifics and Integration. The technology will be developed and tested at NASA and the San Diego Supercomputer Center.

  2. Top 2500 Kaggle Datasets

    • kaggle.com
    Updated Feb 16, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Saket Kumar (2024). Top 2500 Kaggle Datasets [Dataset]. http://doi.org/10.34740/kaggle/dsv/7637365
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Feb 16, 2024
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    Saket Kumar
    License

    http://opendatacommons.org/licenses/dbcl/1.0/http://opendatacommons.org/licenses/dbcl/1.0/

    Description

    This dataset compiles the top 2500 datasets from Kaggle, encompassing a diverse range of topics and contributors. It provides insights into dataset creation, usability, popularity, and more, offering valuable information for researchers, analysts, and data enthusiasts.

    Research Analysis: Researchers can utilize this dataset to analyze trends in dataset creation, popularity, and usability scores across various categories.

    Contributor Insights: Kaggle contributors can explore the dataset to gain insights into factors influencing the success and engagement of their datasets, aiding in optimizing future submissions.

    Machine Learning Training: Data scientists and machine learning enthusiasts can use this dataset to train models for predicting dataset popularity or usability based on features such as creator, category, and file types.

    Market Analysis: Analysts can leverage the dataset to conduct market analysis, identifying emerging trends and popular topics within the data science community on Kaggle.

    Educational Purposes: Educators and students can use this dataset to teach and learn about data analysis, visualization, and interpretation within the context of real-world datasets and community-driven platforms like Kaggle.

    Column Definitions:

    Dataset Name: Name of the dataset. Created By: Creator(s) of the dataset. Last Updated in number of days: Time elapsed since last update. Usability Score: Score indicating the ease of use. Number of File: Quantity of files included. Type of file: Format of files (e.g., CSV, JSON). Size: Size of the dataset. Total Votes: Number of votes received. Category: Categorization of the dataset's subject matter.

  3. Data Visualization Cheat sheets and Resources

    • kaggle.com
    zip
    Updated May 31, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Kash (2022). Data Visualization Cheat sheets and Resources [Dataset]. https://www.kaggle.com/kaushiksuresh147/data-visualization-cheat-cheats-and-resources
    Explore at:
    zip(133638507 bytes)Available download formats
    Dataset updated
    May 31, 2022
    Authors
    Kash
    License

    https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/

    Description

    The Data Visualization Corpus

    https://www.googleapis.com/download/storage/v1/b/kaggle-user-content/o/inbox%2F1430847%2F29f7950c3b7daf11175aab404725542c%2FGettyImages-1187621904-600x360.jpg?generation=1601115151722854&alt=media" alt="">

    Data Visualization

    Data visualization is the graphical representation of information and data. By using visual elements like charts, graphs, and maps, data visualization tools provide an accessible way to see and understand trends, outliers, and patterns in data.

    In the world of Big Data, data visualization tools and technologies are essential to analyze massive amounts of information and make data-driven decisions

    The Data Visualizaion Copus

    The Data Visualization corpus consists:

    • 32 cheat sheets: This includes A-Z about the techniques and tricks that can be used for visualization, Python and R visualization cheat sheets, Types of charts, and their significance, Storytelling with data, etc..

    • 32 Charts: The corpus also consists of a significant amount of data visualization charts information along with their python code, d3.js codes, and presentations relation to the respective charts explaining in a clear manner!

    • Some recommended books for data visualization every data scientist's should read:

      1. Beautiful Visualization by Julie Steele and Noah Iliinsky
      2. Information Dashboard Design by Stephen Few
      3. Knowledge is beautiful by David McCandless (Short abstract)
      4. The Functional Art: An Introduction to Information Graphics and Visualization by Alberto Cairo
      5. The Visual Display of Quantitative Information by Edward R. Tufte
      6. storytelling with data: a data visualization guide for business professionals by cole Nussbaumer knaflic
      7. Research paper - Cheat Sheets for Data Visualization Techniques by Zezhong Wang, Lovisa Sundin, Dave Murray-Rust, Benjamin Bach

    Suggestions:

    In case, if you find any books, cheat sheets, or charts missing and if you would like to suggest some new documents please let me know in the discussion sections!

    Resources:

    Request to kaggle users:

    • A kind request to kaggle users to create notebooks on different visualization charts as per their interest by choosing a dataset of their own as many beginners and other experts could find it useful!

    • To create interactive EDA using animation with a combination of data visualization charts to give an idea about how to tackle data and extract the insights from the data

    Suggestion and queries:

    Feel free to use the discussion platform of this data set to ask questions or any queries related to the data visualization corpus and data visualization techniques

    Kindly upvote the dataset if you find it useful or if you wish to appreciate the effort taken to gather this corpus! Thank you and have a great day!

  4. Online Retail Big Dataset📊📜

    • kaggle.com
    zip
    Updated May 28, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Adil nawaz ashrafi (2024). Online Retail Big Dataset📊📜 [Dataset]. https://www.kaggle.com/datasets/adilashrafi/online-retail-big-dataset
    Explore at:
    zip(22875837 bytes)Available download formats
    Dataset updated
    May 28, 2024
    Authors
    Adil nawaz ashrafi
    License

    Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
    License information was derived automatically

    Description

    Dataset Information

    This is a transnational data set which contains all the transactions occurring between 01/12/2010 and 09/12/2011 for a UK-based and registered non-store online retail. The company mainly sells unique all-occasion gifts. Many customers of the company are wholesalers.

  5. f

    Data_Sheet_1_“R” U ready?: a case study using R to analyze changes in gene...

    • frontiersin.figshare.com
    docx
    Updated Mar 22, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Amy E. Pomeroy; Andrea Bixler; Stefanie H. Chen; Jennifer E. Kerr; Todd D. Levine; Elizabeth F. Ryder (2024). Data_Sheet_1_“R” U ready?: a case study using R to analyze changes in gene expression during evolution.docx [Dataset]. http://doi.org/10.3389/feduc.2024.1379910.s001
    Explore at:
    docxAvailable download formats
    Dataset updated
    Mar 22, 2024
    Dataset provided by
    Frontiers
    Authors
    Amy E. Pomeroy; Andrea Bixler; Stefanie H. Chen; Jennifer E. Kerr; Todd D. Levine; Elizabeth F. Ryder
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    As high-throughput methods become more common, training undergraduates to analyze data must include having them generate informative summaries of large datasets. This flexible case study provides an opportunity for undergraduate students to become familiar with the capabilities of R programming in the context of high-throughput evolutionary data collected using macroarrays. The story line introduces a recent graduate hired at a biotech firm and tasked with analysis and visualization of changes in gene expression from 20,000 generations of the Lenski Lab’s Long-Term Evolution Experiment (LTEE). Our main character is not familiar with R and is guided by a coworker to learn about this platform. Initially this involves a step-by-step analysis of the small Iris dataset built into R which includes sepal and petal length of three species of irises. Practice calculating summary statistics and correlations, and making histograms and scatter plots, prepares the protagonist to perform similar analyses with the LTEE dataset. In the LTEE module, students analyze gene expression data from the long-term evolutionary experiments, developing their skills in manipulating and interpreting large scientific datasets through visualizations and statistical analysis. Prerequisite knowledge is basic statistics, the Central Dogma, and basic evolutionary principles. The Iris module provides hands-on experience using R programming to explore and visualize a simple dataset; it can be used independently as an introduction to R for biological data or skipped if students already have some experience with R. Both modules emphasize understanding the utility of R, rather than creation of original code. Pilot testing showed the case study was well-received by students and faculty, who described it as a clear introduction to R and appreciated the value of R for visualizing and analyzing large datasets.

  6. m

    Dataset of development of business during the COVID-19 crisis

    • data.mendeley.com
    • narcis.nl
    Updated Nov 9, 2020
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Tatiana N. Litvinova (2020). Dataset of development of business during the COVID-19 crisis [Dataset]. http://doi.org/10.17632/9vvrd34f8t.1
    Explore at:
    Dataset updated
    Nov 9, 2020
    Authors
    Tatiana N. Litvinova
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    To create the dataset, the top 10 countries leading in the incidence of COVID-19 in the world were selected as of October 22, 2020 (on the eve of the second full of pandemics), which are presented in the Global 500 ranking for 2020: USA, India, Brazil, Russia, Spain, France and Mexico. For each of these countries, no more than 10 of the largest transnational corporations included in the Global 500 rating for 2020 and 2019 were selected separately. The arithmetic averages were calculated and the change (increase) in indicators such as profitability and profitability of enterprises, their ranking position (competitiveness), asset value and number of employees. The arithmetic mean values of these indicators for all countries of the sample were found, characterizing the situation in international entrepreneurship as a whole in the context of the COVID-19 crisis in 2020 on the eve of the second wave of the pandemic. The data is collected in a general Microsoft Excel table. Dataset is a unique database that combines COVID-19 statistics and entrepreneurship statistics. The dataset is flexible data that can be supplemented with data from other countries and newer statistics on the COVID-19 pandemic. Due to the fact that the data in the dataset are not ready-made numbers, but formulas, when adding and / or changing the values in the original table at the beginning of the dataset, most of the subsequent tables will be automatically recalculated and the graphs will be updated. This allows the dataset to be used not just as an array of data, but as an analytical tool for automating scientific research on the impact of the COVID-19 pandemic and crisis on international entrepreneurship. The dataset includes not only tabular data, but also charts that provide data visualization. The dataset contains not only actual, but also forecast data on morbidity and mortality from COVID-19 for the period of the second wave of the pandemic in 2020. The forecasts are presented in the form of a normal distribution of predicted values and the probability of their occurrence in practice. This allows for a broad scenario analysis of the impact of the COVID-19 pandemic and crisis on international entrepreneurship, substituting various predicted morbidity and mortality rates in risk assessment tables and obtaining automatically calculated consequences (changes) on the characteristics of international entrepreneurship. It is also possible to substitute the actual values identified in the process and following the results of the second wave of the pandemic to check the reliability of pre-made forecasts and conduct a plan-fact analysis. The dataset contains not only the numerical values of the initial and predicted values of the set of studied indicators, but also their qualitative interpretation, reflecting the presence and level of risks of a pandemic and COVID-19 crisis for international entrepreneurship.

  7. Retail Analysis on Large Dataset

    • kaggle.com
    Updated Jun 14, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Sahil Prajapati (2024). Retail Analysis on Large Dataset [Dataset]. http://doi.org/10.34740/kaggle/dsv/8693643
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Jun 14, 2024
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    Sahil Prajapati
    License

    https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/

    Description

    Dataset Description:

    • The dataset represents retail transactional data. It contains information about customers, their purchases, products, and transaction details. The data includes various attributes such as customer ID, name, email, phone, address, city, state, zipcode, country, age, gender, income, customer segment, last purchase date, total purchases, amount spent, product category, product brand, product type, feedback, shipping method, payment method, and order status.

    Key Points:

    Customer Information:

    • Includes customer details like ID, name, email, phone, address, city, state, zipcode, country, age, and gender. Customer segments are categorized into Premium, Regular, and New. ##Transaction Details:
    • Transaction-specific data such as transaction ID, last purchase date, total purchases, amount spent, total purchase amount, feedback, shipping method, payment method, and order status. ##Product Information:
    • Contains product-related details such as product category, brand, and type. Products are categorized into electronics, clothing, grocery, books, and home decor. ##Geographic Information:
    • Contains location details including city, state, and country. Available for various countries including USA, UK, Canada, Australia, and Germany. ##Temporal Information:
    • Last purchase date is provided along with separate columns for year, month, date, and time. Allows analysis based on temporal patterns and trends. ##Data Quality:
    • Some rows contain null values, and others are duplicates, which may need to be handled during data preprocessing. Null values are randomly distributed across rows. Duplicate rows are available at different parts of the dataset. ##Potential Analysis:
    • Customer segmentation analysis based on demographics, purchase behavior, and feedback. Sales trend analysis over time to identify peak seasons or trends. Product performance analysis to determine popular categories, brands, or types. Geographic analysis to understand regional preferences and trends. Payment and shipping method analysis to optimize services. Customer satisfaction analysis based on feedback and order status. ##Data Preprocessing:
    • Handling null values and duplicates. Parsing and formatting temporal data. Encoding categorical variables. Scaling numerical variables if required. Splitting data into training and testing sets for modeling.
  8. R

    WIDEa: a Web Interface for big Data exploration, management and analysis

    • entrepot.recherche.data.gouv.fr
    Updated Sep 12, 2021
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Philippe Santenoise; Philippe Santenoise (2021). WIDEa: a Web Interface for big Data exploration, management and analysis [Dataset]. http://doi.org/10.15454/AGU4QE
    Explore at:
    Dataset updated
    Sep 12, 2021
    Dataset provided by
    Recherche Data Gouv
    Authors
    Philippe Santenoise; Philippe Santenoise
    License

    https://entrepot.recherche.data.gouv.fr/api/datasets/:persistentId/versions/1.0/customlicense?persistentId=doi:10.15454/AGU4QEhttps://entrepot.recherche.data.gouv.fr/api/datasets/:persistentId/versions/1.0/customlicense?persistentId=doi:10.15454/AGU4QE

    Description

    WIDEa is R-based software aiming to provide users with a range of functionalities to explore, manage, clean and analyse "big" environmental and (in/ex situ) experimental data. These functionalities are the following, 1. Loading/reading different data types: basic (called normal), temporal, infrared spectra of mid/near region (called IR) with frequency (wavenumber) used as unit (in cm-1); 2. Interactive data visualization from a multitude of graph representations: 2D/3D scatter-plot, box-plot, hist-plot, bar-plot, correlation matrix; 3. Manipulation of variables: concatenation of qualitative variables, transformation of quantitative variables by generic functions in R; 4. Application of mathematical/statistical methods; 5. Creation/management of data (named flag data) considered as atypical; 6. Study of normal distribution model results for different strategies: calibration (checking assumptions on residuals), validation (comparison between measured and fitted values). The model form can be more or less complex: mixed effects, main/interaction effects, weighted residuals.

  9. D

    Set Visualization Tools Market Research Report 2033

    • dataintelo.com
    csv, pdf, pptx
    Updated Oct 1, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Dataintelo (2025). Set Visualization Tools Market Research Report 2033 [Dataset]. https://dataintelo.com/report/set-visualization-tools-market
    Explore at:
    csv, pdf, pptxAvailable download formats
    Dataset updated
    Oct 1, 2025
    Dataset authored and provided by
    Dataintelo
    License

    https://dataintelo.com/privacy-and-policyhttps://dataintelo.com/privacy-and-policy

    Time period covered
    2024 - 2032
    Area covered
    Global
    Description

    Set Visualization Tools Market Outlook



    According to our latest research, the global set visualization tools market size reached USD 3.6 billion in 2024, with a robust year-over-year growth driven by the surging demand for advanced data analysis and visualization solutions across industries. The market is projected to expand at a CAGR of 11.7% from 2025 to 2033, reaching a forecasted value of USD 10.1 billion by 2033. This remarkable growth trajectory is primarily attributed to the increasing adoption of big data analytics, artificial intelligence, and digital transformation initiatives among enterprises, government bodies, and academic institutions worldwide.




    One of the primary growth factors for the set visualization tools market is the escalating volume, velocity, and variety of data generated across sectors such as business intelligence, scientific research, and education. Organizations are increasingly recognizing the value of transforming complex, multidimensional datasets into intuitive, interactive visual representations to facilitate better decision-making, uncover hidden insights, and drive operational efficiency. The proliferation of IoT devices, cloud computing, and advanced analytics platforms has further amplified the need for sophisticated set visualization tools that can seamlessly integrate with existing data ecosystems, enabling users to analyze relationships, intersections, and trends within large, heterogeneous datasets.




    Another significant driver propelling the market growth is the rapid digitalization of enterprises and the growing emphasis on data-driven strategies. Businesses are leveraging set visualization tools to enhance their business intelligence capabilities, monitor key performance indicators, and gain a competitive edge in an increasingly data-centric landscape. These tools empower organizations to visualize overlaps, gaps, and anomalies in data sets, supporting functions such as market segmentation, customer profiling, and risk management. As companies continue to invest in advanced analytics and visualization solutions, the demand for customizable, scalable, and user-friendly set visualization platforms is poised to witness sustained growth throughout the forecast period.




    Furthermore, the integration of artificial intelligence and machine learning algorithms into set visualization tools is revolutionizing the market, enabling automated pattern recognition, predictive analytics, and real-time data exploration. This technological evolution is not only enhancing the accuracy and efficiency of data analysis but also democratizing access to complex analytical capabilities for non-technical users. The growing focus on enhancing user experience, interoperability, and cross-platform compatibility is fostering innovation and differentiation among solution providers, further accelerating market expansion. Additionally, the increasing adoption of remote and hybrid work models is driving demand for cloud-based visualization tools that offer flexibility, scalability, and collaborative features.




    From a regional perspective, North America currently dominates the set visualization tools market, accounting for the largest revenue share in 2024, followed closely by Europe and Asia Pacific. The strong presence of leading technology vendors, high digital adoption rates, and significant investments in data analytics infrastructure are key factors underpinning North America's leadership. Meanwhile, Asia Pacific is emerging as the fastest-growing region, fueled by rapid digital transformation, expanding enterprise IT budgets, and a burgeoning ecosystem of startups and academic institutions. As organizations across all regions continue to prioritize data-driven decision-making, the global set visualization tools market is expected to maintain its upward momentum over the coming years.



    Component Analysis



    The set visualization tools market by component is primarily segmented into software and services, each playing a pivotal role in the overall ecosystem. Software solutions dominate the market, driven by the continuous evolution of visualization platforms that offer advanced features such as dynamic dashboards, drag-and-drop interfaces, and integration with diverse data sources. Vendors are focusing on enhancing the scalability, security, and customization capabilities of their software offerings to cater to the unique requirements of various industries. The growing trend of self-service analytics is further boo

  10. Data from: Superheat: An R Package for Creating Beautiful and Extendable...

    • tandf.figshare.com
    bin
    Updated Mar 4, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Rebecca L. Barter; Bin Yu (2024). Superheat: An R Package for Creating Beautiful and Extendable Heatmaps for Visualizing Complex Data [Dataset]. http://doi.org/10.6084/m9.figshare.6287693.v1
    Explore at:
    binAvailable download formats
    Dataset updated
    Mar 4, 2024
    Dataset provided by
    Taylor & Francishttps://taylorandfrancis.com/
    Authors
    Rebecca L. Barter; Bin Yu
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    The technological advancements of the modern era have enabled the collection of huge amounts of data in science and beyond. Extracting useful information from such massive datasets is an ongoing challenge as traditional data visualization tools typically do not scale well in high-dimensional settings. An existing visualization technique that is particularly well suited to visualizing large datasets is the heatmap. Although heatmaps are extremely popular in fields such as bioinformatics, they remain a severely underutilized visualization tool in modern data analysis. This article introduces superheat, a new R package that provides an extremely flexible and customizable platform for visualizing complex datasets. Superheat produces attractive and extendable heatmaps to which the user can add a response variable as a scatterplot, model results as boxplots, correlation information as barplots, and more. The goal of this article is two-fold: (1) to demonstrate the potential of the heatmap as a core visualization method for a range of data types, and (2) to highlight the customizability and ease of implementation of the superheat R package for creating beautiful and extendable heatmaps. The capabilities and fundamental applicability of the superheat package will be explored via three reproducible case studies, each based on publicly available data sources.

  11. Scalable ParaView for Extreme Scale Visualization, Phase I

    • data.nasa.gov
    application/rdfxml +5
    Updated Jun 26, 2018
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2018). Scalable ParaView for Extreme Scale Visualization, Phase I [Dataset]. https://data.nasa.gov/dataset/Scalable-ParaView-for-Extreme-Scale-Visualization-/up7h-hkky
    Explore at:
    csv, tsv, xml, application/rssxml, application/rdfxml, jsonAvailable download formats
    Dataset updated
    Jun 26, 2018
    License

    U.S. Government Workshttps://www.usa.gov/government-works
    License information was derived automatically

    Description

    Petscale computing is leading to significant breakthroughs in a number of fields and is revolutionizing the way science is conducted. Data is not knowledge, however, and the challenge has been how to analyze and gain insight from the massive quantities of data that are generated. In order to address the peta-scale visualization challenges, we propose to develop a scientific visualization software that would enable real-time visualization capability of extremely large data sets. We plan to accomplish this by extending the ParaView visualization architecture to extreme scales. ParaView is an open source software installed on all HPC sites including NASA's Pleiades and has a large user base in diverse areas of science and engineering. Our proposed solution will significantly enhance the scientific return from NASA HPC investments by providing the next generation of open source data analysis and visualization tools for very large datasets. To test our solution on real world data with complex pipeline, we have partnered with SciberQuest, who have recently performed the largest kinetic simulations of magnetosphere using 25 K cores on Pleiades and 100 K cores on Kraken. Given that IO is the main bottleneck for scientific visualization at large scales, we propose to work closely with Pleiades's systems team and provide efficient prepackaged general purpose I/O component for ParaView for structured and unstructured data across a spectrum of scales and access patterns with focus on Lustre file system used by Pleiades.

  12. d

    Frontiers of Data Visualization Workshop II: Data Wrangling Workshop Summary...

    • catalog.data.gov
    Updated May 14, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    NCO NITRD (2025). Frontiers of Data Visualization Workshop II: Data Wrangling Workshop Summary [Dataset]. https://catalog.data.gov/dataset/frontiers-of-data-visualization-workshop-ii-data-wrangling-workshop-summary
    Explore at:
    Dataset updated
    May 14, 2025
    Dataset provided by
    NCO NITRD
    Description

    The Data Visualization Workshop II: Data Wrangling was a web-based event held on October 18, 2017. This workshop report summarizes the individual perspectives of a group of visualization experts from the public, private, and academic sectors who met online to discuss how to improve the creation and use of high-quality visualizations. The specific focus of this workshop was on the complexities of "data wrangling". Data wrangling includes finding the appropriate data sources that are both accessible and usable and then shaping and combining that data to facilitate the most accurate and meaningful analysis possible. The workshop was organized as a 3-hour web event and moderated by the members of the Human Computer Interaction and Information Management Task Force of the Networking and Information Technology Research and Development Program's Big Data Interagency Working Group. Report prepared by the Human Computer Interaction And Information Management Task Force, Big Data Interagency Working Group, Networking & Information Technology Research & Development Subcommittee, Committee On Technology Of The National Science & Technology Council...

  13. G

    Set Visualization Tools Market Research Report 2033

    • growthmarketreports.com
    csv, pdf, pptx
    Updated Aug 23, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Growth Market Reports (2025). Set Visualization Tools Market Research Report 2033 [Dataset]. https://growthmarketreports.com/report/set-visualization-tools-market
    Explore at:
    pdf, csv, pptxAvailable download formats
    Dataset updated
    Aug 23, 2025
    Dataset authored and provided by
    Growth Market Reports
    Time period covered
    2024 - 2032
    Area covered
    Global
    Description

    Set Visualization Tools Market Outlook



    According to our latest research, the global set visualization tools market size reached USD 3.2 billion in 2024, driven by the increasing demand for advanced data analytics and visual representation across diverse industries. The market is expected to grow at a robust CAGR of 12.8% from 2025 to 2033, reaching a forecasted value of USD 9.1 billion by 2033. This significant growth is primarily attributed to the proliferation of big data, the rising importance of data-driven decision-making, and the expansion of digital transformation initiatives worldwide.




    One of the primary growth factors fueling the set visualization tools market is the exponential surge in data generation from numerous sources, including IoT devices, enterprise applications, and digital platforms. Organizations are increasingly seeking efficient ways to interpret complex and voluminous datasets, making advanced visualization tools indispensable for extracting actionable insights. The integration of artificial intelligence (AI) and machine learning (ML) into these tools further enhances their capability to identify patterns, trends, and anomalies, thus supporting more informed strategic decisions. As businesses across sectors recognize the value of data visualization in driving operational efficiency and innovation, the adoption of set visualization tools continues to accelerate.




    Another key driver is the growing emphasis on business intelligence (BI) and analytics within enterprises of all sizes. Modern set visualization tools are evolving to offer intuitive interfaces, real-time analytics, and seamless integration with existing IT infrastructure, making them accessible to non-technical users as well. This democratization of data analytics empowers a broader range of stakeholders to participate in data-driven processes, fostering a culture of collaboration and agility. Additionally, the increasing complexity of datasets, especially in sectors like healthcare, finance, and scientific research, necessitates sophisticated visualization solutions capable of handling multidimensional and hierarchical data structures.




    The rapid adoption of cloud computing and the shift towards remote and hybrid work environments have also played a pivotal role in the expansion of the set visualization tools market. Cloud-based deployment models offer unparalleled scalability, flexibility, and cost-effectiveness, enabling organizations to access visualization capabilities without significant upfront investments in hardware or infrastructure. Furthermore, the emergence of mobile and web-based visualization platforms ensures that users can interact with data visualizations anytime, anywhere, thereby enhancing productivity and decision-making speed. As digital transformation initiatives gain momentum globally, the demand for advanced, user-friendly, and scalable set visualization tools is expected to remain strong.




    From a regional perspective, North America currently dominates the set visualization tools market, accounting for the largest share in 2024, followed closely by Europe and the Asia Pacific. The presence of leading technology companies, a mature IT infrastructure, and high investment in analytics and business intelligence solutions contribute to North America's leadership position. However, the Asia Pacific region is witnessing the fastest growth, propelled by rapid digitalization, expanding enterprise IT budgets, and increasing awareness about the benefits of data visualization. As emerging economies in Latin America and the Middle East & Africa continue to invest in digital transformation, these regions are also expected to offer lucrative growth opportunities for market players over the forecast period.





    Component Analysis



    The set visualization tools market by component is primarily segmented into software and services, each playing a crucial role in the overall ecosystem. The software segment holds the majority share, driven by the continuous evolution of visualization platforms

  14. Z

    DEVILS: a tool for the visualization of large datasets with a high dynamic...

    • data.niaid.nih.gov
    • zenodo.org
    Updated Jul 19, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Romain Guiet; Olivier Burri; Nicolas Chiaruttini; Olivier Hagens; Arne Seitz (2024). DEVILS: a tool for the visualization of large datasets with a high dynamic range [Dataset]. https://data.niaid.nih.gov/resources?id=ZENODO_4058413
    Explore at:
    Dataset updated
    Jul 19, 2024
    Dataset provided by
    EPFL - École Polytechnique Fédérale de Lausanne
    Authors
    Romain Guiet; Olivier Burri; Nicolas Chiaruttini; Olivier Hagens; Arne Seitz
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    This repository accompanying the article “DEVILS: a tool for the visualization of large datasets with a high dynamic range” contains the following:

    Extended Material of the article

    An example raw dataset corresponding to the images shown in Fig. 3

    A workflow description that demonstrates the use of the DEVILS workflow with BigStitcher.

    Two scripts (“CLAHE_Parameters_test.ijm” and a “DEVILS_Parallel_tests.groovy”) used for Figure S2, S3 and S4.

  15. r

    1000 Empirical Time series

    • researchdata.edu.au
    • bridges.monash.edu
    • +1more
    Updated May 5, 2022
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Ben Fulcher (2022). 1000 Empirical Time series [Dataset]. http://doi.org/10.6084/m9.figshare.5436136.v10
    Explore at:
    Dataset updated
    May 5, 2022
    Dataset provided by
    Monash University
    Authors
    Ben Fulcher
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    A diverse selection of 1000 empirical time series, along with results of an hctsa feature extraction, using v1.06 of hctsa and Matlab 2019b, computed on a server at The University of Sydney.


    The results of the computation are in the hctsa file, HCTSA_Empirical1000.mat for use in Matlab using v1.06 of hctsa.

    The same data is also provided in .csv format for the hctsa_datamatrix.csv (results of feature computation), with information about rows (time series) in hctsa_timeseries-info.csv, information about columns (features) in hctsa_features.csv (and corresponding hctsa code used to compute each feature in hctsa_masterfeatures.csv), and the data of individual time series (each line a time series, for time series described in hctsa_timeseries-info.csv) is in hctsa_timeseries-data.csv.

    These .csv files were produced by running >>OutputToCSV(HCTSA_Empirical1000.mat,true,true); in hctsa.

    The input file, INP_Empirical1000.mat, is for use with hctsa, and contains the time-series data and metadata for the 1000 time series. For example, massive feature extraction from these data on the user's machine, using hctsa, can proceed as
    >> TS_Init('INP_Empirical1000.mat');

    Some visualizations of the dataset are in CarpetPlot.png (first 1000 samples of all time series as a carpet (color) plot) and 150TS-250samples.png (conventional time-series plots of the first 250 samples of a sample of 150 time series from the dataset). More visualizations can be performed by the user using TS_PlotTimeSeries from the hctsa package.

    See links in references for more comprehensive documentation for performing methodological comparison using this dataset, and on how to download and use v1.06 of hctsa.

  16. H

    Data from: Mapping the Milky Way, from the Inside Out, in Color

    • dataverse.harvard.edu
    Updated Jan 23, 2020
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Harvard Dataverse (2020). Mapping the Milky Way, from the Inside Out, in Color [Dataset]. http://doi.org/10.7910/DVN/7IO69A
    Explore at:
    pdf(31350987), application/x-iwork-keynote-sffkey(753864503)Available download formats
    Dataset updated
    Jan 23, 2020
    Dataset provided by
    Harvard Dataverse
    License

    CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
    License information was derived automatically

    Description

    Presentation Date: Friday, March 15, 2019. Location: Barnstable, MA. Abstract: A presentation to a crowd of Barnstable High "AstroJunkies," about how we use physics, statistics, and visualizations to turn information from large, public, astronomical data sets, across many wavelengths into a better understanding of the structure of the Milky Way.

  17. v

    Global Data Visualization Software Market Size By Deployment Type, By Tool...

    • verifiedmarketresearch.com
    Updated May 21, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    VERIFIED MARKET RESEARCH (2024). Global Data Visualization Software Market Size By Deployment Type, By Tool Type, By Business Function, By Geographic Scope And Forecast [Dataset]. https://www.verifiedmarketresearch.com/product/data-visualization-software-market/
    Explore at:
    Dataset updated
    May 21, 2024
    Dataset authored and provided by
    VERIFIED MARKET RESEARCH
    License

    https://www.verifiedmarketresearch.com/privacy-policy/https://www.verifiedmarketresearch.com/privacy-policy/

    Time period covered
    2024 - 2031
    Area covered
    Global
    Description

    Data Visualization Software Market size is growing at a moderate pace with substantial growth rates over the last few years and is estimated that the market will grow significantly in the forecasted period i.e., 2024 to 2031.

    Global Data Visualization Software Market Drivers

    The market drivers for the Data Visualization Software Market can be influenced by various factors. These may include:

    Growing Need for Analytics and Big Data:Businesses are producing enormous volumes of data, which they must analyse and visualise with the help of effective technologies to get insights that can be put to use.

    Expanding Use of Business Intelligence (BI) Products:The need for data visualisation software is being driven by the increasing use of BI tools by businesses to improve their decision-making procedures.

    Increase in Cloud-Based Solution Adoption:Businesses of all sizes are adopting cloud-based data visualisation solutions due to their affordability, scalability, and flexibility.

    IoT Device Proliferation:Massive data volumes are being generated by the proliferation of IoT devices, necessitating the use of sophisticated visualisation tools for efficient analysis and interpretation.

    Increased Requirement for Data Analysis in Real Time:The market for data visualisation software is being driven by the need for real-time data analysis to support dynamic corporate settings and quick decision-making.

    Increasing Application of Machine Learning (ML) and Artificial Intelligence (AI):When AI and ML are combined with data visualisation tools, it becomes easier to analyse large, complicated data sets, forecast trends, and offer deeper insights.

    Increase in Business Strategies Driven by Data:In order to remain competitive, businesses are embracing data-driven strategies more and more, which is driving up demand for advanced data visualisation tools.

    Raising Knowledge of the Advantages of Data Visualisation:The growth of data visualisation is being driven by a growing understanding of its benefits, which include enhanced communication and data comprehension across a range of industries.

    Improvements in Tools and Techniques for Visualisation:Market expansion is being driven by ongoing advancements in visualisation technologies as well as the creation of more feature-rich and user-friendly tools.

    Widening the Scope of Applications:The market is growing as a result of the growing application of data visualisation in industries like healthcare, finance, retail, and education.

  18. T

    pass

    • tensorflow.org
    Updated Jun 1, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2024). pass [Dataset]. https://www.tensorflow.org/datasets/catalog/pass
    Explore at:
    Dataset updated
    Jun 1, 2024
    Description

    PASS is a large-scale image dataset that does not include any humans, human parts, or other personally identifiable information. It can be used for high-quality self-supervised pretraining while significantly reducing privacy concerns.

    PASS contains 1,439,589 images without any labels sourced from YFCC-100M.

    All images in this dataset are licenced under the CC-BY licence, as is the dataset itself. For YFCC-100M see http://www.multimediacommons.org/.

    To use this dataset:

    import tensorflow_datasets as tfds
    
    ds = tfds.load('pass', split='train')
    for ex in ds.take(4):
     print(ex)
    

    See the guide for more informations on tensorflow_datasets.

    https://storage.googleapis.com/tfds-data/visualization/fig/pass-3.0.0.png" alt="Visualization" width="500px">

  19. D

    Data from: TB-Places: A Data Set for Visual Place Recognition in Garden...

    • dataverse.nl
    bin, text/markdown +1
    Updated Oct 14, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Maria Leyva-Vallina; Maria Leyva-Vallina; Nicola Strisciuglio; Nicola Strisciuglio; Manuel López Antequera; Manuel López Antequera; Radim Tylecek; Radim Tylecek; Michael Blaich; Nicolai Petkov; Nicolai Petkov; Michael Blaich (2022). TB-Places: A Data Set for Visual Place Recognition in Garden Environments [Dataset]. http://doi.org/10.34894/VIL0EV
    Explore at:
    text/markdown(3824), zip(5778796621), bin(8388608000), bin(7340032000), zip(5553347935), zip(6713181123)Available download formats
    Dataset updated
    Oct 14, 2022
    Dataset provided by
    DataverseNL
    Authors
    Maria Leyva-Vallina; Maria Leyva-Vallina; Nicola Strisciuglio; Nicola Strisciuglio; Manuel López Antequera; Manuel López Antequera; Radim Tylecek; Radim Tylecek; Michael Blaich; Nicolai Petkov; Nicolai Petkov; Michael Blaich
    License

    CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
    License information was derived automatically

    Description

    Place recognition can be achieved by identifying whether a pair of images (a labeled reference image and a query image) depict the same place, regardless of appearance changes due to different viewpoints or lighting conditions. It is an important component of systems for camera localization and for loop closure detection, and a widely studied problem for indoor or urban environments. Recently, the use of robots in agriculture and automatic gardening has created new challenges due to the highly repetitive appearance with prevalent green color and repetitive texture of garden-like scenes. The lack of available data recorded in gardens or plant fields makes difficult to improve localization algorithms for such environments. In this paper, we propose a new data set of garden images for testing algorithms for visual place recognition. It contains images with ground truth camera pose recorded in real gardens at different times, with varying light conditions. We also provide ground truth for all possible pairs of images, indicating whether they depict the same place or not. We also performed a thorough benchmark of several holistic (whole-image) descriptors and provide the results on the proposed data set. We observed that existing descriptors have difficulties with scenes with repetitive textures and large changes of camera viewpoint.

  20. G

    Data Visualization Software Market Research Report 2033

    • growthmarketreports.com
    csv, pdf, pptx
    Updated Aug 29, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Growth Market Reports (2025). Data Visualization Software Market Research Report 2033 [Dataset]. https://growthmarketreports.com/report/data-visualization-software-market
    Explore at:
    pptx, csv, pdfAvailable download formats
    Dataset updated
    Aug 29, 2025
    Dataset authored and provided by
    Growth Market Reports
    Time period covered
    2024 - 2032
    Area covered
    Global
    Description

    Data Visualization Software Market Outlook



    According to our latest research, the global Data Visualization Software market size reached USD 8.2 billion in 2024, reflecting the sectorÂ’s rapid adoption across industries. With a robust CAGR of 10.8% projected from 2025 to 2033, the market is expected to grow significantly, attaining a value of USD 20.3 billion by 2033. This dynamic expansion is primarily driven by the increasing demand for actionable business insights, the proliferation of big data analytics, and the growing need for real-time decision-making tools across enterprises worldwide.




    One of the most powerful growth factors for the Data Visualization Software market is the surge in big data generation and the corresponding need for advanced analytics solutions. Organizations are increasingly dealing with massive and complex datasets that traditional reporting tools cannot handle efficiently. Modern data visualization software enables users to interpret these vast datasets quickly, presenting trends, patterns, and anomalies in intuitive graphical formats. This empowers organizations to make informed decisions faster, boosting overall operational efficiency and competitive advantage. Furthermore, the integration of artificial intelligence and machine learning capabilities into data visualization platforms is enhancing their analytical power, allowing for predictive and prescriptive insights that were previously unattainable.




    Another significant driver of the Data Visualization Software market is the widespread digital transformation initiatives across various sectors. Enterprises are investing heavily in digital technologies to streamline operations, improve customer experiences, and unlock new revenue streams. Data visualization tools have become integral to these transformations, serving as a bridge between raw data and strategic business outcomes. By offering interactive dashboards, real-time reporting, and customizable analytics, these solutions enable users at all organizational levels to engage with data meaningfully. The democratization of data access facilitated by user-friendly visualization software is fostering a data-driven culture, encouraging innovation and agility across industries such as BFSI, healthcare, retail, and manufacturing.




    The increasing adoption of cloud-based data visualization solutions is also fueling market growth. Cloud deployment offers scalability, flexibility, and cost-effectiveness, making advanced analytics accessible to organizations of all sizes, including small and medium enterprises (SMEs). Cloud-based platforms support seamless integration with other business applications, facilitate remote collaboration, and provide robust security features. As businesses continue to embrace remote and hybrid work models, the demand for cloud-based data visualization tools is expected to rise, further accelerating market expansion. Vendors are responding with enhanced offerings, including AI-driven analytics, embedded BI, and self-service visualization capabilities, catering to the evolving needs of modern enterprises.



    In the realm of warehouse management systems (WMS), the integration of WMS Data Visualization Tools is becoming increasingly vital. These tools offer a comprehensive view of warehouse operations, enabling managers to visualize data related to inventory levels, order processing, and shipment tracking in real-time. By leveraging advanced visualization techniques, WMS data visualization tools help in identifying bottlenecks, optimizing resource allocation, and improving overall efficiency. The ability to transform complex data sets into intuitive visual formats empowers warehouse managers to make informed decisions swiftly, thereby enhancing productivity and reducing operational costs. As the demand for streamlined logistics and supply chain management continues to grow, the adoption of WMS data visualization tools is expected to rise, driving further innovation in the sector.




    Regionally, North America continues to dominate the Data Visualization Software market due to early technology adoption, a strong presence of leading vendors, and a mature analytics landscape. However, the Asia Pacific region is witnessing the fastest growth, driven by rapid digitalization, increasing IT investments, and the emergence of data-centric business models in countries like China, India

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
(2018). High Interactivity Visualization Software for Large Computational Data Sets, Phase II [Dataset]. https://data.nasa.gov/dataset/High-Interactivity-Visualization-Software-for-Larg/ttzp-wtjx
Organization logo

High Interactivity Visualization Software for Large Computational Data Sets, Phase II

Explore at:
application/rdfxml, xml, csv, application/rssxml, tsv, jsonAvailable download formats
Dataset updated
Jun 26, 2018
License

U.S. Government Workshttps://www.usa.gov/government-works
License information was derived automatically

Description

Existing scientific visualization tools have specific limitations for large scale scientific data sets. Of these four limitations can be seen as paramount: (i) memory management, (ii) remote visualization, (iii) interactivity, and (iv) specificity. In Phase I, we proposed and successfully developed a prototype of a collection of computer tools and libraries called SciViz that overcome these limitations and enable researchers to visualize large scale data sets (greater than 200 gigabytes) on HPC resources remotely from their workstations at interactive rates. A key element of our technology is the stack oriented rather than a framework driven approach which allows it to interoperate with common existing scientific visualization software thereby eliminating the need for the user to switch and learn new software. The result is a versatile 3D visualization capability that will significantly decrease the time to knowledge discovery from large, complex data sets.

Typical visualization activity can be organized into a simple stack of steps that leads to the visualization result. These steps can broadly be classified into data retrieval, data analysis, visual representation, and rendering. Our approach will be to continue with the technique selected in Phase I of utilizing existing visualization tools at each point in the visualization stack and to develop specific tools that address the core limitations identified and seamlessly integrate them into the visualization stack. Specifically, we intend to complete technical objectives in four areas that will complete the development of visualization tools for interactive visualization of very large data sets in each layer of the visualization stack. These four areas are: Feature Objectives, C++ Conversion and Optimization, Testing Objectives, and Domain Specifics and Integration. The technology will be developed and tested at NASA and the San Diego Supercomputer Center.

Search
Clear search
Close search
Google apps
Main menu