100+ datasets found
  1. f

    DataSheet1_Exploratory data analysis (EDA) machine learning approaches for...

    • frontiersin.figshare.com
    docx
    Updated May 31, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Victoria Da Poian; Bethany Theiling; Lily Clough; Brett McKinney; Jonathan Major; Jingyi Chen; Sarah Hörst (2023). DataSheet1_Exploratory data analysis (EDA) machine learning approaches for ocean world analog mass spectrometry.docx [Dataset]. http://doi.org/10.3389/fspas.2023.1134141.s001
    Explore at:
    docxAvailable download formats
    Dataset updated
    May 31, 2023
    Dataset provided by
    Frontiers
    Authors
    Victoria Da Poian; Bethany Theiling; Lily Clough; Brett McKinney; Jonathan Major; Jingyi Chen; Sarah Hörst
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Area covered
    World
    Description

    Many upcoming and proposed missions to ocean worlds such as Europa, Enceladus, and Titan aim to evaluate their habitability and the existence of potential life on these moons. These missions will suffer from communication challenges and technology limitations. We review and investigate the applicability of data science and unsupervised machine learning (ML) techniques on isotope ratio mass spectrometry data (IRMS) from volatile laboratory analogs of Europa and Enceladus seawaters as a case study for development of new strategies for icy ocean world missions. Our driving science goal is to determine whether the mass spectra of volatile gases could contain information about the composition of the seawater and potential biosignatures. We implement data science and ML techniques to investigate what inherent information the spectra contain and determine whether a data science pipeline could be designed to quickly analyze data from future ocean worlds missions. In this study, we focus on the exploratory data analysis (EDA) step in the analytics pipeline. This is a crucial unsupervised learning step that allows us to understand the data in depth before subsequent steps such as predictive/supervised learning. EDA identifies and characterizes recurring patterns, significant correlation structure, and helps determine which variables are redundant and which contribute to significant variation in the lower dimensional space. In addition, EDA helps to identify irregularities such as outliers that might be due to poor data quality. We compared dimensionality reduction methods Uniform Manifold Approximation and Projection (UMAP) and Principal Component Analysis (PCA) for transforming our data from a high-dimensional space to a lower dimension, and we compared clustering algorithms for identifying data-driven groups (“clusters”) in the ocean worlds analog IRMS data and mapping these clusters to experimental conditions such as seawater composition and CO2 concentration. Such data analysis and characterization efforts are the first steps toward the longer-term science autonomy goal where similar automated ML tools could be used onboard a spacecraft to prioritize data transmissions for bandwidth-limited outer Solar System missions.

  2. E

    Exploratory Data Analysis (EDA) Tools Report

    • marketreportanalytics.com
    doc, pdf, ppt
    Updated Apr 2, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Market Report Analytics (2025). Exploratory Data Analysis (EDA) Tools Report [Dataset]. https://www.marketreportanalytics.com/reports/exploratory-data-analysis-eda-tools-54164
    Explore at:
    ppt, pdf, docAvailable download formats
    Dataset updated
    Apr 2, 2025
    Dataset authored and provided by
    Market Report Analytics
    License

    https://www.marketreportanalytics.com/privacy-policyhttps://www.marketreportanalytics.com/privacy-policy

    Time period covered
    2025 - 2033
    Area covered
    Global
    Variables measured
    Market Size
    Description

    The Exploratory Data Analysis (EDA) tools market is experiencing robust growth, driven by the increasing volume and complexity of data across various industries. The market, estimated at $1.5 billion in 2025, is projected to exhibit a Compound Annual Growth Rate (CAGR) of 15% from 2025 to 2033, reaching approximately $5 billion by 2033. This expansion is fueled by several key factors. Firstly, the rising adoption of big data analytics and business intelligence initiatives across large enterprises and SMEs is creating a significant demand for efficient EDA tools. Secondly, the growing need for faster, more insightful data analysis to support better decision-making is driving the preference for user-friendly graphical EDA tools over traditional non-graphical methods. Furthermore, advancements in artificial intelligence and machine learning are seamlessly integrating into EDA tools, enhancing their capabilities and broadening their appeal. The market segmentation reveals a significant portion held by large enterprises, reflecting their greater resources and data handling needs. However, the SME segment is rapidly gaining traction, driven by the increasing affordability and accessibility of cloud-based EDA solutions. Geographically, North America currently dominates the market, but regions like Asia-Pacific are exhibiting high growth potential due to increasing digitalization and technological advancements. Despite this positive outlook, certain restraints remain. The high initial investment cost associated with implementing advanced EDA solutions can be a barrier for some SMEs. Additionally, the need for skilled professionals to effectively utilize these tools can create a challenge for organizations. However, the ongoing development of user-friendly interfaces and the availability of training resources are actively mitigating these limitations. The competitive landscape is characterized by a mix of established players like IBM and emerging innovative companies offering specialized solutions. Continuous innovation in areas like automated data preparation and advanced visualization techniques will further shape the future of the EDA tools market, ensuring its sustained growth trajectory.

  3. Ecommerce Dataset for Data Analysis

    • kaggle.com
    zip
    Updated Sep 19, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Shrishti Manja (2024). Ecommerce Dataset for Data Analysis [Dataset]. https://www.kaggle.com/datasets/shrishtimanja/ecommerce-dataset-for-data-analysis/code
    Explore at:
    zip(2028853 bytes)Available download formats
    Dataset updated
    Sep 19, 2024
    Authors
    Shrishti Manja
    Description

    This dataset contains 55,000 entries of synthetic customer transactions, generated using Python's Faker library. The goal behind creating this dataset was to provide a resource for learners like myself to explore, analyze, and apply various data analysis techniques in a context that closely mimics real-world data.

    About the Dataset: - CID (Customer ID): A unique identifier for each customer. - TID (Transaction ID): A unique identifier for each transaction. - Gender: The gender of the customer, categorized as Male or Female. - Age Group: Age group of the customer, divided into several ranges. - Purchase Date: The timestamp of when the transaction took place. - Product Category: The category of the product purchased, such as Electronics, Apparel, etc. - Discount Availed: Indicates whether the customer availed any discount (Yes/No). - Discount Name: Name of the discount applied (e.g., FESTIVE50). - Discount Amount (INR): The amount of discount availed by the customer. - Gross Amount: The total amount before applying any discount. - Net Amount: The final amount after applying the discount. - Purchase Method: The payment method used (e.g., Credit Card, Debit Card, etc.). - Location: The city where the purchase took place.

    Use Cases: 1. Exploratory Data Analysis (EDA): This dataset is ideal for conducting EDA, allowing users to practice techniques such as summary statistics, visualizations, and identifying patterns within the data. 2. Data Preprocessing and Cleaning: Learners can work on handling missing data, encoding categorical variables, and normalizing numerical values to prepare the dataset for analysis. 3. Data Visualization: Use tools like Python’s Matplotlib, Seaborn, or Power BI to visualize purchasing trends, customer demographics, or the impact of discounts on purchase amounts. 4. Machine Learning Applications: After applying feature engineering, this dataset is suitable for supervised learning models, such as predicting whether a customer will avail a discount or forecasting purchase amounts based on the input features.

    This dataset provides an excellent sandbox for honing skills in data analysis, machine learning, and visualization in a structured but flexible manner.

    This is not a real dataset. This dataset was generated using Python's Faker library for the sole purpose of learning

  4. E

    Exploratory Data Analysis (EDA) Tools Report

    • archivemarketresearch.com
    doc, pdf, ppt
    Updated Feb 12, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Archive Market Research (2025). Exploratory Data Analysis (EDA) Tools Report [Dataset]. https://www.archivemarketresearch.com/reports/exploratory-data-analysis-eda-tools-21680
    Explore at:
    doc, pdf, pptAvailable download formats
    Dataset updated
    Feb 12, 2025
    Dataset authored and provided by
    Archive Market Research
    License

    https://www.archivemarketresearch.com/privacy-policyhttps://www.archivemarketresearch.com/privacy-policy

    Time period covered
    2025 - 2033
    Area covered
    Global
    Variables measured
    Market Size
    Description

    The global Exploratory Data Analysis (EDA) Tools market is anticipated to experience significant growth in the coming years, driven by the increasing adoption of data-driven decision-making and the growing need for efficient data exploration and analysis. The market size is valued at USD XX million in 2025 and is projected to reach USD XX million by 2033, registering a CAGR of XX% during the forecast period. The increasing complexity and volume of data generated by businesses and organizations have necessitated the use of advanced data analysis tools to derive meaningful insights and make informed decisions. Key trends driving the market include the rising adoption of AI and machine learning technologies, the growing need for self-service data analytics, and the increasing emphasis on data visualization and storytelling. Non-graphical EDA tools are gaining traction due to their ability to handle large and complex datasets. Graphical EDA tools are preferred for their intuitive and interactive user interfaces that simplify data exploration. Large enterprises are major consumers of EDA tools as they have large volumes of data to analyze. SMEs are also increasingly adopting EDA tools as they realize the importance of data-driven insights for business growth. The North American region holds a significant market share due to the presence of established technology companies and a high adoption rate of data analytics solutions. The Asia Pacific region is expected to witness substantial growth due to the rising number of businesses and organizations in emerging economies.

  5. E

    Exploratory Data Analysis (EDA) Tools Report

    • datainsightsmarket.com
    doc, pdf, ppt
    Updated Nov 7, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Data Insights Market (2025). Exploratory Data Analysis (EDA) Tools Report [Dataset]. https://www.datainsightsmarket.com/reports/exploratory-data-analysis-eda-tools-532159
    Explore at:
    pdf, ppt, docAvailable download formats
    Dataset updated
    Nov 7, 2025
    Dataset authored and provided by
    Data Insights Market
    License

    https://www.datainsightsmarket.com/privacy-policyhttps://www.datainsightsmarket.com/privacy-policy

    Time period covered
    2025 - 2033
    Area covered
    Global
    Variables measured
    Market Size
    Description

    Explore the booming Exploratory Data Analysis (EDA) Tools market, projected to reach $10.5 billion by 2025 with a 12.5% CAGR. Discover key drivers, trends, and market share for large enterprises, SMEs, graphical & non-graphical tools across North America, Europe, APAC, and more.

  6. Walmart Data Analysis and Forcasting

    • kaggle.com
    zip
    Updated Apr 26, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Amit Kumar Sahu (2023). Walmart Data Analysis and Forcasting [Dataset]. https://www.kaggle.com/datasets/asahu40/walmart-data-analysis-and-forcasting/code
    Explore at:
    zip(125153 bytes)Available download formats
    Dataset updated
    Apr 26, 2023
    Authors
    Amit Kumar Sahu
    License

    https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/

    Description

    A retail store that has multiple outlets across the country are facing issues in managing the inventory - to match the demand with respect to supply. You are a data scientist, who has to come up with useful insights using the data and make prediction models to forecast the sales for X number of months/years.

  7. BI intro to data cleaning eda and machine learning

    • kaggle.com
    zip
    Updated Sep 16, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Ashish Sharma23DLN (2025). BI intro to data cleaning eda and machine learning [Dataset]. https://www.kaggle.com/datasets/ashishsharma23dln/bi-intro-to-data-cleaning-eda-and-machine-learning
    Explore at:
    zip(301595 bytes)Available download formats
    Dataset updated
    Sep 16, 2025
    Authors
    Ashish Sharma23DLN
    License

    Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
    License information was derived automatically

    Description

    Dataset

    This dataset was created by Ashish Sharma23DLN

    Released under Apache 2.0

    Contents

  8. EDA on Cleaned Netflix Data

    • kaggle.com
    zip
    Updated Jul 7, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Nikhil raman K (2025). EDA on Cleaned Netflix Data [Dataset]. https://www.kaggle.com/datasets/nikhilramank/eda-on-cleaned-netflix-data
    Explore at:
    zip(110806 bytes)Available download formats
    Dataset updated
    Jul 7, 2025
    Authors
    Nikhil raman K
    License

    https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/

    Description

    This is a cleaned version of a Netflix movies dataset originally used for exploratory data analysis (EDA). The dataset contains information such as:

    • Title
    • Release Year
    • Rating
    • Genre
    • Votes
    • Description
    • Stars

    Missing values have been handled using appropriate methods (mean, median, unknown), and new features like rating_level and popular have been added for deeper analysis.

    The dataset is ready for: - EDA - Data visualization - Machine learning tasks - Dashboard building

    Used in the accompanying notebook

  9. G

    EDA with AI Market Research Report 2033

    • growthmarketreports.com
    csv, pdf, pptx
    Updated Aug 29, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Growth Market Reports (2025). EDA with AI Market Research Report 2033 [Dataset]. https://growthmarketreports.com/report/eda-with-ai-market
    Explore at:
    pdf, pptx, csvAvailable download formats
    Dataset updated
    Aug 29, 2025
    Dataset authored and provided by
    Growth Market Reports
    Time period covered
    2024 - 2032
    Area covered
    Global
    Description

    EDA with AI Market Outlook



    According to our latest research, the global EDA with AI market size reached USD 7.9 billion in 2024, reflecting robust demand for advanced automation in electronic design automation (EDA) powered by artificial intelligence. The sector is experiencing a strong compound annual growth rate (CAGR) of 18.2% from 2025 to 2033. By the end of 2033, the market is forecasted to reach USD 37.2 billion, driven by the increasing complexity of semiconductor devices, rapid growth in AI-enabled chip design, and the need for faster, more efficient design cycles. These advancements are further supported by the proliferation of IoT devices and the expansion of high-performance computing, which are contributing significantly to the marketÂ’s expansion as per our latest research.




    One of the primary growth factors for the EDA with AI market is the escalating complexity of semiconductor designs, which demands more sophisticated solutions for verification, simulation, and optimization. Traditional EDA tools are struggling to keep pace with the miniaturization of nodes and the integration of multi-billion transistor chips. AI-powered EDA solutions are revolutionizing the industry by automating complex tasks such as floorplanning, routing, and verification, significantly reducing time-to-market and design errors. These AI-driven tools are also enabling predictive analytics and intelligent optimization, allowing design teams to anticipate bottlenecks and improve overall productivity. As chipmakers race to develop next-generation processors for applications like autonomous vehicles, 5G, and quantum computing, the adoption of AI-enhanced EDA tools is accelerating across the globe.




    Another critical growth driver is the increasing adoption of AI and machine learning across various industries, which is fueling demand for specialized hardware and custom chipsets. This trend is particularly evident in sectors such as automotive, healthcare, and consumer electronics, where smart devices and advanced driver-assistance systems (ADAS) require highly reliable and efficient silicon. The integration of AI into EDA workflows is not only improving design accuracy but also facilitating the development of application-specific integrated circuits (ASICs) and system-on-chip (SoC) solutions. Furthermore, the shift towards cloud-based EDA platforms is democratizing access to advanced design tools, enabling startups and small enterprises to compete alongside established industry players. As a result, the ecosystem for EDA with AI is becoming more vibrant and inclusive, spurring innovation at an unprecedented pace.




    The third major growth factor lies in the convergence of EDA with AI and emerging technologies such as the Internet of Things (IoT), edge computing, and 5G communications. The proliferation of connected devices is driving the need for power-efficient, high-performance chips capable of real-time data processing. AI-driven EDA solutions are uniquely positioned to address these requirements by optimizing designs for power, performance, and area (PPA) metrics. Additionally, the use of AI in verification and simulation is reducing the incidence of costly design respins, thereby lowering overall development costs. Strategic collaborations between EDA vendors, semiconductor foundries, and cloud service providers are further enhancing the capabilities of AI-powered design tools, paving the way for the next wave of semiconductor innovation.



    EDA Software plays a crucial role in the burgeoning EDA with AI market, as it forms the backbone of the design automation process. These software solutions are essential for managing the increasing complexity of chip designs, offering tools that automate routine tasks, enhance simulation accuracy, and enable predictive analytics. As the demand for custom and complex chips grows, the reliance on advanced EDA software will only intensify. The software's ability to incorporate machine learning algorithms that learn from historical design data, optimize layouts, and minimize errors is pivotal in maintaining competitive advantage in the fast-evolving semiconductor industry. As such, EDA Software is not just a tool but a strategic asset that drives innovation and efficiency in electronic design.




    From a regional perspective, Asia Pacific continues to dominate the EDA with AI market, accounting f

  10. D

    EDA With AI Market Research Report 2033

    • dataintelo.com
    csv, pdf, pptx
    Updated Oct 1, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Dataintelo (2025). EDA With AI Market Research Report 2033 [Dataset]. https://dataintelo.com/report/eda-with-ai-market
    Explore at:
    pdf, pptx, csvAvailable download formats
    Dataset updated
    Oct 1, 2025
    Dataset authored and provided by
    Dataintelo
    License

    https://dataintelo.com/privacy-and-policyhttps://dataintelo.com/privacy-and-policy

    Time period covered
    2024 - 2032
    Area covered
    Global
    Description

    EDA with AI Market Outlook



    According to our latest research, the global EDA with AI market size reached USD 7.9 billion in 2024, demonstrating robust expansion fueled by the integration of artificial intelligence in electronic design automation. The market is poised for remarkable growth, projected to achieve USD 27.6 billion by 2033, progressing at a compelling 15.1% CAGR during the forecast period. This growth is primarily driven by the escalating demand for faster and more efficient semiconductor and PCB design, the proliferation of AI-powered verification tools, and the increasing complexity of electronic systems across industries.



    One of the foremost growth drivers for the EDA with AI market is the surging complexity and miniaturization of semiconductor devices, which has rendered traditional design methodologies insufficient. AI-driven EDA tools are revolutionizing the design landscape by automating intricate processes such as placement, routing, and verification, significantly reducing time-to-market and error rates. The adoption of AI in EDA is enabling design teams to manage the exponential increase in data and design rules, especially as the industry moves towards advanced nodes like 5nm and beyond. The integration of machine learning algorithms within EDA platforms empowers engineers to predict design bottlenecks, optimize layouts, and enhance overall chip performance, thereby fueling market growth.



    Another significant factor contributing to the expansion of the EDA with AI market is the rapid evolution of end-user industries, particularly automotive, consumer electronics, and telecommunications. The automotive sector, for instance, is witnessing a dramatic transformation with the advent of electric vehicles and autonomous driving technologies, both of which demand highly sophisticated and reliable electronic systems. AI-augmented EDA solutions are instrumental in meeting these stringent requirements by facilitating advanced verification, functional safety analysis, and design optimization. Similarly, the consumer electronics industry’s relentless pursuit of innovation and faster product cycles is driving the adoption of AI-powered EDA tools to streamline design processes and ensure first-pass success.



    Furthermore, the increasing adoption of cloud-based EDA solutions is amplifying the market’s growth trajectory. Cloud deployment not only provides scalability and accessibility but also enables collaborative design environments, which are essential for geographically dispersed engineering teams. AI integration in cloud EDA platforms enhances computational efficiency and accelerates simulation and verification tasks, making it easier for organizations to handle large-scale, complex designs. The shift towards cloud-based EDA is also lowering the entry barrier for small and medium enterprises, democratizing access to advanced design tools and fostering innovation across the ecosystem.



    Regionally, Asia Pacific stands out as the dominant market, driven by the presence of major semiconductor manufacturing hubs in China, Taiwan, South Korea, and Japan. North America follows closely, benefiting from a strong base of EDA technology providers and early adopters in sectors like aerospace, defense, and healthcare. Europe is also witnessing significant growth, particularly in automotive and industrial applications. The Middle East & Africa and Latin America are emerging markets, gradually increasing their investments in semiconductor design and AI-driven automation. This regional dynamism is expected to intensify competition and spur further advancements in the global EDA with AI market.



    Component Analysis



    The EDA with AI market is segmented by component into software, hardware, and services, each playing a pivotal role in the overall ecosystem. The software segment dominates the market, accounting for the largest share in 2024, primarily due to the increasing adoption of AI-enabled design and verification tools. AI-powered software platforms are transforming traditional EDA workflows by automating repetitive and complex tasks, such as synthesis, simulation, and timing analysis. These platforms leverage advanced algorithms and neural networks to enhance design accuracy, optimize resource utilization, and accelerate project timelines. The continuous innovation in software capabilities, including the integration of natural language processing and generative AI, is further solidifying the segment’s leadership.



    The h

  11. f

    Detailed characterization of the dataset.

    • figshare.com
    xls
    Updated Sep 26, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Rodrigo Gutiérrez Benítez; Alejandra Segura Navarrete; Christian Vidal-Castro; Claudia Martínez-Araneda (2024). Detailed characterization of the dataset. [Dataset]. http://doi.org/10.1371/journal.pone.0310707.t006
    Explore at:
    xlsAvailable download formats
    Dataset updated
    Sep 26, 2024
    Dataset provided by
    PLOS ONE
    Authors
    Rodrigo Gutiérrez Benítez; Alejandra Segura Navarrete; Christian Vidal-Castro; Claudia Martínez-Araneda
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Over the last ten years, social media has become a crucial data source for businesses and researchers, providing a space where people can express their opinions and emotions. To analyze this data and classify emotions and their polarity in texts, natural language processing (NLP) techniques such as emotion analysis (EA) and sentiment analysis (SA) are employed. However, the effectiveness of these tasks using machine learning (ML) and deep learning (DL) methods depends on large labeled datasets, which are scarce in languages like Spanish. To address this challenge, researchers use data augmentation (DA) techniques to artificially expand small datasets. This study aims to investigate whether DA techniques can improve classification results using ML and DL algorithms for sentiment and emotion analysis of Spanish texts. Various text manipulation techniques were applied, including transformations, paraphrasing (back-translation), and text generation using generative adversarial networks, to small datasets such as song lyrics, social media comments, headlines from national newspapers in Chile, and survey responses from higher education students. The findings show that the Convolutional Neural Network (CNN) classifier achieved the most significant improvement, with an 18% increase using the Generative Adversarial Networks for Sentiment Text (SentiGan) on the Aggressiveness (Seriousness) dataset. Additionally, the same classifier model showed an 11% improvement using the Easy Data Augmentation (EDA) on the Gender-Based Violence dataset. The performance of the Bidirectional Encoder Representations from Transformers (BETO) also improved by 10% on the back-translation augmented version of the October 18 dataset, and by 4% on the EDA augmented version of the Teaching survey dataset. These results suggest that data augmentation techniques enhance performance by transforming text and adapting it to the specific characteristics of the dataset. Through experimentation with various augmentation techniques, this research provides valuable insights into the analysis of subjectivity in Spanish texts and offers guidance for selecting algorithms and techniques based on dataset features.

  12. Orange dataset table

    • figshare.com
    xlsx
    Updated Mar 4, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Rui Simões (2022). Orange dataset table [Dataset]. http://doi.org/10.6084/m9.figshare.19146410.v1
    Explore at:
    xlsxAvailable download formats
    Dataset updated
    Mar 4, 2022
    Dataset provided by
    figshare
    Figsharehttp://figshare.com/
    Authors
    Rui Simões
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    The complete dataset used in the analysis comprises 36 samples, each described by 11 numeric features and 1 target. The attributes considered were caspase 3/7 activity, Mitotracker red CMXRos area and intensity (3 h and 24 h incubations with both compounds), Mitosox oxidation (3 h incubation with the referred compounds) and oxidation rate, DCFDA fluorescence (3 h and 24 h incubations with either compound) and oxidation rate, and DQ BSA hydrolysis. The target of each instance corresponds to one of the 9 possible classes (4 samples per class): Control, 6.25, 12.5, 25 and 50 µM for 6-OHDA and 0.03, 0.06, 0.125 and 0.25 µM for rotenone. The dataset is balanced, it does not contain any missing values and data was standardized across features. The small number of samples prevented a full and strong statistical analysis of the results. Nevertheless, it allowed the identification of relevant hidden patterns and trends.

    Exploratory data analysis, information gain, hierarchical clustering, and supervised predictive modeling were performed using Orange Data Mining version 3.25.1 [41]. Hierarchical clustering was performed using the Euclidean distance metric and weighted linkage. Cluster maps were plotted to relate the features with higher mutual information (in rows) with instances (in columns), with the color of each cell representing the normalized level of a particular feature in a specific instance. The information is grouped both in rows and in columns by a two-way hierarchical clustering method using the Euclidean distances and average linkage. Stratified cross-validation was used to train the supervised decision tree. A set of preliminary empirical experiments were performed to choose the best parameters for each algorithm, and we verified that, within moderate variations, there were no significant changes in the outcome. The following settings were adopted for the decision tree algorithm: minimum number of samples in leaves: 2; minimum number of samples required to split an internal node: 5; stop splitting when majority reaches: 95%; criterion: gain ratio. The performance of the supervised model was assessed using accuracy, precision, recall, F-measure and area under the ROC curve (AUC) metrics.

  13. h

    diamonds-eda-aurele

    • huggingface.co
    Updated Nov 17, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Aurele Schwalb (2025). diamonds-eda-aurele [Dataset]. https://huggingface.co/datasets/aurele1/diamonds-eda-aurele
    Explore at:
    Dataset updated
    Nov 17, 2025
    Authors
    Aurele Schwalb
    Description

    README - Diamonds Dataset & EDA Analysis

    Dataset Overview + Data Cleaning: The diamonds dataset contains detailed information on 53,940 diamonds, including both numerical and categorical features commonly used in the gemstone and jewelry industry. It is widely used in data analysis and machine learning to study how characteristics such as carat, cut, color, clarity, depth, table, and physical dimensions (x, y, z) influence price. To prepare the dataset for analysis, I first verified… See the full description on the dataset page: https://huggingface.co/datasets/aurele1/diamonds-eda-aurele.

  14. Cars93

    • kaggle.com
    zip
    Updated Sep 16, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Yashpal (2022). Cars93 [Dataset]. https://www.kaggle.com/datasets/yashpaloswal/cars93/discussion
    Explore at:
    zip(4879 bytes)Available download formats
    Dataset updated
    Sep 16, 2022
    Authors
    Yashpal
    License

    http://opendatacommons.org/licenses/dbcl/1.0/http://opendatacommons.org/licenses/dbcl/1.0/

    Description

    Content:- The file contains basic cars details

    Goal:- You can do multiple things using this dataset such as 1. Missing data treatment 2. Various Pandas operations (to learn; the basic concepts) 3. EDA 4. You can choose to run any machine learning algorithm, considering any features and any label.

    The basic purpose of this dataset is to get started in the field of data science and machine learning.

  15. E

    Electronic Design Automation (EDA) Software Report

    • datainsightsmarket.com
    doc, pdf, ppt
    Updated Aug 22, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Data Insights Market (2025). Electronic Design Automation (EDA) Software Report [Dataset]. https://www.datainsightsmarket.com/reports/electronic-design-automation-eda-software-1940656
    Explore at:
    ppt, doc, pdfAvailable download formats
    Dataset updated
    Aug 22, 2025
    Dataset authored and provided by
    Data Insights Market
    License

    https://www.datainsightsmarket.com/privacy-policyhttps://www.datainsightsmarket.com/privacy-policy

    Time period covered
    2025 - 2033
    Area covered
    Global
    Variables measured
    Market Size
    Description

    The Electronic Design Automation (EDA) software market is booming, projected to reach [estimated 2033 market size] by 2033, growing at a CAGR of 5%. Discover key trends, drivers, restraints, and leading companies shaping this dynamic industry. Explore market segmentation and regional insights for informed strategic decision-making.

  16. Z

    Data Analysis for the Systematic Literature Review of DL4SE

    • data.niaid.nih.gov
    • data-staging.niaid.nih.gov
    Updated Jul 19, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Cody Watson; Nathan Cooper; David Nader; Kevin Moran; Denys Poshyvanyk (2024). Data Analysis for the Systematic Literature Review of DL4SE [Dataset]. https://data.niaid.nih.gov/resources?id=zenodo_4768586
    Explore at:
    Dataset updated
    Jul 19, 2024
    Dataset provided by
    College of William and Mary
    Washington and Lee University
    Authors
    Cody Watson; Nathan Cooper; David Nader; Kevin Moran; Denys Poshyvanyk
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Data Analysis is the process that supports decision-making and informs arguments in empirical studies. Descriptive statistics, Exploratory Data Analysis (EDA), and Confirmatory Data Analysis (CDA) are the approaches that compose Data Analysis (Xia & Gong; 2014). An Exploratory Data Analysis (EDA) comprises a set of statistical and data mining procedures to describe data. We ran EDA to provide statistical facts and inform conclusions. The mined facts allow attaining arguments that would influence the Systematic Literature Review of DL4SE.

    The Systematic Literature Review of DL4SE requires formal statistical modeling to refine the answers for the proposed research questions and formulate new hypotheses to be addressed in the future. Hence, we introduce DL4SE-DA, a set of statistical processes and data mining pipelines that uncover hidden relationships among Deep Learning reported literature in Software Engineering. Such hidden relationships are collected and analyzed to illustrate the state-of-the-art of DL techniques employed in the software engineering context.

    Our DL4SE-DA is a simplified version of the classical Knowledge Discovery in Databases, or KDD (Fayyad, et al; 1996). The KDD process extracts knowledge from a DL4SE structured database. This structured database was the product of multiple iterations of data gathering and collection from the inspected literature. The KDD involves five stages:

    Selection. This stage was led by the taxonomy process explained in section xx of the paper. After collecting all the papers and creating the taxonomies, we organize the data into 35 features or attributes that you find in the repository. In fact, we manually engineered features from the DL4SE papers. Some of the features are venue, year published, type of paper, metrics, data-scale, type of tuning, learning algorithm, SE data, and so on.

    Preprocessing. The preprocessing applied was transforming the features into the correct type (nominal), removing outliers (papers that do not belong to the DL4SE), and re-inspecting the papers to extract missing information produced by the normalization process. For instance, we normalize the feature “metrics” into “MRR”, “ROC or AUC”, “BLEU Score”, “Accuracy”, “Precision”, “Recall”, “F1 Measure”, and “Other Metrics”. “Other Metrics” refers to unconventional metrics found during the extraction. Similarly, the same normalization was applied to other features like “SE Data” and “Reproducibility Types”. This separation into more detailed classes contributes to a better understanding and classification of the paper by the data mining tasks or methods.

    Transformation. In this stage, we omitted to use any data transformation method except for the clustering analysis. We performed a Principal Component Analysis to reduce 35 features into 2 components for visualization purposes. Furthermore, PCA also allowed us to identify the number of clusters that exhibit the maximum reduction in variance. In other words, it helped us to identify the number of clusters to be used when tuning the explainable models.

    Data Mining. In this stage, we used three distinct data mining tasks: Correlation Analysis, Association Rule Learning, and Clustering. We decided that the goal of the KDD process should be oriented to uncover hidden relationships on the extracted features (Correlations and Association Rules) and to categorize the DL4SE papers for a better segmentation of the state-of-the-art (Clustering). A clear explanation is provided in the subsection “Data Mining Tasks for the SLR od DL4SE”. 5.Interpretation/Evaluation. We used the Knowledge Discover to automatically find patterns in our papers that resemble “actionable knowledge”. This actionable knowledge was generated by conducting a reasoning process on the data mining outcomes. This reasoning process produces an argument support analysis (see this link).

    We used RapidMiner as our software tool to conduct the data analysis. The procedures and pipelines were published in our repository.

    Overview of the most meaningful Association Rules. Rectangles are both Premises and Conclusions. An arrow connecting a Premise with a Conclusion implies that given some premise, the conclusion is associated. E.g., Given that an author used Supervised Learning, we can conclude that their approach is irreproducible with a certain Support and Confidence.

    Support = Number of occurrences this statement is true divided by the amount of statements Confidence = The support of the statement divided by the number of occurrences of the premise

  17. E

    EDA Tools Report

    • archivemarketresearch.com
    doc, pdf, ppt
    Updated Feb 9, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Archive Market Research (2025). EDA Tools Report [Dataset]. https://www.archivemarketresearch.com/reports/eda-tools-15253
    Explore at:
    ppt, pdf, docAvailable download formats
    Dataset updated
    Feb 9, 2025
    Dataset authored and provided by
    Archive Market Research
    License

    https://www.archivemarketresearch.com/privacy-policyhttps://www.archivemarketresearch.com/privacy-policy

    Time period covered
    2025 - 2033
    Area covered
    Global
    Variables measured
    Market Size
    Description

    Market Overview and Key Trends The Electronic Design Automation (EDA) tools market is projected to grow from USD 12.7 billion in 2025 to USD 24.3 billion by 2033, recording a CAGR of 8.2% during the forecast period. The growth is driven by factors such as the increasing complexity of integrated circuits (ICs), rising adoption of artificial intelligence (AI) and machine learning (ML) in chip design, and growing demand for electronic devices in various industries. Key trends in the market include the shift towards cloud-based EDA tools, the adoption of advanced packaging technologies, and the integration of EDA tools with AI and ML. Market Segments and Regional Analysis Based on type, the CAE segment held the largest market share in 2025, and it is expected to continue its dominance during the forecast period. By application, the electronics and manufacturing segment is expected to witness the highest growth, driven by the increasing demand for electronic devices in various industries. Regionally, North America is expected to remain the largest market, followed by Asia Pacific. The growth in Asia Pacific is attributed to the rising electronics and manufacturing industries in the region. This report provides a comprehensive overview of the global EDA tools market, with a focus on key trends, challenges, and growth drivers. It offers detailed market segmentation, regional insights, and profiles of leading players in the industry.

  18. G

    AI-Driven EDA Tool Market Research Report 2033

    • growthmarketreports.com
    csv, pdf, pptx
    Updated Aug 29, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Growth Market Reports (2025). AI-Driven EDA Tool Market Research Report 2033 [Dataset]. https://growthmarketreports.com/report/ai-driven-eda-tool-market
    Explore at:
    pptx, pdf, csvAvailable download formats
    Dataset updated
    Aug 29, 2025
    Dataset authored and provided by
    Growth Market Reports
    Time period covered
    2024 - 2032
    Area covered
    Global
    Description

    AI-Driven EDA Tool Market Outlook



    According to our latest research, the global AI-driven EDA tool market size reached USD 1.92 billion in 2024, registering a robust year-on-year growth. The market is expected to expand at a CAGR of 21.7% during the forecast period, reaching a projected value of USD 13.23 billion by 2033. This significant growth is primarily attributed to the increasing complexity of semiconductor designs, rapid adoption of advanced electronics across industries, and the integration of artificial intelligence into electronic design automation (EDA) workflows, which are revolutionizing design efficiency and time-to-market for new products.




    The AI-driven EDA tool market is propelled by several compelling growth factors. One of the most notable is the escalating demand for miniaturized and highly complex integrated circuits, especially in the domains of consumer electronics, automotive, and industrial automation. As the number of transistors on a chip continues to rise, traditional EDA tools struggle to keep pace with the intricacies of modern design. AI-powered EDA solutions can automate repetitive tasks, optimize design parameters, and predict potential design flaws early in the process, thereby reducing development cycles and improving overall productivity. This technological leap is particularly critical as industries race to deliver next-generation products with enhanced functionality and lower power consumption.




    Another significant growth driver for the AI-driven EDA tool market is the surging adoption of cloud-based deployment models. Cloud-based EDA platforms offer unparalleled scalability, collaboration, and accessibility, enabling design teams to work seamlessly across geographies and time zones. The integration of AI with cloud infrastructure allows for real-time data analysis, faster simulation, and accelerated verification processes. This is especially valuable for startups and small to medium enterprises (SMEs) that may lack the resources to invest in expensive on-premises solutions. Furthermore, cloud-based AI-driven EDA tools support agile development methodologies, allowing companies to rapidly iterate and innovate in response to market demands.




    The proliferation of AI and machine learning in the EDA ecosystem is also being fueled by the increasing investments from both public and private sectors. Governments worldwide are recognizing the strategic importance of semiconductor self-sufficiency and are pouring resources into research and development initiatives. Simultaneously, venture capital funding in AI-driven EDA startups has surged, fostering innovation and accelerating the commercialization of cutting-edge solutions. These investments are not only enhancing the capabilities of EDA tools but are also driving down costs, making advanced design automation accessible to a broader spectrum of end-users across various industries.



    In this evolving landscape, the introduction of the Carbon-Aware EDA Compute Scheduler is a noteworthy development. This innovative scheduler is designed to optimize the energy consumption of EDA workflows by dynamically adjusting compute resources based on carbon intensity signals. By doing so, it not only reduces the carbon footprint of electronic design processes but also aligns with the growing emphasis on sustainability in the tech industry. As companies strive to meet environmental regulations and corporate sustainability goals, the Carbon-Aware EDA Compute Scheduler offers a practical solution to minimize energy usage while maintaining high performance. This advancement is particularly relevant as the demand for eco-friendly technologies continues to rise, making it a significant consideration for organizations looking to enhance their green credentials.




    From a regional perspective, Asia Pacific continues to dominate the AI-driven EDA tool market, accounting for the largest share in 2024, followed closely by North America and Europe. The region's leadership is underpinned by its robust electronics manufacturing ecosystem, particularly in countries like China, Taiwan, South Korea, and Japan. The presence of leading semiconductor foundries and a thriving consumer electronics sector creates a fertile ground for the adoption of AI-driven EDA solutions. Meanwhile, North America remains a hotbed for innovation, with significant contributions from te

  19. E

    Electronic Design Automation (EDA) Report

    • archivemarketresearch.com
    doc, pdf, ppt
    Updated Mar 14, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Archive Market Research (2025). Electronic Design Automation (EDA) Report [Dataset]. https://www.archivemarketresearch.com/reports/electronic-design-automation-eda-57538
    Explore at:
    pdf, doc, pptAvailable download formats
    Dataset updated
    Mar 14, 2025
    Dataset authored and provided by
    Archive Market Research
    License

    https://www.archivemarketresearch.com/privacy-policyhttps://www.archivemarketresearch.com/privacy-policy

    Time period covered
    2025 - 2033
    Area covered
    Global
    Variables measured
    Market Size
    Description

    The Electronic Design Automation (EDA) market is experiencing robust growth, driven by the increasing complexity of electronic systems and the rising demand for advanced functionalities across various industries. The market size in 2025 is estimated at $843.9 million. While the provided CAGR is missing, considering the strong industry trends towards miniaturization, increased integration, and the rise of AI in design processes, a conservative estimate of a 7% Compound Annual Growth Rate (CAGR) from 2025 to 2033 seems plausible. This would project a market value exceeding $1.5 billion by 2033. Key drivers include the proliferation of 5G and IoT devices requiring sophisticated EDA tools for efficient design and verification, alongside advancements in Artificial Intelligence (AI) and Machine Learning (ML) enhancing automation capabilities within the design process. Emerging trends such as system-on-chip (SoC) design, the growing adoption of cloud-based EDA solutions, and the increasing focus on verification and validation methodologies contribute to the market's expansion. However, challenges such as the high cost of EDA software and the need for skilled professionals to operate these complex tools represent potential restraints to market growth. The EDA market's segmentation reveals significant opportunities across various applications. The Aerospace & Defense, Automotive, and Telecommunications sectors are major contributors due to the increasing need for highly reliable and sophisticated electronic systems. Within the segments, Computer Aided Engineering (CAE) and IC Physical Design & Verification are expected to dominate, driven by the continuous push for higher performance and integration density in chips and systems. Key players like Cadence Design Systems, Siemens (Mentor Graphics), and Synopsys are actively shaping market trends through innovations in software and services. The geographic distribution of the market reveals strong growth across North America and Asia Pacific, fueled by technological advancements and robust manufacturing capabilities in these regions. Overall, the EDA market presents a compelling investment opportunity for companies that can provide innovative solutions to meet the increasing demands of the electronics industry.

  20. D

    EDA for Electronics Market Report | Global Forecast From 2025 To 2033

    • dataintelo.com
    csv, pdf, pptx
    Updated Jan 7, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Dataintelo (2025). EDA for Electronics Market Report | Global Forecast From 2025 To 2033 [Dataset]. https://dataintelo.com/report/global-eda-for-electronics-market
    Explore at:
    pdf, csv, pptxAvailable download formats
    Dataset updated
    Jan 7, 2025
    Dataset authored and provided by
    Dataintelo
    License

    https://dataintelo.com/privacy-and-policyhttps://dataintelo.com/privacy-and-policy

    Time period covered
    2024 - 2032
    Area covered
    Global
    Description

    EDA for Electronics Market Outlook



    The global market size for Electronic Design Automation (EDA) for Electronics was valued at approximately $12 billion in 2023 and is projected to reach around $28 billion by 2032, exhibiting a robust CAGR of 10.2% over the forecast period. This growth is predominantly driven by the increasing complexity of electronic devices and the escalating need for innovative design solutions.



    One of the primary growth factors driving the EDA for Electronics market is the rising demand for more sophisticated and efficient electronic devices. As consumer demand for high-performance gadgets continues to escalate, manufacturers are compelled to adopt advanced EDA tools to streamline the design and development process. The proliferation of smart devices, wearables, and IoT applications has necessitated the adoption of EDA solutions to manage the intricate design requirements, thereby fueling the market growth.



    Moreover, the increasing investments in the semiconductor industry play a pivotal role in propelling the EDA market. As the semiconductor sector evolves, there is a substantial need for precision and accuracy in chip design, which is achievable through advanced EDA software and tools. Governments and private enterprises are heavily investing in semiconductor R&D to stay ahead in the technology race, further bolstering the market for EDA solutions.



    Another significant growth driver is the trend towards automation and the adoption of AI and machine learning in EDA tools. These technologies enhance the capabilities of EDA software, enabling faster and more reliable designs. Companies are increasingly leveraging AI to predict potential design flaws and optimize the design process, resulting in cost savings and reduced time-to-market. This technological advancement is anticipated to significantly contribute to the market's expansion over the forecast period.



    In terms of regional outlook, Asia Pacific dominates the EDA for Electronics market, driven by the presence of a robust semiconductor manufacturing base in countries like China, South Korea, and Taiwan. North America and Europe also hold substantial market shares due to their strong technological infrastructure and significant investments in R&D. These regions exhibit a high adoption rate of advanced EDA solutions, further contributing to market growth.



    Component Analysis



    The EDA for Electronics market is broadly segmented into Software, Hardware, and Services. The Software segment holds the largest market share, driven by the high adoption rate of EDA software tools in the semiconductor and electronics industries. These tools are essential for designing complex integrated circuits and printed circuit boards, which are fundamental to modern electronics. The continuous advancements in EDA software, such as the integration of AI and machine learning, are further expanding the capabilities and efficiency of these tools, thereby driving their demand.



    The Aerospace and Defense sector is increasingly recognizing the importance of EDA in Aerospace and Defense for designing complex electronic systems that must meet stringent performance and reliability standards. EDA tools are crucial in this industry for developing avionics, radar systems, and communication devices that operate under extreme conditions. The integration of EDA solutions in aerospace and defense projects ensures that electronic systems are not only efficient but also adhere to the rigorous safety and certification requirements. As the demand for advanced defense technologies and next-generation aircraft grows, the role of EDA in Aerospace and Defense becomes even more critical, driving innovation and enhancing the capabilities of electronic systems in these sectors.



    The Hardware segment is also witnessing significant growth, primarily due to the increasing demand for specialized hardware to support the complex computations required in electronic design. This includes high-performance computing systems and specialized processors that can handle the intensive computational tasks involved in EDA. With the rise of advanced technologies like 5G and autonomous vehicles, the need for sophisticated hardware solutions in EDA is expected to grow, contributing to the overall market expansion.



    Services, encompassing consulting, maintenance, and support, are another crucial component of the EDA market. As electronic designs b

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
Victoria Da Poian; Bethany Theiling; Lily Clough; Brett McKinney; Jonathan Major; Jingyi Chen; Sarah Hörst (2023). DataSheet1_Exploratory data analysis (EDA) machine learning approaches for ocean world analog mass spectrometry.docx [Dataset]. http://doi.org/10.3389/fspas.2023.1134141.s001

DataSheet1_Exploratory data analysis (EDA) machine learning approaches for ocean world analog mass spectrometry.docx

Related Article
Explore at:
docxAvailable download formats
Dataset updated
May 31, 2023
Dataset provided by
Frontiers
Authors
Victoria Da Poian; Bethany Theiling; Lily Clough; Brett McKinney; Jonathan Major; Jingyi Chen; Sarah Hörst
License

Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically

Area covered
World
Description

Many upcoming and proposed missions to ocean worlds such as Europa, Enceladus, and Titan aim to evaluate their habitability and the existence of potential life on these moons. These missions will suffer from communication challenges and technology limitations. We review and investigate the applicability of data science and unsupervised machine learning (ML) techniques on isotope ratio mass spectrometry data (IRMS) from volatile laboratory analogs of Europa and Enceladus seawaters as a case study for development of new strategies for icy ocean world missions. Our driving science goal is to determine whether the mass spectra of volatile gases could contain information about the composition of the seawater and potential biosignatures. We implement data science and ML techniques to investigate what inherent information the spectra contain and determine whether a data science pipeline could be designed to quickly analyze data from future ocean worlds missions. In this study, we focus on the exploratory data analysis (EDA) step in the analytics pipeline. This is a crucial unsupervised learning step that allows us to understand the data in depth before subsequent steps such as predictive/supervised learning. EDA identifies and characterizes recurring patterns, significant correlation structure, and helps determine which variables are redundant and which contribute to significant variation in the lower dimensional space. In addition, EDA helps to identify irregularities such as outliers that might be due to poor data quality. We compared dimensionality reduction methods Uniform Manifold Approximation and Projection (UMAP) and Principal Component Analysis (PCA) for transforming our data from a high-dimensional space to a lower dimension, and we compared clustering algorithms for identifying data-driven groups (“clusters”) in the ocean worlds analog IRMS data and mapping these clusters to experimental conditions such as seawater composition and CO2 concentration. Such data analysis and characterization efforts are the first steps toward the longer-term science autonomy goal where similar automated ML tools could be used onboard a spacecraft to prioritize data transmissions for bandwidth-limited outer Solar System missions.

Search
Clear search
Close search
Google apps
Main menu