100+ datasets found
  1. f

    Orange dataset table

    • figshare.com
    xlsx
    Updated Mar 4, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Rui Simões (2022). Orange dataset table [Dataset]. http://doi.org/10.6084/m9.figshare.19146410.v1
    Explore at:
    xlsxAvailable download formats
    Dataset updated
    Mar 4, 2022
    Dataset provided by
    figshare
    Authors
    Rui Simões
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    The complete dataset used in the analysis comprises 36 samples, each described by 11 numeric features and 1 target. The attributes considered were caspase 3/7 activity, Mitotracker red CMXRos area and intensity (3 h and 24 h incubations with both compounds), Mitosox oxidation (3 h incubation with the referred compounds) and oxidation rate, DCFDA fluorescence (3 h and 24 h incubations with either compound) and oxidation rate, and DQ BSA hydrolysis. The target of each instance corresponds to one of the 9 possible classes (4 samples per class): Control, 6.25, 12.5, 25 and 50 µM for 6-OHDA and 0.03, 0.06, 0.125 and 0.25 µM for rotenone. The dataset is balanced, it does not contain any missing values and data was standardized across features. The small number of samples prevented a full and strong statistical analysis of the results. Nevertheless, it allowed the identification of relevant hidden patterns and trends.

    Exploratory data analysis, information gain, hierarchical clustering, and supervised predictive modeling were performed using Orange Data Mining version 3.25.1 [41]. Hierarchical clustering was performed using the Euclidean distance metric and weighted linkage. Cluster maps were plotted to relate the features with higher mutual information (in rows) with instances (in columns), with the color of each cell representing the normalized level of a particular feature in a specific instance. The information is grouped both in rows and in columns by a two-way hierarchical clustering method using the Euclidean distances and average linkage. Stratified cross-validation was used to train the supervised decision tree. A set of preliminary empirical experiments were performed to choose the best parameters for each algorithm, and we verified that, within moderate variations, there were no significant changes in the outcome. The following settings were adopted for the decision tree algorithm: minimum number of samples in leaves: 2; minimum number of samples required to split an internal node: 5; stop splitting when majority reaches: 95%; criterion: gain ratio. The performance of the supervised model was assessed using accuracy, precision, recall, F-measure and area under the ROC curve (AUC) metrics.

  2. R

    Rock Analysis Dataset

    • universe.roboflow.com
    zip
    Updated Aug 21, 2023
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Atharva Desai (2023). Rock Analysis Dataset [Dataset]. https://universe.roboflow.com/atharva-desai-x1nvf/rock-analysis/dataset/1
    Explore at:
    zipAvailable download formats
    Dataset updated
    Aug 21, 2023
    Dataset authored and provided by
    Atharva Desai
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Variables measured
    Rock Bounding Boxes
    Description

    Rock Analysis

    ## Overview
    
    Rock Analysis is a dataset for object detection tasks - it contains Rock annotations for 300 images.
    
    ## Getting Started
    
    You can download this dataset for use within your own projects, or fork it into a workspace on Roboflow to create your own model.
    
      ## License
    
      This dataset is available under the [CC BY 4.0 license](https://creativecommons.org/licenses/CC BY 4.0).
    
  3. Data Analytic Market Size, Share, Trends & Insights Report, 2035

    • rootsanalysis.com
    Updated Dec 20, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Roots Analysis (2024). Data Analytic Market Size, Share, Trends & Insights Report, 2035 [Dataset]. https://www.rootsanalysis.com/data-analytics-market
    Explore at:
    Dataset updated
    Dec 20, 2024
    Dataset provided by
    Authors
    Roots Analysis
    License

    https://www.rootsanalysis.com/privacy.htmlhttps://www.rootsanalysis.com/privacy.html

    Time period covered
    2021 - 2031
    Area covered
    Global
    Description

    The data analytic market size is projected to grow from USD 69.40 billion in the current year to USD 877.12 billion by 2035, representing a CAGR of 25.93%, during the forecast period till 2035.

  4. R

    Face Emotion Analysis Dataset

    • universe.roboflow.com
    zip
    Updated Jul 3, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    afssfasff (2025). Face Emotion Analysis Dataset [Dataset]. https://universe.roboflow.com/afssfasff/face-emotion-analysis-1cndy
    Explore at:
    zipAvailable download formats
    Dataset updated
    Jul 3, 2025
    Dataset authored and provided by
    afssfasff
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Variables measured
    Face
    Description

    Face Emotion Analysis

    ## Overview
    
    Face Emotion Analysis is a dataset for classification tasks - it contains Face annotations for 9,961 images.
    
    ## Getting Started
    
    You can download this dataset for use within your own projects, or fork it into a workspace on Roboflow to create your own model.
    
      ## License
    
      This dataset is available under the [CC BY 4.0 license](https://creativecommons.org/licenses/CC BY 4.0).
    
  5. C

    Housing Market Value Analysis 2021

    • data.wprdc.org
    • gimi9.com
    • +1more
    geojson, html, pdf +2
    Updated Jul 8, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Allegheny County (2025). Housing Market Value Analysis 2021 [Dataset]. https://data.wprdc.org/dataset/market-value-analysis-2021
    Explore at:
    xlsx(22669), html, zip(1996574), pdf(28782887), zip(2039140), pdf(881980), geojson(10301172)Available download formats
    Dataset updated
    Jul 8, 2025
    Dataset provided by
    Allegheny County
    License

    CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
    License information was derived automatically

    Description

    In 2021, Allegheny County Economic Development (ACED), in partnership with Urban Redevelopment Authority of Pittsburgh(URA), completed the a Market Value Analysis (MVA) for Allegheny County. This analysis services as both an update to previous MVA’s commissioned separately by ACED and the URA and combines the MVA for the whole of Allegheny County (inclusive of the City of Pittsburgh). The MVA is a unique tool for characterizing markets because it creates an internally referenced index of a municipality’s residential real estate market. It identifies areas that are the highest demand markets as well as areas of greatest distress, and the various markets types between. The MVA offers insight into the variation in market strength and weakness within and between traditional community boundaries because it uses Census block groups as the unit of analysis. Where market types abut each other on the map becomes instructive about the potential direction of market change, and ultimately, the appropriateness of types of investment or intervention strategies.

    This MVA utilized data that helps to define the local real estate market. The data used covers the 2017-2019 period, and data used in the analysis includes:

    • Residential Real Estate Sales
    • Mortgage Foreclosures
    • Residential Vacancy
    • Parcel Year Built
    • Parcel Condition
    • Building Violations
    • Owner Occupancy
    • Subsidized Housing Units

    The MVA uses a statistical technique known as cluster analysis, forming groups of areas (i.e., block groups) that are similar along the MVA descriptors, noted above. The goal is to form groups within which there is a similarity of characteristics within each group, but each group itself different from the others. Using this technique, the MVA condenses vast amounts of data for the universe of all properties to a manageable, meaningful typology of market types that can inform area-appropriate programs and decisions regarding the allocation of resources.

    Please refer to the presentation and executive summary for more information about the data, methodology, and findings.

  6. Forex News Annotated Dataset for Sentiment Analysis

    • zenodo.org
    • paperswithcode.com
    • +1more
    csv
    Updated Nov 11, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Georgios Fatouros; Georgios Fatouros; Kalliopi Kouroumali; Kalliopi Kouroumali (2023). Forex News Annotated Dataset for Sentiment Analysis [Dataset]. http://doi.org/10.5281/zenodo.7976208
    Explore at:
    csvAvailable download formats
    Dataset updated
    Nov 11, 2023
    Dataset provided by
    Zenodohttp://zenodo.org/
    Authors
    Georgios Fatouros; Georgios Fatouros; Kalliopi Kouroumali; Kalliopi Kouroumali
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    This dataset contains news headlines relevant to key forex pairs: AUDUSD, EURCHF, EURUSD, GBPUSD, and USDJPY. The data was extracted from reputable platforms Forex Live and FXstreet over a period of 86 days, from January to May 2023. The dataset comprises 2,291 unique news headlines. Each headline includes an associated forex pair, timestamp, source, author, URL, and the corresponding article text. Data was collected using web scraping techniques executed via a custom service on a virtual machine. This service periodically retrieves the latest news for a specified forex pair (ticker) from each platform, parsing all available information. The collected data is then processed to extract details such as the article's timestamp, author, and URL. The URL is further used to retrieve the full text of each article. This data acquisition process repeats approximately every 15 minutes.

    To ensure the reliability of the dataset, we manually annotated each headline for sentiment. Instead of solely focusing on the textual content, we ascertained sentiment based on the potential short-term impact of the headline on its corresponding forex pair. This method recognizes the currency market's acute sensitivity to economic news, which significantly influences many trading strategies. As such, this dataset could serve as an invaluable resource for fine-tuning sentiment analysis models in the financial realm.

    We used three categories for annotation: 'positive', 'negative', and 'neutral', which correspond to bullish, bearish, and hold sentiments, respectively, for the forex pair linked to each headline. The following Table provides examples of annotated headlines along with brief explanations of the assigned sentiment.

    Examples of Annotated Headlines
    
    
        Forex Pair
        Headline
        Sentiment
        Explanation
    
    
    
    
        GBPUSD 
        Diminishing bets for a move to 12400 
        Neutral
        Lack of strong sentiment in either direction
    
    
        GBPUSD 
        No reasons to dislike Cable in the very near term as long as the Dollar momentum remains soft 
        Positive
        Positive sentiment towards GBPUSD (Cable) in the near term
    
    
        GBPUSD 
        When are the UK jobs and how could they affect GBPUSD 
        Neutral
        Poses a question and does not express a clear sentiment
    
    
        JPYUSD
        Appropriate to continue monetary easing to achieve 2% inflation target with wage growth 
        Positive
        Monetary easing from Bank of Japan (BoJ) could lead to a weaker JPY in the short term due to increased money supply
    
    
        USDJPY
        Dollar rebounds despite US data. Yen gains amid lower yields 
        Neutral
        Since both the USD and JPY are gaining, the effects on the USDJPY forex pair might offset each other
    
    
        USDJPY
        USDJPY to reach 124 by Q4 as the likelihood of a BoJ policy shift should accelerate Yen gains 
        Negative
        USDJPY is expected to reach a lower value, with the USD losing value against the JPY
    
    
        AUDUSD
    
        <p>RBA Governor Lowe’s Testimony High inflation is damaging and corrosive </p>
    
        Positive
        Reserve Bank of Australia (RBA) expresses concerns about inflation. Typically, central banks combat high inflation with higher interest rates, which could strengthen AUD.
    

    Moreover, the dataset includes two columns with the predicted sentiment class and score as predicted by the FinBERT model. Specifically, the FinBERT model outputs a set of probabilities for each sentiment class (positive, negative, and neutral), representing the model's confidence in associating the input headline with each sentiment category. These probabilities are used to determine the predicted class and a sentiment score for each headline. The sentiment score is computed by subtracting the negative class probability from the positive one.

  7. m

    Big Data Analytics in Retail Market - Trends & Industry Analysis

    • mordorintelligence.com
    pdf,excel,csv,ppt
    Updated Dec 11, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Mordor Intelligence (2024). Big Data Analytics in Retail Market - Trends & Industry Analysis [Dataset]. https://www.mordorintelligence.com/industry-reports/big-data-analytics-in-retail-marketing-market
    Explore at:
    pdf,excel,csv,pptAvailable download formats
    Dataset updated
    Dec 11, 2024
    Dataset authored and provided by
    Mordor Intelligence
    License

    https://www.mordorintelligence.com/privacy-policyhttps://www.mordorintelligence.com/privacy-policy

    Time period covered
    2021 - 2030
    Area covered
    Global
    Description

    The Data Analytics in Retail Industry is segmented by Application (Merchandising and Supply Chain Analytics, Social Media Analytics, Customer Analytics, Operational Intelligence, Other Applications), by Business Type (Small and Medium Enterprises, Large-scale Organizations), and Geography. The market size and forecasts are provided in terms of value (USD billion) for all the above segments.

  8. P

    Capriccio Dataset

    • paperswithcode.com
    Updated Sep 15, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Jie You; Jae-Won Chung; Mosharaf Chowdhury (2022). Capriccio Dataset [Dataset]. https://paperswithcode.com/dataset/capriccio
    Explore at:
    Dataset updated
    Sep 15, 2022
    Authors
    Jie You; Jae-Won Chung; Mosharaf Chowdhury
    Description

    Capriccio is a sentiment classification dataset on tweets that simulates data drift. It is created by slicing the Sentiment140 dataset (homepage, Huggingface datasets) with a sliding window of 500,000 tweets, resulting in 38 slices. Thus, each slice can be used to represent the training/validation dataset of a sentiment classification model that is re-trained every day. Each slice has 425,000 tweets for training (file named %d_train.json) and 75,000 tweets for validation (file named %d_val.json).

    The name comes from the adjective capricious.

  9. D

    Data Analysis Application Solution Report

    • datainsightsmarket.com
    doc, pdf, ppt
    Updated May 23, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Data Insights Market (2025). Data Analysis Application Solution Report [Dataset]. https://www.datainsightsmarket.com/reports/data-analysis-application-solution-1439900
    Explore at:
    pdf, ppt, docAvailable download formats
    Dataset updated
    May 23, 2025
    Dataset authored and provided by
    Data Insights Market
    License

    https://www.datainsightsmarket.com/privacy-policyhttps://www.datainsightsmarket.com/privacy-policy

    Time period covered
    2025 - 2033
    Area covered
    Global
    Variables measured
    Market Size
    Description

    The Data Analysis Application Solution market is experiencing robust growth, driven by the increasing volume and complexity of data generated across industries. The market, estimated at $15 billion in 2025, is projected to exhibit a Compound Annual Growth Rate (CAGR) of 15% from 2025 to 2033, reaching an estimated $45 billion by 2033. This expansion is fueled by several key factors, including the rising adoption of cloud-based solutions offering scalability and cost-effectiveness, the growing need for real-time data analytics to support faster decision-making, and the increasing demand for advanced analytics techniques like machine learning and AI to extract deeper insights from data. Furthermore, the market is segmented by deployment (cloud, on-premise), application (business intelligence, data visualization, predictive analytics), and industry (BFSI, healthcare, retail, manufacturing). The competitive landscape is dynamic, with established players like SAP, Microsoft, and Qlik alongside emerging innovative companies like BigID and Collibra vying for market share through continuous product development and strategic partnerships. The major restraints on market growth include the high initial investment costs associated with implementing data analysis solutions, the need for skilled professionals to manage and interpret the data, and concerns around data security and privacy. However, these challenges are being addressed by the development of user-friendly interfaces, affordable cloud-based options, and enhanced data security measures. The market is also witnessing several trends, such as the increasing adoption of self-service analytics tools, empowering business users to perform their own data analysis, and the growing integration of data analysis solutions with other business applications to streamline workflows. The geographical distribution of the market reflects a strong presence in North America and Europe, with significant growth potential in emerging markets like Asia-Pacific. The presence of companies like Sterlite Technologies and Aparavi indicates a growing focus on the development of specialized data analytics applications targeting niche market segments.

  10. d

    Data for Analysis of Endocrine Disrupting Compounds in Lake Mead National...

    • catalog.data.gov
    • data.usgs.gov
    • +1more
    Updated Jul 6, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    U.S. Geological Survey (2024). Data for Analysis of Endocrine Disrupting Compounds in Lake Mead National Recreation Area near Las Vegas, Nevada [Dataset]. https://catalog.data.gov/dataset/data-for-analysis-of-endocrine-disrupting-compounds-in-lake-mead-national-recreation-area-
    Explore at:
    Dataset updated
    Jul 6, 2024
    Dataset provided by
    U.S. Geological Survey
    Area covered
    Nevada, Las Vegas, Lake Mead
    Description

    This data release presents the results of analyses of biota and water samples collected on multiple dates from 2007 to 2014 at 3 locations in Lake Mead National Recreation Area. Data are presented in 3 spreadsheets containing sample analyses for (1) stable isotopes in biota (2007-2014), (2) synthetic organic compounds in biota (2013-2014), and (3) synthetic organic compounds in water (2013-2014)

  11. R

    Test Image Analysis Dataset

    • universe.roboflow.com
    zip
    Updated Sep 25, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Image Measurement (2024). Test Image Analysis Dataset [Dataset]. https://universe.roboflow.com/image-measurement/test-image-analysis/model/1
    Explore at:
    zipAvailable download formats
    Dataset updated
    Sep 25, 2024
    Dataset authored and provided by
    Image Measurement
    License

    Open Database License (ODbL) v1.0https://www.opendatacommons.org/licenses/odbl/1.0/
    License information was derived automatically

    Variables measured
    Cells Polygons
    Description

    Test Image Analysis

    ## Overview
    
    Test Image Analysis is a dataset for instance segmentation tasks - it contains Cells annotations for 387 images.
    
    ## Getting Started
    
    You can download this dataset for use within your own projects, or fork it into a workspace on Roboflow to create your own model.
    
      ## License
    
      This dataset is available under the [ODbL v1.0 license](https://creativecommons.org/licenses/ODbL v1.0).
    
  12. A

    Forest Inventory and Analysis Database

    • data.amerigeoss.org
    • agdatacommons.nal.usda.gov
    • +11more
    html, xml
    Updated Jan 5, 2022
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    United States (2022). Forest Inventory and Analysis Database [Dataset]. https://data.amerigeoss.org/dataset/forest-inventory-and-analysis-database-fb721
    Explore at:
    html, xmlAvailable download formats
    Dataset updated
    Jan 5, 2022
    Dataset provided by
    United States
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    The Forest Inventory and Analysis (FIA) research program has been in existence since mandated by Congress in 1928. FIA's primary objective is to determine the extent, condition, volume, growth, and depletion of timber on the Nation's forest land. Before 1999, all inventories were conducted on a periodic basis. The passage of the 1998 Farm Bill requires FIA to collect data annually on plots within each State. This kind of up-to-date information is essential to frame realistic forest policies and programs. Summary reports for individual States are published but the Forest Service also provides data collected in each inventory to those interested in further analysis. Data is distributed via the FIA DataMart in a standard format. This standard format, referred to as the Forest Inventory and Analysis Database (FIADB) structure, was developed to provide users with as much data as possible in a consistent manner among States. A number of inventories conducted prior to the implementation of the annual inventory are available in the FIADB. However, various data attributes may be empty or the items may have been collected or computed differently. Annual inventories use a common plot design and common data collection procedures nationwide, resulting in greater consistency among FIA work units than earlier inventories. Links to field collection manuals and the FIADB user's manual are provided in the FIA DataMart.

  13. d

    New Orleans 2015 Market Value Analysis - Final Report 3.17.2016

    • catalog.data.gov
    • data.nola.gov
    • +4more
    Updated Sep 15, 2023
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    data.nola.gov (2023). New Orleans 2015 Market Value Analysis - Final Report 3.17.2016 [Dataset]. https://catalog.data.gov/dataset/new-orleans-2015-market-value-analysis-final-report-3-17-2016
    Explore at:
    Dataset updated
    Sep 15, 2023
    Dataset provided by
    data.nola.gov
    Area covered
    New Orleans
    Description

    The Market Value Analysis (MVA) is a tool designed to assist the private market and government officials to identify and comprehend the various elements of local real estate markets. It is based fundamentally on local administrative data sources. By using an MVA, public sector officials and private market actors can more precisely craft intervention strategies in weak markets and support sustainable growth in stronger market segments.

  14. D

    Data Analysis Services Report

    • datainsightsmarket.com
    doc, pdf, ppt
    Updated May 26, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Data Insights Market (2025). Data Analysis Services Report [Dataset]. https://www.datainsightsmarket.com/reports/data-analysis-services-1989313
    Explore at:
    pdf, doc, pptAvailable download formats
    Dataset updated
    May 26, 2025
    Dataset authored and provided by
    Data Insights Market
    License

    https://www.datainsightsmarket.com/privacy-policyhttps://www.datainsightsmarket.com/privacy-policy

    Time period covered
    2025 - 2033
    Area covered
    Global
    Variables measured
    Market Size
    Description

    The Data Analysis Services market is experiencing robust growth, driven by the exponential increase in data volume and the rising demand for data-driven decision-making across various industries. The market, estimated at $150 billion in 2025, is projected to witness a Compound Annual Growth Rate (CAGR) of 15% from 2025 to 2033, reaching an impressive $450 billion by 2033. This expansion is fueled by several key factors, including the increasing adoption of cloud-based analytics platforms, the growing need for advanced analytics techniques like machine learning and AI, and the rising focus on data security and compliance. The market is segmented by service type (e.g., predictive analytics, descriptive analytics, prescriptive analytics), industry vertical (e.g., healthcare, finance, retail), and deployment model (cloud, on-premise). Key players like IBM, Accenture, Microsoft, and SAS Institute are investing heavily in research and development, expanding their service portfolios, and pursuing strategic partnerships to maintain their market leadership. The competitive landscape is characterized by both large established players and emerging niche providers offering specialized solutions. The market's growth trajectory is influenced by various trends, including the increasing adoption of big data technologies, the growing prevalence of self-service analytics tools empowering business users, and the rise of specialized data analysis service providers catering to specific industry needs. However, certain restraints, such as the lack of skilled data analysts, data security concerns, and the high cost of implementation and maintenance of advanced analytics solutions, could potentially hinder market growth. Addressing these challenges through investments in data literacy programs, enhanced security measures, and flexible pricing models will be crucial for sustaining the market's momentum and unlocking its full potential. Overall, the Data Analysis Services market presents a significant opportunity for companies offering innovative solutions and expertise in this rapidly evolving landscape.

  15. Data from: A Sensitivity Analysis of Methodological Variables Associated...

    • catalog.data.gov
    • data.nist.gov
    Updated Dec 15, 2023
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    National Institute of Standards and Technology (2023). A Sensitivity Analysis of Methodological Variables Associated with Microbiome Measurements [Dataset]. https://catalog.data.gov/dataset/a-sensitivity-analysis-of-methodological-variables-associated-with-microbiome-measurements-83f38
    Explore at:
    Dataset updated
    Dec 15, 2023
    Dataset provided by
    National Institute of Standards and Technologyhttp://www.nist.gov/
    Description

    This repository provides the raw data, analysis code, and results generated during a systematic evaluation of the impact of selected experimental protocol choices on the metagenomic sequencing analysis of microbiome samples. Briefly, a full factorial experimental design was implemented varying biological sample (n=5), operator (n=2), lot (n=2), extraction kit (n=2), 16S variable region (n=2), and reference database (n=3), and the main effects were calculated and compared between parameters (bias effects) and samples (real biological differences). A full description of the effort is provided in the associated publication.

  16. D

    Big Data Analysis Platform Market Report | Global Forecast From 2025 To 2033...

    • dataintelo.com
    csv, pdf, pptx
    Updated Jan 7, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Dataintelo (2025). Big Data Analysis Platform Market Report | Global Forecast From 2025 To 2033 [Dataset]. https://dataintelo.com/report/global-big-data-analysis-platform-market
    Explore at:
    pptx, csv, pdfAvailable download formats
    Dataset updated
    Jan 7, 2025
    Authors
    Dataintelo
    License

    https://dataintelo.com/privacy-and-policyhttps://dataintelo.com/privacy-and-policy

    Time period covered
    2024 - 2032
    Area covered
    Global
    Description

    Big Data Analysis Platform Market Outlook



    The global market size for Big Data Analysis Platforms is projected to grow from USD 35.5 billion in 2023 to an impressive USD 110.7 billion by 2032, reflecting a CAGR of 13.5%. This substantial growth can be attributed to the increasing adoption of data-driven decision-making processes across various industries, the rapid proliferation of IoT devices, and the ever-growing volumes of data generated globally.



    One of the primary growth factors for the Big Data Analysis Platform market is the escalating need for businesses to derive actionable insights from complex and voluminous datasets. With the advent of technologies such as artificial intelligence and machine learning, organizations are increasingly leveraging big data analytics to enhance their operational efficiency, customer experience, and competitiveness. The ability to process vast amounts of data quickly and accurately is proving to be a game-changer, enabling businesses to make more informed decisions, predict market trends, and optimize their supply chains.



    Another significant driver is the rise of digital transformation initiatives across various sectors. Companies are increasingly adopting digital technologies to improve their business processes and meet changing customer expectations. Big Data Analysis Platforms are central to these initiatives, providing the necessary tools to analyze and interpret data from diverse sources, including social media, customer transactions, and sensor data. This trend is particularly pronounced in sectors such as retail, healthcare, and BFSI (banking, financial services, and insurance), where data analytics is crucial for personalizing customer experiences, managing risks, and improving operational efficiencies.



    Moreover, the growing adoption of cloud computing is significantly influencing the market. Cloud-based Big Data Analysis Platforms offer several advantages over traditional on-premises solutions, including scalability, flexibility, and cost-effectiveness. Businesses of all sizes are increasingly turning to cloud-based analytics solutions to handle their data processing needs. The ability to scale up or down based on demand, coupled with reduced infrastructure costs, makes cloud-based solutions particularly appealing to small and medium-sized enterprises (SMEs) that may not have the resources to invest in extensive on-premises infrastructure.



    Data Science and Machine-Learning Platforms play a pivotal role in the evolution of Big Data Analysis Platforms. These platforms provide the necessary tools and frameworks for processing and analyzing vast datasets, enabling organizations to uncover hidden patterns and insights. By integrating data science techniques with machine learning algorithms, businesses can automate the analysis process, leading to more accurate predictions and efficient decision-making. This integration is particularly beneficial in sectors such as finance and healthcare, where the ability to quickly analyze complex data can lead to significant competitive advantages. As the demand for data-driven insights continues to grow, the role of data science and machine-learning platforms in enhancing big data analytics capabilities is becoming increasingly critical.



    From a regional perspective, North America currently holds the largest market share, driven by the presence of major technology companies, high adoption rates of advanced technologies, and substantial investments in data analytics infrastructure. Europe and the Asia Pacific regions are also experiencing significant growth, fueled by increasing digitalization efforts and the rising importance of data analytics in business strategy. The Asia Pacific region, in particular, is expected to witness the highest CAGR during the forecast period, propelled by rapid economic growth, a burgeoning middle class, and increasing internet and smartphone penetration.



    Component Analysis



    The Big Data Analysis Platform market can be broadly categorized into three components: Software, Hardware, and Services. The software segment includes analytics software, data management software, and visualization tools, which are crucial for analyzing and interpreting large datasets. This segment is expected to dominate the market due to the continuous advancements in analytics software and the increasing need for sophisticated data analysis tools. Analytics software enables organizations to process and analyze data from multiple sources,

  17. a

    Rangeland Analysis Platform

    • gs-portal-fws.hub.arcgis.com
    Updated Feb 11, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    U.S. Fish & Wildlife Service (2022). Rangeland Analysis Platform [Dataset]. https://gs-portal-fws.hub.arcgis.com/content/74681c43e7b64410ab3bce63528af3b4
    Explore at:
    Dataset updated
    Feb 11, 2022
    Dataset authored and provided by
    U.S. Fish & Wildlife Service
    Description

    Click here to open Rangeland Analysis Platform website.The Rangeland Analysis Platform combines satellite imagery with thousands of on-the-ground vegetation measurements collected by BLM, NPS, and NRCS. The power of cloud computing and machine learning technology allows the RAP to easily map vegetation across the United States.

  18. Data from: SMEX04 Soil Climate Analysis Network (SCAN) Data: Arizona,...

    • data.nasa.gov
    • datasets.ai
    • +8more
    Updated Apr 1, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    nasa.gov (2025). SMEX04 Soil Climate Analysis Network (SCAN) Data: Arizona, Version 1 [Dataset]. https://data.nasa.gov/dataset/smex04-soil-climate-analysis-network-scan-data-arizona-version-1-69d65
    Explore at:
    Dataset updated
    Apr 1, 2025
    Dataset provided by
    NASAhttp://nasa.gov/
    Area covered
    Arizona
    Description

    Notice to Data Users: The documentation for this data set was provided solely by the Principal Investigator(s) and was not further developed, thoroughly reviewed, or edited by NSIDC. Thus, support for this data set may be limited.This data set contains measurements taken during the Soil Moisture Experiment 2004 (SMEX04) in southern Arizona, USA. The SCAN station houses numerous sensors which were used to automatically record the data.

  19. m

    Global Burden of Disease analysis dataset of BMI and CVD outcomes, risk...

    • data.mendeley.com
    Updated Aug 17, 2021
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    David Cundiff (2021). Global Burden of Disease analysis dataset of BMI and CVD outcomes, risk factors, and SAS codes [Dataset]. http://doi.org/10.17632/g6b39zxck4.6
    Explore at:
    Dataset updated
    Aug 17, 2021
    Authors
    David Cundiff
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    This formatted dataset originates from raw data files from the Institute of Health Metrics and Evaluation Global Burden of Disease (GBD2017). It is population weighted worldwide data on male and female cohorts ages 15-69 years including body mass index (BMI) and cardiovascular disease (CVD) and associated dietary, metabolic and other risk factors. The purpose of creating this formatted database is to explore the univariate and multiple regression correlations of BMI and CVD and other health outcomes with risk factors. Our research hypothesis is that we can successfully apply artificial intelligence to model BMI and CVD risk factors and health outcomes. We derived a BMI multiple regression risk factor formula that satisfied all nine Bradford Hill causality criteria for epidemiology research. We found that animal products and added fats are negatively correlated with CVD early deaths worldwide but positively correlated with CVD early deaths in high quantities. We interpret this as showing that optimal cardiovascular outcomes come with moderate (not low and not high) intakes of animal foods and added fats.

    For questions, please email davidkcundiff@gmail.com. Thanks.

  20. D

    Freight Analysis Framework - All FAF summary datasets

    • data.transportation.gov
    • data.virginia.gov
    • +1more
    application/rdfxml +5
    Updated Dec 17, 2018
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2018). Freight Analysis Framework - All FAF summary datasets [Dataset]. https://data.transportation.gov/Roadways-and-Bridges/Freight-Analysis-Framework-All-FAF-summary-dataset/miub-cu89
    Explore at:
    application/rdfxml, xml, csv, json, tsv, application/rssxmlAvailable download formats
    Dataset updated
    Dec 17, 2018
    Description

    The Freight Analysis Framework (FAF) integrates data from a variety of sources to create a comprehensive picture of freight movement among states and major metropolitan areas by all modes of transportation. With data from the 2007 Commodity Flow Survey and additional sources, FAF version 3 (FAF3) provides estimates for tonnage, value, and domestic ton-miles by region of origin and destination, commodity type, and mode for 2007, the most recent year, and forecasts through 2040. Also included are state-to-state flows for these years plus 1997 and 2002, summary statistics, and flows by truck assigned to the highway network for 2007 and 2040.

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
Rui Simões (2022). Orange dataset table [Dataset]. http://doi.org/10.6084/m9.figshare.19146410.v1

Orange dataset table

Explore at:
2 scholarly articles cite this dataset (View in Google Scholar)
xlsxAvailable download formats
Dataset updated
Mar 4, 2022
Dataset provided by
figshare
Authors
Rui Simões
License

Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically

Description

The complete dataset used in the analysis comprises 36 samples, each described by 11 numeric features and 1 target. The attributes considered were caspase 3/7 activity, Mitotracker red CMXRos area and intensity (3 h and 24 h incubations with both compounds), Mitosox oxidation (3 h incubation with the referred compounds) and oxidation rate, DCFDA fluorescence (3 h and 24 h incubations with either compound) and oxidation rate, and DQ BSA hydrolysis. The target of each instance corresponds to one of the 9 possible classes (4 samples per class): Control, 6.25, 12.5, 25 and 50 µM for 6-OHDA and 0.03, 0.06, 0.125 and 0.25 µM for rotenone. The dataset is balanced, it does not contain any missing values and data was standardized across features. The small number of samples prevented a full and strong statistical analysis of the results. Nevertheless, it allowed the identification of relevant hidden patterns and trends.

Exploratory data analysis, information gain, hierarchical clustering, and supervised predictive modeling were performed using Orange Data Mining version 3.25.1 [41]. Hierarchical clustering was performed using the Euclidean distance metric and weighted linkage. Cluster maps were plotted to relate the features with higher mutual information (in rows) with instances (in columns), with the color of each cell representing the normalized level of a particular feature in a specific instance. The information is grouped both in rows and in columns by a two-way hierarchical clustering method using the Euclidean distances and average linkage. Stratified cross-validation was used to train the supervised decision tree. A set of preliminary empirical experiments were performed to choose the best parameters for each algorithm, and we verified that, within moderate variations, there were no significant changes in the outcome. The following settings were adopted for the decision tree algorithm: minimum number of samples in leaves: 2; minimum number of samples required to split an internal node: 5; stop splitting when majority reaches: 95%; criterion: gain ratio. The performance of the supervised model was assessed using accuracy, precision, recall, F-measure and area under the ROC curve (AUC) metrics.

Search
Clear search
Close search
Google apps
Main menu