52 datasets found
  1. f

    Table1_Rock slope displacement prediction based on multi-source information...

    • frontiersin.figshare.com
    • figshare.com
    xls
    Updated Jun 13, 2023
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Song Jiang; Hongsheng Liu; Minjie Lian; Caiwu Lu; Sai Zhang; Jinyuan Li; PengCheng Li (2023). Table1_Rock slope displacement prediction based on multi-source information fusion and SSA-DELM model.XLS [Dataset]. http://doi.org/10.3389/fenvs.2022.982069.s001
    Explore at:
    xlsAvailable download formats
    Dataset updated
    Jun 13, 2023
    Dataset provided by
    Frontiers
    Authors
    Song Jiang; Hongsheng Liu; Minjie Lian; Caiwu Lu; Sai Zhang; Jinyuan Li; PengCheng Li
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    In order to solve the inefficient use of multi-source heterogeneous data information cross fusion and the low accuracy of prediction of landslide displacement, the current research proposed a new prediction model combining variable selection, sparrow search algorithm, and deep extreme learning machine. A cement mine in Fengxiang, Shaanxi Province, was studied as a case. The study first identified the variables related to landslide displacement of rock slope, and removed redundant variables by using Pearson correlation and gray correlation analysis. To avoid the impacts of random input weights and random thresholds in the DELM model, the SSA algorithm is used to optimize the model’s parameters, which can generate the optimal parameter combinations. The results showed an enhanced generalization ability of the model by removal of redundant variables by Pearson correlation and gray correlation analysis, and higher accuracy in the prediction of landside displacement of rock slope by SSA-DELM compared to other traditional machine learning algorithms. The current study is significant in the literature on rock slope disaster analysis.

  2. Y

    Citation Network Graph

    • shibatadb.com
    Updated Jan 6, 2015
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Yubetsu (2015). Citation Network Graph [Dataset]. https://www.shibatadb.com/article/bwViBjkk
    Explore at:
    Dataset updated
    Jan 6, 2015
    Dataset authored and provided by
    Yubetsu
    License

    https://www.shibatadb.com/license/data/proprietary/v1.0/license.txthttps://www.shibatadb.com/license/data/proprietary/v1.0/license.txt

    Description

    Network of 44 papers and 62 citation links related to "Calibration and multi-source consistency analysis of reconstructed precipitation series in Portugal since the early 17th century".

  3. f

    Table3_MHIF-MSEA: a novel model of miRNA set enrichment analysis based on...

    • frontiersin.figshare.com
    • datasetcatalog.nlm.nih.gov
    xlsx
    Updated Mar 22, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Jianwei Li; Xuxu Ma; Hongxin Lin; Shisheng Zhao; Bing Li; Yan Huang (2024). Table3_MHIF-MSEA: a novel model of miRNA set enrichment analysis based on multi-source heterogeneous information fusion.XLSX [Dataset]. http://doi.org/10.3389/fgene.2024.1375148.s003
    Explore at:
    xlsxAvailable download formats
    Dataset updated
    Mar 22, 2024
    Dataset provided by
    Frontiers
    Authors
    Jianwei Li; Xuxu Ma; Hongxin Lin; Shisheng Zhao; Bing Li; Yan Huang
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Introduction: MicroRNAs (miRNAs) are a class of non-coding RNA molecules that play a crucial role in the regulation of diverse biological processes across various organisms. Despite not encoding proteins, miRNAs have been found to have significant implications in the onset and progression of complex human diseases.Methods: Conventional methods for miRNA functional enrichment analysis have certain limitations, and we proposed a novel method called MiRNA Set Enrichment Analysis based on Multi-source Heterogeneous Information Fusion (MHIF-MSEA). Three miRNA similarity networks (miRSN-DA, miRSN-GOA, and miRSN-PPI) were constructed in MHIF-MSEA. These networks were built based on miRNA-disease association, gene ontology (GO) annotation of target genes, and protein-protein interaction of target genes, respectively. These miRNA similarity networks were fused into a single similarity network with the averaging method. This fused network served as the input for the random walk with restart algorithm, which expanded the original miRNA list. Finally, MHIF-MSEA performed enrichment analysis on the expanded list.Results and Discussion: To determine the optimal network fusion approach, three case studies were introduced: colon cancer, breast cancer, and hepatocellular carcinoma. The experimental results revealed that the miRNA-miRNA association network constructed using miRSN-DA and miRSN-GOA exhibited superior performance as the input network. Furthermore, the MHIF-MSEA model performed enrichment analysis on differentially expressed miRNAs in breast cancer and hepatocellular carcinoma. The achieved p-values were 2.17e(-75) and 1.50e(-77), and the hit rates improved by 39.01% and 44.68% compared to traditional enrichment analysis methods, respectively. These results confirm that the MHIF-MSEA method enhances the identification of enriched miRNA sets by leveraging multiple sources of heterogeneous information, leading to improved insights into the functional implications of miRNAs in complex diseases.

  4. AHRQ Social Determinants of Health Updated Database

    • datalumos.org
    • openicpsr.org
    Updated Feb 25, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    AHRQ (2025). AHRQ Social Determinants of Health Updated Database [Dataset]. http://doi.org/10.3886/E220762V1
    Explore at:
    Dataset updated
    Feb 25, 2025
    Dataset provided by
    Agency for Healthcare Research and Qualityhttp://www.ahrq.gov/
    Authors
    AHRQ
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    AHRQ's database on Social Determinants of Health (SDOH) was created under a project funded by the Patient Centered Outcomes Research (PCOR) Trust Fund. The purpose of this project is to create easy to use, easily linkable SDOH-focused data to use in PCOR research, inform approaches to address emerging health issues, and ultimately contribute to improved health outcomes.The database was developed to make it easier to find a range of well documented, readily linkable SDOH variables across domains without having to access multiple source files, facilitating SDOH research and analysis.Variables in the files correspond to five key SDOH domains: social context (e.g., age, race/ethnicity, veteran status), economic context (e.g., income, unemployment rate), education, physical infrastructure (e.g, housing, crime, transportation), and healthcare context (e.g., health insurance). The files can be linked to other data by geography (county, ZIP Code, and census tract). The database includes data files and codebooks by year at three levels of geography, as well as a documentation file.The data contained in the SDOH database are drawn from multiple sources and variables may have differing availability, patterns of missing, and methodological considerations across sources, geographies, and years. Users should refer to the data source documentation and codebooks, as well as the original data sources, to help identify these patterns

  5. f

    Table2_MHIF-MSEA: a novel model of miRNA set enrichment analysis based on...

    • figshare.com
    xlsx
    Updated Mar 22, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Jianwei Li; Xuxu Ma; Hongxin Lin; Shisheng Zhao; Bing Li; Yan Huang (2024). Table2_MHIF-MSEA: a novel model of miRNA set enrichment analysis based on multi-source heterogeneous information fusion.XLSX [Dataset]. http://doi.org/10.3389/fgene.2024.1375148.s002
    Explore at:
    xlsxAvailable download formats
    Dataset updated
    Mar 22, 2024
    Dataset provided by
    Frontiers
    Authors
    Jianwei Li; Xuxu Ma; Hongxin Lin; Shisheng Zhao; Bing Li; Yan Huang
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Introduction: MicroRNAs (miRNAs) are a class of non-coding RNA molecules that play a crucial role in the regulation of diverse biological processes across various organisms. Despite not encoding proteins, miRNAs have been found to have significant implications in the onset and progression of complex human diseases.Methods: Conventional methods for miRNA functional enrichment analysis have certain limitations, and we proposed a novel method called MiRNA Set Enrichment Analysis based on Multi-source Heterogeneous Information Fusion (MHIF-MSEA). Three miRNA similarity networks (miRSN-DA, miRSN-GOA, and miRSN-PPI) were constructed in MHIF-MSEA. These networks were built based on miRNA-disease association, gene ontology (GO) annotation of target genes, and protein-protein interaction of target genes, respectively. These miRNA similarity networks were fused into a single similarity network with the averaging method. This fused network served as the input for the random walk with restart algorithm, which expanded the original miRNA list. Finally, MHIF-MSEA performed enrichment analysis on the expanded list.Results and Discussion: To determine the optimal network fusion approach, three case studies were introduced: colon cancer, breast cancer, and hepatocellular carcinoma. The experimental results revealed that the miRNA-miRNA association network constructed using miRSN-DA and miRSN-GOA exhibited superior performance as the input network. Furthermore, the MHIF-MSEA model performed enrichment analysis on differentially expressed miRNAs in breast cancer and hepatocellular carcinoma. The achieved p-values were 2.17e(-75) and 1.50e(-77), and the hit rates improved by 39.01% and 44.68% compared to traditional enrichment analysis methods, respectively. These results confirm that the MHIF-MSEA method enhances the identification of enriched miRNA sets by leveraging multiple sources of heterogeneous information, leading to improved insights into the functional implications of miRNAs in complex diseases.

  6. Multisource Heterogeneous Physical Edu Dataset

    • kaggle.com
    Updated Jul 15, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Ziya (2025). Multisource Heterogeneous Physical Edu Dataset [Dataset]. https://www.kaggle.com/datasets/ziya07/multisource-heterogeneous-physical-edu-dataset/code
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Jul 15, 2025
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    Ziya
    License

    https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/

    Description

    This dataset supports research on innovating physical education (PE) teaching through multi-source heterogeneous data fusion and deep learning. It includes sensor-based CSV data and accompanying instructional videos designed to model, analyze, and improve PE teaching practices. The dataset is ideal for projects exploring AI-powered adaptive instruction, student movement analysis, and personalized feedback systems in PE contexts.

    ✅ Description PE_multisource_dataset.csv

    Contains synthetic multi-source data simulating wearable sensors, video-derived features, environmental conditions, and student metadata.

    Columns include acceleration, gyroscope, heart rate, environmental data (temperature, humidity, air quality), 128-dimensional VGG19-style video features, timestamps, session IDs, student demographics, and a binary success label.

    Suitable for developing and testing multimodal fusion models for personalized student performance assessment.

    Dataset videos:

    12 Fun Physical Education Games.mp4

    40 Physical Education Games and Activities for School _ 40 Hula Hoop Games _ PhysEd.mp4

    Middle Years Physical Education - Teaching with a Purpose.mp4

    Rethinking Primary Physical Education.mp4

    These curated videos provide instructional content on diverse PE activities and teaching strategies. They can be used for developing video feature extraction pipelines (e.g., with VGG19), training teacher models, or enriching data fusion tasks with realistic visual PE content.

    Key Features:

    Multisource heterogeneous data for PE: sensors, video, environment

    Time-stamped session-based structure

    Includes real-world instructional video resources

    Designed for personalized, AI-powered PE teaching research

  7. Africa Surficial Lithology

    • rwanda.africageoportal.com
    • morocco.africageoportal.com
    • +2more
    Updated May 22, 2014
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Esri (2014). Africa Surficial Lithology [Dataset]. https://rwanda.africageoportal.com/datasets/34ca8e39933e4725b9a68f538abaa565
    Explore at:
    Dataset updated
    May 22, 2014
    Dataset authored and provided by
    Esrihttp://esri.com/
    Area covered
    Description

    Created as part of the USGS’s Africa Ecosystems Mapping project, the Africa Surficial Lithology layer maps the geology of Africa into 20 classes based bedrock type and the distribution of unconsolidated surface material. The data available through this layer map the key geological features that affect the distribution of plants and ecosystems in Africa.This layer provides access to a 100m cell sized raster compiled from multiple sources. The data covers Africa, Madagascar, and other Islands near Africa. It was published by the U.S. Geological Survey and The Nature Conservancy in 2009. Link to source metadata Dataset SummaryAnalysis: Restricted single source analysis. Maximum size of analysis is 24,000 x 24,000 pixels. What can you do with this layer?This layer has query, identify, and export image services available. The layer is restricted to a 24,000 x 24,000 pixel limit for these services, which represents an area roughly 2,400 kilometers on a side. The source data are available here. Restricted single source analysis means this layer has size constraints for analysis and it is not recommended for use with other layers in multisource analysis. This layer is part of a larger collection of landscape layers that you can use to perform a wide variety of mapping and analysis tasks

  8. p

    Description of the Swell Field Generated by Tropical Cyclones from...

    • pigma.org
    • sextant.ifremer.fr
    doi, www:ftp +1
    Updated Mar 30, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    CERSAT Exploitation (2023). Description of the Swell Field Generated by Tropical Cyclones from Multi-Source Wave Observations for ESA MAXSS Project [Dataset]. https://www.pigma.org/geonetwork/srv/api/records/54766b9b-d390-4cb8-b39e-98a7cb6933ba
    Explore at:
    www:ftp, www:link, doiAvailable download formats
    Dataset updated
    Mar 30, 2023
    Dataset provided by
    CERSAT Exploitation
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Time period covered
    Jan 1, 2015 - Dec 31, 2021
    Area covered
    Description

    The main objectives of this dataset is to gather the ocean swells measured by different sensors, including satellite and in-situ sources, that were generated by a given tropical cyclone (TC). This dataset aims at providing characteristics of these swells such as their direction, wavelength (or period) and energy but also the date when they left the influence of the tropical cyclone wind to propagate freely. Wave spectra in tropical cyclones vary strongly per quadrant and provide information about the current and past state of the wave field. However, inside TCs, waves measurements including the wave system direction, energy and wavelength are rare and difficult to obtain with in-situ and remote sensing technics. For this dataset, both moored and drifting buoys are considered as long as they provide wave systems measurements. For the satellite contribution, Synthetic Aperture Radar (SAR) and real aperture radar (RAR) instruments can significantly contribute to the TC-generated waves documentation. Indeed, ocean wave spectra can be derived from modulations of the backscatter in SAR and RAR signal. SAR on board European satellite and in particular the SAR series developed since ERS-1 by ESA and now ESA/Copernicus with Sentinel-1 mission (S-1) are good candidates to provide these ocean waves systems characteristics thanks to the dedicated acquisition mode : the so-called Wave Mode. The wave spectrometer SWIM developed by the French space Agency (CNES) and embedded on the Chinese-French Oceanography SATellite (CFOSAT) has been launched more recently with a new measurement concept relying on a RAR and can certainly complement the S-1 data collection. Although the reasons are different, these two systems are limited for measuring waves generation area within the TC vortex where strong rain rates and wind regimes are observed. Far enough from their source, satellite acquisitions are thus expected to be able to observe these ocean swells during more favorable met-ocean conditions for waves retrieval inversion. As a consequence, our analysis is focused on waves originating from TC but that have been able to propagate far from their source. The analysis of swell measurements far from their area of generation to locate the storm source has been firstly applied to data from one single in-situ wave station (wave energy with frequency and direction) collected 2 miles off shore from San Clemente Island, California and extended to a network of several wave stations in the sixties. More recently, the gathering of swell system observed with SAR far from a storm to characterize the waves properties across the ocean has proven to be efficient in the case of extra-tropical storms. Yet, such analysis is not adapted to Tropical Cyclone whose size is much smaller and currently existing wave datasets do not allow for an accurate monitoring of the tropical cyclones swells. This multi-sensor Level-3 tropical cyclone waves dataset intends to fill this gap and opens for an alternate way of estimating tropical cyclone waves properties over all ocean basins and for all tropical cyclones. This dataset was produced in the frame of the ESA funded Marine Atmosphere eXtreme Satellite Synergy (MAXSS) project. The primary objective of the ESA Marine Atmosphere eXtreme Satellite Synergy (MAXSS) project is to provide guidance and innovative methodologies to maximize the synergetic use of available Earth Observation data (satellite, in situ) to improve understanding about the multi-scale dynamical characteristics of extreme air-sea interaction.

  9. Cloud Analytics Market Analysis North America, Europe, APAC, Middle East and...

    • technavio.com
    pdf
    Updated Jul 22, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Technavio (2024). Cloud Analytics Market Analysis North America, Europe, APAC, Middle East and Africa, South America - US, China, UK, Germany, Japan - Size and Forecast 2024-2028 [Dataset]. https://www.technavio.com/report/cloud-analytics-market-industry-analysis
    Explore at:
    pdfAvailable download formats
    Dataset updated
    Jul 22, 2024
    Dataset provided by
    TechNavio
    Authors
    Technavio
    Time period covered
    2024 - 2028
    Area covered
    United Kingdom, United States
    Description

    Snapshot img

    Cloud Analytics Market Size 2024-2028

    The cloud analytics market size is forecast to increase by USD 74.08 billion at a CAGR of 24.4% between 2023 and 2028.

    The market is experiencing significant growth due to several key trends. The adoption of hybrid and multi-cloud setups is on the rise, as these configurations enhance data connectivity and flexibility. Another trend driving market growth is the increasing use of cloud security applications to safeguard sensitive data.
    However, concerns regarding confidential data security and privacy remain a challenge for market growth. Organizations must ensure robust security measures are in place to mitigate risks and maintain trust with their customers. Overall, the market is poised for continued expansion as businesses seek to leverage the benefits of cloud technologies for data processing and data analytics.
    

    What will be the Size of the Cloud Analytics Market During the Forecast Period?

    Request Free Sample

    The market is experiencing significant growth due to the increasing volume of data generated by businesses and the demand for advanced analytics solutions. Cloud-based analytics enables organizations to process and analyze large datasets from various data sources, including unstructured data, in real-time. This is crucial for businesses looking to make data-driven decisions and gain valuable insights to optimize their operations and meet customer requirements. Key industries such as sales and marketing, customer service, and finance are adopting cloud analytics to improve key performance indicators and gain a competitive edge. Both Small and Medium-sized Enterprises (SMEs) and large enterprises are embracing cloud analytics, with solutions available on private, public, and multi-cloud platforms.
    Big data technology, such as machine learning and artificial intelligence, are integral to cloud analytics, enabling advanced data analytics and business intelligence. Cloud analytics provides businesses with the flexibility to store and process data In the cloud, reducing the need for expensive on-premises data storage and computation. Hybrid environments are also gaining popularity, allowing businesses to leverage the benefits of both private and public clouds. Overall, the market is poised for continued growth as businesses increasingly rely on data-driven insights to inform their decision-making processes.
    

    How is this Cloud Analytics Industry segmented and which is the largest segment?

    The cloud analytics industry research report provides comprehensive data (region-wise segment analysis), with forecasts and estimates in 'USD billion' for the period 2024-2028, as well as historical data from 2017-2022 for the following segments.

    Solution
    
      Hosted data warehouse solutions
      Cloud BI tools
      Complex event processing
      Others
    
    
    Deployment
    
      Public cloud
      Hybrid cloud
      Private cloud
    
    
    Geography
    
      North America
    
        US
    
    
      Europe
    
        Germany
        UK
    
    
      APAC
    
        China
        Japan
    
    
      Middle East and Africa
    
    
    
      South America
    

    By Solution Insights

    The hosted data warehouse solutions segment is estimated to witness significant growth during the forecast period.
    

    Hosted data warehouses enable organizations to centralize and analyze large datasets from multiple sources, facilitating advanced analytics solutions and real-time insights. By utilizing cloud-based infrastructure, businesses can reduce operational costs through eliminating licensing expenses, hardware investments, and maintenance fees. Additionally, cloud solutions offer network security measures, such as Software Defined Networking and Network integration, ensuring data protection. Cloud analytics caters to diverse industries, including SMEs and large enterprises, addressing requirements for sales and marketing, customer service, and key performance indicators. Advanced analytics capabilities, including predictive analytics, automated decision making, and fraud prevention, are essential for data-driven decision making and business optimization.

    Furthermore, cloud platforms provide access to specialized talent, big data technology, and AI, enhancing customer experiences and digital business opportunities. Data connectivity and data processing in real-time are crucial for network agility and application performance. Hosted data warehouses offer computational power and storage capabilities, ensuring efficient data utilization and enterprise information management. Cloud service providers offer various cloud environments, including private, public, multi-cloud, and hybrid, catering to diverse business needs. Compliance and security concerns are addressed through cybersecurity frameworks and data security measures, ensuring data breaches and thefts are minimized.

    Get a glance at the Cloud Analytics Industry report of share of various segments Request Free Sample

    The Hosted data warehouse solutions s

  10. I

    Self-citation analysis data based on PubMed Central subset (2002-2005)

    • databank.illinois.edu
    Updated Apr 27, 2018
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Shubhanshu Mishra; Brent D Fegley; Jana Diesner; Vetle I. Torvik (2018). Self-citation analysis data based on PubMed Central subset (2002-2005) [Dataset]. http://doi.org/10.13012/B2IDB-9665377_V1
    Explore at:
    Dataset updated
    Apr 27, 2018
    Authors
    Shubhanshu Mishra; Brent D Fegley; Jana Diesner; Vetle I. Torvik
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Dataset funded by
    U.S. National Institutes of Health (NIH)
    U.S. National Science Foundation (NSF)
    Description

    Self-citation analysis data based on PubMed Central subset (2002-2005) ---------------------------------------------------------------------- Created by Shubhanshu Mishra, Brent D. Fegley, Jana Diesner, and Vetle Torvik on April 5th, 2018 ## Introduction This is a dataset created as part of the publication titled: Mishra S, Fegley BD, Diesner J, Torvik VI (2018) Self-Citation is the Hallmark of Productive Authors, of Any Gender. PLOS ONE. It contains files for running the self citation analysis on articles published in PubMed Central between 2002 and 2005, collected in 2015. The dataset is distributed in the form of the following tab separated text files: * Training_data_2002_2005_pmc_pair_First.txt (1.2G) - Data for first authors * Training_data_2002_2005_pmc_pair_Last.txt (1.2G) - Data for last authors * Training_data_2002_2005_pmc_pair_Middle_2nd.txt (964M) - Data for middle 2nd authors * Training_data_2002_2005_pmc_pair_txt.header.txt - Header for the data * COLUMNS_DESC.txt file - Descriptions of all columns * model_text_files.tar.gz - Text files containing model coefficients and scores for model selection. * results_all_model.tar.gz - Model coefficient and result files in numpy format used for plotting purposes. v4.reviewer contains models for analysis done after reviewer comments. * README.txt file ## Dataset creation Our experiments relied on data from multiple sources including properitery data from Thompson Rueter's (now Clarivate Analytics) Web of Science collection of MEDLINE citations. Author's interested in reproducing our experiments should personally request from Clarivate Analytics for this data. However, we do make a similar but open dataset based on citations from PubMed Central which can be utilized to get similar results to those reported in our analysis. Furthermore, we have also freely shared our datasets which can be used along with the citation datasets from Clarivate Analytics, to re-create the datased used in our experiments. These datasets are listed below. If you wish to use any of those datasets please make sure you cite both the dataset as well as the paper introducing the dataset. * MEDLINE 2015 baseline: https://www.nlm.nih.gov/bsd/licensee/2015_stats/baseline_doc.html * Citation data from PubMed Central (original paper includes additional citations from Web of Science) * Author-ity 2009 dataset: - Dataset citation: Torvik, Vetle I.; Smalheiser, Neil R. (2018): Author-ity 2009 - PubMed author name disambiguated dataset. University of Illinois at Urbana-Champaign. https://doi.org/10.13012/B2IDB-4222651_V1 - Paper citation: Torvik, V. I., & Smalheiser, N. R. (2009). Author name disambiguation in MEDLINE. ACM Transactions on Knowledge Discovery from Data, 3(3), 1–29. https://doi.org/10.1145/1552303.1552304 - Paper citation: Torvik, V. I., Weeber, M., Swanson, D. R., & Smalheiser, N. R. (2004). A probabilistic similarity metric for Medline records: A model for author name disambiguation. Journal of the American Society for Information Science and Technology, 56(2), 140–158. https://doi.org/10.1002/asi.20105 * Genni 2.0 + Ethnea for identifying author gender and ethnicity: - Dataset citation: Torvik, Vetle (2018): Genni + Ethnea for the Author-ity 2009 dataset. University of Illinois at Urbana-Champaign. https://doi.org/10.13012/B2IDB-9087546_V1 - Paper citation: Smith, B. N., Singh, M., & Torvik, V. I. (2013). A search engine approach to estimating temporal changes in gender orientation of first names. In Proceedings of the 13th ACM/IEEE-CS joint conference on Digital libraries - JCDL ’13. ACM Press. https://doi.org/10.1145/2467696.2467720 - Paper citation: Torvik VI, Agarwal S. Ethnea -- an instance-based ethnicity classifier based on geo-coded author names in a large-scale bibliographic database. International Symposium on Science of Science March 22-23, 2016 - Library of Congress, Washington DC, USA. http://hdl.handle.net/2142/88927 * MapAffil for identifying article country of affiliation: - Dataset citation: Torvik, Vetle I. (2018): MapAffil 2016 dataset -- PubMed author affiliations mapped to cities and their geocodes worldwide. University of Illinois at Urbana-Champaign. https://doi.org/10.13012/B2IDB-4354331_V1 - Paper citation: Torvik VI. MapAffil: A Bibliographic Tool for Mapping Author Affiliation Strings to Cities and Their Geocodes Worldwide. D-Lib magazine : the magazine of the Digital Library Forum. 2015;21(11-12):10.1045/november2015-torvik * IMPLICIT journal similarity: - Dataset citation: Torvik, Vetle (2018): Author-implicit journal, MeSH, title-word, and affiliation-word pairs based on Author-ity 2009. University of Illinois at Urbana-Champaign. https://doi.org/10.13012/B2IDB-4742014_V1 * Novelty dataset for identify article level novelty: - Dataset citation: Mishra, Shubhanshu; Torvik, Vetle I. (2018): Conceptual novelty scores for PubMed articles. University of Illinois at Urbana-Champaign. https://doi.org/10.13012/B2IDB-5060298_V1 - Paper citation: Mishra S, Torvik VI. Quantifying Conceptual Novelty in the Biomedical Literature. D-Lib magazine : The Magazine of the Digital Library Forum. 2016;22(9-10):10.1045/september2016-mishra - Code: https://github.com/napsternxg/Novelty * Expertise dataset for identifying author expertise on articles: * Source code provided at: https://github.com/napsternxg/PubMed_SelfCitationAnalysis Note: The dataset is based on a snapshot of PubMed (which includes Medline and PubMed-not-Medline records) taken in the first week of October, 2016. Check here for information to get PubMed/MEDLINE, and NLMs data Terms and Conditions Additional data related updates can be found at Torvik Research Group ## Acknowledgments This work was made possible in part with funding to VIT from NIH grant P01AG039347 and NSF grant 1348742. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript. ## License Self-citation analysis data based on PubMed Central subset (2002-2005) by Shubhanshu Mishra, Brent D. Fegley, Jana Diesner, and Vetle Torvik is licensed under a Creative Commons Attribution 4.0 International License. Permissions beyond the scope of this license may be available at https://github.com/napsternxg/PubMed_SelfCitationAnalysis.

  11. D

    Big Data Analysis Platform Market Report | Global Forecast From 2025 To 2033...

    • dataintelo.com
    csv, pdf, pptx
    Updated Jan 7, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Dataintelo (2025). Big Data Analysis Platform Market Report | Global Forecast From 2025 To 2033 [Dataset]. https://dataintelo.com/report/global-big-data-analysis-platform-market
    Explore at:
    pptx, csv, pdfAvailable download formats
    Dataset updated
    Jan 7, 2025
    Dataset authored and provided by
    Dataintelo
    License

    https://dataintelo.com/privacy-and-policyhttps://dataintelo.com/privacy-and-policy

    Time period covered
    2024 - 2032
    Area covered
    Global
    Description

    Big Data Analysis Platform Market Outlook



    The global market size for Big Data Analysis Platforms is projected to grow from USD 35.5 billion in 2023 to an impressive USD 110.7 billion by 2032, reflecting a CAGR of 13.5%. This substantial growth can be attributed to the increasing adoption of data-driven decision-making processes across various industries, the rapid proliferation of IoT devices, and the ever-growing volumes of data generated globally.



    One of the primary growth factors for the Big Data Analysis Platform market is the escalating need for businesses to derive actionable insights from complex and voluminous datasets. With the advent of technologies such as artificial intelligence and machine learning, organizations are increasingly leveraging big data analytics to enhance their operational efficiency, customer experience, and competitiveness. The ability to process vast amounts of data quickly and accurately is proving to be a game-changer, enabling businesses to make more informed decisions, predict market trends, and optimize their supply chains.



    Another significant driver is the rise of digital transformation initiatives across various sectors. Companies are increasingly adopting digital technologies to improve their business processes and meet changing customer expectations. Big Data Analysis Platforms are central to these initiatives, providing the necessary tools to analyze and interpret data from diverse sources, including social media, customer transactions, and sensor data. This trend is particularly pronounced in sectors such as retail, healthcare, and BFSI (banking, financial services, and insurance), where data analytics is crucial for personalizing customer experiences, managing risks, and improving operational efficiencies.



    Moreover, the growing adoption of cloud computing is significantly influencing the market. Cloud-based Big Data Analysis Platforms offer several advantages over traditional on-premises solutions, including scalability, flexibility, and cost-effectiveness. Businesses of all sizes are increasingly turning to cloud-based analytics solutions to handle their data processing needs. The ability to scale up or down based on demand, coupled with reduced infrastructure costs, makes cloud-based solutions particularly appealing to small and medium-sized enterprises (SMEs) that may not have the resources to invest in extensive on-premises infrastructure.



    Data Science and Machine-Learning Platforms play a pivotal role in the evolution of Big Data Analysis Platforms. These platforms provide the necessary tools and frameworks for processing and analyzing vast datasets, enabling organizations to uncover hidden patterns and insights. By integrating data science techniques with machine learning algorithms, businesses can automate the analysis process, leading to more accurate predictions and efficient decision-making. This integration is particularly beneficial in sectors such as finance and healthcare, where the ability to quickly analyze complex data can lead to significant competitive advantages. As the demand for data-driven insights continues to grow, the role of data science and machine-learning platforms in enhancing big data analytics capabilities is becoming increasingly critical.



    From a regional perspective, North America currently holds the largest market share, driven by the presence of major technology companies, high adoption rates of advanced technologies, and substantial investments in data analytics infrastructure. Europe and the Asia Pacific regions are also experiencing significant growth, fueled by increasing digitalization efforts and the rising importance of data analytics in business strategy. The Asia Pacific region, in particular, is expected to witness the highest CAGR during the forecast period, propelled by rapid economic growth, a burgeoning middle class, and increasing internet and smartphone penetration.



    Component Analysis



    The Big Data Analysis Platform market can be broadly categorized into three components: Software, Hardware, and Services. The software segment includes analytics software, data management software, and visualization tools, which are crucial for analyzing and interpreting large datasets. This segment is expected to dominate the market due to the continuous advancements in analytics software and the increasing need for sophisticated data analysis tools. Analytics software enables organizations to process and analyze data from multiple sources,

  12. World Surface Water

    • gis-support-utah-em.hub.arcgis.com
    • iwmi.africageoportal.com
    • +4more
    Updated Dec 3, 2014
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Esri (2014). World Surface Water [Dataset]. https://gis-support-utah-em.hub.arcgis.com/datasets/ddfce15a8ccd4c8c88fb125cb4f23cc9
    Explore at:
    Dataset updated
    Dec 3, 2014
    Dataset authored and provided by
    Esrihttp://esri.com/
    Area covered
    Description

    Water bodies are a key element in the landscape. This layer provides a global map of large water bodies for use inlandscape-scale analysis. Dataset SummaryThis layer provides access to a 250m cell-sized raster of surface water created by extracting pixels coded as water in the Global Lithological Map and the Global Landcover Map. The layer was created by Esri in 2014. Analysis: Restricted single source analysis. Maximum size of analysis is 16,000 x 16,000 pixels. What can you do with this layer?This layer is suitable for both visualization and analysis. It can be used in ArcGIS Online in web maps and applications and can be used in ArcGIS Desktop. Restricted single source analysis means this layer has size constraints for analysis and it is not recommended for use with other layers in multisource analysis.This layer has query, identify, and export image services available. This layer is restricted to a maximum area of 16,000 x 16,000 pixels - an area 4,000 kilometerson a side or an area approximately the size of Europe.This layer is part of a larger collection of landscape layers that you can use to perform a wide variety of mapping and analysis tasks.The Living Atlas of the World provides an easy way to explore the landscape layers and many otherbeautiful and authoritative maps on hundreds of topics. Geonetis a good resource for learning more aboutlandscape layers and the Living Atlas of the World. To get started see theLiving Atlas Discussion Group. TheEsri Insider Blogprovides an introduction to the Ecophysiographic Mapping project.

  13. d

    Multisource Field Plot Data for Studies of Vegetation Alliances:...

    • search.dataone.org
    • knb.ecoinformatics.org
    Updated Jan 6, 2015
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    NCEAS 3540: Jennings: The ecology of steppe and grassland vegetation alliances of the Columbia River Basin (Hosted by NCEAS); NCEAS 2180: Peet: An information infrastructure for vegetation science (Hosted by NCEAS); NCEAS 4340: Peet: Tools for vegetation classification and analysis; NCEAS 2840: Reichman: A Knowledge Network for Biocomplexity: Building and evaluating a metadata-based framework for integrating heterogeneous scientific data (Hosted by NCEAS); National Center for Ecological Analysis and Synthesis; Michael Jennings (2015). Multisource Field Plot Data for Studies of Vegetation Alliances: Northwestern USA [Dataset]. http://doi.org/10.5063/AA/nceas.286.1
    Explore at:
    Dataset updated
    Jan 6, 2015
    Dataset provided by
    Knowledge Network for Biocomplexity
    Authors
    NCEAS 3540: Jennings: The ecology of steppe and grassland vegetation alliances of the Columbia River Basin (Hosted by NCEAS); NCEAS 2180: Peet: An information infrastructure for vegetation science (Hosted by NCEAS); NCEAS 4340: Peet: Tools for vegetation classification and analysis; NCEAS 2840: Reichman: A Knowledge Network for Biocomplexity: Building and evaluating a metadata-based framework for integrating heterogeneous scientific data (Hosted by NCEAS); National Center for Ecological Analysis and Synthesis; Michael Jennings
    Time period covered
    Jan 1, 1970 - Dec 31, 2000
    Area covered
    Description

    Although vegetation alliances defined and described in the U.S. National Vegetation Classification are used as a fundamental unit of habitat for modeling species distributions and for conservation assessments, little is known about their ecological characteristics, either generally or individually. A major barrier to understanding alliances better is the lack of primary biotic and physical data about them. In particular, few alliance or association descriptions of the USNVC are based directly on original field plot data. Such data do not exist in the quantity or over the geographic extents necessary, and new field work to acquire such data is unlikely. This study attempts to learn about the efficacy of and limitations to developing the data needed by integrating existing information from multiple sources and themes across the Inland Northwest of the USA. Almost 40,000 field plot records from 11 different sources were integrated, sorted, and evaluated to generate a single standardized database from which plots were classified a priori as members of alliances. Additional data sets of climate, biomass productivity, and morphological traits of plant species were also integrated with the field plot data. The plot records were filtered for eight univariate parameters, species names were standardized, and multivariate outliers of species composition were identified and removed. Field plot records were extracted from the data sets with SQL statements based on existing descriptions of alliances, and these subsets were tested against a null model. Ultimately 21% of the field plots were classified to 49 vegetation alliances. Field plot classifications were corroborated with a nonmetric multidimensional scaling ordination. This study resulted in a large set of primary data critical for the study of vegetation alliances. It shows that it is possible to develop synthetic vegetation field plot data sets from existing multisource information.

  14. f

    Table2_TLSEA: a tool for lncRNA set enrichment analysis based on...

    • figshare.com
    • datasetcatalog.nlm.nih.gov
    xlsx
    Updated Jun 2, 2023
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Jianwei Li; Zhiguang Li; Yinfei Wang; Hongxin Lin; Baoqin Wu (2023). Table2_TLSEA: a tool for lncRNA set enrichment analysis based on multi-source heterogeneous information fusion.XLSX [Dataset]. http://doi.org/10.3389/fgene.2023.1181391.s002
    Explore at:
    xlsxAvailable download formats
    Dataset updated
    Jun 2, 2023
    Dataset provided by
    Frontiers
    Authors
    Jianwei Li; Zhiguang Li; Yinfei Wang; Hongxin Lin; Baoqin Wu
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Long non-coding RNAs (lncRNAs) play an important regulatory role in gene transcription and post-transcriptional modification, and lncRNA regulatory dysfunction leads to a variety of complex human diseases. Hence, it might be beneficial to detect the underlying biological pathways and functional categories of genes that encode lncRNA. This can be carried out by using gene set enrichment analysis, which is a pervasive bioinformatic technique that has been widely used. However, accurately performing gene set enrichment analysis of lncRNAs remains a challenge. Most conventional enrichment analysis methods have not exhaustively included the rich association information among genes, which usually affects the regulatory functions of genes. Here, we developed a novel tool for lncRNA set enrichment analysis (TLSEA) to improve the accuracy of the gene functional enrichment analysis, which extracted the low-dimensional vectors of lncRNAs in two functional annotation networks with the graph representation learning method. A novel lncRNA–lncRNA association network was constructed by merging lncRNA-related heterogeneous information obtained from multiple sources with the different lncRNA-related similarity networks. In addition, the random walk with restart method was adopted to effectively expand the lncRNAs submitted by users according to the lncRNA–lncRNA association network of TLSEA. In addition, a case study of breast cancer was performed, which demonstrated that TLSEA could detect breast cancer more accurately than conventional tools. The TLSEA can be accessed freely at http://www.lirmed.com:5003/tlsea.

  15. Multi-scale Ultra-high Resolution (MUR) SST Analysis fv04.1, Global, 0.01°,...

    • catalog.data.gov
    • s.cnmilf.com
    Updated Jun 10, 2023
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    NOAA NMFS SWFSC ERD and NOAA NESDIS CoastWatch WCRN (Point of Contact) (2023). Multi-scale Ultra-high Resolution (MUR) SST Analysis fv04.1, Global, 0.01°, 2002-present, Monthly, Lon0360 [Dataset]. https://catalog.data.gov/dataset/multi-scale-ultra-high-resolution-mur-sst-analysis-fv04-1-global-0-01a-2002-present-monthly-lon
    Explore at:
    Dataset updated
    Jun 10, 2023
    Dataset provided by
    National Oceanic and Atmospheric Administrationhttp://www.noaa.gov/
    National Environmental Satellite, Data, and Information Service
    Description

    A monthly mean Sea Surface Temperature (SST) product created by NOAA NMFS SWFSC ERD based on the daily, global, Multi-scale, Ultra-high Resolution (MUR) Sea Surface Temperature (SST) 1-km data set, Version 4.1, which is produced at the NASA Jet Propulsion Laboratory (JPL) under the NASA MEaSUREs program. For details of the source dataset, see https://podaac.jpl.nasa.gov/dataset/MUR-JPL-L4-GLOB-v4.1 . The source dataset is part of the Group for High-Resolution Sea Surface Temperature (GHRSST) project.

  16. D

    Literature Review Software Market Report | Global Forecast From 2025 To 2033...

    • dataintelo.com
    csv, pdf, pptx
    Updated Jan 7, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Dataintelo (2025). Literature Review Software Market Report | Global Forecast From 2025 To 2033 [Dataset]. https://dataintelo.com/report/global-literature-review-software-market
    Explore at:
    csv, pptx, pdfAvailable download formats
    Dataset updated
    Jan 7, 2025
    Dataset authored and provided by
    Dataintelo
    License

    https://dataintelo.com/privacy-and-policyhttps://dataintelo.com/privacy-and-policy

    Time period covered
    2024 - 2032
    Area covered
    Global
    Description

    Literature Review Software Market Outlook




    The global literature review software market size was valued at approximately USD 1.5 billion in 2023 and is projected to reach USD 3.2 billion by 2032, growing at a CAGR of 8.2% during the forecast period. This substantial growth is driven by various factors including the increasing need for efficient data management and the rising trend of academic and corporate research activities. The expansion of digital technologies and the increasing volume of research documentation have also significantly contributed to the growth trajectory of the literature review software market.




    One of the primary growth factors for the literature review software market is the increasing demand for efficient data organization and management solutions. With the exponential growth of academic research, the need to manage vast amounts of data in a structured and efficient manner has become paramount. Literature review software provides researchers with tools to systematically review, analyze, and synthesize existing research, significantly enhancing research efficiency and accuracy. Furthermore, the integration of artificial intelligence and machine learning algorithms into these software solutions has improved their functionality, enabling more sophisticated data analysis and literature synthesis.




    Another driving force behind the growth of this market is the increasing adoption of digital tools and technologies in academic and corporate research. As the digital transformation continues to sweep across various sectors, the academic and research communities are also embracing digital solutions to streamline their workflows. Literature review software, with its advanced features such as automated referencing, real-time collaboration, and cloud storage, is becoming an indispensable tool for researchers. This shift towards digitalization is expected to continue, further propelling the market's growth.




    In addition, the rise in interdisciplinary research activities is also fueling the demand for literature review software. Modern research often involves collaboration across different fields, requiring researchers to review and synthesize literature from diverse disciplines. Literature review software helps in managing this complexity by allowing researchers to categorize and analyze literature from multiple sources, thus facilitating comprehensive and multi-faceted research. The increasing complexity of research projects and the need for comprehensive literature reviews are significant factors driving the market's growth.



    The integration of Product Reviews Software into literature review processes is becoming increasingly valuable for researchers and organizations. This software allows users to gather and analyze feedback on various research tools and methodologies, providing insights into their effectiveness and user satisfaction. By leveraging product reviews, researchers can make informed decisions about which software solutions best meet their needs, enhancing the overall quality and efficiency of their literature reviews. The ability to access real-time feedback and ratings from other users also fosters a collaborative environment, where researchers can share experiences and recommendations. As the demand for user-centric research tools grows, the role of Product Reviews Software in shaping the literature review landscape is expected to expand significantly.




    Looking at the regional outlook, North America currently holds the largest share of the global literature review software market, driven by the presence of leading academic institutions and a strong emphasis on research and development. Europe follows closely, with substantial investments in research infrastructure and increasing adoption of digital tools in academic research. The Asia Pacific region is expected to witness the highest growth rate during the forecast period, fueled by the rapid expansion of higher education institutions and growing research activities. Latin America and the Middle East & Africa are also emerging markets, with increasing awareness and adoption of literature review software solutions.



    Component Analysis




    The literature review software market can be segmented by component into software and services. The software segment comprises various tools and platforms designed for literature review, inclu

  17. f

    Data Sheet 2_Research and analysis of the TCN-Multihead-Attention prediction...

    • datasetcatalog.nlm.nih.gov
    • frontiersin.figshare.com
    Updated Jun 3, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Liu, Yimin; Li, Yixuan; Chen, Huan (2025). Data Sheet 2_Research and analysis of the TCN-Multihead-Attention prediction model of landslide deformation in the Three Gorges Reservoir area, China.csv [Dataset]. https://datasetcatalog.nlm.nih.gov/dataset?q=0002068582
    Explore at:
    Dataset updated
    Jun 3, 2025
    Authors
    Liu, Yimin; Li, Yixuan; Chen, Huan
    Description

    Landslide deformation prediction is a crucial task in geotechnical engineering and disaster prevention. Developing an accurate and reliable landslide displacement prediction model is vital for effective landslide warning systems. This paper proposes a TCN-Multihead-Attention prediction model for landslide deformation based on temporal convolutional networks (TCNs). We collected 8 years of monitoring data from the Huangniba Dengkan landslide in the Three Gorges Reservoir area, including surface deformation (horizontal displacement and elevation), rainfall, and reservoir levels. A comprehensive analysis was conducted to assess the effects of rainfall, reservoir levels, and elevation on landslide horizontal displacement. Utilizing the multi-input and single-output characteristics of the long-period time series dataset, we developed the TCN-Multihead-Attention prediction model of landslide deformation. Model evaluation demonstrated that the coefficient of determination (R2) for the test set reached 0.995, with MAPE and RMSE at only 0.482 and 7.180, respectively, indicating high accuracy. Additionally, we developed other prediction models based on single TCN, Attention-based Transformer, RNN-based LSTM, and the hybrid CNN-BiLSTM for comparison. Compared with existing models, the TCN-Multihead-Attention model integrates dilated causal convolutions from TCN with multi-head attention to effectively fuse nonlinear interactions of multi-source environmental factors, capture long-term evolutionary trends, and accurately identify local mutation patterns, demonstrating superior reliability for landslide deformation forecasting in reservoir regions.

  18. d

    HUN Analysis boundaries 20170106 v03

    • data.gov.au
    • researchdata.edu.au
    zip
    Updated Nov 20, 2019
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Bioregional Assessment Program (2019). HUN Analysis boundaries 20170106 v03 [Dataset]. https://data.gov.au/data/dataset/20d25db8-75fd-46f2-a64c-c249c8b40a95
    Explore at:
    zipAvailable download formats
    Dataset updated
    Nov 20, 2019
    Dataset provided by
    Bioregional Assessment Program
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Abstract

    The dataset was derived by the Bioregional Assessment Programme from multiple source datasets. The source datasets are identified in the Lineage field in this metadata statement. The processes undertaken to produce this derived dataset are described in the History field in this metadata statement.

    This dataset includes the current boundary data required for the bioregional assessment impact analysis for the Hunter (HUN) subregion. These data are (1) the current Preliminary Assessment Extent (PAE) which for Hunter is the current subregion boundary, (2) the Analysis Extent (AE) and (3) and the Analysis Domain Extent (AD).

    The PAE is defined and explained in the BA submethodology (1.3 Description of the water-dependent asset register) and, specifically for the HUN subregion in product 1.3 Water-dependent asset register for the HUN subregion. The Analysis Extent (AE) is defined as the geographic area that encompasses all the possible areas that may be reported as part of the impact analysis component of a bioregional assessment, specifically, the subregion boundary and the PAE. The Analysis Domain extent (AD) is defined as the geographic area used for geoprocessing and data preparation purposes that encompasses the Analysis Extent plus additional areas sufficient to ensure all relevant data is included for the impact analysis component of a bioregional assessment. For HUN, the ADE had at least an additional 20 km geographic buffer added to the AE boundary.

    All data are in the Australian Albers coordinate system (EPSG 3577).

    Purpose

    The purpose of the various boundary polygons are to assist in the efficient spatial analysis of the impact of coal resource development in the Hunter subregion.

    Dataset History

    This dataset includes the current boundary data required for the bioregional assessment impact analysis for the Hunter (HUN) subregion. These data are (1) the current Preliminary Assessment Extent (PAE) which for Hunter is the current subregion boundary, (2) the Analysis Extent (AE) and (3) and the Analysis Domain Extent (AD).

    The PAE is defined and explained in the BA submethodology (1.3 Description of the water-dependent asset register) and, specifically for the HUN subregion in product 1.3 Water-dependent asset register for the NAM subregion. The Analysis Extent (AE) is defined as the geographic area that encompasses all the possible areas that may be reported as part of the impact analysis component of a bioregional assessment, specifically, the subregion boundary and the PAE. The Analysis Domain extent (AD) is defined as the geographic area used for geoprocessing and data preparation purposes that encompasses the Analysis Extent plus additional areas sufficient to ensure all relevant data is included for the impact analysis component of a bioregional assessment. For HUN, the ADE had at least an additional 20 km geographic buffer added to the AE boundary.

    All data are in the Australian Albers coordinate system (EPSG 3577).

    Dataset Citation

    Bioregional Assessment Programme (XXXX) HUN Analysis boundaries 20170106 v03. Bioregional Assessment Derived Dataset. Viewed 28 August 2018, http://data.bioregionalassessments.gov.au/dataset/20d25db8-75fd-46f2-a64c-c249c8b40a95.

    Dataset Ancestors

  19. g

    Land Cover Summary Statistics Data Package for Greater Yellowstone Network...

    • gimi9.com
    • catalog.data.gov
    Updated Dec 16, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2023). Land Cover Summary Statistics Data Package for Greater Yellowstone Network Park Units [Dataset]. https://gimi9.com/dataset/data-gov_land-cover-summary-statistics-data-package-for-greater-yellowstone-network-park-units/
    Explore at:
    Dataset updated
    Dec 16, 2023
    Description

    This report documents the acquisition of source data, and calculation of land cover summary statistics datasets for four National Park Service Greater Yellowstone Network park units and six custom areas of analysis: Bighorn Canyon National Recreation Area, Grand Teton National Park, John D. Rockefeller Jr. Memorial Parkway, Yellowstone National Park, and the six custom areas of analysis. The source data and land cover calculations are available for use within the National Park Service (NPS) Inventory and Monitoring Program. Land cover summary statistics datasets can be calculated for all geographic regions within the extent of the NPS; this report includes statistics calculated for the conterminous United States. The land cover summary statistics datasets are calculated from multiple sources, including Multi-Resolution Land Characteristics Consortium products in the National Land Cover Database (NLCD) and the United States Geological Survey’s (USGS) Earth Resources Observation and Science (EROS) Center products in the Land Change Monitoring, Assessment, and Projection (LCMAP) raster dataset. These summary statistics calculate land cover at up to three classification scales: Level 1, modified Anderson Level 2, and Natural versus Converted land cover. The output land cover summary statistics datasets produced here for the four Greater Yellowstone Network park units and six custom areas of analysis utilize the most recent versions of the source datasets (NLCD and LCMAP). These land cover summary statistics datasets are used in the NPS Inventory and Monitoring Program, including the NPS Environmental Settings Monitoring Protocol and may be used by networks and parks for additional efforts.

  20. The Ultimate Film Statistics Dataset - for ML🏆🎬

    • kaggle.com
    Updated Jul 9, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Alessandro Lo Bello (2023). The Ultimate Film Statistics Dataset - for ML🏆🎬 [Dataset]. https://www.kaggle.com/datasets/alessandrolobello/the-ultimate-film-statistics-dataset-for-ml/data
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Jul 9, 2023
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    Alessandro Lo Bello
    License

    https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/

    Description

    Description: This dataset provides comprehensive movie statistics compiled from multiple sources, including Wikipedia, The Numbers, and IMDb. It offers a rich collection of information and insights into various aspects of movies, such as movie titles, production dates, genres, runtime minutes, director information, average ratings, number of votes, approval index, production budgets, domestic gross earnings, and worldwide gross earnings.

    The dataset combines data scraped from Wikipedia, which includes details about movie titles, production dates, genres, runtime minutes, and director information, with data from The Numbers, a reliable source for box office statistics. Additionally, IMDb data is integrated to provide information on average ratings, number of votes, and other movie-related attributes.

    With this dataset, users can analyze and explore trends in the film industry, assess the financial success of movies, identify popular genres, and investigate the relationship between average ratings and box office performance. Researchers, movie enthusiasts, and data analysts can leverage this dataset for various purposes, including data visualization, predictive modeling, and deeper understanding of the movie landscape.

    Features: - Movie_title - Production_date - Genres - Runtime_minutes - Director_name (primaryName) - Director_professions (primaryProfession) - Director_birthYear - Director_deathYear - Movie_averageRating : refers to the average rating given by online users for a particular movie - Movie_numberOfVotes : refers to the number of votes given by online users for a particular movie - Approval_Index :is a normalized indicator (on scale 0-10) calculated by multiplying the logarithm of the number of votes by the average users rating. It provides a concise measure of a movie's overall popularity and approval among online viewers, penalizing both films that got too few reviews and blockbusters that got too many. - Production_budget ( $) - Domestic_gross ($) - Worldwide_gross ($)

    Potential Applications:

    Box office analysis: Analyze the relationship between production budgets, domestic and worldwide gross earnings, and profitability. Genre analysis: Identify the most popular genres based on movie counts and analyze their performance. Rating analysis: Explore the relationship between average ratings, number of votes, and financial success. Director analysis: Investigate the impact of directors on movie ratings and financial performance. Time-based analysis: Study movie trends over different production years and observe changes in production budgets, box office earnings, and genre preferences. By utilizing this dataset, users can gain valuable insights into the movie industry and uncover patterns that can inform decision-making, market research, and creative strategies.

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
Song Jiang; Hongsheng Liu; Minjie Lian; Caiwu Lu; Sai Zhang; Jinyuan Li; PengCheng Li (2023). Table1_Rock slope displacement prediction based on multi-source information fusion and SSA-DELM model.XLS [Dataset]. http://doi.org/10.3389/fenvs.2022.982069.s001

Table1_Rock slope displacement prediction based on multi-source information fusion and SSA-DELM model.XLS

Related Article
Explore at:
xlsAvailable download formats
Dataset updated
Jun 13, 2023
Dataset provided by
Frontiers
Authors
Song Jiang; Hongsheng Liu; Minjie Lian; Caiwu Lu; Sai Zhang; Jinyuan Li; PengCheng Li
License

Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically

Description

In order to solve the inefficient use of multi-source heterogeneous data information cross fusion and the low accuracy of prediction of landslide displacement, the current research proposed a new prediction model combining variable selection, sparrow search algorithm, and deep extreme learning machine. A cement mine in Fengxiang, Shaanxi Province, was studied as a case. The study first identified the variables related to landslide displacement of rock slope, and removed redundant variables by using Pearson correlation and gray correlation analysis. To avoid the impacts of random input weights and random thresholds in the DELM model, the SSA algorithm is used to optimize the model’s parameters, which can generate the optimal parameter combinations. The results showed an enhanced generalization ability of the model by removal of redundant variables by Pearson correlation and gray correlation analysis, and higher accuracy in the prediction of landside displacement of rock slope by SSA-DELM compared to other traditional machine learning algorithms. The current study is significant in the literature on rock slope disaster analysis.

Search
Clear search
Close search
Google apps
Main menu