100+ datasets found
  1. Data from: A Note on Monte Carlo Integration in High Dimensions

    • tandf.figshare.com
    rtf
    Updated Jul 23, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Yanbo Tang (2024). A Note on Monte Carlo Integration in High Dimensions [Dataset]. http://doi.org/10.6084/m9.figshare.24271730.v1
    Explore at:
    rtfAvailable download formats
    Dataset updated
    Jul 23, 2024
    Dataset provided by
    Taylor & Francishttps://taylorandfrancis.com/
    Authors
    Yanbo Tang
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Monte Carlo integration is a commonly used technique to compute intractable integrals and is typically thought to perform poorly for very high-dimensional integrals. To show that this is not always the case, we examine Monte Carlo integration using techniques from the high-dimensional statistics literature by allowing the dimension of the integral to increase. In doing so, we derive nonasymptotic bounds for the relative and absolute error of the approximation for some general classes of functions through concentration inequalities. We provide concrete examples in which the magnitude of the number of points sampled needed to guarantee a consistent estimate varies between polynomial to exponential, and show that in theory arbitrarily fast or slow rates are possible. This demonstrates that the behavior of Monte Carlo integration in high dimensions is not uniform. Through our methods we also obtain nonasymptotic confidence intervals which are valid regardless of the number of points sampled.

  2. Z

    Missing data in the analysis of multilevel and dependent data (Examples)

    • data.niaid.nih.gov
    Updated Jul 20, 2023
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Simon Grund; Oliver Lüdtke; Alexander Robitzsch (2023). Missing data in the analysis of multilevel and dependent data (Examples) [Dataset]. https://data.niaid.nih.gov/resources?id=zenodo_7773613
    Explore at:
    Dataset updated
    Jul 20, 2023
    Dataset provided by
    University of Hamburg
    IPN - Leibniz Institute for Science and Mathematics Education
    Authors
    Simon Grund; Oliver Lüdtke; Alexander Robitzsch
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Example data sets and computer code for the book chapter titled "Missing Data in the Analysis of Multilevel and Dependent Data" submitted for publication in the second edition of "Dependent Data in Social Science Research" (Stemmler et al., 2015). This repository includes the computer code (".R") and the data sets from both example analyses (Examples 1 and 2). The data sets are available in two file formats (binary ".rda" for use in R; plain-text ".dat").

    The data sets contain simulated data from 23,376 (Example 1) and 23,072 (Example 2) individuals from 2,000 groups on four variables:

    ID = group identifier (1-2000) x = numeric (Level 1) y = numeric (Level 1) w = binary (Level 2)

    In all data sets, missing values are coded as "NA".

  3. f

    UC_vs_US Statistic Analysis.xlsx

    • figshare.com
    xlsx
    Updated Jul 9, 2020
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    F. (Fabiano) Dalpiaz (2020). UC_vs_US Statistic Analysis.xlsx [Dataset]. http://doi.org/10.23644/uu.12631628.v1
    Explore at:
    xlsxAvailable download formats
    Dataset updated
    Jul 9, 2020
    Dataset provided by
    Utrecht University
    Authors
    F. (Fabiano) Dalpiaz
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Sheet 1 (Raw-Data): The raw data of the study is provided, presenting the tagging results for the used measures described in the paper. For each subject, it includes multiple columns: A. a sequential student ID B an ID that defines a random group label and the notation C. the used notation: user Story or use Cases D. the case they were assigned to: IFA, Sim, or Hos E. the subject's exam grade (total points out of 100). Empty cells mean that the subject did not take the first exam F. a categorical representation of the grade L/M/H, where H is greater or equal to 80, M is between 65 included and 80 excluded, L otherwise G. the total number of classes in the student's conceptual model H. the total number of relationships in the student's conceptual model I. the total number of classes in the expert's conceptual model J. the total number of relationships in the expert's conceptual model K-O. the total number of encountered situations of alignment, wrong representation, system-oriented, omitted, missing (see tagging scheme below) P. the researchers' judgement on how well the derivation process explanation was explained by the student: well explained (a systematic mapping that can be easily reproduced), partially explained (vague indication of the mapping ), or not present.

    Tagging scheme:
    Aligned (AL) - A concept is represented as a class in both models, either
    

    with the same name or using synonyms or clearly linkable names; Wrongly represented (WR) - A class in the domain expert model is incorrectly represented in the student model, either (i) via an attribute, method, or relationship rather than class, or (ii) using a generic term (e.g., user'' instead ofurban planner''); System-oriented (SO) - A class in CM-Stud that denotes a technical implementation aspect, e.g., access control. Classes that represent legacy system or the system under design (portal, simulator) are legitimate; Omitted (OM) - A class in CM-Expert that does not appear in any way in CM-Stud; Missing (MI) - A class in CM-Stud that does not appear in any way in CM-Expert.

    All the calculations and information provided in the following sheets
    

    originate from that raw data.

    Sheet 2 (Descriptive-Stats): Shows a summary of statistics from the data collection,
    

    including the number of subjects per case, per notation, per process derivation rigor category, and per exam grade category.

    Sheet 3 (Size-Ratio):
    

    The number of classes within the student model divided by the number of classes within the expert model is calculated (describing the size ratio). We provide box plots to allow a visual comparison of the shape of the distribution, its central value, and its variability for each group (by case, notation, process, and exam grade) . The primary focus in this study is on the number of classes. However, we also provided the size ratio for the number of relationships between student and expert model.

    Sheet 4 (Overall):
    

    Provides an overview of all subjects regarding the encountered situations, completeness, and correctness, respectively. Correctness is defined as the ratio of classes in a student model that is fully aligned with the classes in the corresponding expert model. It is calculated by dividing the number of aligned concepts (AL) by the sum of the number of aligned concepts (AL), omitted concepts (OM), system-oriented concepts (SO), and wrong representations (WR). Completeness on the other hand, is defined as the ratio of classes in a student model that are correctly or incorrectly represented over the number of classes in the expert model. Completeness is calculated by dividing the sum of aligned concepts (AL) and wrong representations (WR) by the sum of the number of aligned concepts (AL), wrong representations (WR) and omitted concepts (OM). The overview is complemented with general diverging stacked bar charts that illustrate correctness and completeness.

    For sheet 4 as well as for the following four sheets, diverging stacked bar
    

    charts are provided to visualize the effect of each of the independent and mediated variables. The charts are based on the relative numbers of encountered situations for each student. In addition, a "Buffer" is calculated witch solely serves the purpose of constructing the diverging stacked bar charts in Excel. Finally, at the bottom of each sheet, the significance (T-test) and effect size (Hedges' g) for both completeness and correctness are provided. Hedges' g was calculated with an online tool: https://www.psychometrica.de/effect_size.html. The independent and moderating variables can be found as follows:

    Sheet 5 (By-Notation):
    

    Model correctness and model completeness is compared by notation - UC, US.

    Sheet 6 (By-Case):
    

    Model correctness and model completeness is compared by case - SIM, HOS, IFA.

    Sheet 7 (By-Process):
    

    Model correctness and model completeness is compared by how well the derivation process is explained - well explained, partially explained, not present.

    Sheet 8 (By-Grade):
    

    Model correctness and model completeness is compared by the exam grades, converted to categorical values High, Low , and Medium.

  4. c

    Sample Sales Dataset

    • cubig.ai
    zip
    Updated Jun 15, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    CUBIG (2025). Sample Sales Dataset [Dataset]. https://cubig.ai/store/products/477/sample-sales-dataset
    Explore at:
    zipAvailable download formats
    Dataset updated
    Jun 15, 2025
    Dataset authored and provided by
    CUBIG
    License

    https://cubig.ai/store/terms-of-servicehttps://cubig.ai/store/terms-of-service

    Measurement technique
    Synthetic data generation using AI techniques for model training, Privacy-preserving data transformation via differential privacy
    Description

    1) Data Introduction • The Sample Sales Data is a retail sales dataset of 2,823 orders and 25 columns that includes a variety of sales-related data, including order numbers, product information, quantity, unit price, sales, order date, order status, customer and delivery information.

    2) Data Utilization (1) Sample Sales Data has characteristics that: • This dataset consists of numerical (sales, quantity, unit price, etc.), categorical (product, country, city, customer name, transaction size, etc.), and date (order date) variables, with missing values in some columns (STATE, ADDRESSLINE2, POSTALCODE, etc.). (2) Sample Sales Data can be used to: • Analysis of sales trends and performance by product: Key variables such as order date, product line, and country can be used to visualize and analyze monthly and yearly sales trends, the proportion of sales by product line, and top sales by country and region. • Segmentation and marketing strategies: Segmentation of customer groups based on customer information, transaction size, and regional data, and use them to design targeted marketing and customized promotion strategies.

  5. Z

    Sample raw MAYA simulation with reduced data

    • nde-dev.biothings.io
    • data.niaid.nih.gov
    • +2more
    Updated Feb 15, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Ferguson, Deborah (2024). Sample raw MAYA simulation with reduced data [Dataset]. https://nde-dev.biothings.io/resources?id=zenodo_10668524
    Explore at:
    Dataset updated
    Feb 15, 2024
    Dataset authored and provided by
    Ferguson, Deborah
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    This is a sample of data from a MAYA numerical relativity simulation, primarily for use with the tutorials for the mayawaves python package. This simulation is of binary black holes with a mass ratio of 5 and precessing spins.

  6. d

    US B2B Phone Number Data | 148MM Phone Numbers, Verified Data

    • datarade.ai
    Updated Feb 20, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Salutary Data (2024). US B2B Phone Number Data | 148MM Phone Numbers, Verified Data [Dataset]. https://datarade.ai/data-products/salutary-data-b2b-data-phone-number-data-mobile-phone-72-salutary-data
    Explore at:
    .json, .csv, .xls, .txtAvailable download formats
    Dataset updated
    Feb 20, 2024
    Dataset authored and provided by
    Salutary Data
    Area covered
    United States of America
    Description

    Discover the ultimate resource for your B2B needs with our meticulously curated dataset, featuring 148MM+ highly relevant US B2B Contact Data records and associated company information.

    Very high fill rates for Phone Number, including for Mobile Phone!

    This encompasses a diverse range of fields, including Contact Name (First & Last), Work Address, Work Email, Personal Email, Mobile Phone, Direct-Dial Work Phone, Job Title, Job Function, Job Level, LinkedIn URL, Company Name, Domain, Email Domain, HQ Address, Employee Size, Revenue Size, Industry, NAICS and SIC Codes + Descriptions, ensuring you have the most detailed insights for your business endeavors.

    Key Features:

    Extensive Data Coverage: Access a vast pool of B2B Contact Data records, providing valuable information on where the contacts work now, empowering your sales, marketing, recruiting, and research efforts.

    Versatile Applications: Leverage this robust dataset for Sales Prospecting, Lead Generation, Marketing Campaigns, Recruiting initiatives, Identity Resolution, Analytics, Research, and more.

    Phone Number Data Inclusion: Benefit from our comprehensive Phone Number Data, ensuring you have direct and effective communication channels. Explore our Phone Number Datasets and Phone Number Databases for an even more enriched experience.

    Flexible Pricing Models: Tailor your investment to match your unique business needs, data use-cases, and specific requirements. Choose from targeted lists, CSV enrichment, or licensing our entire database or subsets to seamlessly integrate this data into your products, platform, or service offerings.

    Strategic Utilization of B2B Intelligence:

    Sales Prospecting: Identify and engage with the right decision-makers to drive your sales initiatives.

    Lead Generation: Generate high-quality leads with precise targeting based on specific criteria.

    Marketing Campaigns: Amplify your marketing strategies by reaching the right audience with targeted campaigns.

    Recruiting: Streamline your recruitment efforts by connecting with qualified candidates.

    Identity Resolution: Enhance your data quality and accuracy by resolving identities with our reliable dataset.

    Analytics and Research: Fuel your analytics and research endeavors with comprehensive and up-to-date B2B insights.

    Access Your Tailored B2B Data Solution:

    Reach out to us today to explore flexible pricing options and discover how Salutary Data Company Data, B2B Contact Data, B2B Marketing Data, B2B Email Data, Phone Number Data, Phone Number Datasets, and Phone Number Databases can transform your business strategies. Elevate your decision-making with top-notch B2B intelligence.

  7. S1 File -

    • plos.figshare.com
    xlsx
    Updated Oct 31, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Kalina Hristova; William C. Wimley (2023). S1 File - [Dataset]. http://doi.org/10.1371/journal.pone.0289619.s001
    Explore at:
    xlsxAvailable download formats
    Dataset updated
    Oct 31, 2023
    Dataset provided by
    PLOShttp://plos.org/
    Authors
    Kalina Hristova; William C. Wimley
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    We present a simple, spreadsheet-based method to determine the statistical significance of the difference between any two arbitrary curves. This modified Chi-squared method addresses two scenarios: A single measurement at each point with known standard deviation, or multiple measurements at each point averaged to produce a mean and standard error. The method includes an essential correction for the deviation from normality in measurements with small sample size, which are typical in biomedical sciences. Statistical significance is determined without regard to the functionality of the curves, or the signs of the differences. Numerical simulations are used to validate the procedure. Example experimental data are used to demonstrate its application. An Excel spreadsheet is provided for performing the calculations for either scenario.

  8. Phone Number Data | 50M+ Verified Phone Numbers for Global Professionals |...

    • datarade.ai
    Updated Jan 1, 2018
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Success.ai (2018). Phone Number Data | 50M+ Verified Phone Numbers for Global Professionals | Contact Details from 170M+ Profiles - Best Price Guarantee [Dataset]. https://datarade.ai/data-products/phone-number-data-50m-verified-phone-numbers-for-global-pr-success-ai
    Explore at:
    .bin, .json, .xml, .csv, .xls, .sql, .txtAvailable download formats
    Dataset updated
    Jan 1, 2018
    Dataset provided by
    Area covered
    Algeria, Panama, Mozambique, Mongolia, Tonga, Korea (Democratic People's Republic of), Timor-Leste, San Marino, Uganda, Germany
    Description

    Success.ai’s Phone Number Data offers direct access to over 50 million verified phone numbers for professionals worldwide, extracted from our expansive collection of 170 million profiles. This robust dataset includes work emails and key decision-maker profiles, making it an essential resource for companies aiming to enhance their communication strategies and outreach efficiency. Whether you're launching targeted marketing campaigns, setting up sales calls, or conducting market research, our phone number data ensures you're connected to the right professionals at the right time.

    Why Choose Success.ai’s Phone Number Data?

    Direct Communication: Reach out directly to professionals with verified phone numbers and work emails, ensuring your message gets to the right person without delay. Global Coverage: Our data spans across continents, providing phone numbers for professionals in North America, Europe, APAC, and emerging markets. Continuously Updated: We regularly refresh our dataset to maintain accuracy and relevance, reflecting changes like promotions, company moves, or industry shifts. Comprehensive Data Points:

    Verified Phone Numbers: Direct lines and mobile numbers of professionals across various industries. Work Emails: Reliable email addresses to complement phone communications. Professional Profiles: Decision-makers’ profiles including job titles, company details, and industry information. Flexible Delivery and Integration: Success.ai offers this dataset in various formats suitable for seamless integration into your CRM or sales platform. Whether you prefer API access for real-time data retrieval or static files for periodic updates, we tailor the delivery to meet your operational needs.

    Competitive Pricing with Best Price Guarantee: We provide this essential data at the most competitive prices in the industry, ensuring you receive the best value for your investment. Our best price guarantee means you can trust that you are getting the highest quality data at the lowest possible cost.

    Targeted Applications for Phone Number Data:

    Sales and Telemarketing: Enhance your telemarketing campaigns by reaching out directly to potential customers, bypassing gatekeepers. Market Research: Conduct surveys and research directly with industry professionals to gather insights that can shape your business strategy. Event Promotion: Invite prospects to webinars, conferences, and seminars directly through personal calls or SMS. Customer Support: Improve customer service by integrating accurate contact information into your support systems. Quality Assurance and Compliance:

    Data Accuracy: Our data is verified for accuracy to ensure over 99% deliverability rates. Compliance: Fully compliant with GDPR and other international data protection regulations, allowing you to use the data with confidence globally. Customization and Support:

    Tailored Data Solutions: Customize the data according to geographic, industry-specific, or job role filters to match your unique business needs. Dedicated Support: Our team is on hand to assist with data integration, usage, and any questions you may have. Start with Success.ai Today: Engage with Success.ai to leverage our Phone Number Data and connect with global professionals effectively. Schedule a consultation or request a sample through our dedicated client portal and begin transforming your outreach and communication strategies today.

    Remember, with Success.ai, you don’t just buy data; you invest in a partnership that grows with your business needs, backed by our commitment to quality and affordability.

  9. d

    Data from: Sample locations and total number of species found at each...

    • catalog.data.gov
    • data.usgs.gov
    • +1more
    Updated Nov 21, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    U.S. Geological Survey (2025). Sample locations and total number of species found at each station from Pellegrino and Hubbard (1983) [Dataset]. https://catalog.data.gov/dataset/sample-locations-and-total-number-of-species-found-at-each-station-from-pellegrino-and-hub
    Explore at:
    Dataset updated
    Nov 21, 2025
    Dataset provided by
    United States Geological Surveyhttp://www.usgs.gov/
    Description

    This GIS layer provides detailed information from Pellegrino and Hubbard (1983). It shows the sample locations and provides a summary of the total number of species found at each station (species_richness).

  10. r

    Data from: M007 - A Skateboard (v1.0)

    • resodate.org
    Updated Dec 17, 2021
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Andreas Steinbrecher (2021). M007 - A Skateboard (v1.0) [Dataset]. http://doi.org/10.14279/depositonce-14649
    Explore at:
    Dataset updated
    Dec 17, 2021
    Dataset provided by
    DepositOnce
    Technische Universität Berlin
    Authors
    Andreas Steinbrecher
    Description

    The considerations in this report A Skateboard are part of the example collection which can be found in http://www3.math.tu-berlin.de/multiphysics/Examples/. The aim is to investigate different formulations, i.e., regularized formulations or also index reduced formulations, of the model equations in combination with different numerical solvers with respect to its applicability, efficiency, accuracy, and robustness.

  11. d

    Phone Number Data | USA Coverage | 765 Mil+ Numbers

    • datarade.ai
    .csv
    Updated Mar 15, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    BIGDBM (2023). Phone Number Data | USA Coverage | 765 Mil+ Numbers [Dataset]. https://datarade.ai/data-products/bigdbm-us-consumer-phone-package-bigdbm
    Explore at:
    .csvAvailable download formats
    Dataset updated
    Mar 15, 2023
    Dataset authored and provided by
    BIGDBM
    Area covered
    United States
    Description

    The US Consumer Phone file contains phone numbers, mobile and landline, tied to an individual in the Consumer Database. The fields available include the phone number, phone type, mobile carrier, and Do Not Call registry status.

    All phone numbers can be processed and cleansed using telecom carrier data. The telecom data, including phone and texting activity, porting instances, carrier scoring, spam, and known fraud activity, comprise a proprietary Phone Quality Level (PQL), which is a data-science derived score to ensure the highest levels of deliverability at a fraction of the cost compared to competitors.

    We have developed this file to be tied to our Consumer Demographics Database so additional demographics can be applied as needed. Each record is ranked by confidence and only the highest quality data is used.

    Note - all Consumer packages can include necessary PII (address, email, phone, DOB, etc.) for merging, linking, and activation of the data.

    BIGDBM Privacy Policy: https://bigdbm.com/privacy.html

  12. R

    Data and statistics of a direct numerical simulation of adverse pressure...

    • entrepot.recherche.data.gouv.fr
    avi, mkv, nc, pdf +2
    Updated Oct 17, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    LAVAL Jean-Philippe; LAVAL Jean-Philippe (2024). Data and statistics of a direct numerical simulation of adverse pressure gradient turbulent boundary layer [Dataset]. http://doi.org/10.57745/FMZ9HP
    Explore at:
    nc(33592456192), nc(183750471), text/x-python(5402), zip(585921), nc(29140), avi(88888284), pdf(3015083), nc(41948), mkv(131819512), nc(268780), nc(23532054496), nc(23532054954), nc(17649045648)Available download formats
    Dataset updated
    Oct 17, 2024
    Dataset provided by
    Recherche Data Gouv
    Authors
    LAVAL Jean-Philippe; LAVAL Jean-Philippe
    License

    https://entrepot.recherche.data.gouv.fr/api/datasets/:persistentId/versions/1.0/customlicense?persistentId=doi:10.57745/FMZ9HPhttps://entrepot.recherche.data.gouv.fr/api/datasets/:persistentId/versions/1.0/customlicense?persistentId=doi:10.57745/FMZ9HP

    Dataset funded by
    GENCI
    Description

    Although it is a widespread phenomenon in nature, turbulence in fluids (gases, liquids) is still very poorly understood. One area of research involves analyzing data from academic flow simulations. To make progress, the scientific community needs a large amount of reliable data in various configurations. Turbulent flows near solid flat or curved walls are very interesting examples. The database is composed of the 3D raw data (velocity, pressure, time derivative of velocity) and statistics (mean, Reynolds stresses, length scales) of a direct numerical simulations of moderate adverse pressure gradient (decelerating) turbulent boundary layer on a flat plate at Reynolds number up to Reθ = 8000 .

  13. m

    Dataset of development of business during the COVID-19 crisis

    • data.mendeley.com
    • narcis.nl
    Updated Nov 9, 2020
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Tatiana N. Litvinova (2020). Dataset of development of business during the COVID-19 crisis [Dataset]. http://doi.org/10.17632/9vvrd34f8t.1
    Explore at:
    Dataset updated
    Nov 9, 2020
    Authors
    Tatiana N. Litvinova
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    To create the dataset, the top 10 countries leading in the incidence of COVID-19 in the world were selected as of October 22, 2020 (on the eve of the second full of pandemics), which are presented in the Global 500 ranking for 2020: USA, India, Brazil, Russia, Spain, France and Mexico. For each of these countries, no more than 10 of the largest transnational corporations included in the Global 500 rating for 2020 and 2019 were selected separately. The arithmetic averages were calculated and the change (increase) in indicators such as profitability and profitability of enterprises, their ranking position (competitiveness), asset value and number of employees. The arithmetic mean values of these indicators for all countries of the sample were found, characterizing the situation in international entrepreneurship as a whole in the context of the COVID-19 crisis in 2020 on the eve of the second wave of the pandemic. The data is collected in a general Microsoft Excel table. Dataset is a unique database that combines COVID-19 statistics and entrepreneurship statistics. The dataset is flexible data that can be supplemented with data from other countries and newer statistics on the COVID-19 pandemic. Due to the fact that the data in the dataset are not ready-made numbers, but formulas, when adding and / or changing the values in the original table at the beginning of the dataset, most of the subsequent tables will be automatically recalculated and the graphs will be updated. This allows the dataset to be used not just as an array of data, but as an analytical tool for automating scientific research on the impact of the COVID-19 pandemic and crisis on international entrepreneurship. The dataset includes not only tabular data, but also charts that provide data visualization. The dataset contains not only actual, but also forecast data on morbidity and mortality from COVID-19 for the period of the second wave of the pandemic in 2020. The forecasts are presented in the form of a normal distribution of predicted values and the probability of their occurrence in practice. This allows for a broad scenario analysis of the impact of the COVID-19 pandemic and crisis on international entrepreneurship, substituting various predicted morbidity and mortality rates in risk assessment tables and obtaining automatically calculated consequences (changes) on the characteristics of international entrepreneurship. It is also possible to substitute the actual values identified in the process and following the results of the second wave of the pandemic to check the reliability of pre-made forecasts and conduct a plan-fact analysis. The dataset contains not only the numerical values of the initial and predicted values of the set of studied indicators, but also their qualitative interpretation, reflecting the presence and level of risks of a pandemic and COVID-19 crisis for international entrepreneurship.

  14. VAPOR Sample Data

    • data.ucar.edu
    archive
    Updated Jan 12, 2022
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Clyne, John; Jaroszynski, Stanislaw; Li, Samuel; Pearse, Scott (2022). VAPOR Sample Data [Dataset]. http://doi.org/10.5065/khh0-6nko
    Explore at:
    archiveAvailable download formats
    Dataset updated
    Jan 12, 2022
    Dataset provided by
    University Corporation for Atmospheric Research
    Authors
    Clyne, John; Jaroszynski, Stanislaw; Li, Samuel; Pearse, Scott
    Description

    VAPOR is the Visualization and Analysis Platform for Ocean, Atmosphere, and Solar Researchers. VAPOR provides an interactive 3D visualization environment that can also produce animations and still frame images. VAPOR runs on most UNIX and Windows systems equipped with modern 3D graphics cards. VAPOR is a product of the National Center for Atmospheric Research's Computational and Information Systems Lab. Support for VAPOR is provided by the U.S. National Science Foundation and by the Korea Institute of Science and Technology Information This dataset contains sample files of model outputs from numerical simulations that VAPOR is capable of directly reading. They are not related to each other aside from being sample data for VAPOR.
    To unpack the tar.gz files on Linux/OSX, issue the command tar -xzvf [myFile].tar.gz on the file you've downloaded. On Windows, a program like 7-zip can perform that operation. Once unpacked, the files can be directly imported into VAPOR, or converted to VDC. For more information see the "Getting Data Into VAPOR" Related Link below.

  15. r

    Data from: M002 - Slider Crank (v1.0)

    • resodate.org
    Updated Dec 17, 2021
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Andreas Steinbrecher (2021). M002 - Slider Crank (v1.0) [Dataset]. http://doi.org/10.14279/depositonce-14632
    Explore at:
    Dataset updated
    Dec 17, 2021
    Dataset provided by
    DepositOnce
    Technische Universität Berlin
    Authors
    Andreas Steinbrecher
    Description

    The considerations in this report Slider Crank are part of the example collection which can be found in http://www3.math.tu-berlin.de/multiphysics/Examples/. The aim is to investigate different formulations, i.e., regularized formulations or also index reduced formulations, of the model equations in combination with different numerical solvers with respect to its applicability, efficiency, accuracy, and robustness.

  16. Perfection ratio of numbers 1 to 21.5 million

    • kaggle.com
    zip
    Updated Nov 12, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Erick Magyar (2024). Perfection ratio of numbers 1 to 21.5 million [Dataset]. https://www.kaggle.com/datasets/erickmagyar/perfection-ratio-of-numbers-1-to-1-million/discussion
    Explore at:
    zip(222128399 bytes)Available download formats
    Dataset updated
    Nov 12, 2024
    Authors
    Erick Magyar
    License

    https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/

    Description

    The perfection ratio of a number is a concept that is related to perfect numbers and how closely a given number approximates the ideal perfection ratio, which is 2.0.

    Perfect Numbers:

    A perfect number is a positive integer that is equal to the sum of its proper divisors, excluding the number itself. For example: • 6 is a perfect number because its divisors are 1, 2, and 3, and 1 + 2 + 3 = 6 . • 28 is another perfect number because its divisors are 1, 2, 4, 7, and 14, and 1 + 2 + 4 + 7 + 14 = 28 .

    Perfection Ratio:

    The perfection ratio of a number n is a measure of how close the sum of its divisors (excluding the number itself) is to the number. It is defined as:

    \text{Perfection Ratio} = \frac{\text{Sum of Proper Divisors of } n}{n}

    •  If the perfection ratio is 2.0, the number is considered perfect.
    •  If the perfection ratio is greater than 2.0, the number is abundant (i.e., the sum of its proper divisors exceeds the number itself).
    •  If the perfection ratio is less than 2.0, the number is deficient (i.e., the sum of its proper divisors is less than the number itself).
    

    Examples:

    1. Perfect Number Example:
    •  For n = 6 :
    •  Proper divisors: 1, 2, 3 
    •  Sum of proper divisors: 1 + 2 + 3 = 6 
    •  Perfection ratio: \frac{6}{6} = 1.0 
    •  Since the perfection ratio is 2.0 for a perfect number, we see the idea of perfect numbers where the sum of divisors divides evenly.
    

    1. Near-Perfect Numbers

    • Definition: A near-perfect number is a number for which the sum of its proper divisors is close to the number itself but not exactly equal.
    • Example: Consider the number 24. Its proper divisors are 1, 2, 3, 4, 6, 8, and 12. The sum is 36, which is larger than 24, making it almost perfect in the sense that the sum of its divisors is significant but not equal to the number.

    2. Almost-Perfect Numbers

    • Definition: An almost-perfect number is a number where the sum of its proper divisors equals the number minus one.
    • Example: The number 16 is an almost-perfect number. Its proper divisors are 1, 2, 4, and 8, which sum to 15 (16 - 1).

    3. Abundant Numbers

    • Definition: A number is abundant if the sum of its proper divisors is greater than the number itself.
    • Example: The number 12 is abundant because its proper divisors (1, 2, 3, 4, and 6) sum to 16, which is greater than 12.

    4. Deficient Numbers

    • Definition: A number is deficient if the sum of its proper divisors is less than the number itself.
    • Example: The number 8 is deficient because its proper divisors (1, 2, and 4) sum to 7, which is less than 8.

    5. Semiperfect Numbers

    • Definition: A semiperfect number is a number that is equal to the sum of some (or all) of its proper divisors.
    • Example: The number 12 is semiperfect because 12 = 6 + 4 + 2 (some of its proper divisors).

    Relevance to the Heat Map

    • Density Analysis: By analyzing the heat map further, we might observe concentrations at other specific perfection ratios besides 2. These could indicate near-perfect, almost-perfect, abundant, deficient, or semiperfect numbers.
    • Patterns and Trends: Identifying where these numbers cluster can help us understand the distribution and frequency of numbers with these properties within your dataset.
  17. Orange dataset table

    • figshare.com
    xlsx
    Updated Mar 4, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Rui Simões (2022). Orange dataset table [Dataset]. http://doi.org/10.6084/m9.figshare.19146410.v1
    Explore at:
    xlsxAvailable download formats
    Dataset updated
    Mar 4, 2022
    Dataset provided by
    Figsharehttp://figshare.com/
    figshare
    Authors
    Rui Simões
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    The complete dataset used in the analysis comprises 36 samples, each described by 11 numeric features and 1 target. The attributes considered were caspase 3/7 activity, Mitotracker red CMXRos area and intensity (3 h and 24 h incubations with both compounds), Mitosox oxidation (3 h incubation with the referred compounds) and oxidation rate, DCFDA fluorescence (3 h and 24 h incubations with either compound) and oxidation rate, and DQ BSA hydrolysis. The target of each instance corresponds to one of the 9 possible classes (4 samples per class): Control, 6.25, 12.5, 25 and 50 µM for 6-OHDA and 0.03, 0.06, 0.125 and 0.25 µM for rotenone. The dataset is balanced, it does not contain any missing values and data was standardized across features. The small number of samples prevented a full and strong statistical analysis of the results. Nevertheless, it allowed the identification of relevant hidden patterns and trends.

    Exploratory data analysis, information gain, hierarchical clustering, and supervised predictive modeling were performed using Orange Data Mining version 3.25.1 [41]. Hierarchical clustering was performed using the Euclidean distance metric and weighted linkage. Cluster maps were plotted to relate the features with higher mutual information (in rows) with instances (in columns), with the color of each cell representing the normalized level of a particular feature in a specific instance. The information is grouped both in rows and in columns by a two-way hierarchical clustering method using the Euclidean distances and average linkage. Stratified cross-validation was used to train the supervised decision tree. A set of preliminary empirical experiments were performed to choose the best parameters for each algorithm, and we verified that, within moderate variations, there were no significant changes in the outcome. The following settings were adopted for the decision tree algorithm: minimum number of samples in leaves: 2; minimum number of samples required to split an internal node: 5; stop splitting when majority reaches: 95%; criterion: gain ratio. The performance of the supervised model was assessed using accuracy, precision, recall, F-measure and area under the ROC curve (AUC) metrics.

  18. d

    Data from: Symbolic representation of numerosity by honeybees (Apis...

    • search.dataone.org
    • data.niaid.nih.gov
    • +2more
    Updated May 27, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Scarlett Howard; Aurore Avarguès-Weber; Jair Garcia; Andrew Greentree; Adrian Dyer (2025). Symbolic representation of numerosity by honeybees (Apis mellifera): matching characters to small quantities [Dataset]. http://doi.org/10.5061/dryad.990m8h3
    Explore at:
    Dataset updated
    May 27, 2025
    Dataset provided by
    Dryad Digital Repository
    Authors
    Scarlett Howard; Aurore Avarguès-Weber; Jair Garcia; Andrew Greentree; Adrian Dyer
    Time period covered
    May 17, 2019
    Description

    The assignment of a symbolic representation to a specific numerosity is a fundamental requirement for humans solving complex mathematical calculations used in diverse applications such as algebra, accounting, physics, and everyday commerce. Here we show that honeybees are able to learn to match a sign to a numerosity, or a numerosity to a sign and subsequently transfer this knowledge to novel numerosity stimuli changed in colour properties, shape, and configuration. While honeybees learnt the associations between two quantities (two; three) and two signs (N-shape; inverted T-shape), they failed at reversing their specific task of sign-to-numerosity-matching to numerosity-to-sign-matching and vice-versa (i.e. a honeybee that learnt to match a sign to a number of elements was not able to invert this learning to match the numerosity of elements to a sign). Thus, while bees could learn the association between a symbol and numerosity, it was linked to the specific task and bees could not spo...

  19. Z

    The dataset for the isogeometric boundary element method for acoustic...

    • data-staging.niaid.nih.gov
    • data.niaid.nih.gov
    Updated Jan 24, 2020
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Inci, Emin Oguz; Coox, Laurens; Atak, Onur; Deckers, Elke; Desmet, Wim (2020). The dataset for the isogeometric boundary element method for acoustic problems [Dataset]. https://data-staging.niaid.nih.gov/resources?id=zenodo_3349904
    Explore at:
    Dataset updated
    Jan 24, 2020
    Dataset provided by
    Siemens Industry Software NV
    Katholieke Universiteit Te Leuven, DMMS Lab Flanders Make
    Katholieke Universiteit Te Leuven, DMMS Flanders Makel
    Authors
    Inci, Emin Oguz; Coox, Laurens; Atak, Onur; Deckers, Elke; Desmet, Wim
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    This repository includes the data associated to the numerical examples in the 'Applications of an isogeometric indirect boundary element method and the importance of accurate geometrical representation in acoustic problems' article submitted to the Engineering Analysis with Boundary Elements Elsevier journal. The data comprise of the CAD models, NURBS models to be solved by the isogeometric boundary element method (IGiBEM) with the proposed method in the article, meshes to be solved indirect boundary element method (iBEM) with LMS Virtual.Lab 13.7v, and the results of the numerical examples: 1) vibrating cube, 2) car hood and 3) loudspeaker. The IGiBEM software is under private license of KU Leuven and Siemens Industry Software NV, therefore, it is not shared in this repository.

  20. f

    Prediction comparison for model based on the Lorenz system.

    • figshare.com
    • plos.figshare.com
    xls
    Updated Jun 21, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    HsinKai Wang; Meihui Guo; Sangyeol Lee; Cheng-Han Chua (2023). Prediction comparison for model based on the Lorenz system. [Dataset]. http://doi.org/10.1371/journal.pone.0278816.t001
    Explore at:
    xlsAvailable download formats
    Dataset updated
    Jun 21, 2023
    Dataset provided by
    PLOS ONE
    Authors
    HsinKai Wang; Meihui Guo; Sangyeol Lee; Cheng-Han Chua
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Prediction comparison for model based on the Lorenz system.

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
Yanbo Tang (2024). A Note on Monte Carlo Integration in High Dimensions [Dataset]. http://doi.org/10.6084/m9.figshare.24271730.v1
Organization logo

Data from: A Note on Monte Carlo Integration in High Dimensions

Related Article
Explore at:
rtfAvailable download formats
Dataset updated
Jul 23, 2024
Dataset provided by
Taylor & Francishttps://taylorandfrancis.com/
Authors
Yanbo Tang
License

Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically

Description

Monte Carlo integration is a commonly used technique to compute intractable integrals and is typically thought to perform poorly for very high-dimensional integrals. To show that this is not always the case, we examine Monte Carlo integration using techniques from the high-dimensional statistics literature by allowing the dimension of the integral to increase. In doing so, we derive nonasymptotic bounds for the relative and absolute error of the approximation for some general classes of functions through concentration inequalities. We provide concrete examples in which the magnitude of the number of points sampled needed to guarantee a consistent estimate varies between polynomial to exponential, and show that in theory arbitrarily fast or slow rates are possible. This demonstrates that the behavior of Monte Carlo integration in high dimensions is not uniform. Through our methods we also obtain nonasymptotic confidence intervals which are valid regardless of the number of points sampled.

Search
Clear search
Close search
Google apps
Main menu