13 datasets found
  1. Figure S1. Loci statistics boxplots for data derived from [1].

    • figshare.com
    pdf
    Updated Jan 19, 2016
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Amir Szitenberg (2016). Figure S1. Loci statistics boxplots for data derived from [1]. [Dataset]. http://doi.org/10.6084/m9.figshare.1409424.v1
    Explore at:
    pdfAvailable download formats
    Dataset updated
    Jan 19, 2016
    Dataset provided by
    figshare
    Figsharehttp://figshare.com/
    Authors
    Amir Szitenberg
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    For each locus, the plots illustrate the distributions of (from top to bottom) per-position entropy, per-position gap score [4], per position conservation score [4], sequence length and GC content. 1. Kawahara AY, Breinholt JW. Phylogenomics provides strong evidence for relationships of butterflies and moths. Proc R Soc B. 2014;281: 20140970. 2. Robinson DF, Foulds LR. Comparison of phylogenetic trees. Math Biosci. 1981;53: 131–147. 3. Kuhner MK, Felsenstein J. A simulation comparison of phylogeny algorithms under equal and unequal evolutionary rates. Mol Biol Evol. 1994;11: 459–468. 4. Capella-Gutiérrez S, Silla-Martínez JM, Gabaldón T. trimAl: a tool for automated alignment trimming in large-scale phylogenetic analyses. Bioinformatics. 2009;25: 1972–1973.

  2. f

    Data from: Sparse Functional Boxplots for Multivariate Curves

    • datasetcatalog.nlm.nih.gov
    • tandf.figshare.com
    Updated Apr 19, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Qu, Zhuo; Genton, Marc G. (2022). Sparse Functional Boxplots for Multivariate Curves [Dataset]. https://datasetcatalog.nlm.nih.gov/dataset?q=0000238011
    Explore at:
    Dataset updated
    Apr 19, 2022
    Authors
    Qu, Zhuo; Genton, Marc G.
    Description

    This paper introduces the sparse functional boxplot and the intensity sparse functional boxplot as practical exploratory tools. Besides being available for complete functional data, they can be used in sparse univariate and multivariate functional data. The sparse functional boxplot, based on the functional boxplot, displays sparseness proportions within the 50% central region. The intensity sparse functional boxplot indicates the relative intensity of fitted sparse point patterns in the central region. The two-stage functional boxplot, which derives from the functional boxplot to detect outliers, is furthermore extended to its sparse form. We also contribute to sparse data fitting improvement and sparse multivariate functional data depth. In a simulation study, we evaluate the goodness of data fitting, several depth proposals for sparse multivariate functional data, and compare the results of outlier detection between the sparse functional boxplot and its two-stage version. The practical applications of the sparse functional boxplot and intensity sparse functional boxplot are illustrated with two public health datasets. Supplementary materials and codes are available for readers to apply our visualization tools and replicate the analysis.

  3. Prioritization of barriers that hinders Local Flexibility Market...

    • data.europa.eu
    unknown
    Updated Jun 8, 2020
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Zenodo (2020). Prioritization of barriers that hinders Local Flexibility Market proliferation [Dataset]. https://data.europa.eu/data/datasets/oai-zenodo-org-3855546?locale=bg
    Explore at:
    unknown(2109374)Available download formats
    Dataset updated
    Jun 8, 2020
    Dataset authored and provided by
    Zenodohttp://zenodo.org/
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    This dataset contains the prioritization provided by a panel of 15 experts to a set of 28 barriers categories for 8 different roles of the future energy system. A Delphi method was followed and the scores provided in the three rounds carried out are included. The dataset also contains the scripts used to assess the results and the output of this assessment. A list of the information contained in this file is: data folder: this folders includes the scores given by the 15 experts in the 3 rounds. Every round is in an individual folder. There is a file per expert that has the scores between -5 (not relevant at all) to 5 (completely relevant) per barrier (rows) and actor (columns). There is also a file with the description of the experts in terms of their position in the company, the type of company and the country. fig folder: this folder includes the figures created to assess the information provided by the experts. For each round, the following figures are created (in each respective folder): Boxplot with the distribution of scores per barriers and roles. Heatmap with the mean scores per barriers and roles. Boxplots with the comparison of the different distributions provided by the experts of each group (depending on the keywords) per barrier and role. Heatmap with the mean score per barrier and use case and with the prioritization per barrier and use case. Finally, bar plots with the mean scores differences between rounds and boxplot with comparisons of the scores distributions are also provided. stat folder: this folder includes the files with the results of the different statistical assessment carried out. For each round, the following figures are created (in each respective folder): The statistics used to assess the scores (Intraclass correlation coefficient, Inter-rater agreement, Inter-rater agreement p-value, Homogeneity of Variances, Average interquartile range, Standard Deviation of interquartile ranges, Friedman test p-value Average power post hoc) per barrier and per role. The results of the post hoc of the Friedman Test per berries and per roles. The average score per barrier and per role. The mean value of the scores provided by the experts grouped by the keywords per barrier and role. P-value of the comparison of these two values. The end prioritization of the barrier for the use case (averaging the scores or merging the critical sets) Finally, the differences between the mean and standard deviations of the scores between two consecutive rounds are provided.

  4. Predict Term Deposit

    • kaggle.com
    zip
    Updated Nov 29, 2021
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Aslan Ahmedov (2021). Predict Term Deposit [Dataset]. https://www.kaggle.com/aslanahmedov/predict-term-deposit
    Explore at:
    zip(588608 bytes)Available download formats
    Dataset updated
    Nov 29, 2021
    Authors
    Aslan Ahmedov
    Description

    Predict Term Deposit

    Introduction

    Bank has multiple banking products that it sells to customer such as saving account, credit cards, investments etc. It wants to which customer will purchase its credit cards. For the same it has various kind of information regarding the demographic details of the customer, their banking behavior etc. Once it can predict the chances that customer will purchase a product, it wants to use the same to make pre-payment to the authors.

    In this part I will demonstrate how to build a model, to predict which clients will subscribing to a term deposit, with inception of machine learning. In the first part we will deal with the description and visualization of the analysed data, and in the second we will go to data classification models.

    Strategy

    -Desire target -Data Understanding -Preprocessing Data -Machine learning Model -Prediction -Comparing Results

    Desire Target

    Predict if a client will subscribe (yes/no) to a term deposit — this is defined as a classification problem.

    Data

    The dataset (Assignment-2_data.csv) used in this assignment contains bank customers’ data. File name: Assignment-2_Data File format: . csv Numbers of Row: 45212 Numbers of Attributes: 17 non- empty conditional attributes attributes and one decision attribute.

    imagehttps://user-images.githubusercontent.com/91852182/143783430-eafd25b0-6d40-40b8-ac5b-1c4f67ca9e02.png"> imagehttps://user-images.githubusercontent.com/91852182/143783451-3e49b817-29a6-4108-b597-ce35897dda4a.png">

    Exploratory Data Analysis (EDA)

    Data pre-processing is a main step in Machine Learning as the useful information which can be derived it from data set directly affects the model quality so it is extremely important to do at least necessary preprocess for our data before feeding it into our model.

    In this assignment, we are going to utilize python to develop a predictive machine learning model. First, we will import some important and necessary libraries.

    Below we are can see that there are various numerical and categorical columns. The most important column here is y, which is the output variable (desired target): this will tell us if the client subscribed to a term deposit(binary: ‘yes’,’no’).

    imagehttps://user-images.githubusercontent.com/91852182/143783456-78c22016-149b-4218-a4a5-765ca348f069.png">

    We must to check missing values in our dataset if we do have any and do, we have any duplicated values or not.

    imagehttps://user-images.githubusercontent.com/91852182/143783471-a8656640-ec57-4f38-8905-35ef6f3e7f30.png">

    We can see that in 'age' 9 missing values and 'balance' as well 3 values missed. In this case based that our dataset it has around 45k row I will remove them from dataset. on Pic 1 and 2 you will see before and after.

    imagehttps://user-images.githubusercontent.com/91852182/143783474-b3898011-98e3-43c8-bd06-2cfcde714694.png">

    From the above analysis we can see that only 5289 people out of 45200 have subscribed which is roughly 12%. We can see that our dataset highly unbalanced. we need to take it as a note.

    imagehttps://user-images.githubusercontent.com/91852182/143783534-a05020a8-611d-4da1-98cf-4fec811cb5d8.png">

    Our list of categorical variables.

    imagehttps://user-images.githubusercontent.com/91852182/143783542-d40006cd-4086-4707-a683-f654a8cb2205.png">

    Our list of numerical variables.

    imagehttps://user-images.githubusercontent.com/91852182/143783551-6b220f99-2c4d-47d0-90ab-18ede42a4ae5.png">

    "Age" Q-Q Plots and Box Plot.

    In above boxplot we can see that some point in very young age and as well impossible age. So,

    imagehttps://user-images.githubusercontent.com/91852182/143783564-ad0e2a27-5df5-4e04-b5d7-6d218cabd405.png"> imagehttps://user-images.githubusercontent.com/91852182/143783589-5abf0a0b-8bab-4192-98c8-d2e04f32a5c5.png">

    Now, we don’t have issues on this feature so we can use it

    imagehttps://user-images.githubusercontent.com/91852182/143783599-5205eddb-a0f5-446d-9f45-cc1adbfcce67.png"> imagehttps://user-images.githubusercontent.com/91852182/143783601-e520d59c-3b21-4627-a9bb-cac06f415a1e.png">

    "Duration" Q-Q Plots and Box Plot

    imagehttps://user-images.githubusercontent.com/91852182/143783634-03e5a584-a6fb-4bcb-8dc5-1f3cc50f9507.png"> imagehttps://user-images.githubusercontent.com/91852182/143783640-f6e71323-abbe-49c1-9935-35ffb2d10569.png">

    This attribute highly affects the output target (e.g., if duration=0 then y=’no’). Yet, the duration is not known before a call is performed. Also, after the end of the call y is obviously known. Thus, this input should only be included for benchmark purposes...

  5. Appendix B. Three boxplots comparing phenotypic trait measures between...

    • wiley.figshare.com
    • figshare.com
    html
    Updated Jun 1, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Matteo Garbelotto; Gianni Della Rocca; Todd Osmundson; Vincenzo di Lonardo; Roberto Danti (2023). Appendix B. Three boxplots comparing phenotypic trait measures between populations; comparisons correspond to tests 1–3 as shown in Fig. 1 in text. [Dataset]. http://doi.org/10.6084/m9.figshare.3564105.v1
    Explore at:
    htmlAvailable download formats
    Dataset updated
    Jun 1, 2023
    Dataset provided by
    Wileyhttps://www.wiley.com/
    Authors
    Matteo Garbelotto; Gianni Della Rocca; Todd Osmundson; Vincenzo di Lonardo; Roberto Danti
    License

    CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
    License information was derived automatically

    Description

    Three boxplots comparing phenotypic trait measures between populations; comparisons correspond to tests 1–3 as shown in Fig. 1 in text.

  6. Boxplots comparing Bray-Curtis dissimilarity distances for sites sampled in...

    • plos.figshare.com
    tiff
    Updated May 31, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Kristina Cervantes-Yoshida; Robert A. Leidy; Stephanie M. Carlson (2023). Boxplots comparing Bray-Curtis dissimilarity distances for sites sampled in both time periods, presented separately for low-impacted sites and urbanized sites. [Dataset]. http://doi.org/10.1371/journal.pone.0141707.g004
    Explore at:
    tiffAvailable download formats
    Dataset updated
    May 31, 2023
    Dataset provided by
    PLOShttp://plos.org/
    Authors
    Kristina Cervantes-Yoshida; Robert A. Leidy; Stephanie M. Carlson
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Boxplots comparing Bray-Curtis dissimilarity distances for sites sampled in both time periods, presented separately for low-impacted sites and urbanized sites.

  7. The comparison results of different algorithms on CEC2017 functions with...

    • plos.figshare.com
    xls
    Updated Jun 6, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Yu Li; Xiao Liang; Jingsen Liu; Huan Zhou (2023). The comparison results of different algorithms on CEC2017 functions with D=30. [Dataset]. http://doi.org/10.1371/journal.pone.0276210.t004
    Explore at:
    xlsAvailable download formats
    Dataset updated
    Jun 6, 2023
    Dataset provided by
    PLOShttp://plos.org/
    Authors
    Yu Li; Xiao Liang; Jingsen Liu; Huan Zhou
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    The comparison results of different algorithms on CEC2017 functions with D=30.

  8. Comparison of result on welded beam design problem.

    • plos.figshare.com
    xls
    Updated Jun 13, 2023
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Yu Li; Xiao Liang; Jingsen Liu; Huan Zhou (2023). Comparison of result on welded beam design problem. [Dataset]. http://doi.org/10.1371/journal.pone.0276210.t011
    Explore at:
    xlsAvailable download formats
    Dataset updated
    Jun 13, 2023
    Dataset provided by
    PLOShttp://plos.org/
    Authors
    Yu Li; Xiao Liang; Jingsen Liu; Huan Zhou
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Comparison of result on welded beam design problem.

  9. Comparison of result on three-bar truss design problem.

    • plos.figshare.com
    xls
    Updated Jun 13, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Yu Li; Xiao Liang; Jingsen Liu; Huan Zhou (2023). Comparison of result on three-bar truss design problem. [Dataset]. http://doi.org/10.1371/journal.pone.0276210.t013
    Explore at:
    xlsAvailable download formats
    Dataset updated
    Jun 13, 2023
    Dataset provided by
    PLOShttp://plos.org/
    Authors
    Yu Li; Xiao Liang; Jingsen Liu; Huan Zhou
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Comparison of result on three-bar truss design problem.

  10. Comparison of result on speed reducer design problem.

    • plos.figshare.com
    xls
    Updated Jun 2, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Yu Li; Xiao Liang; Jingsen Liu; Huan Zhou (2023). Comparison of result on speed reducer design problem. [Dataset]. http://doi.org/10.1371/journal.pone.0276210.t014
    Explore at:
    xlsAvailable download formats
    Dataset updated
    Jun 2, 2023
    Dataset provided by
    PLOShttp://plos.org/
    Authors
    Yu Li; Xiao Liang; Jingsen Liu; Huan Zhou
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Comparison of result on speed reducer design problem.

  11. Comparison of result on pressure vessel design problem.

    • plos.figshare.com
    xls
    Updated Jun 13, 2023
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Yu Li; Xiao Liang; Jingsen Liu; Huan Zhou (2023). Comparison of result on pressure vessel design problem. [Dataset]. http://doi.org/10.1371/journal.pone.0276210.t010
    Explore at:
    xlsAvailable download formats
    Dataset updated
    Jun 13, 2023
    Dataset provided by
    PLOShttp://plos.org/
    Authors
    Yu Li; Xiao Liang; Jingsen Liu; Huan Zhou
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Comparison of result on pressure vessel design problem.

  12. The comparison results of different algorithms on CEC2019 functions.

    • plos.figshare.com
    xls
    Updated Jun 13, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Yu Li; Xiao Liang; Jingsen Liu; Huan Zhou (2023). The comparison results of different algorithms on CEC2019 functions. [Dataset]. http://doi.org/10.1371/journal.pone.0276210.t006
    Explore at:
    xlsAvailable download formats
    Dataset updated
    Jun 13, 2023
    Dataset provided by
    PLOShttp://plos.org/
    Authors
    Yu Li; Xiao Liang; Jingsen Liu; Huan Zhou
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    The comparison results of different algorithms on CEC2019 functions.

  13. The comparison results of different algorithms on 23 benchmark functions...

    • plos.figshare.com
    xls
    Updated Jun 6, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Yu Li; Xiao Liang; Jingsen Liu; Huan Zhou (2023). The comparison results of different algorithms on 23 benchmark functions with D=30. [Dataset]. http://doi.org/10.1371/journal.pone.0276210.t002
    Explore at:
    xlsAvailable download formats
    Dataset updated
    Jun 6, 2023
    Dataset provided by
    PLOShttp://plos.org/
    Authors
    Yu Li; Xiao Liang; Jingsen Liu; Huan Zhou
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    The comparison results of different algorithms on 23 benchmark functions with D=30.

  14. Not seeing a result you expected?
    Learn how you can add new datasets to our index.

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
Amir Szitenberg (2016). Figure S1. Loci statistics boxplots for data derived from [1]. [Dataset]. http://doi.org/10.6084/m9.figshare.1409424.v1
Organization logoOrganization logo

Figure S1. Loci statistics boxplots for data derived from [1].

Explore at:
pdfAvailable download formats
Dataset updated
Jan 19, 2016
Dataset provided by
figshare
Figsharehttp://figshare.com/
Authors
Amir Szitenberg
License

Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically

Description

For each locus, the plots illustrate the distributions of (from top to bottom) per-position entropy, per-position gap score [4], per position conservation score [4], sequence length and GC content. 1. Kawahara AY, Breinholt JW. Phylogenomics provides strong evidence for relationships of butterflies and moths. Proc R Soc B. 2014;281: 20140970. 2. Robinson DF, Foulds LR. Comparison of phylogenetic trees. Math Biosci. 1981;53: 131–147. 3. Kuhner MK, Felsenstein J. A simulation comparison of phylogeny algorithms under equal and unequal evolutionary rates. Mol Biol Evol. 1994;11: 459–468. 4. Capella-Gutiérrez S, Silla-Martínez JM, Gabaldón T. trimAl: a tool for automated alignment trimming in large-scale phylogenetic analyses. Bioinformatics. 2009;25: 1972–1973.

Search
Clear search
Close search
Google apps
Main menu