13 datasets found

Figure S1. Loci statistics boxplots for data derived from [1].
figshare.com
pdf
Updated Jan 19, 2016
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Amir Szitenberg (2016). Figure S1. Loci statistics boxplots for data derived from [1]. [Dataset]. http://doi.org/10.6084/m9.figshare.1409424.v1
Explore at:
pdfAvailable download formats
Unique identifier
https://doi.org/10.6084/m9.figshare.1409424.v1
Dataset updated
Jan 19, 2016
Dataset provided by
figshare
Figsharehttp://figshare.com/
Authors
Amir Szitenberg
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
For each locus, the plots illustrate the distributions of (from top to bottom) per-position entropy, per-position gap score [4], per position conservation score [4], sequence length and GC content. 1. Kawahara AY, Breinholt JW. Phylogenomics provides strong evidence for relationships of butterflies and moths. Proc R Soc B. 2014;281: 20140970. 2. Robinson DF, Foulds LR. Comparison of phylogenetic trees. Math Biosci. 1981;53: 131–147. 3. Kuhner MK, Felsenstein J. A simulation comparison of phylogeny algorithms under equal and unequal evolutionary rates. Mol Biol Evol. 1994;11: 459–468. 4. Capella-Gutiérrez S, Silla-Martínez JM, Gabaldón T. trimAl: a tool for automated alignment trimming in large-scale phylogenetic analyses. Bioinformatics. 2009;25: 1972–1973.
f
Data from: Sparse Functional Boxplots for Multivariate Curves
datasetcatalog.nlm.nih.gov
tandf.figshare.com
Updated Apr 19, 2022
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Qu, Zhuo; Genton, Marc G. (2022). Sparse Functional Boxplots for Multivariate Curves [Dataset]. https://datasetcatalog.nlm.nih.gov/dataset?q=0000238011
Explore at:
Dataset updated
Apr 19, 2022
Authors
Qu, Zhuo; Genton, Marc G.
Description
This paper introduces the sparse functional boxplot and the intensity sparse functional boxplot as practical exploratory tools. Besides being available for complete functional data, they can be used in sparse univariate and multivariate functional data. The sparse functional boxplot, based on the functional boxplot, displays sparseness proportions within the 50% central region. The intensity sparse functional boxplot indicates the relative intensity of fitted sparse point patterns in the central region. The two-stage functional boxplot, which derives from the functional boxplot to detect outliers, is furthermore extended to its sparse form. We also contribute to sparse data fitting improvement and sparse multivariate functional data depth. In a simulation study, we evaluate the goodness of data fitting, several depth proposals for sparse multivariate functional data, and compare the results of outlier detection between the sparse functional boxplot and its two-stage version. The practical applications of the sparse functional boxplot and intensity sparse functional boxplot are illustrated with two public health datasets. Supplementary materials and codes are available for readers to apply our visualization tools and replicate the analysis.
Prioritization of barriers that hinders Local Flexibility Market...
data.europa.eu
unknown
Updated Jun 8, 2020
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Zenodo (2020). Prioritization of barriers that hinders Local Flexibility Market proliferation [Dataset]. https://data.europa.eu/data/datasets/oai-zenodo-org-3855546?locale=bg
Explore at:
unknown(2109374)Available download formats
Dataset updated
Jun 8, 2020
Dataset authored and provided by
Zenodohttp://zenodo.org/
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
This dataset contains the prioritization provided by a panel of 15 experts to a set of 28 barriers categories for 8 different roles of the future energy system. A Delphi method was followed and the scores provided in the three rounds carried out are included. The dataset also contains the scripts used to assess the results and the output of this assessment. A list of the information contained in this file is: data folder: this folders includes the scores given by the 15 experts in the 3 rounds. Every round is in an individual folder. There is a file per expert that has the scores between -5 (not relevant at all) to 5 (completely relevant) per barrier (rows) and actor (columns). There is also a file with the description of the experts in terms of their position in the company, the type of company and the country. fig folder: this folder includes the figures created to assess the information provided by the experts. For each round, the following figures are created (in each respective folder): Boxplot with the distribution of scores per barriers and roles. Heatmap with the mean scores per barriers and roles. Boxplots with the comparison of the different distributions provided by the experts of each group (depending on the keywords) per barrier and role. Heatmap with the mean score per barrier and use case and with the prioritization per barrier and use case. Finally, bar plots with the mean scores differences between rounds and boxplot with comparisons of the scores distributions are also provided. stat folder: this folder includes the files with the results of the different statistical assessment carried out. For each round, the following figures are created (in each respective folder): The statistics used to assess the scores (Intraclass correlation coefficient, Inter-rater agreement, Inter-rater agreement p-value, Homogeneity of Variances, Average interquartile range, Standard Deviation of interquartile ranges, Friedman test p-value Average power post hoc) per barrier and per role. The results of the post hoc of the Friedman Test per berries and per roles. The average score per barrier and per role. The mean value of the scores provided by the experts grouped by the keywords per barrier and role. P-value of the comparison of these two values. The end prioritization of the barrier for the use case (averaging the scores or merging the critical sets) Finally, the differences between the mean and standard deviations of the scores between two consecutive rounds are provided.
Predict Term Deposit
kaggle.com
zip
Updated Nov 29, 2021
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Aslan Ahmedov (2021). Predict Term Deposit [Dataset]. https://www.kaggle.com/aslanahmedov/predict-term-deposit
Explore at:
zip(588608 bytes)Available download formats
Dataset updated
Nov 29, 2021
Authors
Aslan Ahmedov
Description
Predict Term Deposit

Introduction

Bank has multiple banking products that it sells to customer such as saving account, credit cards, investments etc. It wants to which customer will purchase its credit cards. For the same it has various kind of information regarding the demographic details of the customer, their banking behavior etc. Once it can predict the chances that customer will purchase a product, it wants to use the same to make pre-payment to the authors.

In this part I will demonstrate how to build a model, to predict which clients will subscribing to a term deposit, with inception of machine learning. In the ﬁrst part we will deal with the description and visualization of the analysed data, and in the second we will go to data classiﬁcation models.

Strategy

-Desire target -Data Understanding -Preprocessing Data -Machine learning Model -Prediction -Comparing Results

Desire Target

Predict if a client will subscribe (yes/no) to a term deposit — this is defined as a classification problem.

Data

The dataset (Assignment-2_data.csv) used in this assignment contains bank customers’ data. File name: Assignment-2_Data File format: . csv Numbers of Row: 45212 Numbers of Attributes: 17 non- empty conditional attributes attributes and one decision attribute.

https://user-images.githubusercontent.com/91852182/143783430-eafd25b0-6d40-40b8-ac5b-1c4f67ca9e02.png"> https://user-images.githubusercontent.com/91852182/143783451-3e49b817-29a6-4108-b597-ce35897dda4a.png">

Exploratory Data Analysis (EDA)

Data pre-processing is a main step in Machine Learning as the useful information which can be derived it from data set directly affects the model quality so it is extremely important to do at least necessary preprocess for our data before feeding it into our model.

In this assignment, we are going to utilize python to develop a predictive machine learning model. First, we will import some important and necessary libraries.

Below we are can see that there are various numerical and categorical columns. The most important column here is y, which is the output variable (desired target): this will tell us if the client subscribed to a term deposit(binary: ‘yes’,’no’).

https://user-images.githubusercontent.com/91852182/143783456-78c22016-149b-4218-a4a5-765ca348f069.png">

We must to check missing values in our dataset if we do have any and do, we have any duplicated values or not.

https://user-images.githubusercontent.com/91852182/143783471-a8656640-ec57-4f38-8905-35ef6f3e7f30.png">

We can see that in 'age' 9 missing values and 'balance' as well 3 values missed. In this case based that our dataset it has around 45k row I will remove them from dataset. on Pic 1 and 2 you will see before and after.

https://user-images.githubusercontent.com/91852182/143783474-b3898011-98e3-43c8-bd06-2cfcde714694.png">

From the above analysis we can see that only 5289 people out of 45200 have subscribed which is roughly 12%. We can see that our dataset highly unbalanced. we need to take it as a note.

https://user-images.githubusercontent.com/91852182/143783534-a05020a8-611d-4da1-98cf-4fec811cb5d8.png">

Our list of categorical variables.

https://user-images.githubusercontent.com/91852182/143783542-d40006cd-4086-4707-a683-f654a8cb2205.png">

Our list of numerical variables.

https://user-images.githubusercontent.com/91852182/143783551-6b220f99-2c4d-47d0-90ab-18ede42a4ae5.png">

"Age" Q-Q Plots and Box Plot.

In above boxplot we can see that some point in very young age and as well impossible age. So,

https://user-images.githubusercontent.com/91852182/143783564-ad0e2a27-5df5-4e04-b5d7-6d218cabd405.png"> https://user-images.githubusercontent.com/91852182/143783589-5abf0a0b-8bab-4192-98c8-d2e04f32a5c5.png">

Now, we don’t have issues on this feature so we can use it

https://user-images.githubusercontent.com/91852182/143783599-5205eddb-a0f5-446d-9f45-cc1adbfcce67.png"> https://user-images.githubusercontent.com/91852182/143783601-e520d59c-3b21-4627-a9bb-cac06f415a1e.png">

"Duration" Q-Q Plots and Box Plot

https://user-images.githubusercontent.com/91852182/143783634-03e5a584-a6fb-4bcb-8dc5-1f3cc50f9507.png"> https://user-images.githubusercontent.com/91852182/143783640-f6e71323-abbe-49c1-9935-35ffb2d10569.png">

This attribute highly affects the output target (e.g., if duration=0 then y=’no’). Yet, the duration is not known before a call is performed. Also, after the end of the call y is obviously known. Thus, this input should only be included for benchmark purposes...
Appendix B. Three boxplots comparing phenotypic trait measures between...
wiley.figshare.com
figshare.com
html
Updated Jun 1, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Matteo Garbelotto; Gianni Della Rocca; Todd Osmundson; Vincenzo di Lonardo; Roberto Danti (2023). Appendix B. Three boxplots comparing phenotypic trait measures between populations; comparisons correspond to tests 1–3 as shown in Fig. 1 in text. [Dataset]. http://doi.org/10.6084/m9.figshare.3564105.v1
Explore at:
htmlAvailable download formats
Unique identifier
https://doi.org/10.6084/m9.figshare.3564105.v1
Dataset updated
Jun 1, 2023
Dataset provided by
Wileyhttps://www.wiley.com/
Authors
Matteo Garbelotto; Gianni Della Rocca; Todd Osmundson; Vincenzo di Lonardo; Roberto Danti
License
CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
License information was derived automatically
Description
Three boxplots comparing phenotypic trait measures between populations; comparisons correspond to tests 1–3 as shown in Fig. 1 in text.
Boxplots comparing Bray-Curtis dissimilarity distances for sites sampled in...
plos.figshare.com
tiff
Updated May 31, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Kristina Cervantes-Yoshida; Robert A. Leidy; Stephanie M. Carlson (2023). Boxplots comparing Bray-Curtis dissimilarity distances for sites sampled in both time periods, presented separately for low-impacted sites and urbanized sites. [Dataset]. http://doi.org/10.1371/journal.pone.0141707.g004
Explore at:
tiffAvailable download formats
Unique identifier
https://doi.org/10.1371/journal.pone.0141707.g004
Dataset updated
May 31, 2023
Dataset provided by
PLOShttp://plos.org/
Authors
Kristina Cervantes-Yoshida; Robert A. Leidy; Stephanie M. Carlson
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
Boxplots comparing Bray-Curtis dissimilarity distances for sites sampled in both time periods, presented separately for low-impacted sites and urbanized sites.
The comparison results of different algorithms on CEC2017 functions with...
plos.figshare.com
xls
Updated Jun 6, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Yu Li; Xiao Liang; Jingsen Liu; Huan Zhou (2023). The comparison results of different algorithms on CEC2017 functions with D=30. [Dataset]. http://doi.org/10.1371/journal.pone.0276210.t004
Explore at:
xlsAvailable download formats
Unique identifier
https://doi.org/10.1371/journal.pone.0276210.t004
Dataset updated
Jun 6, 2023
Dataset provided by
PLOShttp://plos.org/
Authors
Yu Li; Xiao Liang; Jingsen Liu; Huan Zhou
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
The comparison results of different algorithms on CEC2017 functions with D=30.
Comparison of result on welded beam design problem.
plos.figshare.com
xls
Updated Jun 13, 2023
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Yu Li; Xiao Liang; Jingsen Liu; Huan Zhou (2023). Comparison of result on welded beam design problem. [Dataset]. http://doi.org/10.1371/journal.pone.0276210.t011
Explore at:
xlsAvailable download formats
Unique identifier
https://doi.org/10.1371/journal.pone.0276210.t011
Dataset updated
Jun 13, 2023
Dataset provided by
PLOShttp://plos.org/
Authors
Yu Li; Xiao Liang; Jingsen Liu; Huan Zhou
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
Comparison of result on welded beam design problem.
Comparison of result on three-bar truss design problem.
plos.figshare.com
xls
Updated Jun 13, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Yu Li; Xiao Liang; Jingsen Liu; Huan Zhou (2023). Comparison of result on three-bar truss design problem. [Dataset]. http://doi.org/10.1371/journal.pone.0276210.t013
Explore at:
xlsAvailable download formats
Unique identifier
https://doi.org/10.1371/journal.pone.0276210.t013
Dataset updated
Jun 13, 2023
Dataset provided by
PLOShttp://plos.org/
Authors
Yu Li; Xiao Liang; Jingsen Liu; Huan Zhou
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
Comparison of result on three-bar truss design problem.
Comparison of result on speed reducer design problem.
plos.figshare.com
xls
Updated Jun 2, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Yu Li; Xiao Liang; Jingsen Liu; Huan Zhou (2023). Comparison of result on speed reducer design problem. [Dataset]. http://doi.org/10.1371/journal.pone.0276210.t014
Explore at:
xlsAvailable download formats
Unique identifier
https://doi.org/10.1371/journal.pone.0276210.t014
Dataset updated
Jun 2, 2023
Dataset provided by
PLOShttp://plos.org/
Authors
Yu Li; Xiao Liang; Jingsen Liu; Huan Zhou
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
Comparison of result on speed reducer design problem.
Comparison of result on pressure vessel design problem.
plos.figshare.com
xls
Updated Jun 13, 2023
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Yu Li; Xiao Liang; Jingsen Liu; Huan Zhou (2023). Comparison of result on pressure vessel design problem. [Dataset]. http://doi.org/10.1371/journal.pone.0276210.t010
Explore at:
xlsAvailable download formats
Unique identifier
https://doi.org/10.1371/journal.pone.0276210.t010
Dataset updated
Jun 13, 2023
Dataset provided by
PLOShttp://plos.org/
Authors
Yu Li; Xiao Liang; Jingsen Liu; Huan Zhou
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
Comparison of result on pressure vessel design problem.
The comparison results of different algorithms on CEC2019 functions.
plos.figshare.com
xls
Updated Jun 13, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Yu Li; Xiao Liang; Jingsen Liu; Huan Zhou (2023). The comparison results of different algorithms on CEC2019 functions. [Dataset]. http://doi.org/10.1371/journal.pone.0276210.t006
Explore at:
xlsAvailable download formats
Unique identifier
https://doi.org/10.1371/journal.pone.0276210.t006
Dataset updated
Jun 13, 2023
Dataset provided by
PLOShttp://plos.org/
Authors
Yu Li; Xiao Liang; Jingsen Liu; Huan Zhou
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
The comparison results of different algorithms on CEC2019 functions.
The comparison results of different algorithms on 23 benchmark functions...
plos.figshare.com
xls
Updated Jun 6, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Yu Li; Xiao Liang; Jingsen Liu; Huan Zhou (2023). The comparison results of different algorithms on 23 benchmark functions with D=30. [Dataset]. http://doi.org/10.1371/journal.pone.0276210.t002
Explore at:
xlsAvailable download formats
Unique identifier
https://doi.org/10.1371/journal.pone.0276210.t002
Dataset updated
Jun 6, 2023
Dataset provided by
PLOShttp://plos.org/
Authors
Yu Li; Xiao Liang; Jingsen Liu; Huan Zhou
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
The comparison results of different algorithms on 23 benchmark functions with D=30.
Not seeing a result you expected?
Learn how you can add new datasets to our index.

Facebook

Twitter

Click to copy link

Link copied

Cite

Amir Szitenberg (2016). Figure S1. Loci statistics boxplots for data derived from [1]. [Dataset]. http://doi.org/10.6084/m9.figshare.1409424.v1

Figure S1. Loci statistics boxplots for data derived from [1].

Explore at:

pdfAvailable download formats

Unique identifier

https://doi.org/10.6084/m9.figshare.1409424.v1

Dataset updated

Jan 19, 2016

Dataset provided by

figshare
Figsharehttp://figshare.com/

Authors

Amir Szitenberg

License

Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically

Description

For each locus, the plots illustrate the distributions of (from top to bottom) per-position entropy, per-position gap score [4], per position conservation score [4], sequence length and GC content. 1. Kawahara AY, Breinholt JW. Phylogenomics provides strong evidence for relationships of butterflies and moths. Proc R Soc B. 2014;281: 20140970. 2. Robinson DF, Foulds LR. Comparison of phylogenetic trees. Math Biosci. 1981;53: 131–147. 3. Kuhner MK, Felsenstein J. A simulation comparison of phylogeny algorithms under equal and unequal evolutionary rates. Mol Biol Evol. 1994;11: 459–468. 4. Capella-Gutiérrez S, Silla-Martínez JM, Gabaldón T. trimAl: a tool for automated alignment trimming in large-scale phylogenetic analyses. Bioinformatics. 2009;25: 1972–1973.

Clear search

Close search

Google apps

Main menu

Figure S1. Loci statistics boxplots for data derived from [1].

Data from: Sparse Functional Boxplots for Multivariate Curves

Prioritization of barriers that hinders Local Flexibility Market...

Predict Term Deposit

Predict Term Deposit

Introduction

Strategy

Desire Target

Data

Exploratory Data Analysis (EDA)

"Age" Q-Q Plots and Box Plot.

"Duration" Q-Q Plots and Box Plot

Appendix B. Three boxplots comparing phenotypic trait measures between...

Boxplots comparing Bray-Curtis dissimilarity distances for sites sampled in...

The comparison results of different algorithms on CEC2017 functions with...

Comparison of result on welded beam design problem.

Comparison of result on three-bar truss design problem.

Comparison of result on speed reducer design problem.

Comparison of result on pressure vessel design problem.

The comparison results of different algorithms on CEC2019 functions.

The comparison results of different algorithms on 23 benchmark functions...

Figure S1. Loci statistics boxplots for data derived from [1].