4 datasets found

f
Definition of important variables.
plos.figshare.com
bin
Updated Jun 4, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Wen Kun; Xiangxiang Hu; Fajiang Liu (2023). Definition of important variables. [Dataset]. http://doi.org/10.1371/journal.pone.0272983.t001
Explore at:
binAvailable download formats
Unique identifier
https://doi.org/10.1371/journal.pone.0272983.t001
Dataset updated
Jun 4, 2023
Dataset provided by
PLOS ONE
Authors
Wen Kun; Xiangxiang Hu; Fajiang Liu
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
Definition of important variables.
Data from: Bike Sharing Dataset
kaggle.com
Updated Sep 10, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Ram Vishnu R (2024). Bike Sharing Dataset [Dataset]. https://www.kaggle.com/datasets/ramvishnur/bike-sharing-dataset
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Sep 10, 2024
Dataset provided by
Kagglehttp://kaggle.com/
Authors
Ram Vishnu R
Description
Problem Statement:

A bike-sharing system is a service in which bikes are made available for shared use to individuals on a short term basis for a price or free. Many bike share systems allow people to borrow a bike from a "dock" which is usually computer-controlled wherein the user enters the payment information, and the system unlocks it. This bike can then be returned to another dock belonging to the same system.

A US bike-sharing provider BoomBikes has recently suffered considerable dip in their revenue due to the Corona pandemic. The company is finding it very difficult to sustain in the current market scenario. So, it has decided to come up with a mindful business plan to be able to accelerate its revenue.

In such an attempt, BoomBikes aspires to understand the demand for shared bikes among the people. They have planned this to prepare themselves to cater to the people's needs once the situation gets better all around and stand out from other service providers and make huge profits.

They have contracted a consulting company to understand the factors on which the demand for these shared bikes depends. Specifically, they want to understand the factors affecting the demand for these shared bikes in the American market. The company wants to know:

Which variables are significant in predicting the demand for shared bikes.

How well those variables describe the bike demands

Based on various meteorological surveys and people's styles, the service provider firm has gathered a large dataset on daily bike demands across the American market based on some factors.

Business Goal:

You are required to model the demand for shared bikes with the available independent variables. It will be used by the management to understand how exactly the demands vary with different features. They can accordingly manipulate the business strategy to meet the demand levels and meet the customer's expectations. Further, the model will be a good way for management to understand the demand dynamics of a new market.

Data Preparation:

You can observe in the dataset that some of the variables like 'weathersit' and 'season' have values as 1, 2, 3, 4 which have specific labels associated with them (as can be seen in the data dictionary). These numeric values associated with the labels may indicate that there is some order to them - which is actually not the case (Check the data dictionary and think why). So, it is advisable to convert such feature values into categorical string values before proceeding with model building. Please refer the data dictionary to get a better understanding of all the independent variables.

You might notice the column 'yr' with two values 0 and 1 indicating the years 2018 and 2019 respectively. At the first instinct, you might think it is a good idea to drop this column as it only has two values so it might not be a value-add to the model. But in reality, since these bike-sharing systems are slowly gaining popularity, the demand for these bikes is increasing every year proving that the column 'yr' might be a good variable for prediction. So think twice before dropping it.

Model Building:

In the dataset provided, you will notice that there are three columns named 'casual', 'registered', and 'cnt'. The variable 'casual' indicates the number casual users who have made a rental. The variable 'registered' on the other hand shows the total number of registered users who have made a booking on a given day. Finally, the 'cnt' variable indicates the total number of bike rentals, including both casual and registered. The model should be built taking this 'cnt' as the target variable.

Model Evaluation:

When you're done with model building and residual analysis and have made predictions on the test set, just make sure you use the following two lines of code to calculate the R-squared score on the test set. python from sklearn.metrics import r2_score r2_score(y_test, y_pred) - where y_test is the test data set for the target variable, and y_pred is the variable containing the predicted values of the target variable on the test set. - Please perform this step as the R-squared score on the test set holds as a benchmark for your model.
Data from: Rapid evolution of thermal tolerance and phenotypic plasticity in...
zenodo.org
bin, csv
Updated Mar 30, 2022
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
C-Elisa Schaum; C-Elisa Schaum; Angus Buckling; Smirnoff; Yvon-Durocher; Angus Buckling; Smirnoff; Yvon-Durocher (2022). Rapid evolution of thermal tolerance and phenotypic plasticity in variable environments [Dataset]. http://doi.org/10.5281/zenodo.6394982
Explore at:
csv, binAvailable download formats
Unique identifier
https://doi.org/10.5281/zenodo.6394982
Dataset updated
Mar 30, 2022
Dataset provided by
Zenodohttp://zenodo.org/
Authors
C-Elisa Schaum; C-Elisa Schaum; Angus Buckling; Smirnoff; Yvon-Durocher; Angus Buckling; Smirnoff; Yvon-Durocher
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
These are the data and code to go with "Rapid evolution of thermal tolerance and phenotypic plasticity in variable environments"

Figure 01 takes the following data/scripts:

20210610_Thally02_Figure01_plot_and_stats.R with track_keeper.csv and corr_Response growth .csv . These files contain growth rates per transfers for all selection environments throughout the experiment and growth rates in correlated environments during reciprocal transplants, respectively.

Figure 02 takes the following data/scripts:

20210610_Thally02_Figure02_plot_and_stats.R with 20161120_res_logis_t000.csv and 20161123_resloglint300.csv . These files contain the output of the shapes of the growth curves (i.e. information on lag time , growth at µmax, K etc) for all samples in all selection environments at t0 and t300, respectively

The remaining figures - position not clear at time of submission - take the following data/script.

For plasticity in FRRF data, the script 20181204_FRRF_plasticity.R takes the FRRF raw data contained in allfvfmdata_thally_t300_t000.csv . Extracted parameters are in files CvaluesThally02.csv, psiPSI_slope_intercept.csv, and rP_extracted_values.csv and can be analysed using the R script extracted parameter plots.R . R script FRRF visualisation only .R is for visualisation only, as the title suggests.

For comparing plasticity/growth , the data are in 20170327_giantbigtable.csv , and can be visualised/analysed in plast vs growth.R

In order to recreate the AMOVAS based on SNVs, use all_variants_fixed-only_using_5x_depth_threshold.cvs with amova thally02.R

For additional information, please contact elisa.schaum@uni-hamburg.de
o
Whole body T1 mapping of small animals using prospective gating and variable...
ora.ox.ac.uk
octet-stream, plain
Updated Jan 1, 2016
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Kinchesh, P; Smart, S (2016). Whole body T1 mapping of small animals using prospective gating and variable flip angle imaging [Dataset]. http://doi.org/10.5287/bodleian:a4awG5r7R
Explore at:
octet-stream(4194656), octet-stream(83886432), octet-stream(100663648), octet-stream(67109216), octet-stream(184549728), plain(3619)Available download formats
Unique identifier
https://doi.org/10.5287/bodleian:a4awG5r7R
Dataset updated
Jan 1, 2016
Dataset provided by
University of Oxford
Authors
Kinchesh, P; Smart, S
License
https://ora.ox.ac.uk/terms_of_usehttps://ora.ox.ac.uk/terms_of_use
Description
See text in Content.txt Prospective gating and automatic reacquisition of data corrupted by respiration motion were implemented in variable flip angle (VFA) and actual flip angle imaging (AFI) MRI scans to enable cardio-respiratory synchronised T1 mapping of the whole mouse. Stability tests of cardio-respiratory gating (CR-gating) and respiratory gating (R-gating) with and without reacquisition were compared with un-gated scans in 4 mice. The automatic and immediate reacquisition of data corrupted by respiration motion is observed to properly eliminate respiration motion artefact. CR-gated VFA scans with 16 flip angles and 32 k-lines per cardiac R-wave were acquired with R-gated AFI scans in a total scan time of less than 14 minutes. The VFA data were acquired with a voxel size of 0.075 mm3. T1 was calculated in the whole mouse with a robust and efficient nonlinear least squares fit of data. The standard deviation in the T1 measurement is conservatively estimated to be less than 6.2%. The T1 values measured from VFA scans with 32 k-lines per R-wave are in very good agreement with those measured from VFA scans with 8 k-lines per R-wave, even for myocardium. As such, it is demonstrated that prospective gating and reacquisition enables fast and accurate T1 mapping of small animals.
Not seeing a result you expected?
Learn how you can add new datasets to our index.