100+ datasets found

f
Dataset for: Simulation and data-generation for random-effects network...
wiley.figshare.com
figshare.com
txt
Updated Jun 1, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Svenja Elisabeth Seide; Katrin Jensen; Meinhard Kieser (2023). Dataset for: Simulation and data-generation for random-effects network meta-analysis of binary outcome [Dataset]. http://doi.org/10.6084/m9.figshare.8001863.v1
Explore at:
txtAvailable download formats
Unique identifier
https://doi.org/10.6084/m9.figshare.8001863.v1
Dataset updated
Jun 1, 2023
Dataset provided by
Wiley
Authors
Svenja Elisabeth Seide; Katrin Jensen; Meinhard Kieser
License
CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
License information was derived automatically
Description
The performance of statistical methods is frequently evaluated by means of simulation studies. In case of network meta-analysis of binary data, however, available data- generating models are restricted to either inclusion of two-armed trials or the fixed-effect model. Based on data-generation in the pairwise case, we propose a framework for the simulation of random-effect network meta-analyses including multi-arm trials with binary outcome. The only of the common data-generating models which is directly applicable to a random-effects network setting uses strongly restrictive assumptions. To overcome these limitations, we modify this approach and derive a related simulation procedure using odds ratios as effect measure. The performance of this procedure is evaluated with synthetic data and in an empirical example.
i
Grant Giving Statistics for Funny River Crafters
instrumentl.com
Updated Jul 6, 2021
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
(2021). Grant Giving Statistics for Funny River Crafters [Dataset]. https://www.instrumentl.com/990-report/funny-river-crafters
Explore at:
Dataset updated
Jul 6, 2021
Area covered
Funny River
Variables measured
Total Assets
Description
Financial overview and grant giving statistics of Funny River Crafters
d
Community Survey: 2021 Random Sample Results
catalog.data.gov
data.bloomington.in.gov
Updated May 20, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
data.bloomington.in.gov (2023). Community Survey: 2021 Random Sample Results [Dataset]. https://catalog.data.gov/dataset/community-survey-2021-random-sample-results-69942
Explore at:
Dataset updated
May 20, 2023
Dataset provided by
data.bloomington.in.gov
Description
A random sample of households were invited to participate in this survey. In the dataset, you will find the respondent level data in each row with the questions in each column. The numbers represent a scale option from the survey, such as 1=Excellent, 2=Good, 3=Fair, 4=Poor. The question stem, response option, and scale information for each field can be found in the var "variable labels" and "value labels" sheets. VERY IMPORTANT NOTE: The scientific survey data were weighted, meaning that the demographic profile of respondents was compared to the demographic profile of adults in Bloomington from US Census data. Statistical adjustments were made to bring the respondent profile into balance with the population profile. This means that some records were given more "weight" and some records were given less weight. The weights that were applied are found in the field "wt". If you do not apply these weights, you will not obtain the same results as can be found in the report delivered to the Bloomington. The easiest way to replicate these results is likely to create pivot tables, and use the sum of the "wt" field rather than a count of responses.
Data from: Funny Jokes
kaggle.com
Updated Aug 11, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Cody Helscel (2023). Funny Jokes [Dataset]. https://www.kaggle.com/codyhelscel/funny-jokes/discussion
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Aug 11, 2023
Dataset provided by
Kagglehttp://kaggle.com/
Authors
Cody Helscel
Description
This is a dataset containing jokes. The dataset is already in a SQL file to import into MySQL or other tools.
i
Grant Giving Statistics for Funny Farm Early Learning Center Incorporated
instrumentl.com
Updated Dec 17, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
(2023). Grant Giving Statistics for Funny Farm Early Learning Center Incorporated [Dataset]. https://www.instrumentl.com/990-report/funny-farm-early-learning-center-incorporated
Explore at:
Dataset updated
Dec 17, 2023
Variables measured
Total Assets, Total Giving
Description
Financial overview and grant giving statistics of Funny Farm Early Learning Center Incorporated
Average daily time spent on social media worldwide 2012-2025
statista.com
Updated Jun 19, 2025
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Statista (2025). Average daily time spent on social media worldwide 2012-2025 [Dataset]. https://www.statista.com/statistics/433871/daily-social-media-usage-worldwide/
Explore at:
Dataset updated
Jun 19, 2025
Dataset authored and provided by
Statistahttp://statista.com/
Area covered
Worldwide
Description
How much time do people spend on social media? As of 2025, the average daily social media usage of internet users worldwide amounted to 141 minutes per day, down from 143 minutes in the previous year. Currently, the country with the most time spent on social media per day is Brazil, with online users spending an average of 3 hours and 49 minutes on social media each day. In comparison, the daily time spent with social media in the U.S. was just 2 hours and 16 minutes. Global social media usageCurrently, the global social network penetration rate is 62.3 percent. Northern Europe had an 81.7 percent social media penetration rate, topping the ranking of global social media usage by region. Eastern and Middle Africa closed the ranking with 10.1 and 9.6 percent usage reach, respectively. People access social media for a variety of reasons. Users like to find funny or entertaining content and enjoy sharing photos and videos with friends, but mainly use social media to stay in touch with current events friends. Global impact of social mediaSocial media has a wide-reaching and significant impact on not only online activities but also offline behavior and life in general. During a global online user survey in February 2019, a significant share of respondents stated that social media had increased their access to information, ease of communication, and freedom of expression. On the flip side, respondents also felt that social media had worsened their personal privacy, increased a polarization in politics and heightened everyday distractions.
S
Amazing eBay Statistics By Users, Revenue And Facts (2025)
sci-tech-today.com
Updated May 15, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Sci-Tech Today (2025). Amazing eBay Statistics By Users, Revenue And Facts (2025) [Dataset]. https://www.sci-tech-today.com/stats/amazing-ebay-statistics-updated/
Explore at:
Dataset updated
May 15, 2025
Dataset authored and provided by
Sci-Tech Today
License
https://www.sci-tech-today.com/privacy-policyhttps://www.sci-tech-today.com/privacy-policy
Time period covered
2022 - 2032
Area covered
Global
Description
Introduction

Amazing eBay Statistics: eBay, founded in 1995, is one of the worldâ€™s largest online marketplaces, connecting millions of buyers and sellers across the globe. Known for its wide range of products, from rare collectibles to everyday essentials, eBay has become a household name in e-commerce. With its unique auction-style format alongside fixed-price listings, eBay has transformed the way people shop online.

In addition to its massive user base, the platform attracts more than 109 million unique visitors each month, highlighting its significant influence in the online retail market. This introduction provides a glimpse into the impressive scale and reach of eBay, showcasing why it remains a dominant force in the global e-commerce industry.
q
Analyzing interesting images motivates mathematics and statistics learning
qubeshub.org
Updated Feb 28, 2018
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Jeremy Wojdak (2018). Analyzing interesting images motivates mathematics and statistics learning [Dataset]. http://doi.org/10.25334/Q4Z66C
Explore at:
Unique identifier
https://doi.org/10.25334/Q4Z66C
Dataset updated
Feb 28, 2018
Dataset provided by
QUBES
Authors
Jeremy Wojdak
Description
Presentation by Jeremy Wojdak made as part of the "Bringing Research Data to the Ecology Classroom: Opportunities, Barriers, and Next Steps” Session at the Ecological Society of America annual meeting, August 8th, 2017, Portland Oregon
S
Eros Now Statistics By Fun Facts, Subscription Plans, Region, Revenue,...
sci-tech-today.com
Updated Mar 20, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Sci-Tech Today (2025). Eros Now Statistics By Fun Facts, Subscription Plans, Region, Revenue, Demographics, Referral, Users, Content Library And Engagement [Dataset]. https://www.sci-tech-today.com/stats/eros-now-statistics/
Explore at:
Dataset updated
Mar 20, 2025
Dataset authored and provided by
Sci-Tech Today
License
https://www.sci-tech-today.com/privacy-policyhttps://www.sci-tech-today.com/privacy-policy
Time period covered
2022 - 2032
Area covered
Global
Description
Introduction

Eros Now Statistics: Established in 2012, Eros Now is one of Indiaâ€™s leading streaming services that operates heavily in the digital entertainment industry. The platform is managed by Eros International, which provides numerous Bollywood films, regional cinema from India, and original content that appeals to a large audience throughout India and around the world. In this digital streaming age where media consumption is being reshaped, Eros Now has strategically positioned itself as a formidable player through an extensive film library and exclusive offerings as it seeks to attract and retain subscribers.

The latest Eros Now statistics show that Eros Now has acquired a significant user base, thus indicating its stature and popularity among the players involved in the Indian entertainment sector. The growth experienced by this platform has been attributed to partnerships within different regions, such as focusing on regionalised content and enhancing customer experience, among others.

This article presents Eros Now statistics and trends on Eros Nowâ€™s current subscribers, the content it offers, and its market position. By exploring these metrics, we are able to understand the role played by Eros now in current transformations taking place in terms of digital media and how it affects the wider streaming industry altogether.
N
Random Lake, WI Population Breakdown by Gender Dataset: Male and Female...
neilsberg.com
csv, json
Updated Feb 24, 2025
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Neilsberg Research (2025). Random Lake, WI Population Breakdown by Gender Dataset: Male and Female Population Distribution // 2025 Edition [Dataset]. https://www.neilsberg.com/insights/random-lake-wi-population-by-gender/
Explore at:
csv, jsonAvailable download formats
Dataset updated
Feb 24, 2025
Dataset authored and provided by
Neilsberg Research
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Area covered
Wisconsin, Random Lake
Variables measured
Male Population, Female Population, Male Population as Percent of Total Population, Female Population as Percent of Total Population
Measurement technique
The data presented in this dataset is derived from the latest U.S. Census Bureau American Community Survey (ACS) 2019-2023 5-Year Estimates. To measure the two variables, namely (a) population and (b) population as a percentage of the total population, we initially analyzed and categorized the data for each of the gender classifications (biological sex) reported by the US Census Bureau. For further information regarding these estimates, please feel free to reach out to us via email at research@neilsberg.com.
Dataset funded by
Neilsberg Research
Description
About this dataset

Context

The dataset tabulates the population of Random Lake by gender, including both male and female populations. This dataset can be utilized to understand the population distribution of Random Lake across both sexes and to determine which sex constitutes the majority.

Key observations

There is a slight majority of male population, with 52.41% of total population being male. Source: U.S. Census Bureau American Community Survey (ACS) 2019-2023 5-Year Estimates.

Content

When available, the data consists of estimates from the U.S. Census Bureau American Community Survey (ACS) 2019-2023 5-Year Estimates.

Scope of gender :

Please note that American Community Survey asks a question about the respondents current sex, but not about gender, sexual orientation, or sex at birth. The question is intended to capture data for biological sex, not gender. Respondents are supposed to respond with the answer as either of Male or Female. Our research and this dataset mirrors the data reported as Male and Female for gender distribution analysis. No further analysis is done on the data reported from the Census Bureau.

Variables / Data Columns

Gender: This column displays the Gender (Male / Female)

Population: The population of the gender in the Random Lake is shown in this column.

% of Total Population: This column displays the percentage distribution of each gender as a proportion of Random Lake total population. Please note that the sum of all percentages may not equal one due to rounding of values.

Good to know

Margin of Error

Data in the dataset are based on the estimates and are subject to sampling variability and thus a margin of error. Neilsberg Research recommends using caution when presening these estimates in your research.

Custom data

If you do need custom data for any of your research project, report or presentation, you can contact our research staff at research@neilsberg.com for a feasibility of a custom tabulation on a fee-for-service basis.

Inspiration

Neilsberg Research Team curates, analyze and publishes demographics and economic data from a variety of public and proprietary sources, each of which often includes multiple surveys and programs. The large majority of Neilsberg Research aggregated datasets and insights is made available for free download at https://www.neilsberg.com/research/.

Recommended for further research

This dataset is a part of the main dataset for Random Lake Population by Race & Ethnicity. You can refer the same here
N
Random Lake, WI Population Breakdown by Gender and Age Dataset: Male and...
neilsberg.com
csv, json
Updated Feb 24, 2025
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Neilsberg Research (2025). Random Lake, WI Population Breakdown by Gender and Age Dataset: Male and Female Population Distribution Across 18 Age Groups // 2025 Edition [Dataset]. https://www.neilsberg.com/research/datasets/e1fb5554-f25d-11ef-8c1b-3860777c1fe6/
Explore at:
csv, jsonAvailable download formats
Dataset updated
Feb 24, 2025
Dataset authored and provided by
Neilsberg Research
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Area covered
Wisconsin, Random Lake
Variables measured
Male and Female Population Under 5 Years, Male and Female Population over 85 years, Male and Female Population Between 5 and 9 years, Male and Female Population Between 10 and 14 years, Male and Female Population Between 15 and 19 years, Male and Female Population Between 20 and 24 years, Male and Female Population Between 25 and 29 years, Male and Female Population Between 30 and 34 years, Male and Female Population Between 35 and 39 years, Male and Female Population Between 40 and 44 years, and 8 more
Measurement technique
The data presented in this dataset is derived from the latest U.S. Census Bureau American Community Survey (ACS) 2019-2023 5-Year Estimates. To measure the three variables, namely (a) Population (Male), (b) Population (Female), and (c) Gender Ratio (Males per 100 Females), we initially analyzed and categorized the data for each of the gender classifications (biological sex) reported by the US Census Bureau across 18 age groups, ranging from under 5 years to 85 years and above. These age groups are described above in the variables section. For further information regarding these estimates, please feel free to reach out to us via email at research@neilsberg.com.
Dataset funded by
Neilsberg Research
Description
About this dataset

Context

The dataset tabulates the population of Random Lake by gender across 18 age groups. It lists the male and female population in each age group along with the gender ratio for Random Lake. The dataset can be utilized to understand the population distribution of Random Lake by gender and age. For example, using this dataset, we can identify the largest age group for both Men and Women in Random Lake. Additionally, it can be used to see how the gender ratio changes from birth to senior most age group and male to female ratio across each age group for Random Lake.

Key observations

Largest age group (population): Male # 15-19 years (102) | Female # 30-34 years (80). Source: U.S. Census Bureau American Community Survey (ACS) 2019-2023 5-Year Estimates.

Content

When available, the data consists of estimates from the U.S. Census Bureau American Community Survey (ACS) 2019-2023 5-Year Estimates.

Age groups:

Under 5 years

5 to 9 years

10 to 14 years

15 to 19 years

20 to 24 years

25 to 29 years

30 to 34 years

35 to 39 years

40 to 44 years

45 to 49 years

50 to 54 years

55 to 59 years

60 to 64 years

65 to 69 years

70 to 74 years

75 to 79 years

80 to 84 years

85 years and over

Scope of gender :

Please note that American Community Survey asks a question about the respondents current sex, but not about gender, sexual orientation, or sex at birth. The question is intended to capture data for biological sex, not gender. Respondents are supposed to respond with the answer as either of Male or Female. Our research and this dataset mirrors the data reported as Male and Female for gender distribution analysis.

Variables / Data Columns

Age Group: This column displays the age group for the Random Lake population analysis. Total expected values are 18 and are define above in the age groups section.

Population (Male): The male population in the Random Lake is shown in the following column.

Population (Female): The female population in the Random Lake is shown in the following column.

Gender Ratio: Also known as the sex ratio, this column displays the number of males per 100 females in Random Lake for each age group.

Good to know

Margin of Error

Data in the dataset are based on the estimates and are subject to sampling variability and thus a margin of error. Neilsberg Research recommends using caution when presening these estimates in your research.

Custom data

If you do need custom data for any of your research project, report or presentation, you can contact our research staff at research@neilsberg.com for a feasibility of a custom tabulation on a fee-for-service basis.

Inspiration

Neilsberg Research Team curates, analyze and publishes demographics and economic data from a variety of public and proprietary sources, each of which often includes multiple surveys and programs. The large majority of Neilsberg Research aggregated datasets and insights is made available for free download at https://www.neilsberg.com/research/.

Recommended for further research

This dataset is a part of the main dataset for Random Lake Population by Gender. You can refer the same here
w
Portable pseudo-random reference sequences with Mersenne Twister using GNU...
data.wu.ac.at
datahub.io
bin, html, png, txt
Updated Oct 10, 2013
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Global (2013). Portable pseudo-random reference sequences with Mersenne Twister using GNU Octave [Dataset]. https://data.wu.ac.at/odso/datahub_io/ZTUzNDkwNmItOTgyYy00ZGE2LTliYjctNDI2NDA3ODY3NDhm
Explore at:
png(1182325.0), txt(35000000.0), bin(20558.0), html(20.0), txt(350000.0), bin(2560.0), bin(17816349.0), txt(350.0), txt(399.0), txt(3500000.0), bin(1787130.0), txt(351.0), txt(3500.0), bin(558.0), bin(181307.0), txt(35000.0)Available download formats
Dataset updated
Oct 10, 2013
Dataset provided by
Global
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
de Rigo, D. (2012). Portable pseudo-random reference sequences with Mersenne Twister using GNU Octave. Mastrave project technical report. FigShare Digital Science. doi: 10.6084/m9.figshare.94593

Portable pseudo-random reference sequences with Mersenne Twister using GNU Octave

Mastrave project technical report

Daniele de Rigo

Abstract: Computationally intensive numerical tasks such as those involving statistical resampling, evolutionary techniques or Monte Carlo based applications are known to require robust algorithms for generating large sequences of pseudo-random numbers (PRN). While several languages, libraries and computing environments offer suitable PRN generators, the underlying algorithms and parametrization widely differ. Therefore, easily replicating a certain PRN sequence generally implies forcing researchers to use a very specific language or computing environment, also paying attention to its version, possible critical dependencies or even operating system and computer architecture.

Despite the awareness of the benefits of reproducible research is rapidly growing, the definition itself of “reproducibility” for PRN based applications may lead to diverging interpretations and expectations. Where the cardinality of PRN sequences needed for data to be processed is relatively moderate, the paradigm of reproducible research is in principle suitable to be applied not only to algorithms, free software, data and metadata (classic reproducible research, CRR), but also to the involved pseudo-random sequences themselves (deep reproducible research, DRR). This would allow not only the “typical” scientific results to be reproducible “except for PRN-related statistical fluctuations”, but also the exact results published by a research team to be independently reproduced by other scientists - without of course preventing sensitivity analysis with different PRN sequences, as even classic reproducible research should easily allow.

However, finding reference sequences of pseudo random numbers suitable to enable such a deep reproducibility may be surprisingly difficult. Here, sequences eligible to be used as reference dataset of uniformly distributed pseudo-random numbers are presented. The dataset of sequences has been generated using Mersenne Twister with a period of 2^19937-1, as implemented in GNU Octave (version 3.6.1) with the Mastrave modelling library. The sequences are available in plain text format and also in the format MATLAB version 7, which is portable in both GNU Octave and MATLAB computing environments. The plain text format uses a fixed number of characters per each PRN so allowing random access to sparse PRNs to be easily done in constant time without needing a whole file to be loaded. This straightforward solution is language neutral, with the advantage of enabling wide and immediate portability for the presented reference PRN dataset, irrespective of the language, libraries, computing environment of choice for the users.
w
Dataset of books in the Weird true facts series
workwithdata.com
Updated Apr 17, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Work With Data (2025). Dataset of books in the Weird true facts series [Dataset]. https://www.workwithdata.com/datasets/books?f=1&fcol0=j0-book_series&fop0=%3D&fval0=Weird+true+facts&j=1&j0=book_series
Explore at:
Dataset updated
Apr 17, 2025
Dataset authored and provided by
Work With Data
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
This dataset is about books. It has 8 rows and is filtered where the book series is Weird true facts. It features 9 columns including author, publication date, language, and book publisher.
C
Global Funny Cat Stick Market Economic and Social Impact 2025-2032
statsndata.org
excel, pdf
Updated May 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Stats N Data (2025). Global Funny Cat Stick Market Economic and Social Impact 2025-2032 [Dataset]. https://www.statsndata.org/report/global-164808
Explore at:
excel, pdfAvailable download formats
Dataset updated
May 2025
Dataset authored and provided by
Stats N Data
License
https://www.statsndata.org/how-to-orderhttps://www.statsndata.org/how-to-order
Area covered
Global
Description
The Funny Cat Stick market has emerged as a delightful segment within the pet accessories industry, capturing the love and attention of both feline enthusiasts and pet owners alike. This quirky product, designed to engage and entertain cats through a variety of interactive features, including bright colors, amusing
o
OLAF PROJECT DATA SET
ordo.open.ac.uk
figshare.com
xlsx
Updated Nov 20, 2020
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Alexandra Okada (2020). OLAF PROJECT DATA SET [Dataset]. http://doi.org/10.21954/ou.rd.12670949.v2
Explore at:
xlsxAvailable download formats
Unique identifier
https://doi.org/10.21954/ou.rd.12670949.v2
Dataset updated
Nov 20, 2020
Dataset provided by
The Open University
Authors
Alexandra Okada
License
Attribution-ShareAlike 2.0 (CC BY-SA 2.0)https://creativecommons.org/licenses/by-sa/2.0/
License information was derived automatically
Description
Subject: EducationSpecific: Online Learning and FunType: Questionnaire survey data (csv / excel)Date: February - March 2020Content: Students' views about online learning and fun Data Source: Project OLAFValue: These data provide students' beliefs about how learning occurs and correlations with fun. Participants were 206 students from the OU
f
Data from: Random integers
figshare.com
txt
Updated May 31, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Feilong Wu (2023). Random integers [Dataset]. http://doi.org/10.6084/m9.figshare.12949601.v2
Explore at:
txtAvailable download formats
Unique identifier
https://doi.org/10.6084/m9.figshare.12949601.v2
Dataset updated
May 31, 2023
Dataset provided by
figshare
Authors
Feilong Wu
License
CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
License information was derived automatically
Description
This dataset is a text containing 1000 random integers from 0 to 100. Each number takes a line.
What consumers find creepy or cool about targeted ads & personalization 2024...
statista.com
ai-chatbox.pro
Updated Jun 23, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Statista (2025). What consumers find creepy or cool about targeted ads & personalization 2024 [Dataset]. https://www.statista.com/statistics/1411110/consumer-perceptions-targeted-ads-and-personalization-worldwide/
Explore at:
Dataset updated
Jun 23, 2025
Dataset authored and provided by
Statistahttp://statista.com/
Time period covered
Jul 2024 - Aug 2024
Area covered
France, United States, United Kingdom
Description
Around *************** consumers interviewed in a 2024 survey considered getting recommendations from a brand based on past purchases cool. Shoppers also liked to receive personalized offers and email reminders about an abandoned shopping cart. On the other hand, ** percent of respondents stated that ads based on location data were creepy.

NBA Player Dataset & Prediction Model Artifacts

test.researchdata.tuwien.ac.at

bin, csv, json, png +2

Updated Apr 28, 2025

Facebook

Twitter

Click to copy link

Link copied

Cite

Burak Baltali; Burak Baltali (2025). NBA Player Dataset & Prediction Model Artifacts [Dataset]. http://doi.org/10.70124/ymgzs-z3s43

Explore at:

csv, text/markdown, png, bin, txt, jsonAvailable download formats

Unique identifier

https://doi.org/10.70124/ymgzs-z3s43

Dataset updated

Apr 28, 2025

Dataset provided by

TU Wien

Authors

Burak Baltali; Burak Baltali

License

Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically

Description

This dataset contains end-of-season box-score aggregates for NBA players over the 2012–13 through 2023–24 seasons, split into training and test sets for both regular season and playoffs. Each CSV has one row per player per season with columns for points, rebounds, steals, turnovers, 3-pt attempts, FG attempts, plus identifiers.

Brief overview of Files

end-of-season box-score aggregates (2012–13 – 2023–24) split into train/test;
the Jupyter notebook (Analysis.ipynb); All the code can be executed in there
the trained model binary (nba_model.pkl); Serialized Random Forest model artifact
Evaluation plots (LAL vs. whole‐league) for regular & playoff predictions are given as png outputs and uploaded in here
FAIR4ML metadata (fair4ml_metadata.jsonld);
see README.md and abbreviations.txt for file details.”
For further information you can go to the github site (Link below)

File Details

Notebook

Analysis.ipynb: Involves the graphica output of the trained and tested data.

Trained/ Test csv Data

Name	Description	PID
regular_train.csv	For training purposes, the seasons 2012-2013 through 2021-2022 were selected as training purpose	4421e56c-4cd3-4ec1-a566-a89d7ec0bced
regular_test.csv:	For testing purpose of the regular season, the 2022-2023 season was selected	f9d84d5e-db01-4475-b7d1-80cfe9fe0e61
playoff_train.csv	For training purposes of the playoff season, the seasons 2012-2013 through 2022-2023 were selected	bcb3cf2b-27df-48cc-8b76-9e49254783d0
playoff_test.csv	For testing purpose of the playoff season, 2023-2024 season was selected	de37d568-e97f-4cb9-bc05-2e600cc97102

Others

abbrevations.txt: Involves the fundemental abbrevations of the columns in csv data

Additional Notes

Raw csv files are taken from Kaggle (Source: https://www.kaggle.com/datasets/shivamkumar121215/nba-stats-dataset-for-last-10-years/data)

Some preprocessing has to be done before uploading into dbrepo

Plots have also been uploaded as an output for visual purposes.

A more detailed version can be found on github (Link: https://github.com/bubaltali/nba-prediction-analysis/)

Revenue share of Penguin Random House 2017-2023, by region
statista.com
Updated Apr 10, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Statista (2024). Revenue share of Penguin Random House 2017-2023, by region [Dataset]. https://www.statista.com/statistics/693284/revenue-pengiun-random-house-region/
Explore at:
Dataset updated
Apr 10, 2024
Dataset authored and provided by
Statistahttp://statista.com/
Area covered
Worldwide
Description
In 2023, 58.3 percent of Penguin Random House's revenue was generated in the United States, up from 57 percent the previous year. The share of revenue coming from the United Kingdom decreased slightly, as well as the figure for European countries other than the UK, Germany, and France, which tends to bring in less than nine percent of the company's total revenue each year.
f
Data_Sheet_1_The Optimal Machine Learning-Based Missing Data Imputation for...
frontiersin.figshare.com
docx
Updated Jun 3, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Chao-Yu Guo; Ying-Chen Yang; Yi-Hau Chen (2023). Data_Sheet_1_The Optimal Machine Learning-Based Missing Data Imputation for the Cox Proportional Hazard Model.docx [Dataset]. http://doi.org/10.3389/fpubh.2021.680054.s001
Explore at:
docxAvailable download formats
Unique identifier
https://doi.org/10.3389/fpubh.2021.680054.s001
Dataset updated
Jun 3, 2023
Dataset provided by
Frontiers
Authors
Chao-Yu Guo; Ying-Chen Yang; Yi-Hau Chen
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
An adequate imputation of missing data would significantly preserve the statistical power and avoid erroneous conclusions. In the era of big data, machine learning is a great tool to infer the missing values. The root means square error (RMSE) and the proportion of falsely classified entries (PFC) are two standard statistics to evaluate imputation accuracy. However, the Cox proportional hazards model using various types requires deliberate study, and the validity under different missing mechanisms is unknown. In this research, we propose supervised and unsupervised imputations and examine four machine learning-based imputation strategies. We conducted a simulation study under various scenarios with several parameters, such as sample size, missing rate, and different missing mechanisms. The results revealed the type-I errors according to different imputation techniques in the survival data. The simulation results show that the non-parametric “missForest” based on the unsupervised imputation is the only robust method without inflated type-I errors under all missing mechanisms. In contrast, other methods are not valid to test when the missing pattern is informative. Statistical analysis, which is improperly conducted, with missing data may lead to erroneous conclusions. This research provides a clear guideline for a valid survival analysis using the Cox proportional hazard model with machine learning-based imputations.

Facebook

Twitter

Click to copy link

Link copied

Cite

Svenja Elisabeth Seide; Katrin Jensen; Meinhard Kieser (2023). Dataset for: Simulation and data-generation for random-effects network meta-analysis of binary outcome [Dataset]. http://doi.org/10.6084/m9.figshare.8001863.v1

Dataset for: Simulation and data-generation for random-effects network meta-analysis of binary outcome

Explore at:

txtAvailable download formats

Unique identifier

https://doi.org/10.6084/m9.figshare.8001863.v1

Dataset updated

Jun 1, 2023

Dataset provided by

Wiley

Authors

Svenja Elisabeth Seide; Katrin Jensen; Meinhard Kieser

License

CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
License information was derived automatically

Description

The performance of statistical methods is frequently evaluated by means of simulation studies. In case of network meta-analysis of binary data, however, available data- generating models are restricted to either inclusion of two-armed trials or the fixed-effect model. Based on data-generation in the pairwise case, we propose a framework for the simulation of random-effect network meta-analyses including multi-arm trials with binary outcome. The only of the common data-generating models which is directly applicable to a random-effects network setting uses strongly restrictive assumptions. To overcome these limitations, we modify this approach and derive a related simulation procedure using odds ratios as effect measure. The performance of this procedure is evaluated with synthetic data and in an empirical example.

Clear search

Close search

Google apps

Main menu

Dataset for: Simulation and data-generation for random-effects network...

Grant Giving Statistics for Funny River Crafters

Community Survey: 2021 Random Sample Results

Data from: Funny Jokes

Grant Giving Statistics for Funny Farm Early Learning Center Incorporated

Average daily time spent on social media worldwide 2012-2025

Amazing eBay Statistics By Users, Revenue And Facts (2025)

Introduction

Analyzing interesting images motivates mathematics and statistics learning

Eros Now Statistics By Fun Facts, Subscription Plans, Region, Revenue,...

Introduction

Random Lake, WI Population Breakdown by Gender Dataset: Male and Female...

About this dataset

Content

Inspiration

Recommended for further research

Random Lake, WI Population Breakdown by Gender and Age Dataset: Male and...

About this dataset

Content

Inspiration

Recommended for further research

Portable pseudo-random reference sequences with Mersenne Twister using GNU...

Portable pseudo-random reference sequences with Mersenne Twister using GNU Octave

Mastrave project technical report

Daniele de Rigo

Dataset of books in the Weird true facts series

Global Funny Cat Stick Market Economic and Social Impact 2025-2032

OLAF PROJECT DATA SET

Data from: Random integers

What consumers find creepy or cool about targeted ads & personalization 2024...

NBA Player Dataset & Prediction Model Artifacts

Description

Brief overview of Files

File Details

Revenue share of Penguin Random House 2017-2023, by region

Data_Sheet_1_The Optimal Machine Learning-Based Missing Data Imputation for...

Dataset for: Simulation and data-generation for random-effects network meta-analysis of binary outcome