20 datasets found

Quantitative Trading Strategy Development (APPLE)
kaggle.com
Updated Jun 27, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Pranshu Jayswal (2024). Quantitative Trading Strategy Development (APPLE) [Dataset]. https://www.kaggle.com/datasets/pranshujayswal/quantitative-trading-strategy-development-apple/data
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Jun 27, 2024
Dataset provided by
Kagglehttp://kaggle.com/
Authors
Pranshu Jayswal
License
Attribution-NonCommercial-ShareAlike 4.0 (CC BY-NC-SA 4.0)https://creativecommons.org/licenses/by-nc-sa/4.0/
License information was derived automatically
Description
Dataset

This dataset was created by Pranshu Jayswal

Released under CC BY-NC-SA 4.0

Contents
QwQ-32B-Preview quant
kaggle.com
Updated Nov 28, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Kɔuq Wang (2024). QwQ-32B-Preview quant [Dataset]. https://www.kaggle.com/gmhost/qwq-32b-preview-quant/discussion
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Nov 28, 2024
Dataset provided by
Kagglehttp://kaggle.com/
Authors
Kɔuq Wang
License
https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/
Description
Dataset

This dataset was created by kwang

Released under CC0: Public Domain

Contents
salary_data
kaggle.com
zip
Updated Nov 1, 2019
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
quAnt (2019). salary_data [Dataset]. https://www.kaggle.com/rookequant/salary-data
Explore at:
zip(378 bytes)Available download formats
Dataset updated
Nov 1, 2019
Authors
quAnt
Description
Dataset

This dataset was created by quAnt

Contents
Ubi Quant Future Market Prediction's Dataset.
kaggle.com
Updated Mar 23, 2022
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
JITENDRA PYLA. (2022). Ubi Quant Future Market Prediction's Dataset. [Dataset]. https://www.kaggle.com/datasets/pylajitendra/ubi-quant-future-market-predictions-dataset/data
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Mar 23, 2022
Dataset provided by
Kagglehttp://kaggle.com/
Authors
JITENDRA PYLA.
Description
Dataset

This dataset was created by JITENDRA PYLA.

Released under Data files © Original Authors

Contents
Tokka Labs Quant July 2023 to Feb 2024 Dataset
kaggle.com
Updated Feb 29, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Po Yan Chuan (2024). Tokka Labs Quant July 2023 to Feb 2024 Dataset [Dataset]. https://www.kaggle.com/poyanchuan/tokka-labs-quant-july-2023-to-feb-2024-dataset/discussion
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Feb 29, 2024
Dataset provided by
Kagglehttp://kaggle.com/
Authors
Po Yan Chuan
Description
Dataset

This dataset was created by Po Yan Chuan

Contents
NIFTY 50 OHLC DATA ||09-01-2015 to 25-04-2025||
kaggle.com
Updated May 3, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Swapnil Chaitanya (2025). NIFTY 50 OHLC DATA ||09-01-2015 to 25-04-2025|| [Dataset]. https://www.kaggle.com/datasets/swapnilchaitanya/nifty-50-ohlc-data-09-01-2015-to-25-04-2025
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
May 3, 2025
Dataset provided by
Kaggle
Authors
Swapnil Chaitanya
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
his dataset contains cleaned and time-synced OHLC (Open, High, Low, Close) data for the NIFTY 50 index, covering the period from 9th January 2015 to 25th April 2025.

It includes:

5-minute timeframe data (intraday)

25-minute aggregated interval (useful for trend and momentum strategies)

Daily candles for long-term technical setups

This dataset is ideal for:

Quantitative trading research

Algorithmic strategy backtesting (MACD, RSI, Price Action, etc.)

Time-series analysis & forecasting

No forward-filled or synthetic values were used — all data is from real market trading sessions.
Quant_Quest
kaggle.com
zip
Updated Mar 6, 2020
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Abhinesh reddy (2020). Quant_Quest [Dataset]. https://www.kaggle.com/abhineshreddy/quant-quest
Explore at:
zip(1918399 bytes)Available download formats
Dataset updated
Mar 6, 2020
Authors
Abhinesh reddy
Description
Dataset

This dataset was created by Abhinesh reddy

Contents
Finalcial analysis quantitative data
kaggle.com
Updated Apr 7, 2021
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Foridur (2021). Finalcial analysis quantitative data [Dataset]. https://www.kaggle.com/datasets/foridurrahman/finalcial-analysis-quantitative-data/suggestions?status=pending&yourSuggestions=true
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Apr 7, 2021
Dataset provided by
Kagglehttp://kaggle.com/
Authors
Foridur
Description
Dataset

This dataset was created by Foridur

Contents
QSAR Fish Toxicity Data Set
kaggle.com
Updated Jun 14, 2020
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Ishan Dutta (2020). QSAR Fish Toxicity Data Set [Dataset]. https://www.kaggle.com/ishandutta/qsar-fish-toxicity-data-set/metadata
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Jun 14, 2020
Dataset provided by
Kagglehttp://kaggle.com/
Authors
Ishan Dutta
License
Open Database License (ODbL) v1.0https://www.opendatacommons.org/licenses/odbl/1.0/
License information was derived automatically
Description
Data Set Information:

This dataset was used to develop quantitative regression QSAR models to predict acute aquatic toxicity towards the fish Pimephales promelas (fathead minnow) on a set of 908 chemicals. LC50 data, which is the concentration that causes death in 50% of test fish over a test duration of 96 hours, was used as model response. The model comprised 6 molecular descriptors: MLOGP (molecular properties), CIC0 (information indices), GATS1i (2D autocorrelations), NdssC (atom-type counts), NdsCH ((atom-type counts), SM1_Dz(Z) (2D matrix-based descriptors). Details can be found in the quoted reference: M. Cassotti, D. Ballabio, R. Todeschini, V. Consonni. A similarity-based QSAR model for predicting acute toxicity towards the fathead minnow (Pimephales promelas), SAR and QSAR in Environmental Research (2015), 26, 217-243; doi: 10.1080/1062936X.2015.1018938

Attribute Information:

6 molecular descriptors and 1 quantitative experimental response: 1) CIC0 2) SM1_Dz(Z) 3) GATS1i 4) NdsCH 5) NdssC 6) MLOGP 7) quantitative response, LC50 [-LOG(mol/L)]

Relevant Papers:

M. Cassotti, D. Ballabio, R. Todeschini, V. Consonni. A similarity-based QSAR model for predicting acute toxicity towards the fathead minnow (Pimephales promelas), SAR and QSAR in Environmental Research (2015), 26, 217-243; doi: 10.1080/1062936X.2015.1018938
KPIsUX
kaggle.com
Updated Aug 28, 2021
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Gustavo Jota (2021). KPIsUX [Dataset]. https://www.kaggle.com/gustavojota/ux-kpis/tasks
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Aug 28, 2021
Dataset provided by
Kagglehttp://kaggle.com/
Authors
Gustavo Jota
License
https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/
Description
Context

Data to understand how to transform raw Google Analytic data into qualitative usability metrics.

Google Analytics data (Title and Duration in seconds) Hotjar (NPS) Survey Monkey (NPS)

Acknowledgements

Bebideria.com.br´s developers and its readers Google´s developers and communities that keep it running and free Hotjar´s team and CEO who supported me exporting raw data and even improving the app towards my experiment´s needs. Finally UDESC´s PPGDesign professors who guided and oriented this study

Inspiration

Let´s make better products!
Breast Cancer Coimbra
kaggle.com
Updated Jan 7, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Vivek Agrawal (2024). Breast Cancer Coimbra [Dataset]. https://www.kaggle.com/datasets/atom1991/breast-cancer-coimbra
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Jan 7, 2024
Dataset provided by
Kaggle
Authors
Vivek Agrawal
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
This dataset originates from a deep learning model trained on the "Coimbra Breast Cancer" dataset, with feature distributions closely resembling the original. The original data includes clinical observations from 64 patients with breast cancer and 52 healthy controls, encompassing 10 quantitative predictors and a binary dependent variable indicating the presence or absence of breast cancer.

Quantitative Attributes:

Age (years): Represents the age of individuals in the dataset.

BMI (kg/m²): Body Mass Index, a measure of body fat based on weight and height.

Glucose (mg/dL): Reflects blood glucose levels, a vital metabolic indicator.

Insulin (µU/mL): Indicates insulin levels, a hormone associated with glucose regulation.

HOMA: Homeostatic Model Assessment, a method assessing insulin resistance and beta-cell function.

Leptin (ng/mL): Represents leptin levels, a hormone involved in appetite and energy balance regulation.

Adiponectin (µg/mL): Reflects adiponectin levels, a protein associated with metabolic regulation.

Resistin (ng/mL): Indicates resistin levels, a protein implicated in insulin resistance.

MCP-1 (pg/dL): Reflects Monocyte Chemoattractant Protein-1 levels, a cytokine involved in inflammation.

Labels:

1: Healthy controls

2: Patients with breast cancer

These quantitative attributes, including anthropometric data and parameters gathered from routine blood analysis, serve as the foundation for potential biomarkers of breast cancer. The dataset presents an opportunity for developing accurate prediction models, aiding in the identification and understanding of factors associated with breast cancer.
Chinese Macroeconomic Data (2005 - 2022)
kaggle.com
Updated May 14, 2022
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Francisco Feng (2022). Chinese Macroeconomic Data (2005 - 2022) [Dataset]. https://www.kaggle.com/datasets/franciscofeng/chinese-macroeconomic-data-20052022
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
May 14, 2022
Dataset provided by
Kagglehttp://kaggle.com/
Authors
Francisco Feng
License
Attribution-ShareAlike 4.0 (CC BY-SA 4.0)https://creativecommons.org/licenses/by-sa/4.0/
License information was derived automatically
Description
Macroeconomic data is an important source for both institutions and companies to have a rough sense of what government's policies and economy will head to. This dataset can help macroeconomic and fundamental analysts to do research on Chinese market or macroeconomics. Quantitative researchers can also use this dataset as a reference to assist them making better strategies. The SHIBOR rate of different maturities is recorded at daily frequency. Users can construct the yield curve for economic research. Quantitative researchers can use it to see how SHIBOR influences the overall Chinese stock & fixed income market and etc. Many Chinese Indices are also very important in conducting research about Chinese market & economy. These data are also at daily frequency. Other macroeconomic data are recorded in monthly frequency and thus can be used to conduct broader area of economic and financial research and etc.

#Janatahack: Independence Day 2020 ML Hackathon

kaggle.com

Updated Aug 15, 2020

Facebook

Twitter

Click to copy link

Link copied

Cite

VinayVikram (2020). #Janatahack: Independence Day 2020 ML Hackathon [Dataset]. https://www.kaggle.com/vin1234/janatahack-independence-day-2020-ml-hackathon/code

Explore at:

CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.

Dataset updated

Aug 15, 2020

Dataset provided by

Kagglehttp://kaggle.com/

Authors

VinayVikram

Description

Problem Statement

Topic Modeling for Research Articles Researchers have access to large online archives of scientific articles. As a consequence, finding relevant articles has become more difficult. Tagging or topic modelling provides a way to give token of identification to research articles which facilitates recommendation and search process.

Given the abstract and title for a set of research articles, predict the topics for each article included in the test set.

Note that a research article can possibly have more than 1 topic. The research article abstracts and titles are sourced from the following 6 topics:

Computer Science
Physics
Mathematics
Statistics
Quantitative Biology
Quantitative Finance

Data Dictionary

train.csv

Column	Description
ID	Unique ID for each article
TITLE	Title of the research article
ABSTRACT	Abstract of the research article
Computer Science	Whether article belongs to topic computer science (1/0)
Physics	Whether article belongs to topic physics (1/0)
Mathematics	Whether article belongs to topic Mathematics (1/0)
Statistics	Whether article belongs to topic Statistics (1/0)
Quantitative Biology	Whether article belongs to topic Quantitative Biology (1/0)
Quantitative Finance	Whether article belongs to topic Quantitative Finance (1/0)

ID	Unique ID for each article
TITLE	Title of the research article
ABSTRACT	Abstract of the research article

ID	Unique ID for each article
TITLE	Title of the research article
ABSTRACT	Abstract of the research article
Computer Science	Whether article belongs to topic computer science (1/0)
Physics	Whether article belongs to topic physics (1/0)
Mathematics	Whether article belongs to topic Mathematics (1/0)
Statistics	Whether article belongs to topic Statistics (1/0)
Quantitative Biology	Whether article belongs to topic Quantitative Biology (1/0)
Quantitative Finance	Whether article belongs to topic Quantitative Finance (1/0)

Evaluation Metric

Submissions are evaluated on micro F1 Score between the predicted and observed topics for each article in the test set.

Inspiration

Your data will be in front of the world's largest data science community. What questions do you want to see answered?

Janatahack: Independence Day 2020 ML Hackathon

kaggle.com

zip

Updated Aug 15, 2020

Facebook

Twitter

Click to copy link

Link copied

Cite

Anmol Kumar (2020). Janatahack: Independence Day 2020 ML Hackathon [Dataset]. https://www.kaggle.com/anmolkumar/janatahack-independence-day-2020-ml-hackathon

Explore at:

zip(420422235 bytes)Available download formats

Dataset updated

Aug 15, 2020

Authors

Anmol Kumar

Description

Topic Modeling for Research Articles

Researchers have access to large online archives of scientific articles. As a consequence, finding relevant articles has become more difficult. Tagging or topic modelling provides a way to give token of identification to research articles which facilitates recommendation and search process.

Given the abstract and title for a set of research articles, predict the topics for each article included in the test set.

Note that a research article can possibly have more than 1 topic. The research article abstracts and titles are sourced from the following 6 topics:

Computer Science
Physics
Mathematics
Statistics
Quantitative Biology
Quantitative Finance

Data Dictionary

train.csv

Column	Description
ID	Unique ID for each article
TITLE	Title of the research article
ABSTRACT	Abstract of the research article
Computer Science	Whether article belongs to topic computer science (1/0)
Physics	Whether article belongs to topic physics (1/0)
Mathematics	Whether article belongs to topic Mathematics (1/0)
Statistics	Whether article belongs to topic Statistics (1/0)
Quantitative Biology	Whether article belongs to topic Quantitative Biology (1/0)
Quantitative Finance	Whether article belongs to topic Quantitative Finance (1/0)

What's inside is more than just rows and columns. Make it easy for others to get started by describing how you acquired the data and what time period it represents, too.

test.csv

Column	Description
ID	Unique ID for each article
TITLE	Title of the research article
ABSTRACT	Abstract of the research article

Evaluation Metric

Submissions are evaluated on micro F1 Score between the predicted and observed topics for each article in the test set

### Public and Private Split Test reviews are further divided into Public (40%) and Private (60%)

Your initial responses will be checked and scored on the Public data. The final rankings would be based on your private score which will be published once the competition is over.

Guidelines for Final Submission

Please ensure that your final submission includes the following:
Solution file containing the predicted 1/0 for each of the 6 topics for every research article in the test set
Code file for reproducing the submission, note that it is mandatory to submit your code for a valid final submission

QBI Image Segmentation
kaggle.com
zip
Updated Mar 15, 2017
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
K Scott Mader (2017). QBI Image Segmentation [Dataset]. https://www.kaggle.com/kmader/qbi-image-segmentation
Explore at:
zip(20669114 bytes)Available download formats
Dataset updated
Mar 15, 2017
Authors
K Scott Mader
License
https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/
Description
Image Segmentation

The data and kernels here are connected to the lecture material on image segmentation from the Quantitative Big Imaging Course at ETH Zurich (kmader.github.io/Quantitative-Big-Imaging-2017). The specific description of the exercise tasks can be found (https://github.com/kmader/Quantitative-Big-Imaging-2016/blob/master/Exercises/03-Description.md)
126 RNA-Seq datasets of COVID-19
kaggle.com
Updated Mar 24, 2022
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Octopus210 (2022). 126 RNA-Seq datasets of COVID-19 [Dataset]. http://doi.org/10.34740/kaggle/ds/972946
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Unique identifier
https://doi.org/10.34740/kaggle/ds/972946
Dataset updated
Mar 24, 2022
Dataset provided by
Kagglehttp://kaggle.com/
Authors
Octopus210
Description
The data was updated on March 13, 2022.

We have uploaded the complete data analysis as a txt.file. In addition, we uploaded information on age, sex, hospital room, and mapping rate as supplemental data.

Changes from last time! Use salmon's --validateMapping option for quantification. In addition, the --quantmerge option was used in place of tximport.

126 RNA-Seq datasets of COVID19 ( 100 COVID19, 26 Control cases ) Let’s find important transcripts in COVID-19 by using machine learning👍

RNA-Seq is a particular technology-based sequencing technique which uses next-generation sequencing (NGS) to reveal the presence and quantity of RNA in a biological sample, analyzing the continuously changing cellular transcriptome.

https://en.wikipedia.org/wiki/RNA-Seq

We download SRA and FASTQ files and prepared 126 RNA-Seq datasets. The known transcriptome (GRCh38) was used as a reference for quantitative analysis. The abundances of individual transcripts were quantified by Salmon. Finally, We applied tximport to create an output file for analysis.

Content Source : https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE157103

Let’s find important transcripts in COVID-19 by using machine learning👍
Compressive Strength of Concrete
kaggle.com
zip
Updated Sep 21, 2020
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Divya Chatty (2020). Compressive Strength of Concrete [Dataset]. https://www.kaggle.com/divyachatty/compressive-strength-of-concrete
Explore at:
zip(10845 bytes)Available download formats
Dataset updated
Sep 21, 2020
Authors
Divya Chatty
Description
This dataset deals with the factors that influence the compressive strength of concrete. There are a few input parameters and an output parameter.

Name -------------------------------Data Type --- Measurement--------Description

Cement (component 1) quantitative kg in a m3 mixture Input Variable Blast Furnace Slag (component 2) quantitative kg in a m3 mixture Input Variable Fly Ash (component 3) quantitative kg in a m3 mixture Input Variable Water (component 4) quantitative kg in a m3 mixture Input Variable Superplasticizer (component 5) quantitative kg in a m3 mixture Input Variable Coarse Aggregate (component 6) quantitative kg in a m3 mixture Input Variable Fine Aggregate (component 7) quantitative kg in a m3 mixture Input Variable Age quantitative Day (1~365) Input Variable Concrete compressive strength quantitative MPa Output Variable

Data has been sourced from : http://archive.ics.uci.edu/ml/datasets/Concrete+Compressive+Strength
Revisiting a Concrete Strength regression
kaggle.com
zip
Updated Jun 15, 2018
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
maajdl (2018). Revisiting a Concrete Strength regression [Dataset]. https://www.kaggle.com/maajdl/yeh-concret-data
Explore at:
zip(10477 bytes)Available download formats
Dataset updated
Jun 15, 2018
Authors
maajdl
Description
Context

Abstract:

Concrete is the most important material in civil engineering.
The concrete compressive strength is a highly nonlinear function of age and ingredients.

Content

Concrete Compressive Strength Data Set

Data Set Information:

Number of instances 1030 Number of Attributes 9 Attribute breakdown 8 quantitative input variables, and 1 quantitative output variable Missing Attribute Values None

Attribute Information:

Given are the variable name, variable type, the measurement unit and a brief description. The concrete compressive strength is the regression problem. The order of this listing corresponds to the order of numerals along the rows of the database.

Name -- Data Type -- Measurement -- Description

Cement (component 1) -- quantitative -- kg in a m3 mixture -- Input Variable
Blast Furnace Slag (component 2) -- quantitative -- kg in a m3 mixture -- Input Variable
Fly Ash (component 3) -- quantitative -- kg in a m3 mixture -- Input Variable
Water (component 4) -- quantitative -- kg in a m3 mixture -- Input Variable
Superplasticizer (component 5) -- quantitative -- kg in a m3 mixture -- Input Variable
Coarse Aggregate (component 6) -- quantitative -- kg in a m3 mixture -- Input Variable
Fine Aggregate (component 7) -- quantitative -- kg in a m3 mixture -- Input Variable
Age -- quantitative -- Day (1~365) -- Input Variable
Concrete compressive strength -- quantitative -- MPa -- Output Variable

Acknowledgements

Source:

Original Owner and Donor
Prof. I-Cheng Yeh
Department of Information Management
Chung-Hua University,
Hsin Chu, Taiwan 30067, R.O.C.
e-mail:icyeh '@' chu.edu.tw
TEL:886-3-5186511

Date Donated: August 3, 2007

From: https://archive.ics.uci.edu/ml/datasets/Concrete+Compressive+Strength

Relevant Papers:

Main
1) I-Cheng Yeh, "Modeling of strength of high performance concrete using artificial neural networks," Cement and Concrete Research, Vol. 28, No. 12, pp. 1797-1808 (1998).

Others

2) I-Cheng Yeh, "Modeling Concrete Strength with Augment-Neuron Networks," J. of Materials in Civil Engineering, ASCE, Vol. 10, No. 4, pp. 263-268 (1998).

3) I-Cheng Yeh, "Design of High Performance Concrete Mixture Using Neural Networks," J. of Computing in Civil Engineering, ASCE, Vol. 13, No. 1, pp. 36-42 (1999).

4) I-Cheng Yeh, "Prediction of Strength of Fly Ash and Slag Concrete By The Use of Artificial Neural Networks," Journal of the Chinese Institute of Civil and Hydraulic Engineering, Vol. 15, No. 4, pp. 659-663 (2003).

5) I-Cheng Yeh, "A mix Proportioning Methodology for Fly Ash and Slag Concrete Using Artificial Neural Networks," Chung Hua Journal of Science and Engineering, Vol. 1, No. 1, pp. 77-84 (2003).

6) Yeh, I-Cheng, "Analysis of strength of concrete using design of experiments and neural networks," Journal of Materials in Civil Engineering, ASCE, Vol.18, No.4, pp.597-604 (2006).

Citation Request:

NOTE: Reuse of this database is unlimited with retention of copyright notice for Prof. I-Cheng Yeh and the following published paper:

I-Cheng Yeh, "Modeling of strength of high performance concrete using artificial neural networks," Cement and Concrete Research, Vol. 28, No. 12, pp. 1797-1808 (1998).

Inspiration

Can you predict the strength of concrete?
ML Competition on Cryptocurrency Market Data
kaggle.com
Updated Nov 23, 2021
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
YIEDL (2021). ML Competition on Cryptocurrency Market Data [Dataset]. https://www.kaggle.com/datasets/rocketcapital/ml-competition-on-cryptocurrency-market-data/data
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Nov 23, 2021
Dataset provided by
Kagglehttp://kaggle.com/
Authors
YIEDL
Description
Context

The world of Asset Management today, from a technological point of view, is mainly linked to mature but inefficient supply chains, which merge discretionary and quantitative forecasting models. The financial industry has been working in the shadows for years to overcome this paradigm, pushing beyond technology, making use not only of automated models (trading systems and dynamic asset allocation systems) but also of the most modern Machine Learning techniques for Time Series Forecasting and Unsupervised Learning for the classification of financial instruments. However, in most cases, it uses proprietary technologies that are limited by definition (workforce, technology investment, scalability). Numerai, an offshoot of Jim Simons’ Renaissance Technologies, was the first to blaze a new path by building a first centralized machine learning competition, in order to gather a swarm of predictors outside the company, to integrate with internal intelligence. The discretionary contribution was therefore eliminated, and the information content generated internally was enriched by thousands of external contributors, in many cases linked to sectors unrelated to the financial industry, such as energy, aerospace, or biotechnology. In fact, the concept that to obtain good market forecasts, it is necessary to have only skills related to the financial world is overcome. What we have just described is the starting point of Rocket Capital Investment. To overcome the limit imposed by Numerai, a new competition has been engineered, which has the ambition to make this project even more “democratic”. How? Decentralizing, thanks to the Blockchain, the entire chain of participant management, collection, and validation of forecasts, as well as decisions relating to the evaluation and remuneration of the participants themselves. In this way, it is possible to make every aspect of the competition completely transparent and inviolable. Everything is managed by a Smart Contract, whose rules are known and shared. Let’s find out in more detail what it is.

Starting from the idea of Numerai, we have completely re-engineered all aspects related to the management of participants, Scoring, and Reward, following the concept of decentralization of the production chain. To this end, a proprietary token (MUSA token) has been created which acts as an exchange currency and which integrates a smart contract that acts as an autonomous competition manager. The communication interface between the users and the smart contract is a DApp (“Decentralized Application”). But let’s see in more detail how all these elements combine with each other, like in a puzzle.

Competition Technicalities

A suitably normalized dataset is issued every week, containing data from over 400 cryptocurrencies. For each asset, the data relating to prices, volumes traded, quantitative elements, as well as alternative data (information on the blockchain and on the sentiment of the various providers) are aggregated. Another difference with Numerai is the ability to distinguish assets for each row (the first column shows the related ticker). The last column instead contains the question to which the Data Scientists are asked to give an answer: the relative strength ranking of each asset, built on the forecast of the percentage change expected in the following week.

Registration for the Competition takes place by providing, in a completely anonymous way, the address of a crypto wallet on which the MUSA tokens are loaded. From that moment on, the MUSAs become, to all intents and purposes, the currency of exchange between participants and organizers. Every Monday a new Challenge opens, and all Data Scientists registered in the Contest are asked to use their models to generate predictions. By accessing the DApp, the participant can download the new dataset, complete with the history of the previous weeks and the last useful week. At this point the participant can perform two actions in sequence directly from the DApp: - Staking: MUSA tokens are placed on your prediction. - Submission: the forecast for the following week is uploaded to the blockchain.

Since the forecast consists of a series of numbers between 0 and 1 associated with each asset, it is very easy, the following week, to calculate the error committed in terms of RMSE (“Root Mean Square Error”). This allows creating a ranking on the participants, to be able to reward them accordingly with additional MUSA tokens. But let’s see in more detail how the Smart Contract, which was created, allows us to differentiate the reward based on different items (all, again, in a completely transparent and verifiable way): - Staking Reward: the mere fact of participating in the competition is remunerated. In future versions, it will also be possible to bet on the goodness of the other participants’ predictions. - Challenge Rew...
m
Stated Preferences for Car Choice
data.mendeley.com
kaggle.com
Updated May 6, 2019
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Steven Taylor (2019). Stated Preferences for Car Choice [Dataset]. http://doi.org/10.17632/25gf67c96f.5
Explore at:
Unique identifier
https://doi.org/10.17632/25gf67c96f.5
Dataset updated
May 6, 2019
Authors
Steven Taylor
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
This Data set including all the Variables (choice, college,hsg2, coml5, typez, fuelz, pricez, speedz, pollutionz, sizez) from 2016 to 2018.I scrapped this data from www.qed.econ.queensu.ca
Not seeing a result you expected?
Learn how you can add new datasets to our index.

Facebook

Twitter

Click to copy link

Link copied

Cite

Pranshu Jayswal (2024). Quantitative Trading Strategy Development (APPLE) [Dataset]. https://www.kaggle.com/datasets/pranshujayswal/quantitative-trading-strategy-development-apple/data

Quantitative Trading Strategy Development (APPLE)

Explore at:

CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.

Dataset updated

Jun 27, 2024

Dataset provided by

Kagglehttp://kaggle.com/

Authors

Pranshu Jayswal

License

Attribution-NonCommercial-ShareAlike 4.0 (CC BY-NC-SA 4.0)https://creativecommons.org/licenses/by-nc-sa/4.0/
License information was derived automatically

Description

Dataset

This dataset was created by Pranshu Jayswal

Released under CC BY-NC-SA 4.0

Clear search

Close search

Google apps

Main menu

Quantitative Trading Strategy Development (APPLE)

Dataset

Contents

QwQ-32B-Preview quant

Dataset

Contents

salary_data

Dataset

Contents

Ubi Quant Future Market Prediction's Dataset.

Dataset

Contents

Tokka Labs Quant July 2023 to Feb 2024 Dataset

Dataset

Contents

NIFTY 50 OHLC DATA ||09-01-2015 to 25-04-2025||

Quant_Quest

Dataset

Contents

Finalcial analysis quantitative data

Dataset

Contents

QSAR Fish Toxicity Data Set

Data Set Information:

Attribute Information:

Relevant Papers:

KPIsUX

Context

Acknowledgements

Inspiration

Breast Cancer Coimbra

Chinese Macroeconomic Data (2005 - 2022)

#Janatahack: Independence Day 2020 ML Hackathon

Problem Statement

Data Dictionary

Evaluation Metric

Inspiration

Janatahack: Independence Day 2020 ML Hackathon

Topic Modeling for Research Articles

Data Dictionary

train.csv

test.csv

Evaluation Metric

Guidelines for Final Submission

QBI Image Segmentation

Image Segmentation

126 RNA-Seq datasets of COVID-19

Compressive Strength of Concrete

Revisiting a Concrete Strength regression

Context

Abstract:

Content

Concrete Compressive Strength Data Set

Data Set Information:

Attribute Information:

Acknowledgements

Source:

Relevant Papers:

Citation Request:

Inspiration

ML Competition on Cryptocurrency Market Data

Context

Competition Technicalities

Stated Preferences for Car Choice

Quantitative Trading Strategy Development (APPLE)

Dataset

Contents