15 datasets found
  1. Integrated Cryptocurrency Historical Data for a Predictive Data-Driven...

    • cryptodata.center
    Updated Dec 4, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    cryptodata.center (2024). Integrated Cryptocurrency Historical Data for a Predictive Data-Driven Decision-Making Algorithm - Dataset - CryptoData Hub [Dataset]. https://cryptodata.center/dataset/integrated-cryptocurrency-historical-data-for-a-predictive-data-driven-decision-making-algorithm
    Explore at:
    Dataset updated
    Dec 4, 2024
    Dataset provided by
    CryptoDATA
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Cryptocurrency historical datasets from January 2012 (if available) to October 2021 were obtained and integrated from various sources and Application Programming Interfaces (APIs) including Yahoo Finance, Cryptodownload, CoinMarketCap, various Kaggle datasets, and multiple APIs. While these datasets used various formats of time (e.g., minutes, hours, days), in order to integrate the datasets days format was used for in this research study. The integrated cryptocurrency historical datasets for 80 cryptocurrencies including but not limited to Bitcoin (BTC), Ethereum (ETH), Binance Coin (BNB), Cardano (ADA), Tether (USDT), Ripple (XRP), Solana (SOL), Polkadot (DOT), USD Coin (USDC), Dogecoin (DOGE), Tron (TRX), Bitcoin Cash (BCH), Litecoin (LTC), EOS (EOS), Cosmos (ATOM), Stellar (XLM), Wrapped Bitcoin (WBTC), Uniswap (UNI), Terra (LUNA), SHIBA INU (SHIB), and 60 more cryptocurrencies were uploaded in this online Mendeley data repository. Although the primary attribute of including the mentioned cryptocurrencies was the Market Capitalization, a subject matter expert i.e., a professional trader has also guided the initial selection of the cryptocurrencies by analyzing various indicators such as Relative Strength Index (RSI), Moving Average Convergence/Divergence (MACD), MYC Signals, Bollinger Bands, Fibonacci Retracement, Stochastic Oscillator and Ichimoku Cloud. The primary features of this dataset that were used as the decision-making criteria of the CLUS-MCDA II approach are Timestamps, Open, High, Low, Closed, Volume (Currency), % Change (7 days and 24 hours), Market Cap and Weighted Price values. The available excel and CSV files in this data set are just part of the integrated data and other databases, datasets and API References that was used in this study are as follows: [1] https://finance.yahoo.com/ [2] https://coinmarketcap.com/historical/ [3] https://cryptodatadownload.com/ [4] https://kaggle.com/philmohun/cryptocurrency-financial-data [5] https://kaggle.com/deepshah16/meme-cryptocurrency-historical-data [6] https://kaggle.com/sudalairajkumar/cryptocurrencypricehistory [7] https://min-api.cryptocompare.com/data/price?fsym=BTC&tsyms=USD [8] https://min-api.cryptocompare.com/ [9] https://p.nomics.com/cryptocurrency-bitcoin-api [10] https://www.coinapi.io/ [11] https://www.coingecko.com/en/api [12] https://cryptowat.ch/ [13] https://www.alphavantage.co/ This dataset is part of the CLUS-MCDA (Cluster analysis for improving Multiple Criteria Decision Analysis) and CLUS-MCDAII Project: https://aimaghsoodi.github.io/CLUSMCDA-R-Package/ https://github.com/Aimaghsoodi/CLUS-MCDA-II https://github.com/azadkavian/CLUS-MCDA

  2. A

    ‘Crypto Fear and Greed Index’ analyzed by Analyst-2

    • analyst-2.ai
    Updated May 28, 2018
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Analyst-2 (analyst-2.ai) / Inspirient GmbH (inspirient.com) (2018). ‘Crypto Fear and Greed Index’ analyzed by Analyst-2 [Dataset]. https://analyst-2.ai/analysis/kaggle-crypto-fear-and-greed-index-e01d/63c3ed46/?iid=001-519&v=presentation
    Explore at:
    Dataset updated
    May 28, 2018
    Dataset authored and provided by
    Analyst-2 (analyst-2.ai) / Inspirient GmbH (inspirient.com)
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Analysis of ‘Crypto Fear and Greed Index’ provided by Analyst-2 (analyst-2.ai), based on source dataset retrieved from https://www.kaggle.com/adelsondias/crypto-fear-and-greed-index on 13 February 2022.

    --- Dataset description provided by original source is as follows ---

    Crypto Fear and Greed Index

    Each day, the website https://alternative.me/crypto/fear-and-greed-index/ publishes this index based on analysis of emotions and sentiments from different sources crunched into one simple number: The Fear & Greed Index for Bitcoin and other large cryptocurrencies.

    Why Measure Fear and Greed?

    The crypto market behaviour is very emotional. People tend to get greedy when the market is rising which results in FOMO (Fear of missing out). Also, people often sell their coins in irrational reaction of seeing red numbers. With our Fear and Greed Index, we try to save you from your own emotional overreactions. There are two simple assumptions:

    • Extreme fear can be a sign that investors are too worried. That could be a buying opportunity.
    • When Investors are getting too greedy, that means the market is due for a correction.

    Therefore, we analyze the current sentiment of the Bitcoin market and crunch the numbers into a simple meter from 0 to 100. Zero means "Extreme Fear", while 100 means "Extreme Greed". See below for further information on our data sources.

    Data Sources

    We are gathering data from the five following sources. Each data point is valued the same as the day before in order to visualize a meaningful progress in sentiment change of the crypto market.

    First of all, the current index is for bitcoin only (we offer separate indices for large alt coins soon), because a big part of it is the volatility of the coin price.

    But let’s list all the different factors we’re including in the current index:

    Volatility (25 %)

    We’re measuring the current volatility and max. drawdowns of bitcoin and compare it with the corresponding average values of the last 30 days and 90 days. We argue that an unusual rise in volatility is a sign of a fearful market.

    Market Momentum/Volume (25%)

    Also, we’re measuring the current volume and market momentum (again in comparison with the last 30/90 day average values) and put those two values together. Generally, when we see high buying volumes in a positive market on a daily basis, we conclude that the market acts overly greedy / too bullish.

    Social Media (15%)

    While our reddit sentiment analysis is still not in the live index (we’re still experimenting some market-related key words in the text processing algorithm), our twitter analysis is running. There, we gather and count posts on various hashtags for each coin (publicly, we show only those for Bitcoin) and check how fast and how many interactions they receive in certain time frames). A unusual high interaction rate results in a grown public interest in the coin and in our eyes, corresponds to a greedy market behaviour.

    Surveys (15%) currently paused

    Together with strawpoll.com (disclaimer: we own this site, too), quite a large public polling platform, we’re conducting weekly crypto polls and ask people how they see the market. Usually, we’re seeing 2,000 - 3,000 votes on each poll, so we do get a picture of the sentiment of a group of crypto investors. We don’t give those results too much attention, but it was quite useful in the beginning of our studies. You can see some recent results here.

    Dominance (10%)

    The dominance of a coin resembles the market cap share of the whole crypto market. Especially for Bitcoin, we think that a rise in Bitcoin dominance is caused by a fear of (and thus a reduction of) too speculative alt-coin investments, since Bitcoin is becoming more and more the safe haven of crypto. On the other side, when Bitcoin dominance shrinks, people are getting more greedy by investing in more risky alt-coins, dreaming of their chance in next big bull run. Anyhow, analyzing the dominance for a coin other than Bitcoin, you could argue the other way round, since more interest in an alt-coin may conclude a bullish/greedy behaviour for that specific coin.

    Trends (10%)

    We pull Google Trends data for various Bitcoin related search queries and crunch those numbers, especially the change of search volumes as well as recommended other currently popular searches. For example, if you check Google Trends for "Bitcoin", you can’t get much information from the search volume. But currently, you can see that there is currently a +1,550% rise of the query „bitcoin price manipulation“ in the box of related search queries (as of 05/29/2018). This is clearly a sign of fear in the market, and we use that for our index.

    There's a story behind every dataset and here's your opportunity to share yours.

    Copyright disclaimer

    This dataset is produced and maintained by the administrators of https://alternative.me/crypto/fear-and-greed-index/.

    This published version is an unofficial copy of their data, which can be also collected using their API (e.g., GET https://api.alternative.me/fng/?limit=10&format=csv&date_format=us).

    --- Original source retains full ownership of the source dataset ---

  3. Cryptocurrency extra data - Litecoin

    • kaggle.com
    Updated Jan 20, 2022
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Yam Peleg (2022). Cryptocurrency extra data - Litecoin [Dataset]. http://doi.org/10.34740/kaggle/dsv/3066229
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Jan 20, 2022
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    Yam Peleg
    Description

    Context:

    This dataset is an extra updating dataset for the G-Research Crypto Forecasting competition.

    Introduction

    This is a daily updated dataset, automaticlly collecting market data for G-Research crypto forecasting competition. The data is of the 1-minute resolution, collected for all competition assets and both retrieval and uploading are fully automated. see discussion topic.

    The Data

    For every asset in the competition, the following fields from Binance's official API endpoint for historical candlestick data are collected, saved, and processed.

    
    1. **timestamp** - A timestamp for the minute covered by the row.
    2. **Asset_ID** - An ID code for the cryptoasset.
    3. **Count** - The number of trades that took place this minute.
    4. **Open** - The USD price at the beginning of the minute.
    5. **High** - The highest USD price during the minute.
    6. **Low** - The lowest USD price during the minute.
    7. **Close** - The USD price at the end of the minute.
    8. **Volume** - The number of cryptoasset u units traded during the minute.
    9. **VWAP** - The volume-weighted average price for the minute.
    10. **Target** - 15 minute residualized returns. See the 'Prediction and Evaluation section of this notebook for details of how the target is calculated.
    11. **Weight** - Weight, defined by the competition hosts [here](https://www.kaggle.com/cstein06/tutorial-to-the-g-research-crypto-competition)
    12. **Asset_Name** - Human readable Asset name.
    

    Indexing

    The dataframe is indexed by timestamp and sorted from oldest to newest. The first row starts at the first timestamp available on the exchange, which is July 2017 for the longest-running pairs.

    Usage Example

    The following is a collection of simple starter notebooks for Kaggle's Crypto Comp showing PurgedTimeSeries in use with the collected dataset. Purged TimesSeries is explained here. There are many configuration variables below to allow you to experiment. Use either GPU or TPU. You can control which years are loaded, which neural networks are used, and whether to use feature engineering. You can experiment with different data preprocessing, model architecture, loss, optimizers, and learning rate schedules. The extra datasets contain the full history of the assets in the same format as the competition, so you can input that into your model too.

    Baseline Example Notebooks:

    These notebooks follow the ideas presented in my "Initial Thoughts" here. Some code sections have been reused from Chris' great (great) notebook series on SIIM ISIC melanoma detection competition here

    Loose-ends:

    This is a work in progress and will be updated constantly throughout the competition. At the moment, there are some known issues that still needed to be addressed:

    • VWAP: - At the moment VWAP calculation formula is still unclear. Currently the dataset uses an approximation calculated from the Open, High, Low, Close, Volume candlesticks. [Waiting for competition hosts input]
    • Target Labeling: There exist some mismatches to the original target provided by the hosts at some time intervals. On all the others - it is the same. The labeling code can be seen here. [Waiting for competition hosts] input]
    • Filtering: No filtration of 0 volume data is taken place.

    Example Visualisations

    Opening price with an added indicator (MA50): https://www.googleapis.com/download/storage/v1/b/kaggle-user-content/o/inbox%2F2234678%2Fb8664e6f26dc84e9a40d5a3d915c9640%2Fdownload.png?generation=1582053879538546&alt=media" alt="">

    Volume and number of trades: https://www.googleapis.com/download/storage/v1/b/kaggle-user-content/o/inbox%2F2234678%2Fcd04ed586b08c1576a7b67d163ad9889%2Fdownload-1.png?generation=1582053899082078&alt=media" alt="">

    License

    This data is being collected automatically from the crypto exchange Binance.

  4. Global Stock, ETF, and Index data

    • datarade.ai
    .json, .csv
    Updated Jul 7, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Twelve Data (2023). Global Stock, ETF, and Index data [Dataset]. https://datarade.ai/data-products/twelve-data-world-stock-forex-crypto-data-via-api-and-webs-twelve-data
    Explore at:
    .json, .csvAvailable download formats
    Dataset updated
    Jul 7, 2023
    Dataset authored and provided by
    Twelve Data
    Area covered
    Christmas Island, Burundi, Afghanistan, Egypt, Mozambique, Iran (Islamic Republic of), United States Minor Outlying Islands, Costa Rica, Belarus, Micronesia (Federated States of)
    Description

    Twelve Data is a technology-driven company that provides financial market data, financial tools, and dedicated solutions. Large audiences - from individuals to financial institutions - use our products to stay ahead of the competition and success.

    At Twelve Data we feel responsible for where the markets are going and how people are able to explore them. Coming from different technological backgrounds, we see how the world is lacking the unique and simple place where financial data can be accessed by anyone, at any time. This is what distinguishes us from others, we do not only supply the financial data but instead, we want you to benefit from it, by using the convenient format, tools, and special solutions.

    We believe that the human factor is still a very important aspect of our work and therefore our ethics guides us on how to treat people, with convenient and understandable resources. This includes world-class documentation, human support, and dedicated solutions.

  5. Cryptocurrency extra data - Ethereum Classic

    • kaggle.com
    Updated Jan 19, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Yam Peleg (2022). Cryptocurrency extra data - Ethereum Classic [Dataset]. http://doi.org/10.34740/kaggle/dsv/3066021
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Jan 19, 2022
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    Yam Peleg
    Description

    Context:

    This dataset is an extra updating dataset for the G-Research Crypto Forecasting competition.

    Introduction

    This is a daily updated dataset, automaticlly collecting market data for G-Research crypto forecasting competition. The data is of the 1-minute resolution, collected for all competition assets and both retrieval and uploading are fully automated. see discussion topic.

    The Data

    For every asset in the competition, the following fields from Binance's official API endpoint for historical candlestick data are collected, saved, and processed.

    
    1. **timestamp** - A timestamp for the minute covered by the row.
    2. **Asset_ID** - An ID code for the cryptoasset.
    3. **Count** - The number of trades that took place this minute.
    4. **Open** - The USD price at the beginning of the minute.
    5. **High** - The highest USD price during the minute.
    6. **Low** - The lowest USD price during the minute.
    7. **Close** - The USD price at the end of the minute.
    8. **Volume** - The number of cryptoasset u units traded during the minute.
    9. **VWAP** - The volume-weighted average price for the minute.
    10. **Target** - 15 minute residualized returns. See the 'Prediction and Evaluation section of this notebook for details of how the target is calculated.
    11. **Weight** - Weight, defined by the competition hosts [here](https://www.kaggle.com/cstein06/tutorial-to-the-g-research-crypto-competition)
    12. **Asset_Name** - Human readable Asset name.
    

    Indexing

    The dataframe is indexed by timestamp and sorted from oldest to newest. The first row starts at the first timestamp available on the exchange, which is July 2017 for the longest-running pairs.

    Usage Example

    The following is a collection of simple starter notebooks for Kaggle's Crypto Comp showing PurgedTimeSeries in use with the collected dataset. Purged TimesSeries is explained here. There are many configuration variables below to allow you to experiment. Use either GPU or TPU. You can control which years are loaded, which neural networks are used, and whether to use feature engineering. You can experiment with different data preprocessing, model architecture, loss, optimizers, and learning rate schedules. The extra datasets contain the full history of the assets in the same format as the competition, so you can input that into your model too.

    Baseline Example Notebooks:

    These notebooks follow the ideas presented in my "Initial Thoughts" here. Some code sections have been reused from Chris' great (great) notebook series on SIIM ISIC melanoma detection competition here

    Loose-ends:

    This is a work in progress and will be updated constantly throughout the competition. At the moment, there are some known issues that still needed to be addressed:

    • VWAP: - At the moment VWAP calculation formula is still unclear. Currently the dataset uses an approximation calculated from the Open, High, Low, Close, Volume candlesticks. [Waiting for competition hosts input]
    • Target Labeling: There exist some mismatches to the original target provided by the hosts at some time intervals. On all the others - it is the same. The labeling code can be seen here. [Waiting for competition hosts] input]
    • Filtering: No filtration of 0 volume data is taken place.

    Example Visualisations

    Opening price with an added indicator (MA50): https://www.googleapis.com/download/storage/v1/b/kaggle-user-content/o/inbox%2F2234678%2Fb8664e6f26dc84e9a40d5a3d915c9640%2Fdownload.png?generation=1582053879538546&alt=media" alt="">

    Volume and number of trades: https://www.googleapis.com/download/storage/v1/b/kaggle-user-content/o/inbox%2F2234678%2Fcd04ed586b08c1576a7b67d163ad9889%2Fdownload-1.png?generation=1582053899082078&alt=media" alt="">

    License

    This data is being collected automatically from the crypto exchange Binance.

  6. RDA Pricing: Fundamental Pricing Data for Cryptoassets

    • datarade.ai
    .json
    Updated Feb 3, 2021
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    RDA Index (2021). RDA Pricing: Fundamental Pricing Data for Cryptoassets [Dataset]. https://datarade.ai/data-products/rda-pricing-fundamental-pricing-data-for-cryptoassets-rda-index
    Explore at:
    .jsonAvailable download formats
    Dataset updated
    Feb 3, 2021
    Dataset provided by
    Xtant Real Limited
    Authors
    RDA Index
    Area covered
    Kyrgyzstan, Egypt, Ghana, Cyprus, Timor-Leste, Saint Pierre and Miquelon, American Samoa, Cameroon, Afghanistan, Iran (Islamic Republic of)
    Description

    Despite their libertarian use cases to enable peer-to-peer, trustless, decentralised peer-to-peer transactions, behaviour consistent with speculative trading accounts for the majority of cryptoasset uses.

    The FCA cryptoasset consumer research 2020 concluded that 47% of people considered buying cryptoassets as a gamble that could make or lose money, 25% sees it as part of their wider investment portfolio, 22% don't want to miss out on a money making opportunity, 17% classifies it as part of their long term savings plan (e.g. pension), and 7% invest in it because they don't trust the current financial system. Majority of people buy them on the expectation that the asset will appreciate in value over time simply because more people are buying it which subsequently creates risks for investors at all levels of the pyramid.

    The RDA Price data stands in contrast with the market price to reveal the impact of speculative trading on each asset. The fundamental-market price ratio (FMr) is a key data point in this product. The FMr enables crypto users and investors to determine over-pricing and and manage risks upside and downside.

  7. Data Set: Python Crypto Misuses in the Wild

    • figshare.com
    zip
    Updated May 31, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Anna-Katharina Wickert; Lars Baumgärtner; Florian Breitfelder; Mira Mezini (2023). Data Set: Python Crypto Misuses in the Wild [Dataset]. http://doi.org/10.6084/m9.figshare.16499085.v1
    Explore at:
    zipAvailable download formats
    Dataset updated
    May 31, 2023
    Dataset provided by
    Figsharehttp://figshare.com/
    Authors
    Anna-Katharina Wickert; Lars Baumgärtner; Florian Breitfelder; Mira Mezini
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Study results and scripts to obtain the results for our paper "Python Crypto Misuses in the Wild" [@akwick @gh0st42 @Breitfelder @miramezini]The archives in this folder contains the following:- evaluations.tar.gz contains the evaluation folder from the GitHub project linked in References. - tools.tar.gz contains the tools folder from the GitHub project linked in References.- repos-py-with-dep-only-src-files.zip contains the source files and their dependencies of the Python projects analyzed.- repos-micropy-with-dep-only-src-files.zip contains the sources files and their depedencies of the MicroPython projects analyzed.

  8. RDA Ratings: A 5-Tier Fundamental Ratings for Cryptoassets

    • datarade.ai
    .json, .csv
    Updated Mar 22, 2021
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    RDA Index (2021). RDA Ratings: A 5-Tier Fundamental Ratings for Cryptoassets [Dataset]. https://datarade.ai/data-products/rda-intrinsic-value-ratings-a-5-tier-fundamental-ratings-for-cryptoassets-rda-index
    Explore at:
    .json, .csvAvailable download formats
    Dataset updated
    Mar 22, 2021
    Dataset provided by
    Xtant Real Limited
    Authors
    RDA Index
    Area covered
    Macedonia (the former Yugoslav Republic of), Korea (Republic of), Israel, Norfolk Island, Greece, Switzerland, Iraq, Vanuatu, Chad, Turkey
    Description

    A 5-star rating system to simplify comparison between investability of cryptoassets.

    RDA Intrinsic Value (IV) Ratings enable crypto users and investors to identify risks related to asset integrity and systemic issues.

    When used in conjunction with RDA IV Ranking, RDA IV Ratings empower investors to move from the speculative to knowledge-based crypto portfolio construction and investment strategies.

    IV Ratings enable the setting of cryptoassets standards including compliance and consumer protection policies.

  9. Cryptocurrency extra data - TRON

    • kaggle.com
    Updated Jan 20, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Yam Peleg (2022). Cryptocurrency extra data - TRON [Dataset]. http://doi.org/10.34740/kaggle/dsv/3066485
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Jan 20, 2022
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    Yam Peleg
    Description

    Context:

    This dataset is an extra updating dataset for the G-Research Crypto Forecasting competition.

    Introduction

    This is a daily updated dataset, automaticlly collecting market data for G-Research crypto forecasting competition. The data is of the 1-minute resolution, collected for all competition assets and both retrieval and uploading are fully automated. see discussion topic.

    The Data

    For every asset in the competition, the following fields from Binance's official API endpoint for historical candlestick data are collected, saved, and processed.

    
    1. **timestamp** - A timestamp for the minute covered by the row.
    2. **Asset_ID** - An ID code for the cryptoasset.
    3. **Count** - The number of trades that took place this minute.
    4. **Open** - The USD price at the beginning of the minute.
    5. **High** - The highest USD price during the minute.
    6. **Low** - The lowest USD price during the minute.
    7. **Close** - The USD price at the end of the minute.
    8. **Volume** - The number of cryptoasset u units traded during the minute.
    9. **VWAP** - The volume-weighted average price for the minute.
    10. **Target** - 15 minute residualized returns. See the 'Prediction and Evaluation section of this notebook for details of how the target is calculated.
    11. **Weight** - Weight, defined by the competition hosts [here](https://www.kaggle.com/cstein06/tutorial-to-the-g-research-crypto-competition)
    12. **Asset_Name** - Human readable Asset name.
    

    Indexing

    The dataframe is indexed by timestamp and sorted from oldest to newest. The first row starts at the first timestamp available on the exchange, which is July 2017 for the longest-running pairs.

    Usage Example

    The following is a collection of simple starter notebooks for Kaggle's Crypto Comp showing PurgedTimeSeries in use with the collected dataset. Purged TimesSeries is explained here. There are many configuration variables below to allow you to experiment. Use either GPU or TPU. You can control which years are loaded, which neural networks are used, and whether to use feature engineering. You can experiment with different data preprocessing, model architecture, loss, optimizers, and learning rate schedules. The extra datasets contain the full history of the assets in the same format as the competition, so you can input that into your model too.

    Baseline Example Notebooks:

    These notebooks follow the ideas presented in my "Initial Thoughts" here. Some code sections have been reused from Chris' great (great) notebook series on SIIM ISIC melanoma detection competition here

    Loose-ends:

    This is a work in progress and will be updated constantly throughout the competition. At the moment, there are some known issues that still needed to be addressed:

    • VWAP: - At the moment VWAP calculation formula is still unclear. Currently the dataset uses an approximation calculated from the Open, High, Low, Close, Volume candlesticks. [Waiting for competition hosts input]
    • Target Labeling: There exist some mismatches to the original target provided by the hosts at some time intervals. On all the others - it is the same. The labeling code can be seen here. [Waiting for competition hosts] input]
    • Filtering: No filtration of 0 volume data is taken place.

    Example Visualisations

    Opening price with an added indicator (MA50): https://www.googleapis.com/download/storage/v1/b/kaggle-user-content/o/inbox%2F2234678%2Fb8664e6f26dc84e9a40d5a3d915c9640%2Fdownload.png?generation=1582053879538546&alt=media" alt="">

    Volume and number of trades: https://www.googleapis.com/download/storage/v1/b/kaggle-user-content/o/inbox%2F2234678%2Fcd04ed586b08c1576a7b67d163ad9889%2Fdownload-1.png?generation=1582053899082078&alt=media" alt="">

    License

    This data is being collected automatically from the crypto exchange Binance.

  10. Dataset for Multivariate Bitcoin Price Forecasting.

    • figshare.com
    txt
    Updated Apr 22, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Anny Mardjo; Chidchanok Choksuchat (2023). Dataset for Multivariate Bitcoin Price Forecasting. [Dataset]. http://doi.org/10.6084/m9.figshare.22678540.v1
    Explore at:
    txtAvailable download formats
    Dataset updated
    Apr 22, 2023
    Dataset provided by
    Figsharehttp://figshare.com/
    Authors
    Anny Mardjo; Chidchanok Choksuchat
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    The dataset was collected for the period spanning between 01/07/2019 and 31/12/2022.The historical Twitter volume were retrieved using ‘‘Bitcoin’’ (case insensitive) as the keyword from bitinfocharts.com. Google search volume was retrieved using library Gtrends. 2000 tweets per day using 4 times interval were crawled by employing Twitter API with the keyword “Bitcoin. The daily closing prices of Bitcoin, oil price, gold price, and U.S stock market indexes (S&P 500, NASDAQ, and Dow Jones Industrial Average) were collected using R libraries either Quantmod or Quandl.

  11. Binance Futures Full History

    • kaggle.com
    zip
    Updated Dec 26, 2020
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Nicolae (2020). Binance Futures Full History [Dataset]. https://www.kaggle.com/nicolaes/binance-futures
    Explore at:
    zip(604497840 bytes)Available download formats
    Dataset updated
    Dec 26, 2020
    Authors
    Nicolae
    Description

    Introduction

    This is a collection of all 1 minute candlesticks of all cryptocurrency pairs on Binance.com. All 80 of them are included. Both retrieval and uploading the data is fully automated—see this GitHub repo.

    Content

    For every trading pair, the following fields from Binance's official API endpoint for historical candlestick data are saved into a Parquet file:

     #  Column            Dtype     
    --- ------            -----     
     0  open_time           datetime64[ns]
     1  open             float32    
     2  high             float32    
     3  low              float32    
     4  close             float32    
     5  volume            float32    
     6  quote_asset_volume      float32    
     7  number_of_trades       uint16    
     8  taker_buy_base_asset_volume  float32    
     9  taker_buy_quote_asset_volume float32    
    dtypes: datetime64[ns](1), float32(8), uint16(1)
    

    The dataframe is indexed by open_time and sorted from oldest to newest. The first row starts at the first timestamp available on the exchange, which is July 2017 for the longest running pairs.

    Here are two simple plots based on a single file; one of the opening price with an added indicator (MA50) and one of the volume and number of trades:

    https://www.googleapis.com/download/storage/v1/b/kaggle-user-content/o/inbox%2F2234678%2Fb8664e6f26dc84e9a40d5a3d915c9640%2Fdownload.png?generation=1582053879538546&alt=media" alt=""> https://www.googleapis.com/download/storage/v1/b/kaggle-user-content/o/inbox%2F2234678%2Fcd04ed586b08c1576a7b67d163ad9889%2Fdownload-1.png?generation=1582053899082078&alt=media" alt="">

    Inspiration

    One obvious use-case for this data could be technical analysis by adding indicators such as moving averages, MACD, RSI, etc. Other approaches could include backtesting trading algorithms or computing arbitrage potential with other exchanges.

    License

    This data is being collected automatically from crypto exchange Binance.

  12. Cryptocurrencies Price

    • kaggle.com
    zip
    Updated Jan 2, 2018
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    amrrs (2018). Cryptocurrencies Price [Dataset]. https://www.kaggle.com/nulldata/cryptocurrencies-price
    Explore at:
    zip(77341 bytes)Available download formats
    Dataset updated
    Jan 2, 2018
    Authors
    amrrs
    License

    Attribution-ShareAlike 4.0 (CC BY-SA 4.0)https://creativecommons.org/licenses/by-sa/4.0/
    License information was derived automatically

    Description

    Context

    Cryptocurrencies have become more than just a computational challenge with the recent Bitcoin Future listing on NASDAQ, hence it becomes an interesting spot for analysts to get their hands dirty. This data even though is minimal, help analysts get started in the world of cryptocurrenices analysis.

    Content

    Column Information:

    • id
    • name
    • symbol
    • rank
    • price_usd
    • price_btc
    • 24h_volume_usd
    • market_cap_usd
    • available_supply
    • total_supply
    • max_supply
    • percent_change_1h
    • percent_change_24h
    • percent_change_7d
    • last_updated

    Acknowledgements

    This data is an extract from the R-package coinmarketcapr which is an R binding of the coinmarketcap api. Courtesy: coinmarketcap.com

  13. Ethereum ETH, 7 Exchanges, 1m Full Historical Data

    • kaggle.com
    Updated Jul 7, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Imran Bukhari (2025). Ethereum ETH, 7 Exchanges, 1m Full Historical Data [Dataset]. https://www.kaggle.com/datasets/imranbukhari/comprehensive-ethusd-1m-data
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Jul 7, 2025
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    Imran Bukhari
    License

    Attribution-ShareAlike 4.0 (CC BY-SA 4.0)https://creativecommons.org/licenses/by-sa/4.0/
    License information was derived automatically

    Description

    I am a new developer and I would greatly appreciate your support. If you find this dataset helpful, please consider giving it an upvote!

    Key Features:

    Complete 1m Data: Raw 1m historical data from multiple exchanges, covering the entire trading history of ETHUSD available through their API endpoints. This dataset is updated daily to ensure up-to-date coverage.

    Combined Index Dataset: A unique feature of this dataset is the combined index, which is derived by averaging all other datasets into one, please see attached notebook. This creates the longest continuous, unbroken ETHUSD dataset available on Kaggle, with no gaps and no erroneous values. It gives a much more comprehensive view of the market i.e. total volume across multiple exchanges.

    Superior Performance: The combined index dataset has demonstrated superior 'mean average error' (MAE) metric performance when training machine learning models, compared to single-source datasets by a whole order of MAE magnitude.

    Unbroken History: The combined dataset's continuous history is a valuable asset for researchers and traders who require accurate and uninterrupted time series data for modeling or back-testing.

    https://i.imgur.com/5ti89wM.png" alt="ETHUSD Dataset Summary">

    https://i.imgur.com/DnpNF9R.png" alt="Combined Dataset Close Plot"> This plot illustrates the continuity of the dataset over time, with no gaps in data, making it ideal for time series analysis.

    Included Resources:

    Two Notebooks:

    Dataset Usage and Diagnostics: This notebook demonstrates how to use the dataset and includes a powerful data diagnostics function, which is useful for all time series analyses.

    Aggregating Multiple Data Sources: This notebook walks you through the process of combining multiple exchange datasets into a single, clean dataset. (Currently unavailable, will be added shortly)

  14. Ethereum ETH, 7 Exchanges, 1d Full Historical Data

    • kaggle.com
    Updated Aug 8, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Imran Bukhari (2025). Ethereum ETH, 7 Exchanges, 1d Full Historical Data [Dataset]. https://www.kaggle.com/datasets/imranbukhari/comprehensive-ethusd-1d-data
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Aug 8, 2025
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    Imran Bukhari
    License

    Attribution-ShareAlike 4.0 (CC BY-SA 4.0)https://creativecommons.org/licenses/by-sa/4.0/
    License information was derived automatically

    Description

    I am a new developer and I would greatly appreciate your support. If you find this dataset helpful, please consider giving it an upvote!

    Key Features:

    Complete 1d Data: Raw 1d historical data from multiple exchanges, covering the entire trading history of ETHUSD available through their API endpoints. This dataset is updated daily to ensure up-to-date coverage.

    Combined Index Dataset: A unique feature of this dataset is the combined index, which is derived by averaging all other datasets into one, please see attached notebook. This creates the longest continuous, unbroken ETHUSD dataset available on Kaggle, with no gaps and no erroneous values. It gives a much more comprehensive view of the market i.e. total volume across multiple exchanges.

    Superior Performance: The combined index dataset has demonstrated superior 'mean average error' (MAE) metric performance when training machine learning models, compared to single-source datasets by a whole order of MAE magnitude.

    Unbroken History: The combined dataset's continuous history is a valuable asset for researchers and traders who require accurate and uninterrupted time series data for modeling or back-testing.

    https://i.imgur.com/l1JzL0Z.png" alt="ETHUSD Dataset Summary">

    https://i.imgur.com/GgREheF.png" alt="Combined Dataset Close Plot"> This plot illustrates the continuity of the dataset over time, with no gaps in data, making it ideal for time series analysis.

    Included Resources:

    Two Notebooks:

    Dataset Usage and Diagnostics: This notebook demonstrates how to use the dataset and includes a powerful data diagnostics function, which is useful for all time series analyses.

    Aggregating Multiple Data Sources: This notebook walks you through the process of combining multiple exchange datasets into a single, clean dataset. (Currently unavailable, will be added shortly)

  15. NFT Dataset of 1000 NFTS

    • kaggle.com
    Updated Apr 9, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Jatin Sinha (2025). NFT Dataset of 1000 NFTS [Dataset]. https://www.kaggle.com/datasets/jatinsinha/nft-dataset-of-1000-nfts/discussion
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Apr 9, 2025
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    Jatin Sinha
    License

    Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
    License information was derived automatically

    Description

    About the Dataset

    This dataset was generated using the BitsCrunch API to provide insights and analytics for 1000 NFT collections. It focuses on metrics related to collection performance, trading activity, and market dynamics, enabling comprehensive analysis of the NFT ecosystem. Dataset Generation Process:

    Collection Retrieval:
      The Collection by Chain API was used to retrieve a dataset of 1000 NFT collections, including their contract addresses and names.
    
    Profile Data Enrichment:
      The Collection Profile API was utilized to fetch detailed metrics for each collection, including performance scores, trading activity, liquidity, and sentiment.
    
    Data Cleaning and Structuring:
      The retrieved data was cleaned to ensure consistency and completeness.
      The data was structured into an Excel file format, where each row represents an NFT collection and each column provides a specific attribute or metric.
    

    Key Features in the Dataset:

    Collection Details:
      contract_address: The unique contract address of the NFT collection.
      collection_name: The name of each NFT collection.
      blockchain: The blockchain platform where the collection exists (e.g., Ethereum).
      chain_id: The identifier of the blockchain network.
    
    Performance Metrics:
      collection_score: A score representing the overall performance of the collection.
      token_distribution_score: A score indicating the distribution of tokens within the collection.
      metadata_score: A score evaluating the quality of the metadata for the collection.
      holder_metrics_score: A score based on the holding patterns of users.
    
    Trading Metrics:
      profitable_trades: The total number of profitable trades.
      loss_making_trades: The total number of loss-making trades.
      profitable_trades_percentage: The percentage of trades that were profitable.
      loss_making_trades_percentage: The percentage of trades that resulted in a loss.
      zero_profit_trades: The number of trades with zero profit.
    
    Volume Metrics:
      profitable_volume: The total volume of profitable trades.
      loss_making_volume: The total volume of loss-making trades.
    
    Market Dynamics:
      fear_and_greed_index: An index representing market sentiment for the collection.
      washtrade_index: An index indicating the presence of wash trading activity.
      market_dominance_score: A score representing the market dominance of the collection.
    
    Investor Behavior:
      diamond_hands: The metric reflecting long-term holders in the collection.
      liquidity_score: A score representing the liquidity of the collection.
    

    This dataset provides a robust foundation for data-driven insights, enabling visualization, statistical analysis, and machine learning applications related to NFT collections. It is particularly valuable for understanding market trends, evaluating collection performance, and detecting manipulative trading activities in the NFT space.

  16. Not seeing a result you expected?
    Learn how you can add new datasets to our index.

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
cryptodata.center (2024). Integrated Cryptocurrency Historical Data for a Predictive Data-Driven Decision-Making Algorithm - Dataset - CryptoData Hub [Dataset]. https://cryptodata.center/dataset/integrated-cryptocurrency-historical-data-for-a-predictive-data-driven-decision-making-algorithm
Organization logo

Integrated Cryptocurrency Historical Data for a Predictive Data-Driven Decision-Making Algorithm - Dataset - CryptoData Hub

Explore at:
Dataset updated
Dec 4, 2024
Dataset provided by
CryptoDATA
License

Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically

Description

Cryptocurrency historical datasets from January 2012 (if available) to October 2021 were obtained and integrated from various sources and Application Programming Interfaces (APIs) including Yahoo Finance, Cryptodownload, CoinMarketCap, various Kaggle datasets, and multiple APIs. While these datasets used various formats of time (e.g., minutes, hours, days), in order to integrate the datasets days format was used for in this research study. The integrated cryptocurrency historical datasets for 80 cryptocurrencies including but not limited to Bitcoin (BTC), Ethereum (ETH), Binance Coin (BNB), Cardano (ADA), Tether (USDT), Ripple (XRP), Solana (SOL), Polkadot (DOT), USD Coin (USDC), Dogecoin (DOGE), Tron (TRX), Bitcoin Cash (BCH), Litecoin (LTC), EOS (EOS), Cosmos (ATOM), Stellar (XLM), Wrapped Bitcoin (WBTC), Uniswap (UNI), Terra (LUNA), SHIBA INU (SHIB), and 60 more cryptocurrencies were uploaded in this online Mendeley data repository. Although the primary attribute of including the mentioned cryptocurrencies was the Market Capitalization, a subject matter expert i.e., a professional trader has also guided the initial selection of the cryptocurrencies by analyzing various indicators such as Relative Strength Index (RSI), Moving Average Convergence/Divergence (MACD), MYC Signals, Bollinger Bands, Fibonacci Retracement, Stochastic Oscillator and Ichimoku Cloud. The primary features of this dataset that were used as the decision-making criteria of the CLUS-MCDA II approach are Timestamps, Open, High, Low, Closed, Volume (Currency), % Change (7 days and 24 hours), Market Cap and Weighted Price values. The available excel and CSV files in this data set are just part of the integrated data and other databases, datasets and API References that was used in this study are as follows: [1] https://finance.yahoo.com/ [2] https://coinmarketcap.com/historical/ [3] https://cryptodatadownload.com/ [4] https://kaggle.com/philmohun/cryptocurrency-financial-data [5] https://kaggle.com/deepshah16/meme-cryptocurrency-historical-data [6] https://kaggle.com/sudalairajkumar/cryptocurrencypricehistory [7] https://min-api.cryptocompare.com/data/price?fsym=BTC&tsyms=USD [8] https://min-api.cryptocompare.com/ [9] https://p.nomics.com/cryptocurrency-bitcoin-api [10] https://www.coinapi.io/ [11] https://www.coingecko.com/en/api [12] https://cryptowat.ch/ [13] https://www.alphavantage.co/ This dataset is part of the CLUS-MCDA (Cluster analysis for improving Multiple Criteria Decision Analysis) and CLUS-MCDAII Project: https://aimaghsoodi.github.io/CLUSMCDA-R-Package/ https://github.com/Aimaghsoodi/CLUS-MCDA-II https://github.com/azadkavian/CLUS-MCDA

Search
Clear search
Close search
Google apps
Main menu