Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Cryptocurrency historical datasets from January 2012 (if available) to October 2021 were obtained and integrated from various sources and Application Programming Interfaces (APIs) including Yahoo Finance, Cryptodownload, CoinMarketCap, various Kaggle datasets, and multiple APIs. While these datasets used various formats of time (e.g., minutes, hours, days), in order to integrate the datasets days format was used for in this research study. The integrated cryptocurrency historical datasets for 80 cryptocurrencies including but not limited to Bitcoin (BTC), Ethereum (ETH), Binance Coin (BNB), Cardano (ADA), Tether (USDT), Ripple (XRP), Solana (SOL), Polkadot (DOT), USD Coin (USDC), Dogecoin (DOGE), Tron (TRX), Bitcoin Cash (BCH), Litecoin (LTC), EOS (EOS), Cosmos (ATOM), Stellar (XLM), Wrapped Bitcoin (WBTC), Uniswap (UNI), Terra (LUNA), SHIBA INU (SHIB), and 60 more cryptocurrencies were uploaded in this online Mendeley data repository. Although the primary attribute of including the mentioned cryptocurrencies was the Market Capitalization, a subject matter expert i.e., a professional trader has also guided the initial selection of the cryptocurrencies by analyzing various indicators such as Relative Strength Index (RSI), Moving Average Convergence/Divergence (MACD), MYC Signals, Bollinger Bands, Fibonacci Retracement, Stochastic Oscillator and Ichimoku Cloud. The primary features of this dataset that were used as the decision-making criteria of the CLUS-MCDA II approach are Timestamps, Open, High, Low, Closed, Volume (Currency), % Change (7 days and 24 hours), Market Cap and Weighted Price values. The available excel and CSV files in this data set are just part of the integrated data and other databases, datasets and API References that was used in this study are as follows: [1] https://finance.yahoo.com/ [2] https://coinmarketcap.com/historical/ [3] https://cryptodatadownload.com/ [4] https://kaggle.com/philmohun/cryptocurrency-financial-data [5] https://kaggle.com/deepshah16/meme-cryptocurrency-historical-data [6] https://kaggle.com/sudalairajkumar/cryptocurrencypricehistory [7] https://min-api.cryptocompare.com/data/price?fsym=BTC&tsyms=USD [8] https://min-api.cryptocompare.com/ [9] https://p.nomics.com/cryptocurrency-bitcoin-api [10] https://www.coinapi.io/ [11] https://www.coingecko.com/en/api [12] https://cryptowat.ch/ [13] https://www.alphavantage.co/ This dataset is part of the CLUS-MCDA (Cluster analysis for improving Multiple Criteria Decision Analysis) and CLUS-MCDAII Project: https://aimaghsoodi.github.io/CLUSMCDA-R-Package/ https://github.com/Aimaghsoodi/CLUS-MCDA-II https://github.com/azadkavian/CLUS-MCDA
CoinAPI's Level 1 Crypto Quote Data delivers essential digital asset market intelligence, capturing real-time bid/ask prices and volumes across 350+ exchanges including both CEX and DEX platforms.
This comprehensive data stream provides precise market snapshots with microsecond-accurate timestamps, perfect for applications demanding rapid price discovery and effective market monitoring.
Designed for minimal latency and maximum update frequency, our feed powers everything from sophisticated trading algorithms and responsive price widgets to in-depth market analysis tools.
You can access data through FIX or WebSocket for instant streaming or REST API for historical analysis and backtesting.
Why work with us?
Market Coverage & Data Types: - Real-time and historical data since 2010 (for chosen assets) - Full order book depth (L2/L3) - Tick-by-tick data - OHLCV across multiple timeframes - Market indexes (VWAP, PRIMKT) - Exchange rates with fiat pairs - Spot, futures, options, and perpetual contracts - Coverage of 90%+ global trading volume - Full Cryptocurrency Investor Data
Technical Excellence: - 99,9% uptime guarantee - Multiple delivery methods: REST, WebSocket, FIX, S3 - Standardized data format across exchanges - Ultra-low latency data streaming - Detailed documentation - Custom integration assistance
CoinAPI is trusted by financial institutions, trading firms, hedge funds, researchers, and technology developers worldwide. We provide reliable cryptocurrency market data through our commitment to quality and technical performance.
https://www.futuremarketinsights.com/privacy-policyhttps://www.futuremarketinsights.com/privacy-policy
The market is projected to reach USD 1,074 Million in 2025 and is expected to grow to USD 7,975.7 Million by 2035, registering a CAGR of 22.2% over the forecast period. The expansion of Web3 infrastructure, advancements in multi-chain API solutions, and increasing demand for secure and scalable blockchain integrations are fueling market expansion. Additionally, rising adoption of tokenization, cross-chain interoperability, and API-driven NFT marketplaces is shaping the industry's future.
Metric | Value |
---|---|
Market Size (2025E) | USD 1,074 Million |
Market Value (2035F) | USD 7,975.7 Million |
CAGR (2025 to 2035) | 22.2% |
Country-wise Insights
Country | CAGR (2025 to 2035) |
---|---|
USA | 22.5% |
Country | CAGR (2025 to 2035) |
---|---|
UK | 21.8% |
Region | CAGR (2025 to 2035) |
---|---|
European Union (EU) | 22.2% |
Country | CAGR (2025 to 2035) |
---|---|
Japan | 22.4% |
Country | CAGR (2025 to 2035) |
---|---|
South Korea | 22.7% |
Competitive Outlook
Company Name | Estimated Market Share (%) |
---|---|
Coinbase Cloud | 18-22% |
Binance API | 12-16% |
Chainalysis | 10-14% |
Alchemy | 8-12% |
CryptoAPIs | 6-10% |
Other Companies (combined) | 30-40% |
This dataset provides comprehensive access to financial market data from Google Finance in real-time. Get detailed information on stocks, market quotes, trends, ETFs, international exchanges, forex, crypto, and related news. Perfect for financial applications, trading platforms, and market analysis tools. The dataset is delivered in a JSON format via REST API.
Cryptocurrencies
Finage offers you more than 1700+ cryptocurrency data in real time.
With Finage, you can react to the cryptocurrency data in Real-Time via WebSocket or unlimited API calls. Also, we offer you a 7-year historical data API.
You can view the full Cryptocurrency market coverage with the link given below. https://finage.s3.eu-west-2.amazonaws.com/Finage_Crypto_Coverage.pdf
When you need to analyze crypto market history, batch processing often beats streaming APIs. That's why we built the Flat Files S3 API - giving analysts and researchers direct access to structured historical cryptocurrency data without the integration complexity of traditional APIs.
Pull comprehensive historical data across 800+ cryptocurrencies and their trading pairs, delivered in clean, ready-to-use CSV formats that drop straight into your analysis tools. Whether you're building backtest environments, training machine learning models, or running complex market studies, our flat file approach gives you the flexibility to work with massive datasets efficiently.
Why work with us?
Market Coverage & Data Types: - Comprehensive historical data since 2010 (for chosen assets) - Comprehensive order book snapshots and updates - Trade-by-trade data
Technical Excellence: - 99,9% uptime guarantee - Standardized data format across exchanges - Flexible Integration - Detailed documentation - Scalable Architecture
CoinAPI serves hundreds of institutions worldwide, from trading firms and hedge funds to research organizations and technology providers. Our S3 delivery method easily integrates with your existing workflows, offering familiar access patterns, reliable downloads, and straightforward automation for your data team. Our commitment to data quality and technical excellence, combined with accessible delivery options, makes us the trusted choice for institutions that demand both comprehensive historical data and real-time market intelligence
CoinAPI delivers institutional-grade Historical Crypto Data for backtesting and analysis. Our cryptocurrency archive powers research across Bitcoin, Ethereum and all markets through one reliable API—transforming strategies with precision data that matters.
Attribution-ShareAlike 4.0 (CC BY-SA 4.0)https://creativecommons.org/licenses/by-sa/4.0/
License information was derived automatically
Bitcoin is the longest running and most well known cryptocurrency, first released as open source in 2009 by the anonymous Satoshi Nakamoto. Bitcoin serves as a decentralized medium of digital exchange, with transactions verified and recorded in a public distributed ledger (the blockchain) without the need for a trusted record keeping authority or central intermediary. Transaction blocks contain a SHA-256 cryptographic hash of previous transaction blocks, and are thus "chained" together, serving as an immutable record of all transactions that have ever occurred. As with any currency/commodity on the market, bitcoin trading and financial instruments soon followed public adoption of bitcoin and continue to grow. Included here is historical bitcoin market data for select bitcoin exchanges where trading takes place. Happy (data) mining!
CSV files for select bitcoin exchanges for the time period of September 2011 to June 2024, with updates of OHLC (Open, High, Low, Close), Volume in BTC and indicated currency, and weighted bitcoin price. Timestamps are in Unix time. Timestamps without any trades or activity have their data fields filled with NaNs. If a timestamp is missing, or if there are jumps, this may be because the exchange (or its API) was down, the exchange (or its API) did not exist, or some other unforeseen technical error in data reporting or gathering. All effort has been made to deduplicate entries and verify the contents are correct and complete to the best of my ability, but obviously trust at your own risk.
Bitcoin charts for the data. The various exchange APIs, for making it difficult or unintuitive enough to get OHLC and volume data that I set out on this data scraping project. Satoshi Nakamoto and the novel core concept of the blockchain, as well as its first execution via the bitcoin protocol. I'd also like to thank viewers like you! Can't wait to see what code or insights you all have to share.
Cryptocurrency options markets have grown increasingly sophisticated, requiring reliable data infrastructure to support trading and analysis. Our platform gives you direct access to comprehensive crypto options data through straightforward API connections.
We capture the complete options chain across major crypto derivatives exchanges, delivering real-time and historical cryptocurrency market data that shows exactly what's happening in these complex markets. Each options contract is tracked with precision - strikes, expiration dates, premiums, open interest, and volume metrics all accessible through our standardized data feeds.
The data is available through multiple integration methods depending on your needs. Use our REST API for flexible queries and historical analysis, WebSocket for real-time market monitoring, or FIX protocol for institutional-grade connectivity with minimal latency.
Why work with us?
Market Coverage & Data Types: - Real-time and historical data since 2010 (for chosen assets) - Full order book depth (L2/L3) - Tick-by-tick data - OHLCV across multiple timeframes - Market indexes (VWAP, PRIMKT) - Exchange rates with fiat pairs - Spot, futures, options, and perpetual contracts - Coverage of 90%+ global trading volume
Technical Excellence: - 99% uptime guarantee - Multiple delivery methods: REST, WebSocket, FIX, S3 - Standardized data format across exchanges - Ultra-low latency data streaming - Detailed documentation - Custom integration assistance
When options traders need reliable market intelligence, they don't leave it to chance. That's why trading desks across five continents, quantitative hedge funds managing billions, and fintech innovators building tomorrow's trading platforms all rely on our data infrastructure. We've established ourselves as a dependable source in a market where accuracy isn't just preferred - it's essential. While others promise comprehensive coverage, we deliver it consistently, trade after trade, day after day.
This dataset is an extra updating dataset for the G-Research Crypto Forecasting competition.
This is a daily updated dataset, automaticlly collecting market data for G-Research crypto forecasting competition. The data is of the 1-minute resolution, collected for all competition assets and both retrieval and uploading are fully automated. see discussion topic.
For every asset in the competition, the following fields from Binance's official API endpoint for historical candlestick data are collected, saved, and processed.
1. **timestamp** - A timestamp for the minute covered by the row.
2. **Asset_ID** - An ID code for the cryptoasset.
3. **Count** - The number of trades that took place this minute.
4. **Open** - The USD price at the beginning of the minute.
5. **High** - The highest USD price during the minute.
6. **Low** - The lowest USD price during the minute.
7. **Close** - The USD price at the end of the minute.
8. **Volume** - The number of cryptoasset u units traded during the minute.
9. **VWAP** - The volume-weighted average price for the minute.
10. **Target** - 15 minute residualized returns. See the 'Prediction and Evaluation section of this notebook for details of how the target is calculated.
11. **Weight** - Weight, defined by the competition hosts [here](https://www.kaggle.com/cstein06/tutorial-to-the-g-research-crypto-competition)
12. **Asset_Name** - Human readable Asset name.
The dataframe is indexed by timestamp
and sorted from oldest to newest.
The first row starts at the first timestamp available on the exchange, which is July 2017 for the longest-running pairs.
The following is a collection of simple starter notebooks for Kaggle's Crypto Comp showing PurgedTimeSeries in use with the collected dataset. Purged TimesSeries is explained here. There are many configuration variables below to allow you to experiment. Use either GPU or TPU. You can control which years are loaded, which neural networks are used, and whether to use feature engineering. You can experiment with different data preprocessing, model architecture, loss, optimizers, and learning rate schedules. The extra datasets contain the full history of the assets in the same format as the competition, so you can input that into your model too.
These notebooks follow the ideas presented in my "Initial Thoughts" here. Some code sections have been reused from Chris' great (great) notebook series on SIIM ISIC melanoma detection competition here
This is a work in progress and will be updated constantly throughout the competition. At the moment, there are some known issues that still needed to be addressed:
Opening price with an added indicator (MA50):
https://www.googleapis.com/download/storage/v1/b/kaggle-user-content/o/inbox%2F2234678%2Fb8664e6f26dc84e9a40d5a3d915c9640%2Fdownload.png?generation=1582053879538546&alt=media" alt="">
Volume and number of trades:
https://www.googleapis.com/download/storage/v1/b/kaggle-user-content/o/inbox%2F2234678%2Fcd04ed586b08c1576a7b67d163ad9889%2Fdownload-1.png?generation=1582053899082078&alt=media" alt="">
This data is being collected automatically from the crypto exchange Binance.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Study results and scripts to obtain the results for our paper "Python Crypto Misuses in the Wild" [@akwick @gh0st42 @Breitfelder @miramezini]The archives in this folder contains the following:- evaluations.tar.gz contains the evaluation folder from the GitHub project linked in References. - tools.tar.gz contains the tools folder from the GitHub project linked in References.- repos-py-with-dep-only-src-files.zip contains the source files and their dependencies of the Python projects analyzed.- repos-micropy-with-dep-only-src-files.zip contains the sources files and their depedencies of the MicroPython projects analyzed.
Access real-time and historical crypto futures data from global exchanges through CoinAPI. Our comprehensive derivatives coverage includes futures contracts and options from major trading platforms worldwide. Connect easily via REST API, WebSocket, or FIX protocols.
Get complete crypto market data from CoinAPI, featuring comprehensive price history, detailed trading volumes, and powerful analytics tools. Our service combines real-time feeds with extensive historical crypto data, giving you a complete view of market movements over time.
CoinAPI's Crypto Market Data delivers quotes, orderbooks, trades and OHLCV data through flexible delivery methods including API, WebSocket, and convenient S3 exports. This institutional-grade data is clean, normalized, and ready for immediate use in your trading platforms or analysis tools.
Whether you're tracking market trends, analyzing volume patterns, developing trading strategies, or conducting risk assessments, our reliable data foundation gives you the insights you need to make informed decisions in the crypto market.
Why work with us?
Market Coverage & Data Types: - Real-time and historical data since 2010 (for chosen assets) - Full order book depth (L2/L3) - Trading Data - Tick-by-tick data - OHLCV across multiple timeframes - Market indexes (VWAP, PRIMKT) - Exchange rates with fiat pairs - Spot, futures, options, and perpetual contracts - Coverage of 90%+ global trading volume
Technical Excellence: - 99,9% uptime guarantee - Multiple delivery methods: REST, WebSocket, FIX, S3 - Standardized data format across exchanges - Ultra-low latency data streaming - Detailed documentation - Custom integration assistance
CoinAPI works with hundreds of companies around the world - from trading desks and hedge funds to research teams and tech developers. We've built our reputation on reliable data and solid technical performance. That's why so many businesses turn to us when they need dependable crypto market information.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Through Telegram API, the authors collected this database over four months ago. These data are Telegram's comments of over eight professional Telegram channels about cryptocurrencies from December 2023 to March 2024. The theory of Behavioral economics shows that the opinions of people, especially experts, can impact the stock market trend (here, cryptocurrencies). Existing databases often cover tweets or Telegram's comments on one or more cryptocurrencies. Also, in these databases, no attention is paid to the user's expertise, and most of the data is extracted using hashtags. Failure to pay attention to the user's expertise causes the irrelevant volume to increase and the neutral polarity considerably. This database has a main table with eight columns. The columns of the main table are explained in the attached document. Researchers can use this dataset in various machine learning tasks, such as sentiment analysis and deep transfer learning with sentiment analysis. Also, this data can be used to check the impact of influencers' opinions on the cryptocurrency market trend. The use of this database is allowed by mentioning the source. Furthermore, we have added Python code to extract Telegram's comments. We used the RoBERTa pre-trained deep neural network and BiGRU deep neural network with an attention layer-based HDRB model(https://ieeexplore.ieee.org/document/10292644) for sentiment analysis.
This dataset is an extra updating dataset for the G-Research Crypto Forecasting competition.
This is a daily updated dataset, automaticlly collecting market data for G-Research crypto forecasting competition. The data is of the 1-minute resolution, collected for all competition assets and both retrieval and uploading are fully automated. see discussion topic.
For every asset in the competition, the following fields from Binance's official API endpoint for historical candlestick data are collected, saved, and processed.
1. **timestamp** - A timestamp for the minute covered by the row.
2. **Asset_ID** - An ID code for the cryptoasset.
3. **Count** - The number of trades that took place this minute.
4. **Open** - The USD price at the beginning of the minute.
5. **High** - The highest USD price during the minute.
6. **Low** - The lowest USD price during the minute.
7. **Close** - The USD price at the end of the minute.
8. **Volume** - The number of cryptoasset u units traded during the minute.
9. **VWAP** - The volume-weighted average price for the minute.
10. **Target** - 15 minute residualized returns. See the 'Prediction and Evaluation section of this notebook for details of how the target is calculated.
11. **Weight** - Weight, defined by the competition hosts [here](https://www.kaggle.com/cstein06/tutorial-to-the-g-research-crypto-competition)
12. **Asset_Name** - Human readable Asset name.
The dataframe is indexed by timestamp
and sorted from oldest to newest.
The first row starts at the first timestamp available on the exchange, which is July 2017 for the longest-running pairs.
The following is a collection of simple starter notebooks for Kaggle's Crypto Comp showing PurgedTimeSeries in use with the collected dataset. Purged TimesSeries is explained here. There are many configuration variables below to allow you to experiment. Use either GPU or TPU. You can control which years are loaded, which neural networks are used, and whether to use feature engineering. You can experiment with different data preprocessing, model architecture, loss, optimizers, and learning rate schedules. The extra datasets contain the full history of the assets in the same format as the competition, so you can input that into your model too.
These notebooks follow the ideas presented in my "Initial Thoughts" here. Some code sections have been reused from Chris' great (great) notebook series on SIIM ISIC melanoma detection competition here
This is a work in progress and will be updated constantly throughout the competition. At the moment, there are some known issues that still needed to be addressed:
Opening price with an added indicator (MA50):
https://www.googleapis.com/download/storage/v1/b/kaggle-user-content/o/inbox%2F2234678%2Fb8664e6f26dc84e9a40d5a3d915c9640%2Fdownload.png?generation=1582053879538546&alt=media" alt="">
Volume and number of trades:
https://www.googleapis.com/download/storage/v1/b/kaggle-user-content/o/inbox%2F2234678%2Fcd04ed586b08c1576a7b67d163ad9889%2Fdownload-1.png?generation=1582053899082078&alt=media" alt="">
This data is being collected automatically from the crypto exchange Binance.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Authors, through Twitter API, collected this database over eight months. These data are tweets of over 50 experts regarding market analysis of 40 cryptocurrencies. These experts are known as influencers on social networks such as Twitter. The theory of Behavioral economics shows that the opinions of people, especially experts, can impact the stock market trend (here, cryptocurrencies). Existing databases often cover tweets related to one or more cryptocurrencies. Also, in these databases, no attention is paid to the user's expertise, and most of the data is extracted using hashtags. Failure to pay attention to the user's expertise causes the irrelevant volume to increase and the neutral polarity to increase considerably. This database has a main table named "Tweets1" with 11 columns and 40 tables to separate comments related to each cryptocurrency. The columns of the main table and the cryptocurrency tables are explained in the attached document. Researchers can use this dataset in various machine learning tasks, such as sentiment analysis and deep transfer learning with sentiment analysis. Also, this data can be used to check the impact of influencers' opinions on the cryptocurrency market trend. The use of this database is allowed by mentioning the source. Also, in this version, we have added the excel version of the database and Python code to extract the names of influencers and tweets. in Version(3): In the new version, three datasets related to historical prices and sentiments related to Bitcoin, Ethereum, and Binance have been added as Excel files from January 1, 2023, to June 12, 2023. Also, two datasets of 52 influential tweets in cryptocurrencies have been published, along with the score and polarity of sentiments regarding more than 300 cryptocurrencies from February 2021 to June 2023. Also, two Python codes related to the sentiment analysis algorithm of tweets with Python have been published. This algorithm combines RoBERTa pre-trained deep neural network and BiGRU deep neural network with an attention layer (see code Preprocessing_and_sentiment_analysis with python).
DOGE started it. SHIB took it mainstream. BONK and PEPE brought in the crowds. Now what?
Stay on top of the entire meme coin ecosystem through CoinAPI's comprehensive data feeds. We've connected to 350+ exchanges so you don't have to, bringing together every significant market into one unified API that actually works when you need it. Dig into historical patterns that shaped today's meme coin landscape. Compare volume spikes across different tokens during viral moments. Track institutional entry points that transformed joke coins into serious market movers.
From quick price checks to in-depth research projects, our institutional-grade precision helps you navigate this volatile but opportunity-rich corner of the crypto market. With Digital Asset Data complete market coverage, you'll never miss a beat. Serious data for not-so-serious coins. That's the CoinAPI difference
➡️ Why choose us?
📊 Market Coverage & Data Types: ◦ Real-time and historical data since 2010 (for chosen assets) ◦ Full order book depth (L2/L3) ◦ Trade-by-trade data ◦ OHLCV across multiple timeframes ◦ Market indexes (VWAP, PRIMKT) ◦ Exchange rates with fiat pairs ◦ Spot, futures, options, and perpetual contracts ◦ Coverage of 90%+ global trading volume ◦ Full Crypto Trade Data
🔧 Technical Excellence: ◦ 99,9% uptime guarantee ◦ Multiple delivery methods: REST, WebSocket, FIX, S3 ◦ Standardized data format across exchanges ◦ Ultra-low latency data streaming ◦ Detailed documentation ◦ Custom integration assistance
CoinAPI represents the gold standard in cryptocurrency data, trusted by leading financial institutions, technology providers, and market makers worldwide. By combining technology with rigorous data validation protocols, we provide the foundation upon which many financial products are being built.
This dataset is an extra updating dataset for the G-Research Crypto Forecasting competition.
This is a daily updated dataset, automaticlly collecting market data for G-Research crypto forecasting competition. The data is of the 1-minute resolution, collected for all competition assets and both retrieval and uploading are fully automated. see discussion topic.
For every asset in the competition, the following fields from Binance's official API endpoint for historical candlestick data are collected, saved, and processed.
1. **timestamp** - A timestamp for the minute covered by the row.
2. **Asset_ID** - An ID code for the cryptoasset.
3. **Count** - The number of trades that took place this minute.
4. **Open** - The USD price at the beginning of the minute.
5. **High** - The highest USD price during the minute.
6. **Low** - The lowest USD price during the minute.
7. **Close** - The USD price at the end of the minute.
8. **Volume** - The number of cryptoasset u units traded during the minute.
9. **VWAP** - The volume-weighted average price for the minute.
10. **Target** - 15 minute residualized returns. See the 'Prediction and Evaluation section of this notebook for details of how the target is calculated.
11. **Weight** - Weight, defined by the competition hosts [here](https://www.kaggle.com/cstein06/tutorial-to-the-g-research-crypto-competition)
12. **Asset_Name** - Human readable Asset name.
The dataframe is indexed by timestamp
and sorted from oldest to newest.
The first row starts at the first timestamp available on the exchange, which is July 2017 for the longest-running pairs.
The following is a collection of simple starter notebooks for Kaggle's Crypto Comp showing PurgedTimeSeries in use with the collected dataset. Purged TimesSeries is explained here. There are many configuration variables below to allow you to experiment. Use either GPU or TPU. You can control which years are loaded, which neural networks are used, and whether to use feature engineering. You can experiment with different data preprocessing, model architecture, loss, optimizers, and learning rate schedules. The extra datasets contain the full history of the assets in the same format as the competition, so you can input that into your model too.
These notebooks follow the ideas presented in my "Initial Thoughts" here. Some code sections have been reused from Chris' great (great) notebook series on SIIM ISIC melanoma detection competition here
This is a work in progress and will be updated constantly throughout the competition. At the moment, there are some known issues that still needed to be addressed:
Opening price with an added indicator (MA50):
https://www.googleapis.com/download/storage/v1/b/kaggle-user-content/o/inbox%2F2234678%2Fb8664e6f26dc84e9a40d5a3d915c9640%2Fdownload.png?generation=1582053879538546&alt=media" alt="">
Volume and number of trades:
https://www.googleapis.com/download/storage/v1/b/kaggle-user-content/o/inbox%2F2234678%2Fcd04ed586b08c1576a7b67d163ad9889%2Fdownload-1.png?generation=1582053899082078&alt=media" alt="">
This data is being collected automatically from the crypto exchange Binance.
CoinAPI's Flat Files S3 API delivers historical crypto market data through downloadable CSV files. Access trades, quotes, and order book information in a user-friendly format. Our readable data files provide everything universities and analytics companies need.
CoinAPI delivers ultra-low latency cryptocurrency market data built for professional traders who demand absolute precision. Our tick-by-tick updates capture every market movement in real-time, providing the critical insights needed for split-second decisions in volatile markets.
Our WebSocket implementation streams live data directly to your trading systems with minimal delay, giving you the edge when identifying emerging patterns and opportunities. This immediate visibility helps optimize your trading strategies and manage risk more effectively in rapidly changing conditions.
We've engineered our infrastructure specifically for reliability under pressure. When markets surge and data volumes spike, our systems maintain consistent performance and delivery - ensuring your critical operations continue without interruption. For high-frequency trading and institutional investors who can't afford to wait, CoinAPI provides real-time cryptocurrency intelligence that drives successful decision-making
Why work with us?
Market Coverage & Data Types: - Real-time and historical data since 2010 (for chosen assets) - Full order book depth (L2/L3) - Tick-by-tick data - OHLCV across multiple timeframes - Market indexes (VWAP, PRIMKT) - Exchange rates with fiat pairs - Spot, futures, options, and perpetual contracts - Coverage of 90%+ global trading volume - Full Cryptocurrency Investor Data.
Technical Excellence: - 99,9% uptime guarantee - Multiple delivery methods: REST, WebSocket, FIX, S3 - Standardized data format across exchanges - Ultra-low latency data streaming - Detailed documentation - Custom integration assistance
CoinAPI delivers mission-critical insights to financial institutions globally, enabling informed decision-making in volatile cryptocurrency markets. Our enterprise-grade infrastructure processes milions of data points daily, offering unmatched reliability.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Cryptocurrency historical datasets from January 2012 (if available) to October 2021 were obtained and integrated from various sources and Application Programming Interfaces (APIs) including Yahoo Finance, Cryptodownload, CoinMarketCap, various Kaggle datasets, and multiple APIs. While these datasets used various formats of time (e.g., minutes, hours, days), in order to integrate the datasets days format was used for in this research study. The integrated cryptocurrency historical datasets for 80 cryptocurrencies including but not limited to Bitcoin (BTC), Ethereum (ETH), Binance Coin (BNB), Cardano (ADA), Tether (USDT), Ripple (XRP), Solana (SOL), Polkadot (DOT), USD Coin (USDC), Dogecoin (DOGE), Tron (TRX), Bitcoin Cash (BCH), Litecoin (LTC), EOS (EOS), Cosmos (ATOM), Stellar (XLM), Wrapped Bitcoin (WBTC), Uniswap (UNI), Terra (LUNA), SHIBA INU (SHIB), and 60 more cryptocurrencies were uploaded in this online Mendeley data repository. Although the primary attribute of including the mentioned cryptocurrencies was the Market Capitalization, a subject matter expert i.e., a professional trader has also guided the initial selection of the cryptocurrencies by analyzing various indicators such as Relative Strength Index (RSI), Moving Average Convergence/Divergence (MACD), MYC Signals, Bollinger Bands, Fibonacci Retracement, Stochastic Oscillator and Ichimoku Cloud. The primary features of this dataset that were used as the decision-making criteria of the CLUS-MCDA II approach are Timestamps, Open, High, Low, Closed, Volume (Currency), % Change (7 days and 24 hours), Market Cap and Weighted Price values. The available excel and CSV files in this data set are just part of the integrated data and other databases, datasets and API References that was used in this study are as follows: [1] https://finance.yahoo.com/ [2] https://coinmarketcap.com/historical/ [3] https://cryptodatadownload.com/ [4] https://kaggle.com/philmohun/cryptocurrency-financial-data [5] https://kaggle.com/deepshah16/meme-cryptocurrency-historical-data [6] https://kaggle.com/sudalairajkumar/cryptocurrencypricehistory [7] https://min-api.cryptocompare.com/data/price?fsym=BTC&tsyms=USD [8] https://min-api.cryptocompare.com/ [9] https://p.nomics.com/cryptocurrency-bitcoin-api [10] https://www.coinapi.io/ [11] https://www.coingecko.com/en/api [12] https://cryptowat.ch/ [13] https://www.alphavantage.co/ This dataset is part of the CLUS-MCDA (Cluster analysis for improving Multiple Criteria Decision Analysis) and CLUS-MCDAII Project: https://aimaghsoodi.github.io/CLUSMCDA-R-Package/ https://github.com/Aimaghsoodi/CLUS-MCDA-II https://github.com/azadkavian/CLUS-MCDA