Facebook
Twitterhttps://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/
This dataset contains detailed information on cryptocurrency transactions, specifically focusing on Bitcoin (BTC) and Ethereum (ETH). The dataset includes transaction details such as sender and receiver addresses, transaction amounts, fees, timestamps, and mining pool information. It serves as a valuable resource for analyzing market trends, identifying patterns in trading behavior, and studying blockchain transaction dynamics across different mining pools.
TX2QW62Q5XM17K0xd377b9203ad74038664c08f658c0245632651f550x4a3370c0f0b83d519ddf50892d006f64d742588011.39618058 (BTC or ETH depending on the currency type)6.226e-052025-03-30T23:32:40.589676Zf4A4D894b9Ee166B3F75F4FbEthermineETHTransferConfirmed50 (Only applicable to Ethereum transactions)Analyze transaction patterns to identify market trends and behaviors.
- Use the data to track spikes or drops in transaction volumes and correlate them with market events or price movements.
Study how mining pools and transaction fees interact with blockchain dynamics.
- Investigate how different mining pools impact transaction confirmation times and fees across Bitcoin and Ethereum networks.
Investigate the behavior of users sending or receiving cryptocurrency.
- Identify patterns such as frequent senders/receivers, average transaction amounts, and transaction frequency.
Explore transaction fees and gas price fluctuations across different mining pools and blockchains.
- Examine how Ethereum’s gas prices and Bitcoin’s transaction fees fluctuate over time and how this affects user behavior.
Facebook
Twitterhttp://opendatacommons.org/licenses/dbcl/1.0/http://opendatacommons.org/licenses/dbcl/1.0/
This dataset captures YouTube's public discourse surrounding the United States government's seizure of $225 million in cryptocurrency. The topic was selected for data collection as it was trending on YouTube's News tab, indicating significant public interest and media coverage. Collected on June 19, 2025, it includes comprehensive video metadata and user comments from 453 relevant videos, totaling over 115,000 comments. This dataset provides valuable insights into public sentiment, media coverage patterns, and social media reactions to major cryptocurrency-related government actions.
Data was systematically collected using YouTube's public data with the query "United States seizes $225 million in cryptocurrency." The collection process targeted the most relevant and recent content to ensure comprehensive coverage of the topic across different YouTube channels and content creators.
Collection Date: June 19, 2025
Search Query: "United States seizes $225 million in cryptocurrency"
Total Videos Processed: 453
Total Comments Collected: 115,202
The dataset consists of three main components:
This dataset represents a comprehensive snapshot of YouTube discourse on a significant cryptocurrency news event. If you have questions about the data collection methodology, need clarification on any fields, or have suggestions for improvements, please don't hesitate to ask in the discussion forum - I'll be happy to help and provide additional context!
Facebook
TwitterA league table of the 120 cryptocurrencies with the highest market cap reveals how diverse each crypto is and potentially how much risk is involved when investing in one. Bitcoin (BTC), for instance, had a so-called "high cap" - a market cap worth more than 10 billion U.S. dollars - indicating this crypto project has a certain track record or, at the very least, is considered a major player in the cryptocurrency space. This is different in Decentralize Finance (DeFi), where Bitcoin is only a relatively new player. A concentrated market The number of existing cryptocurrencies is several thousands, even if most have a limited significance. Indeed, Bitcoin and Ethereum account for nearly 75 percent of the entire crypto market capitalization. As crypto is relatively easy to create, the range of projects varies significantly - from improving payments to solving real-world issues, but also meme coins and more speculative investments. Crypto is not considered a payment method While often talked about as an investment vehicle, cryptocurrencies have not yet established a clear use case in day-to-day life. Central bankers found that usefulness of crypto in domestic payments or remittances to be negligible. A forecast for the world's main online payment methods took a similar stance: It predicts that cryptocurrency would only take up 0.2 percent of total transaction value by 2027.
Facebook
TwitterAttribution-ShareAlike 4.0 (CC BY-SA 4.0)https://creativecommons.org/licenses/by-sa/4.0/
License information was derived automatically
Cryptocurrency trading analysis and algorithmic strategy development rely on high-quality, high-frequency historical data. This dataset provides clean, structured OHLCV data for one of the most liquid and popular trading pairs, ETH/USDT, sourced directly from the Bybit exchange. It is ideal for quantitative analysts, data scientists, and trading enthusiasts looking to backtest strategies, perform market analysis, or build predictive models across different time horizons.
The dataset consists of three separate CSV files, each corresponding to a different time frame:
BYBIT_ETHUSDT_15m.csv: Historical data in 15-minute intervals. BYBIT_ETHUSDT_1h.csv: Historical data in 1-hour intervals. BYBIT_ETHUSDT_4h.csv: Historical data in 4-hour intervals.
Each file contains the same six columns:
This dataset is made possible by the publicly available data from the Bybit exchange. Please consider this when using the data for your projects.
Facebook
Twitterhttps://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/
This dataset was obtained by myself from the top crypto Telegram groups using the Telegram API and the Telethon Python module. More details on Telethon can be found here: https://docs.telethon.dev/en/latest/
The dataset is in JSON format. This particular JSON is easy to read and load into a Pandas data frame. Refer to my notebook here: https://www.kaggle.com/aagghh/trends-in-crypto-space-telegram-analysis
The size of the dataset is 2.8GB
This dataset contains records of approximately 3+ million messages in the official top crypto exchanges Telegram groups from various users. Also, it contains data such as dates, number of views, shares, and many more.
| Telegram Group Name | Link | File Name | Number of messages |
|---|---|---|---|
| Binance official group | https://t.me/binanceexchange | group_messages_binance.json | 650k |
| Kucoin official group | https://t.me/Kucoin_Exchange | group_messages_kucoin.json | 860k |
| OKEx official group | https://t.me/OKExOfficial_English | group_messages_okex.json | 1m |
| Huobi official group | https://t.me/huobiglobalofficial | group_messages_huobi.json | 550k |
| Bittrex official group | https://t.me/BittrexGlobalEnglish | group_messages_bittrex.json | 70k |
A potential use-case for this data could be a practice in some sentiment analysis in order to predict the popularity/price of some crypto assets
Also good to try using this dataset along with various time-series price data in an attempt to see if there is a correlation between the price of an asset and how the community sees it (Bitcoin for instance)
Some other EDA can be applied to this dataset. For instance, I am gonna use this data for the crypto space trends analysis (hello GameStop)
Please refer to my notebook on the trends here: https://www.kaggle.com/aagghh/trends-in-crypto-space-telegram-analysis
Facebook
TwitterApache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
License information was derived automatically
🧠 Overview This dataset offers a meticulously collected and organized set of Ethereum (ETH) OHLCV data against Tether (USDT), spanning over 4 years — from December 1, 2019 to January 1, 2024. It is derived from Binance, one of the most trusted and liquid cryptocurrency exchanges globally. The dataset is offered in multiple timeframes — from 1-minute to 1-day intervals — allowing it to serve as a comprehensive base for a wide variety of quantitative, financial, machine learning, and research use cases.
ETH/USDT is one of the most traded crypto pairs and reflects significant market sentiment, liquidity, and volatility patterns across all time horizons.
🎯 Who is This Dataset For? This dataset was built with a wide range of users in mind:
Quant researchers and traders building trading strategies and backtests
Data scientists and ML practitioners training deep learning models (LSTM, CNN, Transformer) on price prediction, volatility forecasting, or anomaly detection
Finance students and educators looking for clean, real-world time-series data
Crypto developers and bot builders designing arbitrage, scalping, or HFT strategies
Technical analysts and chartists needing high-quality historical data for visual analysis
⚙️ Why This Dataset? (The Motivation) While there are countless crypto datasets available, most have limitations:
They cover only one timeframe (e.g., just 1d or 1h)
They include missing or irregular data
They are not structured for use in modeling pipelines or financial toolkits
This dataset solves all of that by:
Providing 12 distinct timeframes (1m to 1d)
Ensuring clean, sorted, and complete records
Using a standard OHLCV schema that aligns with trading systems and model inputs
Covering an extensive historical range, long enough for statistically valid backtests and long-term model training
Being stored in easy-to-read Excel files, compatible with Python, R, Excel, Power BI, etc.
🔍 Data Breakdown Each record (row) represents one candlestick (also called a "kline") for a specific time interval.
Columns (Features): Column Description Timestamp Start time of the candlestick interval (UTC). It's the reference time for the open of that candle. Format: YYYY-MM-DD HH:MM:SS. Open ETH/USDT price at the start of the interval (i.e., opening price). High Maximum price reached by ETH during the interval. Low Minimum price reached by ETH during the interval. Close ETH/USDT price at the end of the interval (i.e., closing price). Volume The total volume of ETH traded during the interval (base asset volume, not USDT).
All values are as recorded from the Binance exchange without transformation, preserving their integrity for accurate modeling and testing.
🗃️ Timeframes Provided The dataset is divided into 12 separate Excel files, each corresponding to a distinct time resolution:
Filename Interval Description ETHUSDT_1m.xlsx 1 Minute High-frequency data for scalping or HFT models ETHUSDT_3m.xlsx 3 Minute Good compromise between noise and resolution ETHUSDT_5m.xlsx 5 Minute Widely used in intraday strategies ETHUSDT_15m.xlsx 15 Minute Ideal for swing trading and signal generation ETHUSDT_30m.xlsx 30 Minute Captures broader intraday patterns ETHUSDT_1h.xlsx 1 Hour Balanced view of short and mid-term trends ETHUSDT_2h.xlsx 2 Hour Smooths volatility for multi-hour trend analysis ETHUSDT_4h.xlsx 4 Hour Macro movement visualization across days ETHUSDT_6h.xlsx 6 Hour Suited for overnight or global market overlap patterns ETHUSDT_8h.xlsx 8 Hour Large-window analysis for swing traders ETHUSDT_12h.xlsx 12 Hour Captures morning-evening cycle globally ETHUSDT_1d.xlsx 1 Day Ideal for backtesting, long-term trends, feature creation
Each file is structured with consistent formatting and identical columns, making merging, resampling, or comparative analysis simple.
Not seeing a result you expected?
Learn how you can add new datasets to our index.
Facebook
Twitterhttps://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/
This dataset contains detailed information on cryptocurrency transactions, specifically focusing on Bitcoin (BTC) and Ethereum (ETH). The dataset includes transaction details such as sender and receiver addresses, transaction amounts, fees, timestamps, and mining pool information. It serves as a valuable resource for analyzing market trends, identifying patterns in trading behavior, and studying blockchain transaction dynamics across different mining pools.
TX2QW62Q5XM17K0xd377b9203ad74038664c08f658c0245632651f550x4a3370c0f0b83d519ddf50892d006f64d742588011.39618058 (BTC or ETH depending on the currency type)6.226e-052025-03-30T23:32:40.589676Zf4A4D894b9Ee166B3F75F4FbEthermineETHTransferConfirmed50 (Only applicable to Ethereum transactions)Analyze transaction patterns to identify market trends and behaviors.
- Use the data to track spikes or drops in transaction volumes and correlate them with market events or price movements.
Study how mining pools and transaction fees interact with blockchain dynamics.
- Investigate how different mining pools impact transaction confirmation times and fees across Bitcoin and Ethereum networks.
Investigate the behavior of users sending or receiving cryptocurrency.
- Identify patterns such as frequent senders/receivers, average transaction amounts, and transaction frequency.
Explore transaction fees and gas price fluctuations across different mining pools and blockchains.
- Examine how Ethereum’s gas prices and Bitcoin’s transaction fees fluctuate over time and how this affects user behavior.