99 datasets found

Co-purchase Graphs
kaggle.com
zip
Updated Nov 11, 2021
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Subhajit Sahu (2021). Co-purchase Graphs [Dataset]. https://www.kaggle.com/datasets/wolfram77/graphs-co-purchase
Explore at:
zip(251051772 bytes)Available download formats
Dataset updated
Nov 11, 2021
Authors
Subhajit Sahu
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
Network was collected by crawling Amazon website. It is based on Customers Who Bought This Item Also Bought feature of the Amazon website. If a product i is frequently co-purchased with product j, the graph contains a directed edge from i to j.

The data was collected by crawling Amazon website and contains product metadata and review information about 548,552 different products (Books, music CDs, DVDs and VHS video tapes).

For each product the following information is available:

Title Salesrank List of similar products (that get co-purchased with the current product) Detailed product categorization Product reviews: time, customer, rating, number of votes, number of people that found the review helpful

Stanford Network Analysis Platform (SNAP) is a general purpose, high performance system for analysis and manipulation of large networks. Graphs consists of nodes and directed/undirected/multiple edges between the graph nodes. Networks are graphs with data on nodes and/or edges of the network.

The core SNAP library is written in C++ and optimized for maximum performance and compact graph representation. It easily scales to massive networks with hundreds of millions of nodes, and billions of edges. It efficiently manipulates large graphs, calculates structural properties, generates regular and random graphs, and supports attributes on nodes and edges. Besides scalability to large graphs, an additional strength of SNAP is that nodes, edges and attributes in a graph or a network can be changed dynamically during the computation.

SNAP was originally developed by Jure Leskovec in the course of his PhD studies. The first release was made available in Nov, 2009. SNAP uses a general purpose STL (Standard Template Library)-like library GLib developed at Jozef Stefan Institute. SNAP and GLib are being actively developed and used in numerous academic and industrial projects.

http://snap.stanford.edu/data/index.html#amazon
t
Amazon co-purchasing graph - Dataset - LDM
service.tib.eu
Updated Dec 16, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
(2024). Amazon co-purchasing graph - Dataset - LDM [Dataset]. https://service.tib.eu/ldmservice/dataset/amazon-co-purchasing-graph
Explore at:
Dataset updated
Dec 16, 2024
Description
The dataset used in the paper is the Amazon co-purchasing graph.
u
Amazon review data 2018
cseweb.ucsd.edu
nijianmo.github.io
+1more
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
UCSD CSE Research Project, Amazon review data 2018 [Dataset]. https://cseweb.ucsd.edu/~jmcauley/datasets/amazon_v2/
Explore at:
Dataset authored and provided by
UCSD CSE Research Project
Description
Context

This Dataset is an updated version of the Amazon review dataset released in 2014. As in the previous version, this dataset includes reviews (ratings, text, helpfulness votes), product metadata (descriptions, category information, price, brand, and image features), and links (also viewed/also bought graphs). In addition, this version provides the following features:

More reviews:

The total number of reviews is 233.1 million (142.8 million in 2014).

New reviews:

Current data includes reviews in the range May 1996 - Oct 2018.

Metadata: - We have added transaction metadata for each review shown on the review page.

Added more detailed metadata of the product landing page.

Acknowledgements

If you publish articles based on this dataset, please cite the following paper:

Jianmo Ni, Jiacheng Li, Julian McAuley. Justifying recommendations using distantly-labeled reviews and fined-grained aspects. EMNLP, 2019.
Amazon Product Co-purchasing Network (SNAP)
kaggle.com
zip
Updated Dec 16, 2021
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Subhajit Sahu (2021). Amazon Product Co-purchasing Network (SNAP) [Dataset]. https://www.kaggle.com/wolfram77/graphs-snap-com-amazon
Explore at:
zip(12922748 bytes)Available download formats
Dataset updated
Dec 16, 2021
Authors
Subhajit Sahu
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
Amazon product co-purchasing network and ground-truth communities

https://snap.stanford.edu/data/com-Amazon.html

Dataset information

Network was collected by crawling the Amazon.com website. It is based on
Customers Who Bought This Item Also Bought feature of the Amazon website.
If a product i is frequently co-purchased with product j, the graph
contains an undirected edge from i to j. Each product category provided by Amazon defines each ground-truth community.

We regard each connected component in a product category as a separate
ground-truth community. We remove the ground-truth communities which have
less than 3 nodes. We also provide the top 5,000 communities with highest
quality which are described in our paper (http://arxiv.org/abs/1205.6233). As for the network, we provide the largest connected component.

Dataset statistics
Nodes 334863
Edges 925872
Nodes in largest WCC 334863 (1.000)
Edges in largest WCC 925872 (1.000)
Nodes in largest SCC 334863 (1.000)
Edges in largest SCC 925872 (1.000)
Average clustering coefficient 0.3967
Number of triangles 667129
Fraction of closed triangles 0.07925
Diameter (longest shortest path) 44
90-percentile effective diameter 15

Source (citation) J. Yang and J. Leskovec. Defining and Evaluating Network Communities based on Ground-truth. ICDM, 2012.
http://arxiv.org/abs/1205.6233

Files
File Description
com-amazon.ungraph.txt.gz Undirected Amazon product co-purchasing network com-amazon.all.dedup.cmty.txt.gz Amazon communities
com-amazon.top5000.cmty.txt.gz Amazon communities (Top 5,000)

Notes on inclusion into the SuiteSparse Matrix Collection, July 2018:

The graph in the SNAP data set is 1-based, with nodes numbered 1 to
548,551.

In the SuiteSparse Matrix Collection, Problem.A is the undirected Amazon
product co-purchasing network, a matrix of size n-by-n with n=334,863,
which is the number of unique product id's appearing in any edge.
Problem.aux.nodeid is a list of the node id's that appear in the SNAP data set. A(i,j)=1 if the product nodeid(i) is co-purchased with product
nodeid(j). The node id's are the same as the SNAP data set (1-based).

C = Problem.aux.Communities_all is a sparse matrix of size n by 75,149,
which holds the 75,149 categories in the com-amazon.all.dedup.cmty.txt
file. The kth line in that file defines the kth community, and is the
column C(:,k), where C(i,k)=1 if product nodeid(i) is in the kth
community. Row C(i,:) and row/column i of the A matrix thus refer to the
same product, nodeid(i).

Ctop = Problem.aux.Communities_top5000 is n-by-5000, with the same
structure as the C array above, with the content of the
com-amazon.top5000.cmty.txt.
t
Amazon Product Dataset - Dataset - LDM
service.tib.eu
Updated Dec 3, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
(2024). Amazon Product Dataset - Dataset - LDM [Dataset]. https://service.tib.eu/ldmservice/dataset/amazon-product-dataset
Explore at:
Dataset updated
Dec 3, 2024
Description
The dataset used in the paper is a large-scale graph dataset, consisting of users and shows with multi-attribute edges. The graph is constructed by selecting user IDs and side information combinations of shows as nodes, and click/co-click relations and view time as edges.
OGBN-Products (Processed for PyG)
kaggle.com
zip
Updated Feb 27, 2021
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Redao da Taupl (2021). OGBN-Products (Processed for PyG) [Dataset]. https://www.kaggle.com/datasets/dataup1/ogbn-products/code
Explore at:
zip(3699538358 bytes)Available download formats
Dataset updated
Feb 27, 2021
Authors
Redao da Taupl
Description
OGBN-Products

Webpage: https://ogb.stanford.edu/docs/nodeprop/#ogbn-products

Usage in Python

import os.path as osp import pandas as pd import datatable as dt import torch import torch_geometric as pyg from ogb.nodeproppred import PygNodePropPredDataset class PygOgbnProducts(PygNodePropPredDataset): def _init_(self, meta_csv = None): root, name, transform = '/kaggle/input', 'ogbn-products', None if meta_csv is None: meta_csv = osp.join(root, name, 'ogbn-master.csv') master = pd.read_csv(meta_csv, index_col = 0) meta_dict = master[name] meta_dict['dir_path'] = osp.join(root, name) super()._init_(name = name, root = root, transform = transform, meta_dict = meta_dict) def get_idx_split(self, split_type = None): if split_type is None: split_type = self.meta_info['split'] path = osp.join(self.root, 'split', split_type) if osp.isfile(os.path.join(path, 'split_dict.pt')): return torch.load(os.path.join(path, 'split_dict.pt')) if self.is_hetero: train_idx_dict, valid_idx_dict, test_idx_dict = read_nodesplitidx_split_hetero(path) for nodetype in train_idx_dict.keys(): train_idx_dict[nodetype] = torch.from_numpy(train_idx_dict[nodetype]).to(torch.long) valid_idx_dict[nodetype] = torch.from_numpy(valid_idx_dict[nodetype]).to(torch.long) test_idx_dict[nodetype] = torch.from_numpy(test_idx_dict[nodetype]).to(torch.long) return {'train': train_idx_dict, 'valid': valid_idx_dict, 'test': test_idx_dict} else: train_idx = dt.fread(osp.join(path, 'train.csv'), header = None).to_numpy().T[0] train_idx = torch.from_numpy(train_idx).to(torch.long) valid_idx = dt.fread(osp.join(path, 'valid.csv'), header = None).to_numpy().T[0] valid_idx = torch.from_numpy(valid_idx).to(torch.long) test_idx = dt.fread(osp.join(path, 'test.csv'), header = None).to_numpy().T[0] test_idx = torch.from_numpy(test_idx).to(torch.long) return {'train': train_idx, 'valid': valid_idx, 'test': test_idx}

dataset = PygOgbnProducts() split_idx = dataset.get_idx_split() train_idx, valid_idx, test_idx = split_idx['train'], split_idx['valid'], split_idx['test'] graph = dataset[0] # PyG Graph object

Description

Graph: The ogbn-products dataset is an undirected and unweighted graph, representing an Amazon product co-purchasing network [1]. Nodes represent products sold in Amazon, and edges between two products indicate that the products are purchased together. The authors follow [2] to process node features and target categories. Specifically, node features are generated by extracting bag-of-words features from the product descriptions followed by a Principal Component Analysis to reduce the dimension to 100.

Prediction task: The task is to predict the category of a product in a multi-class classification setup, where the 47 top-level categories are used for target labels.

Dataset splitting: The authors consider a more challenging and realistic dataset splitting that differs from the one used in [2] Instead of randomly assigning 90% of the nodes for training and 10% of the nodes for testing (without use of a validation set), use the sales ranking (popularity) to split nodes into training/validation/test sets. Specifically, the authors sort the products according to their sales ranking and use the top 8% for training, next top 2% for validation, and the rest for testing. This is a more challenging splitting procedure that closely matches the real-world application where labels are first assigned to important nodes in the network and ML models are subsequently used to make predictions on less important ones.

Note 1: A very small number of self-connecting edges are repeated (see here); you may remove them if necessary.

Note 2: For undirected graphs, the loaded graphs will have the doubled number of edges because the bidirectional edges will be added automatically.

Summary

Package #Nodes #Edges Split Type Task Type Metric
ogb>=1.1.1 2,449,029 61,859,140 Sales rank Multi-class classification Accuracy

Open Graph Benchmark

Website: https://ogb.stanford.edu

The Open Graph Benchmark (OGB) [3] is a collection of realistic, large-scale, and diverse benchmark datasets for machine learning on graphs. OGB datasets are automatically downloaded, processed, and split using the OGB Data Loader. The model performance can be evaluated using the OGB Evaluator in a unified manner.

References

[1] http://manikvarma.org/downloads/XC/XMLRepository.html [2] Wei-Lin Chiang, ...
Z
Large-scale attributed graph & hypergraph datasets: TWeibo, Amazon2M,...
data-staging.niaid.nih.gov
data.niaid.nih.gov
Updated Dec 23, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Li, Yiran; Guo, Gongyao; Yang, Renchi; Shi, Jieming (2023). Large-scale attributed graph & hypergraph datasets: TWeibo, Amazon2M, Amazon, MAG-PM [Dataset]. https://data-staging.niaid.nih.gov/resources?id=zenodo_10426623
Explore at:
Dataset updated
Dec 23, 2023
Dataset provided by
Hong Kong Baptist University
Hong Kong Polytechnic University
Authors
Li, Yiran; Guo, Gongyao; Yang, Renchi; Shi, Jieming
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
Here we provide additional large-scale datasets used in our work "A Versatile Framework for Attributed Network Clustering via K-Nearest Neighbor Augmentation", along with the index files for constructing KNN graphs using ScaNN and Faiss.

Usage:

cd ANCKA/

unzip ~/Download_path/ANCKA_data.zip -d data/
u
Amazon Question and Answer Data
cseweb.ucsd.edu
json
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
UCSD CSE Research Project, Amazon Question and Answer Data [Dataset]. https://cseweb.ucsd.edu/~jmcauley/datasets.html
Explore at:
jsonAvailable download formats
Dataset authored and provided by
UCSD CSE Research Project
Description
These datasets contain 1.48 million question and answer pairs about products from Amazon.

Metadata includes

question and answer text

is the question binary (yes/no), and if so does it have a yes/no answer?

timestamps

product ID (to reference the review dataset)

Basic Statistics:

Questions: 1.48 million

Answers: 4,019,744

Labeled yes/no questions: 309,419

Number of unique products with questions: 191,185
h
pareto-amazon-photo
huggingface.co
Updated Feb 14, 2024
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Saurav Maheshkar (2024). pareto-amazon-photo [Dataset]. https://huggingface.co/datasets/SauravMaheshkar/pareto-amazon-photo
Explore at:
Dataset updated
Feb 14, 2024
Authors
Saurav Maheshkar
License
https://choosealicense.com/licenses/cc/https://choosealicense.com/licenses/cc/
Description
Dataset Information

Nodes

Edges

Features

7,650 119,043 745

Pre-processed as per the official codebase of https://arxiv.org/abs/2210.02016

Citations

@article{ju2023multi, title={Multi-task Self-supervised Graph Neural Networks Enable Stronger Task Generalization}, author={Ju, Mingxuan and Zhao, Tong and Wen, Qianlong and Yu, Wenhao and Shah, Neil and Ye, Yanfang and Zhang, Chuxu}, booktitle={International Conference on Learning… See the full description on the dataset page: https://huggingface.co/datasets/SauravMaheshkar/pareto-amazon-photo.
m
Amazon.com Inc - Ebitda
macro-rankings.com
csv, excel
Updated Mar 18, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
macro-rankings (2025). Amazon.com Inc - Ebitda [Dataset]. https://www.macro-rankings.com/markets/stocks/amzn-nasdaq/income-statement/ebitda
Explore at:
csv, excelAvailable download formats
Dataset updated
Mar 18, 2025
Dataset authored and provided by
macro-rankings
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Area covered
united states
Description
Ebitda Time Series for Amazon.com Inc. Amazon.com, Inc. engages in the retail sale of consumer products, advertising, and subscriptions service through online and physical stores in North America and internationally. The company operates through three segments: North America, International, and Amazon Web Services (AWS). It also manufactures and sells electronic devices, including Kindle, fire tablets, fire TVs, echo, ring, blink, and eero; and develops and produces media content. In addition, the company offers programs that enable sellers to sell their products in its stores; and programs that allow authors, independent publishers, musicians, filmmakers, Twitch streamers, skill and app developers, and others to publish and sell content. Further, it provides compute, storage, database, analytics, machine learning, and other services, as well as advertising services through programs, such as sponsored ads, display, and video advertising. Additionally, the company offers Amazon Prime, a membership program. The company's products offered through its stores include merchandise and content purchased for resale and products offered by third-party sellers. It also provides AgentCore services, such as AgentCore Runtime, AgentCore Memory, AgentCore Observability, AgentCore Identity, AgentCore Gateway, AgentCore Browser, and AgentCore Code Interpreter. It serves consumers, sellers, developers, enterprises, content creators, advertisers, and employees. Amazon.com, Inc. was incorporated in 1994 and is headquartered in Seattle, Washington.
G
Graph Technology Report
datainsightsmarket.com
doc, pdf, ppt
Updated Aug 7, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Data Insights Market (2025). Graph Technology Report [Dataset]. https://www.datainsightsmarket.com/reports/graph-technology-1956854
Explore at:
ppt, pdf, docAvailable download formats
Dataset updated
Aug 7, 2025
Dataset authored and provided by
Data Insights Market
License
https://www.datainsightsmarket.com/privacy-policyhttps://www.datainsightsmarket.com/privacy-policy
Time period covered
2025 - 2033
Area covered
Global
Variables measured
Market Size
Description
The graph technology market is experiencing robust growth, driven by the increasing need for advanced data analytics and the rising adoption of artificial intelligence (AI) and machine learning (ML) applications. The market's expansion is fueled by the ability of graph databases to handle complex, interconnected data more efficiently than traditional relational databases. This is particularly crucial in industries like finance (fraud detection, risk management), healthcare (patient relationship mapping, drug discovery), and e-commerce (recommendation systems, personalized marketing). Key trends include the move towards cloud-based graph solutions, the integration of graph technology with other data management systems, and the development of more sophisticated graph algorithms for advanced analytics. While challenges remain, such as the need for skilled professionals and the complexity of implementing graph databases, the overall market outlook remains positive, with a projected Compound Annual Growth Rate (CAGR) – let's conservatively estimate this at 25% – for the forecast period 2025-2033. This growth will be driven by ongoing digital transformation initiatives across various sectors, leading to an increased demand for efficient data management and analytics capabilities. We can expect to see continued innovation in both open-source and commercial graph database solutions, further fueling the market's expansion. The competitive landscape is characterized by a mix of established players like Oracle, IBM, and Microsoft, alongside emerging innovative companies such as Neo4j, TigerGraph, and Amazon Web Services. These companies are constantly vying for market share through product innovation, strategic partnerships, and acquisitions. The presence of both open-source and proprietary solutions caters to a diverse range of needs and budgets. The market segmentation, while not explicitly detailed, likely includes categories based on deployment (cloud, on-premise), database type (property graph, RDF), and industry vertical. The regional distribution will likely show strong growth in North America and Europe, reflecting the higher adoption of advanced technologies in these regions, followed by a steady rise in Asia-Pacific and other developing markets. Looking ahead, the convergence of graph technology with other emerging technologies like blockchain and the Internet of Things (IoT) promises to unlock even greater opportunities for growth and innovation in the years to come.
Amazon Financial Dataset
kaggle.com
zip
Updated Dec 18, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Krishna Yadu (2024). Amazon Financial Dataset [Dataset]. https://www.kaggle.com/datasets/krishnayadav456wrsty/amazon-financial-dataset
Explore at:
zip(7415 bytes)Available download formats
Dataset updated
Dec 18, 2024
Authors
Krishna Yadu
License
Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
License information was derived automatically
Description
Title:

Amazon Financial Dataset: R&D, Marketing, Campaigns, and Profit

Description:

This dataset provides fictional yet insightful financial data of Amazon's business activities across all 50 states of the USA. It is specifically designed to help students, researchers, and practitioners perform various data analysis tasks such as log normalization, Gaussian distribution visualization, and financial performance comparisons.

Each row represents a state and contains the following columns:
- R&D Amount (in $): The investment made in research and development.
- Marketing Amount (in $): The expenditure on marketing activities.
- Campaign Amount (in $): The costs associated with promotional campaigns.
- State: The state in which the data is recorded.
- Profit (in $): The net profit generated from the state.

Additional features include log-normalized and Z-score transformations for advanced analysis.

Use Cases:

This dataset is ideal for practicing:
1. Log Transformation: Normalize skewed data for better modeling and analysis.
2. Statistical Analysis: Explore relationships between financial investments and profit.
3. Visualization: Create compelling graphs such as Gaussian distributions and standard normal distributions.
4. Machine Learning Projects: Build regression models to predict profits based on R&D and marketing spend.

File Information:

File Format: Excel (.xlsx)

Number of Records: 50 (one for each state of the USA)

Columns: 5 primary financial columns and additional preprocessed columns for normalization and Z-scores.

Important Note:

This dataset is synthetically generated and is not based on actual Amazon financial records. It is created solely for educational and practice purposes.

Tags:

Financial Analysis

Data Visualization

Machine Learning

Statistical Analysis

Educational Dataset
m
Amazon.com Inc - Change-Receivables
macro-rankings.com
csv, excel
Updated Sep 27, 2025
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
macro-rankings (2025). Amazon.com Inc - Change-Receivables [Dataset]. https://www.macro-rankings.com/markets/stocks/amzn-nasdaq/cashflow-statement/change-receivables
Explore at:
excel, csvAvailable download formats
Dataset updated
Sep 27, 2025
Dataset authored and provided by
macro-rankings
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Area covered
united states
Description
Change-Receivables Time Series for Amazon.com Inc. Amazon.com, Inc. engages in the retail sale of consumer products, advertising, and subscriptions service through online and physical stores in North America and internationally. The company operates through three segments: North America, International, and Amazon Web Services (AWS). It also manufactures and sells electronic devices, including Kindle, fire tablets, fire TVs, echo, ring, blink, and eero; and develops and produces media content. In addition, the company offers programs that enable sellers to sell their products in its stores; and programs that allow authors, independent publishers, musicians, filmmakers, Twitch streamers, skill and app developers, and others to publish and sell content. Further, it provides compute, storage, database, analytics, machine learning, and other services, as well as advertising services through programs, such as sponsored ads, display, and video advertising. Additionally, the company offers Amazon Prime, a membership program. The company's products offered through its stores include merchandise and content purchased for resale and products offered by third-party sellers. It also provides AgentCore services, such as AgentCore Runtime, AgentCore Memory, AgentCore Observability, AgentCore Identity, AgentCore Gateway, AgentCore Browser, and AgentCore Code Interpreter. It serves consumers, sellers, developers, enterprises, content creators, advertisers, and employees. Amazon.com, Inc. was incorporated in 1994 and is headquartered in Seattle, Washington.
Amazon product co-purchasing network and ground-truth communities
berd-platform.de
txt
Updated Jul 31, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Jaewon Yang; Jure Leskovec; Jaewon Yang; Jure Leskovec (2025). Amazon product co-purchasing network and ground-truth communities [Dataset]. http://doi.org/10.82939/8eq5f-e1186
Explore at:
txtAvailable download formats
Unique identifier
https://doi.org/10.82939/8eq5f-e1186
Dataset updated
Jul 31, 2025
Dataset provided by
Institute of Electrical and Electronics Engineershttp://www.ieee.ro/
Authors
Jaewon Yang; Jure Leskovec; Jaewon Yang; Jure Leskovec
Description
Network was collected by crawling Amazon website. It is based on Customers Who Bought This Item Also Bought feature of the Amazon website. If a product i is frequently co-purchased with product j, the graph contains an undirected edge from i to j. Each product category provided by Amazon defines each ground-truth community. We regard each connected component in a product category as a separate ground-truth community. We remove the ground-truth communities which have less than 3 nodes. We also provide the top 5,000 communities with highest quality which are described in our paper. As for the network, we provide the largest connected component. The dataset contains 334,863 nodes and 925,872 edges.
G
Graph Database Market Report
marketreportanalytics.com
doc, pdf, ppt
Updated Mar 19, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Market Report Analytics (2025). Graph Database Market Report [Dataset]. https://www.marketreportanalytics.com/reports/graph-database-market-10714
Explore at:
pdf, doc, pptAvailable download formats
Dataset updated
Mar 19, 2025
Dataset authored and provided by
Market Report Analytics
License
https://www.marketreportanalytics.com/privacy-policyhttps://www.marketreportanalytics.com/privacy-policy
Time period covered
2025 - 2033
Area covered
Global
Variables measured
Market Size
Description
The graph database market is booming, projected to reach $5.97 billion by 2025 with a 24.4% CAGR. Discover key drivers, trends, and regional insights in our comprehensive market analysis, including leading companies like Neo4j and Amazon. Explore the future of data management with this in-depth report.
b
Amazon Statistics (2025)
businessofapps.com
Updated Jul 20, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Business of Apps (2025). Amazon Statistics (2025) [Dataset]. https://www.businessofapps.com/data/amazon-statistics/
Explore at:
Dataset updated
Jul 20, 2025
Dataset authored and provided by
Business of Apps
License
Attribution-NonCommercial-NoDerivs 4.0 (CC BY-NC-ND 4.0)https://creativecommons.org/licenses/by-nc-nd/4.0/
License information was derived automatically
Description
Amazon is one of the most recognisable brands in the world, and the third largest by revenue. It was the fourth tech company to reach a $1 trillion market cap, and a market leader in e-commerce,...
T
United States - CBOE Equity VIX on Amazon
tradingeconomics.com
csv, excel, json, xml
Updated Feb 3, 2020
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
TRADING ECONOMICS (2020). United States - CBOE Equity VIX on Amazon [Dataset]. https://tradingeconomics.com/united-states/cboe-equity-vix-on-amazon-fed-data.html
Explore at:
xml, json, excel, csvAvailable download formats
Dataset updated
Feb 3, 2020
Dataset authored and provided by
TRADING ECONOMICS
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Time period covered
Jan 1, 1976 - Dec 31, 2025
Area covered
United States
Description
United States - CBOE Equity VIX on Amazon was 30.99000 Index in November of 2025, according to the United States Federal Reserve. Historically, United States - CBOE Equity VIX on Amazon reached a record high of 72.66000 in March of 2020 and a record low of 5.13000 in March of 2017. Trading Economics provides the current actual value, an historical data chart and related indicators for United States - CBOE Equity VIX on Amazon - last updated from the United States Federal Reserve on December of 2025.
Amazon Reviews Data 2023
kaggle.com
zip
Updated Jul 25, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Wajahat Waheed (2024). Amazon Reviews Data 2023 [Dataset]. https://www.kaggle.com/datasets/wajahat1064/amazon-reviews-data-2023/versions/2
Explore at:
zip(283902356 bytes)Available download formats
Dataset updated
Jul 25, 2024
Authors
Wajahat Waheed
License
https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/
Description
2 useful files:

all_categories.txt: 34 lines (33 categories + "Unknown"), each line contains a category name.

asin2category.json: A mapping between parent_asin (item ID) to its corresponding category name.

This is a large-scale Amazon Reviews dataset, collected in 2023 by McAuley Lab, and it includes rich features such as:

- User Reviews (ratings, text, helpfulness votes, etc.); - Item Metadata (descriptions, price, raw image, etc.); - Links (user-item / bought together graphs).

What's New? In the Amazon Reviews'23, we provide:

Larger Dataset: We collected 571.54M reviews, **245.2% **larger than the last version; - Newer Interactions: Current interactions range from May. 1996 to Sep. 2023; Richer Metadata: More descriptive features in item metadata; Fine-grained Timestamp: Interaction timestamp at the second or finer level; Cleaner Processing: Cleaner item metadata than previous versions; Standard Splitting: Standard data splits to encourage RecSys benchmarking.
u
Goodreads Book Reviews
cseweb.ucsd.edu
json
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
UCSD CSE Research Project, Goodreads Book Reviews [Dataset]. https://cseweb.ucsd.edu/~jmcauley/datasets.html
Explore at:
jsonAvailable download formats
Dataset authored and provided by
UCSD CSE Research Project
Description
These datasets contain reviews from the Goodreads book review website, and a variety of attributes describing the items. Critically, these datasets have multiple levels of user interaction, raging from adding to a shelf, rating, and reading.

Metadata includes

reviews

add-to-shelf, read, review actions

book attributes: title, isbn

graph of similar books

Basic Statistics:

Items: 1,561,465

Users: 808,749

Interactions: 225,394,930
e
Amazon Robotics - citations
exaly.com
csv, json
Updated Nov 1, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
(2025). Amazon Robotics - citations [Dataset]. https://exaly.com/institution/149356/amazon-robotics
Explore at:
csv, jsonAvailable download formats
Dataset updated
Nov 1, 2025
License
Attribution-ShareAlike 4.0 (CC BY-SA 4.0)https://creativecommons.org/licenses/by-sa/4.0/
License information was derived automatically
Description
The graph shows the citations of ^'s papers published in each year.

Package	#Nodes	#Edges	Split Type	Task Type	Metric
`ogb>=1.1.1`	2,449,029	61,859,140	Sales rank	Multi-class classification	Accuracy

Facebook

Twitter

Click to copy link

Link copied

Cite

Subhajit Sahu (2021). Co-purchase Graphs [Dataset]. https://www.kaggle.com/datasets/wolfram77/graphs-co-purchase

Co-purchase Graphs

Co-purchasing networks from the Stanford Network Analysis Platform (SNAP)

Explore at:

167 scholarly articles cite this dataset (View in Google Scholar)

zip(251051772 bytes)Available download formats

Dataset updated

Nov 11, 2021

Authors

Subhajit Sahu

License

Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically

Description

Network was collected by crawling Amazon website. It is based on Customers Who Bought This Item Also Bought feature of the Amazon website. If a product i is frequently co-purchased with product j, the graph contains a directed edge from i to j.

The data was collected by crawling Amazon website and contains product metadata and review information about 548,552 different products (Books, music CDs, DVDs and VHS video tapes).

For each product the following information is available:

Title Salesrank List of similar products (that get co-purchased with the current product) Detailed product categorization Product reviews: time, customer, rating, number of votes, number of people that found the review helpful

Stanford Network Analysis Platform (SNAP) is a general purpose, high performance system for analysis and manipulation of large networks. Graphs consists of nodes and directed/undirected/multiple edges between the graph nodes. Networks are graphs with data on nodes and/or edges of the network.

The core SNAP library is written in C++ and optimized for maximum performance and compact graph representation. It easily scales to massive networks with hundreds of millions of nodes, and billions of edges. It efficiently manipulates large graphs, calculates structural properties, generates regular and random graphs, and supports attributes on nodes and edges. Besides scalability to large graphs, an additional strength of SNAP is that nodes, edges and attributes in a graph or a network can be changed dynamically during the computation.

SNAP was originally developed by Jure Leskovec in the course of his PhD studies. The first release was made available in Nov, 2009. SNAP uses a general purpose STL (Standard Template Library)-like library GLib developed at Jozef Stefan Institute. SNAP and GLib are being actively developed and used in numerous academic and industrial projects.

http://snap.stanford.edu/data/index.html#amazon

Clear search

Close search

Google apps

Main menu

Co-purchase Graphs

Amazon co-purchasing graph - Dataset - LDM

Amazon review data 2018

Context

Acknowledgements

Amazon Product Co-purchasing Network (SNAP)

Amazon product co-purchasing network and ground-truth communities

Notes on inclusion into the SuiteSparse Matrix Collection, July 2018:

Amazon Product Dataset - Dataset - LDM

OGBN-Products (Processed for PyG)

OGBN-Products

Usage in Python

Description

Summary

Open Graph Benchmark

References

Large-scale attributed graph & hypergraph datasets: TWeibo, Amazon2M,...

Amazon Question and Answer Data

pareto-amazon-photo

Nodes

Edges

Features

Amazon.com Inc - Ebitda

Graph Technology Report

Amazon Financial Dataset

Title:

Description:

Use Cases:

File Information:

Important Note:

Tags:

Amazon.com Inc - Change-Receivables

Amazon product co-purchasing network and ground-truth communities

Graph Database Market Report

Amazon Statistics (2025)

United States - CBOE Equity VIX on Amazon

Amazon Reviews Data 2023

Goodreads Book Reviews

Amazon Robotics - citations

Co-purchase Graphs

Co-purchasing networks from the Stanford Network Analysis Platform (SNAP)