100+ datasets found

h
example-space-to-dataset-json
huggingface.co
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Lucain Pouget, example-space-to-dataset-json [Dataset]. https://huggingface.co/datasets/Wauplin/example-space-to-dataset-json
Explore at:
Authors
Lucain Pouget
Description
Demo to save data from a Space to a Dataset. Goal is to provide reusable snippets of code.

Documentation: https://huggingface.co/docs/huggingface_hub/main/en/guides/upload#scheduled-uploads Space: https://huggingface.co/spaces/Wauplin/space_to_dataset_saver/ JSON dataset: https://huggingface.co/datasets/Wauplin/example-space-to-dataset-json Image dataset: https://huggingface.co/datasets/Wauplin/example-space-to-dataset-image Image (zipped) dataset:… See the full description on the dataset page: https://huggingface.co/datasets/Wauplin/example-space-to-dataset-json.
Store Sales json
kaggle.com
zip
Updated Jun 1, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Indi Ella (2024). Store Sales json [Dataset]. https://www.kaggle.com/datasets/indiella/store-sales-json
Explore at:
zip(5397153 bytes)Available download formats
Dataset updated
Jun 1, 2024
Authors
Indi Ella
License
http://opendatacommons.org/licenses/dbcl/1.0/http://opendatacommons.org/licenses/dbcl/1.0/
Description
Dataset contains more than 50000 records of Sales and order data related to an online store.
json_large_sample
kaggle.com
zip
Updated Dec 1, 2023
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Noura Aly (2023). json_large_sample [Dataset]. https://www.kaggle.com/datasets/nouraaly/json-large-sample
Explore at:
zip(55508 bytes)Available download formats
Dataset updated
Dec 1, 2023
Authors
Noura Aly
License
Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
License information was derived automatically
Description
Dataset

This dataset was created by Noura Aly

Released under Apache 2.0

Contents
h
example-space-to-dataset-json
huggingface.co
Updated Dec 11, 2024
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Dan (2024). example-space-to-dataset-json [Dataset]. https://huggingface.co/datasets/Dhdb/example-space-to-dataset-json
Explore at:
Dataset updated
Dec 11, 2024
Authors
Dan
Description
Dhdb/example-space-to-dataset-json dataset hosted on Hugging Face and contributed by the HF Datasets community
m
Sample JSON file
mygeodata.cloud
Updated Sep 11, 2018
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
(2018). Sample JSON file [Dataset]. https://mygeodata.cloud/converter/dwg-to-json
Explore at:
Dataset updated
Sep 11, 2018
Description
Sample data in GeoJSON format available for download for testing purposes.
Synthetic EHR JSON Dataset 7 Samples
kaggle.com
zip
Updated May 31, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
TheWiseO (2025). Synthetic EHR JSON Dataset 7 Samples [Dataset]. https://www.kaggle.com/datasets/thewiseo/synthetic-ehr-json-dataset-7-samples
Explore at:
zip(7703 bytes)Available download formats
Dataset updated
May 31, 2025
Authors
TheWiseO
License
MIT Licensehttps://opensource.org/licenses/MIT
License information was derived automatically
Description
Dataset

This dataset was created by TheWiseO

Released under MIT

Contents
pipeline.json.gz
figshare.com
application/gzip
Updated Nov 12, 2019
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Martin Hunt (2019). pipeline.json.gz [Dataset]. http://doi.org/10.6084/m9.figshare.7605509.v1
Explore at:
application/gzipAvailable download formats
Unique identifier
https://doi.org/10.6084/m9.figshare.7605509.v1
Dataset updated
Nov 12, 2019
Dataset provided by
Figsharehttp://figshare.com/
figshare
Authors
Martin Hunt
License
CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
License information was derived automatically
Description
Gzipped JSON file of the output of the benchmarking pipeline. This has, for each sample, the resistance calls of each tool for that sample. It is the input file needed to generate all the results in the publication.
d
JSON example
dune.com
Updated Aug 11, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
pistomat (2023). JSON example [Dataset]. https://dune.com/discover/content/popular?q=author%3Apistomat&resource-type=queries
Explore at:
Dataset updated
Aug 11, 2023
Authors
pistomat
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
Blockchain data query: JSON example
sample.json
kaggle.com
zip
Updated Aug 14, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
hung hoang 31 (2024). sample.json [Dataset]. https://www.kaggle.com/datasets/hunghoang31/sample-json/code
Explore at:
zip(2442 bytes)Available download formats
Dataset updated
Aug 14, 2024
Authors
hung hoang 31
License
MIT Licensehttps://opensource.org/licenses/MIT
License information was derived automatically
Description
Dataset

This dataset was created by hung hoang 31

Released under MIT

Contents
c
Complete News Data Extracted from CNBC in JSON Format: Covering Business,...
crawlfeeds.com
json, zip
Updated Jul 6, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Crawl Feeds (2025). Complete News Data Extracted from CNBC in JSON Format: Covering Business, Finance, Technology, and Global Trends for Europe, US, and UK Audiences [Dataset]. https://crawlfeeds.com/datasets/complete-news-data-extracted-from-cnbc-in-json-format-covering-business-finance-technology-and-global-trends-for-europe-us-and-uk-audiences
Explore at:
zip, jsonAvailable download formats
Dataset updated
Jul 6, 2025
Dataset authored and provided by
Crawl Feeds
License
https://crawlfeeds.com/privacy_policyhttps://crawlfeeds.com/privacy_policy
Area covered
United Kingdom, United States
Description
We have successfully extracted a comprehensive news dataset from CNBC, covering not only financial updates but also an extensive range of news categories relevant to diverse audiences in Europe, the US, and the UK. This dataset includes over 500,000 records, meticulously structured in JSON format for seamless integration and analysis.

Diverse News Segments for In-Depth Analysis

This extensive extraction spans multiple segments, such as:

Business and Market Analysis: Stay updated on major companies, mergers, and acquisitions.

Technology and Innovation: Explore developments in AI, cybersecurity, and digital transformation.

Economic Forecasts: Access insights into GDP, employment rates, inflation, and other economic indicators.

Geopolitical Developments: Understand the impact of political events and global trade dynamics on markets.

Personal Finance: Learn about saving strategies, investment tips, and real estate trends.

Each record in the dataset is enriched with metadata tags, enabling precise filtering by region, sector, topic, and publication date.

Why Choose This Dataset?

The comprehensive news dataset provides real-time insights into global developments, corporate strategies, leadership changes, and sector-specific trends. Designed for media analysts, research firms, and businesses, it empowers users to perform:

Trend Analysis

Sentiment Analysis

Predictive Modeling

Additionally, the JSON format ensures easy integration with analytics platforms for advanced processing.

Access More News Datasets

Looking for a rich repository of structured news data? Visit our news dataset collection to explore additional offerings tailored to your analysis needs.

Sample Dataset Available

To get a preview, check out the CSV sample of the CNBC economy articles dataset.
Sample JSON
kaggle.com
zip
Updated Jun 5, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Neal Magee (2023). Sample JSON [Dataset]. https://www.kaggle.com/datasets/nealmagee/sample-json
Explore at:
zip(844 bytes)Available download formats
Dataset updated
Jun 5, 2023
Authors
Neal Magee
Description
Dataset

This dataset was created by Neal Magee

Contents
Inventory data for Pharmacy Website in JSON format
kaggle.com
zip
Updated Oct 22, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Priti Poddar (2024). Inventory data for Pharmacy Website in JSON format [Dataset]. https://www.kaggle.com/datasets/pritipoddar/inventory-data-for-pharmacy-website-in-json-format
Explore at:
zip(14761 bytes)Available download formats
Dataset updated
Oct 22, 2024
Authors
Priti Poddar
License
http://opendatacommons.org/licenses/dbcl/1.0/http://opendatacommons.org/licenses/dbcl/1.0/
Description
This dataset contains inventory data for a pharmacy e-commerce website in JSON format, designed for easy integration into MongoDB databases, making it ideal for MERN stack projects. It includes 10 fields:

drugName: Name of the drug

manufacturer: Drug manufacturer

image: URL of the product image

description: Detailed description of the drug

expiryDate: Expiry date of the drug

price: Price of the drug

sideEffects: Potential side effects

disclaimer: Important legal and medical disclaimers

category: Drug classification (e.g., pain relief, antibiotics)

countInStock: Quantity of the product available in stock

This dataset is useful for developing pharmacy-related web applications, inventory management systems, or online medical stores using the MERN stack.

Do not use for production-level purposes; use for project development only. Feel free to contribute if you find any mistakes or have suggestions.
Stackoverflow post sample data. JSON format
kaggle.com
zip
Updated Apr 16, 2021
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Jeong Hoon Lee (2021). Stackoverflow post sample data. JSON format [Dataset]. https://www.kaggle.com/jeonghoonlee0ljh/stackoverflow-post-sample-data-json-format
Explore at:
zip(28017615 bytes)Available download formats
Dataset updated
Apr 16, 2021
Authors
Jeong Hoon Lee
Description
Dataset

This dataset was created by Jeong Hoon Lee

Contents
r
Data from: JSON Dataset of Simulated Building Heat Control for System of...
researchdata.se
gimi9.com
Updated Mar 21, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Jacob Nilsson (2025). JSON Dataset of Simulated Building Heat Control for System of Systems Interoperability [Dataset]. http://doi.org/10.5878/e5hb-ne80
Explore at:
(438755370), (110041420), (156812), (5417)Available download formats
Unique identifier
https://doi.org/10.5878/e5hb-ne80
Dataset updated
Mar 21, 2025
Dataset provided by
Luleå University of Technology
Authors
Jacob Nilsson
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Area covered
Luleå Municipality
Description
Interoperability in systems-of-systems is a difficult problem due to the abundance of data standards and formats. Current approaches to interoperability rely on hand-made adapters or methods using ontological metadata. This dataset was created to facilitate research on data-driven interoperability solutions. The data comes from a simulation of a building heating system, and the messages sent within control systems-of-systems. For more information see attached data documentation.

The data comes in two semicolon-separated (;) csv files, training.csv and test.csv. The train/test split is not random; training data comes from the first 80% of simulated timesteps, and the test data is the last 20%. There is no specific validation dataset, the validation data should instead be randomly selected from the training data. The simulation runs for as many time steps as there are outside temperature values available. The original SMHI data only samples once every hour, which we linearly interpolate to get one temperature sample every ten seconds. The data saved at each time step consists of 34 JSON messages (four per room and two temperature readings from the outside), 9 temperature values (one per room and outside), 8 setpoint values, and 8 actuator outputs. The data associated with each of those 34 JSON-messages is stored as a single row in the tables. This means that much data is duplicated, a choice made to make it easier to use the data.

The simulation data is not meant to be opened and analyzed in spreadsheet software, it is meant for training machine learning models. It is recommended to open the data with the pandas library for Python, available at https://pypi.org/project/pandas/.

The data file with temperatures (smhi-july-23-29-2018.csv) acts as input for the thermodynamic building simulation found on Github, where it is used to get the outside temperature and corresponding timestamps. Temperature data for Luleå Summer 2018 were downloaded from SMHI.
d
V2 Parse JSON String sample
dune.com
Updated Apr 7, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
springzhang (2025). V2 Parse JSON String sample [Dataset]. https://dune.com/discover/content/relevant?q=author:springzhang&resource-type=queries
Explore at:
Dataset updated
Apr 7, 2025
Authors
springzhang
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
Blockchain data query: V2 Parse JSON String sample
T-Rex : Alignment of Natural Language[JSON SAMPLE]
kaggle.com
zip
Updated Oct 22, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Ahmed Sleem (2025). T-Rex : Alignment of Natural Language[JSON SAMPLE] [Dataset]. https://www.kaggle.com/datasets/a7medsleem/t-rex-alignment-of-natural-languagejson-sample
Explore at:
zip(22344126 bytes)Available download formats
Dataset updated
Oct 22, 2025
Authors
Ahmed Sleem
License
Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
License information was derived automatically
Description
Dataset

This dataset was created by Ahmed Sleem

Released under Apache 2.0

Contents
employee_json
kaggle.com
zip
Updated Mar 26, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
prateek khandelwal (2024). employee_json [Dataset]. https://www.kaggle.com/datasets/khandelwal10iitj/employee-json
Explore at:
zip(302 bytes)Available download formats
Dataset updated
Mar 26, 2024
Authors
prateek khandelwal
License
Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
License information was derived automatically
Description
Dataset

This dataset was created by prateek khandelwal

Released under Apache 2.0

Contents
Modified Swiss Dwellings 01 JSON
kaggle.com
Updated Oct 8, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Wassim Jabi (2025). Modified Swiss Dwellings 01 JSON [Dataset]. http://doi.org/10.34740/kaggle/dsv/13298888
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Unique identifier
https://doi.org/10.34740/kaggle/dsv/13298888
Dataset updated
Oct 8, 2025
Dataset provided by
Kagglehttp://kaggle.com/
Authors
Wassim Jabi
License
Attribution-ShareAlike 4.0 (CC BY-SA 4.0)https://creativecommons.org/licenses/by-sa/4.0/
License information was derived automatically
Description
The Modified Swiss Dwellings (MSD) JSON dataset is an ML-ready dataset for floor plan generation and analysis at building-level scale. The MSD JSON dataset is derived from the Modified Swiss Dwellings database (v6). The MSD JSON dataset contains 4572 room-based geometries as well as their topological dual graphs in JSON format. It also contains sample colour-coded 250 images for visual reference. The dataset (geometries and graphs) can be imported into TopologicPy for further analysis and for use with ML workflows. The original attributes are stored within the nodes and edges of the graphs as well as in the faces of the geometries.

https://www.googleapis.com/download/storage/v1/b/kaggle-user-content/o/inbox%2F10402362%2F422b0a20652bc743085f668979905d99%2Fmsd_header_large.png?generation=1759921999232197&alt=media" alt="">

Graph Schema (Floor-Plan Graph)

1) Graph-level

properties:

building_id (int)

floor_id (int)

plan_id (int)

site_id (int)

elevation (float)

height (float)

ml_type (str)

unit_usage (str)

2) Vertices (nodes)

Each vertex represents a spatial unit (room, corridor, balcony, etc.).

Core classification & IDs

entity_type (str) = "area"

entity_subtype ∈ {ROOM, BATHROOM, CORRIDOR, KITCHEN, BALCONY, STAIRCASE, STOREROOM, LIVING_DINING, …}.

area_id (float|int)

apartment_id (str|null)

unit_id (float|int|null)

Geometry

geom (WKT polygon string)

geometry (array of [x, y] points)

Centroid: x, y, z; plus per-vertex height, elevation, area.

Semantic & visual attributes

roomtype

node_name

node_type

unit_usage

zoning

zone_name

zone_type

node_color

apartment_color

zone_color

3) Edges (relationships)

Edges encode connectivity between vertices (by vertex IDs).

Edge object

connectivity (str): e.g., "door", "entrance", "passage"

edge_width (number): e.g., 4

source (str),

target (str): vertex IDs like "Vertex_0000"

Topologic Geometry Schema

A single JSON file encoding a floor plan as Topologic geometry. The file is an array of topology objects — Vertex, Edge, Wire, and Face. Each object carries a uuid, a type, a dictionary (metadata), and (optionally) an apertures array.

Object Model

Common fields

type: "Vertex" | "Edge" | "Wire" | "Face".

uuid: globally unique identifier (string).

dictionary: per-object metadata. Faces include rich room/zone attributes here (see Room/Zone attributes below). :contentReference[oaicite:1]

apertures: array (often empty) reserved for openings/voids.

Vertex

Stores 3D coordinates (XYZ):
coordinates: [x, y, z] (z is often 0.0 for floor plans).

Example ```json { "type": "Vertex", "uuid": "4097fc7d-a38c-11f0-82e9-e8c8299204ae", "dictionary": { "toplevel": false, "uuid": "4097fc7d-a38c-11f0-82e9-e8c8299204ae" }, "apertures": [], "coordinates": [5.529276, 2.15043, 0.0] }
Z
Valencia Portcalls 07/2018 to 12/2018
data.niaid.nih.gov
data.europa.eu
Updated Jan 24, 2020
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Eneko Olivares Gorriti (2020). Valencia Portcalls 07/2018 to 12/2018 [Dataset]. https://data.niaid.nih.gov/resources?id=zenodo_3257156
Explore at:
Dataset updated
Jan 24, 2020
Dataset provided by
UPV
Authors
Eneko Olivares Gorriti
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
Json file with a list of portcalls from vessels arriving to Valencia ports. Data was used inside the INTER-IoT project as an example dataset that a legacy IoT platform was providing.

*NOTE: Due to a bug in the system it is not possible to upload files with a .json extension. It is uploaded to ._json extension instead. Please rename it after download.
r
Dataset containing Features from DNS Tunneling Samples stored in JSON files
researchdata.se
Updated May 10, 2017
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Irvin Homem; Panagiotis Papapetrou (2017). Dataset containing Features from DNS Tunneling Samples stored in JSON files [Dataset]. http://doi.org/10.17045/STHLMUNI.4229399
Explore at:
Unique identifier
https://doi.org/10.17045/STHLMUNI.4229399
Dataset updated
May 10, 2017
Dataset provided by
Stockholm University
Authors
Irvin Homem; Panagiotis Papapetrou
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
Data set containing features extracted from 211 DNS Tunneling packet captures. The packet capture samples are classified by the protocols tunneled within the DNS tunnel. The features are stored in json files for each packet capture. The features in each file include the IP Packet Length, the DNS Query Name Length and the DNS Query Name entropy. In this "slightly unclean" version of the feature set the DNS Query Name field values are also present, but are not actually necessary.

This feature set may be used to perform machine learning techniques on DNS Tunneling traffic to discover new insights without necessarily having to reconstruct and analyze the equivalent full packet captures.

Facebook

Twitter

Click to copy link

Link copied

Cite

Lucain Pouget, example-space-to-dataset-json [Dataset]. https://huggingface.co/datasets/Wauplin/example-space-to-dataset-json

example-space-to-dataset-json

Wauplin/example-space-to-dataset-json

Explore at:

Authors

Lucain Pouget

Description

Demo to save data from a Space to a Dataset. Goal is to provide reusable snippets of code.

Documentation: https://huggingface.co/docs/huggingface_hub/main/en/guides/upload#scheduled-uploads Space: https://huggingface.co/spaces/Wauplin/space_to_dataset_saver/ JSON dataset: https://huggingface.co/datasets/Wauplin/example-space-to-dataset-json Image dataset: https://huggingface.co/datasets/Wauplin/example-space-to-dataset-image Image (zipped) dataset:… See the full description on the dataset page: https://huggingface.co/datasets/Wauplin/example-space-to-dataset-json.

Clear search

Close search

Google apps

Main menu

example-space-to-dataset-json

Store Sales json

json_large_sample

Dataset

Contents

example-space-to-dataset-json

Sample JSON file

Synthetic EHR JSON Dataset 7 Samples

Dataset

Contents

pipeline.json.gz

JSON example

sample.json

Dataset

Contents

Complete News Data Extracted from CNBC in JSON Format: Covering Business,...

Diverse News Segments for In-Depth Analysis

Why Choose This Dataset?

Access More News Datasets

Sample Dataset Available

Sample JSON

Dataset

Contents

Inventory data for Pharmacy Website in JSON format

Stackoverflow post sample data. JSON format

Dataset

Contents

Data from: JSON Dataset of Simulated Building Heat Control for System of...

V2 Parse JSON String sample

T-Rex : Alignment of Natural Language[JSON SAMPLE]

Dataset

Contents

employee_json

Dataset

Contents

Modified Swiss Dwellings 01 JSON

Graph Schema (Floor-Plan Graph)

1) Graph-level

2) Vertices (nodes)

3) Edges (relationships)

Topologic Geometry Schema

Object Model

Common fields

Vertex

Valencia Portcalls 07/2018 to 12/2018

Dataset containing Features from DNS Tunneling Samples stored in JSON files

example-space-to-dataset-jsonSee More Versions

Wauplin/example-space-to-dataset-json

example-space-to-dataset-json