97 datasets found

My NASA Data
data.nasa.gov
s.cnmilf.com
+4more
Updated Mar 31, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
nasa.gov (2025). My NASA Data [Dataset]. https://data.nasa.gov/dataset/my-nasa-data
Explore at:
Dataset updated
Mar 31, 2025
Dataset provided by
NASAhttp://nasa.gov/
Description
MY NASA DATA (MND) is a tool that allows anyone to make use of satellite data that was previously unavailable.Through the use of MND’s Live Access Server (LAS) a multitude of charts, plots and graphs can be generated using a wide variety of constraints. This site provides a large number of lesson plans with a wide variety of topics, all with the students in mind. Not only can you use our lesson plans, you can use the LAS to improve the ones that you are currently implementing in your classroom.
COVID-19 Diagnostic Laboratory Testing (PCR Testing) Time Series
healthdata.gov
data.virginia.gov
+2more
Updated Dec 14, 2020
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
U.S. Department of Health & Human Services (2020). COVID-19 Diagnostic Laboratory Testing (PCR Testing) Time Series [Dataset]. https://healthdata.gov/dataset/COVID-19-Diagnostic-Laboratory-Testing-PCR-Testing/j8mb-icvb
Explore at:
application/rdfxml, tsv, csv, xml, application/rssxml, kmz, application/geo+json, kmlAvailable download formats
Dataset updated
Dec 14, 2020
Dataset provided by
United States Department of Health and Human Serviceshttp://www.hhs.gov/
Authors
U.S. Department of Health & Human Services
License
Open Data Commons Attribution License (ODC-By) v1.0https://www.opendatacommons.org/licenses/by/1.0/
License information was derived automatically
Description
After May 3, 2024, this dataset and webpage will no longer be updated because hospitals are no longer required to report data on COVID-19 hospital admissions, and hospital capacity and occupancy data, to HHS through CDC’s National Healthcare Safety Network. Data voluntarily reported to NHSN after May 1, 2024, will be available starting May 10, 2024, at COVID Data Tracker Hospitalizations.

This time series dataset includes viral COVID-19 laboratory test [Polymerase chain reaction (PCR)] results from over 1,000 U.S. laboratories and testing locations including commercial and reference laboratories, public health laboratories, hospital laboratories, and other testing locations. Data are reported to state and jurisdictional health departments in accordance with applicable state or local law and in accordance with the Coronavirus Aid, Relief, and Economic Security (CARES) Act (CARES Act Section 18115).

Data are provisional and subject to change.

Data presented here is representative of diagnostic specimens being tested - not individual people - and excludes serology tests where possible. Data presented might not represent the most current counts for the most recent 3 days due to the time it takes to report testing information. The data may also not include results from all potential testing sites within the jurisdiction (e.g., non-laboratory or point of care test sites) and therefore reflect the majority, but not all, of COVID-19 testing being conducted in the United States.

Sources: CDC COVID-19 Electronic Laboratory Reporting (CELR), Commercial Laboratories, State Public Health Labs, In-House Hospital Labs

Data for each state is sourced from either data submitted directly by the state health department via COVID-19 electronic laboratory reporting (CELR), or a combination of commercial labs, public health labs, and in-house hospital labs. Data is taken from CELR for states that either submit line level data or submit aggregate counts which do not include serology tests.
d
Indicators of Coastal Water Quality: Ancillary Data
catalog.data.gov
s.cnmilf.com
+3more
Updated Apr 24, 2025
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
SEDAC (2025). Indicators of Coastal Water Quality: Ancillary Data [Dataset]. https://catalog.data.gov/dataset/indicators-of-coastal-water-quality-ancillary-data
Explore at:
Dataset updated
Apr 24, 2025
Dataset provided by
SEDAC
Description
The Ancillary Data component of the Indicators of Coastal Water Quality Collection includes a 5 arc-minute (approximately 9 x 9 km at the equator) sequence grid, grid cell centroids that relate to the grid cells in the tabular "Indicators of Coastal Water Quality: Change in Chlorophyll-a Concentration 1998-2007" data set, and a country buffer data set that is divided by exclusive economic zones (EEZ). The data are produced by the Columbia University Center for International Earth Science Information Network (CIESIN).
R
Head Data Set 2 Dataset
universe.roboflow.com
zip
Updated Oct 1, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Innovateitt (2024). Head Data Set 2 Dataset [Dataset]. https://universe.roboflow.com/innovateitt/head-data-set-2
Explore at:
zipAvailable download formats
Dataset updated
Oct 1, 2024
Dataset authored and provided by
Innovateitt
Variables measured
Heads QiDz Bounding Boxes
Description
Head Data Set 2

## Overview Head Data Set 2 is a dataset for object detection tasks - it contains Heads QiDz annotations for 2,342 images. ## Getting Started You can download this dataset for use within your own projects, or fork it into a workspace on Roboflow to create your own model.
f
Orange dataset table
figshare.com
xlsx
Updated Mar 4, 2022
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Rui Simões (2022). Orange dataset table [Dataset]. http://doi.org/10.6084/m9.figshare.19146410.v1
Explore at:
xlsxAvailable download formats
Unique identifier
https://doi.org/10.6084/m9.figshare.19146410.v1
Dataset updated
Mar 4, 2022
Dataset provided by
figshare
Authors
Rui Simões
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
The complete dataset used in the analysis comprises 36 samples, each described by 11 numeric features and 1 target. The attributes considered were caspase 3/7 activity, Mitotracker red CMXRos area and intensity (3 h and 24 h incubations with both compounds), Mitosox oxidation (3 h incubation with the referred compounds) and oxidation rate, DCFDA fluorescence (3 h and 24 h incubations with either compound) and oxidation rate, and DQ BSA hydrolysis. The target of each instance corresponds to one of the 9 possible classes (4 samples per class): Control, 6.25, 12.5, 25 and 50 µM for 6-OHDA and 0.03, 0.06, 0.125 and 0.25 µM for rotenone. The dataset is balanced, it does not contain any missing values and data was standardized across features. The small number of samples prevented a full and strong statistical analysis of the results. Nevertheless, it allowed the identification of relevant hidden patterns and trends.

Exploratory data analysis, information gain, hierarchical clustering, and supervised predictive modeling were performed using Orange Data Mining version 3.25.1 [41]. Hierarchical clustering was performed using the Euclidean distance metric and weighted linkage. Cluster maps were plotted to relate the features with higher mutual information (in rows) with instances (in columns), with the color of each cell representing the normalized level of a particular feature in a specific instance. The information is grouped both in rows and in columns by a two-way hierarchical clustering method using the Euclidean distances and average linkage. Stratified cross-validation was used to train the supervised decision tree. A set of preliminary empirical experiments were performed to choose the best parameters for each algorithm, and we verified that, within moderate variations, there were no significant changes in the outcome. The following settings were adopted for the decision tree algorithm: minimum number of samples in leaves: 2; minimum number of samples required to split an internal node: 5; stop splitting when majority reaches: 95%; criterion: gain ratio. The performance of the supervised model was assessed using accuracy, precision, recall, F-measure and area under the ROC curve (AUC) metrics.
pii-comp
kaggle.com
zip
Updated Apr 18, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Devin Anzelmo (2024). pii-comp [Dataset]. https://www.kaggle.com/datasets/devinanzelmo/pii-comp
Explore at:
zip(0 bytes)Available download formats
Dataset updated
Apr 18, 2024
Authors
Devin Anzelmo
Description
Models and external data of 3rd place efficiency solution for https://www.kaggle.com/competitions/pii-detection-removal-from-educational-data competition.

See https://www.kaggle.com/code/devinanzelmo/piidd-efficiency-3rd-process-external-data for links to external data and processing code

See https://www.kaggle.com/code/devinanzelmo/piidd-efficiency-3rd-train for training code that generated models.

See https://www.kaggle.com/code/devinanzelmo/piidd-efficiency-3rd-inference for inference code
m
Data from: Predicting Long-term Dynamics of Soil Salinity and Sodicity on a...
data.mendeley.com
Updated Nov 26, 2020
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Amirhossein Hassani (2020). Predicting Long-term Dynamics of Soil Salinity and Sodicity on a Global Scale [Dataset]. http://doi.org/10.17632/v9mgbmtnf2.1
Explore at:
Unique identifier
https://doi.org/10.17632/v9mgbmtnf2.1
Dataset updated
Nov 26, 2020
Authors
Amirhossein Hassani
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
This dataset globally (excluding frigid/polar zones) quantifies the different facets of variability in surface soil (0 – 30 cm) salinity and sodicity for the period between 1980 and 2018. This is realised by developing 4-D predictive models of Electrical Conductivity of saturated soil Extract (ECe) and soil Exchangeable Sodium Percentage (ESP) as indicators of soil salinity and sodicity. These machine learning-based models make predictions for ECe and ESP at different times, locations, and depths and by extracting meaningful statistics form those predictions, different facets of variability in the surface soil salinity and sodicity are quantified. The dataset includes 10 maps documenting different aspects of soil salinity and sodicity variations, and auxiliary data required for generation of those maps. Users are referred to the corresponding "READ_ME" file for more information about this dataset.
d
United States Census Bureau Open Data
catalog.data.gov
hub.arcgis.com
Updated Apr 19, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
City of Sioux Falls GIS (2025). United States Census Bureau Open Data [Dataset]. https://catalog.data.gov/dataset/united-states-census-bureau-open-data-3513f
Explore at:
Dataset updated
Apr 19, 2025
Dataset provided by
City of Sioux Falls GIS
Area covered
United States
Description
Link to the Open Data site for the United States Census Bureau.
o
PG&E: Energy Usage Data
openenergyhub.ornl.gov
Updated Jun 10, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
(2025). PG&E: Energy Usage Data [Dataset]. https://openenergyhub.ornl.gov/explore/dataset/pg-and-e-energy-usage-data/
Explore at:
Dataset updated
Jun 10, 2025
Description
Note: Find data at source; data is continuously updated・ PG&E provides non-confidential, aggregated usage data that are available to the public and updated on a quarterly basis. These public datasets consist of monthly consumption aggregated by ZIP code and by customer segment: Residential, Commercial, Industrial and Agricultural. The public datasets must meet the standards for aggregating and anonymizing customer data pursuant to CPUC Decision 14-05-016, as follows: a minimum of 100 Residential customers; a minimum of 15 Non-Residential customers, with no single Non-Residential customer in each sector accounting for more than 15% of the total consumption. If the aggregation standard is not met, the consumption will be combined with a neighboring ZIP code until the aggregation requirements are met.
Flight Data For Tail 655
data.nasa.gov
data.staging.idas-ds1.appdat.jsc.nasa.gov
+1more
Updated Mar 31, 2025
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
nasa.gov (2025). Flight Data For Tail 655 [Dataset]. https://data.nasa.gov/dataset/flight-data-for-tail-655
Explore at:
Dataset updated
Mar 31, 2025
Dataset provided by
NASAhttp://nasa.gov/
Description
The following zip files contain individual flight recorded data in Matlab file format. There are 186 parameters each with a data structure that contains the following: -sensor recordings -sampling rate -units -parameter description -parameter ID
t
Data Policy and Governance Guide
data-academy.tempe.gov
data.tempe.gov
+6more
Updated Jun 28, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
City of Tempe (2023). Data Policy and Governance Guide [Dataset]. https://data-academy.tempe.gov/documents/34b98451d56043b186dbb6c0a69eb49b
Explore at:
Dataset updated
Jun 28, 2023
Dataset authored and provided by
City of Tempe
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
Use this guide to find information on Tempe data policy and standards.Open Data PolicyEthical Artificial Intelligence (AI) PolicyEvaluation PolicyExpedited Data Sharing PolicyData Sharing Agreement (General)Data Sharing Agreement (GIS)Data Quality Standard and ChecklistDisaggregated Data StandardsData and Analytics Service Standard
Soil Data Grevena
kaggle.com
data.mendeley.com
Updated Sep 4, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Jocelyn Dumlao (2023). Soil Data Grevena [Dataset]. https://www.kaggle.com/datasets/jocelyndumlao/soil-data-grevena
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Sep 4, 2023
Dataset provided by
Kagglehttp://kaggle.com/
Authors
Jocelyn Dumlao
License
https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/
Area covered
Grevena
Description
Description

In this dataset, there are soil data analyses with properties such as pH, organic matter (OM), salinity (EC), etc., major elements (N, P, K, Mg) as well as some microelements (Fe, Zn, Mn, Cu, B) with significant impact on plant nutrition.

Categories

Agricultural Soil

Acknowledgements & Source

Panagiotis Tziachris

Data Source

View Details

Image Source
AWS-02-I CTD Data (MATLAB Format) [Flagg, C.]
data.ucar.edu
dataone.org
+1more
matlab
Updated Dec 26, 2024
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Charles N. Flagg; James H. Swift; Louis Codispoti (2024). AWS-02-I CTD Data (MATLAB Format) [Flagg, C.] [Dataset]. http://doi.org/10.5065/D6CF9N6S
Explore at:
matlabAvailable download formats
Unique identifier
https://doi.org/10.5065/D6CF9N6S
Dataset updated
Dec 26, 2024
Dataset provided by
University Corporation for Atmospheric Research
Authors
Charles N. Flagg; James H. Swift; Louis Codispoti
Time period covered
Jul 15, 2002 - Aug 13, 2002
Area covered

Description
This data set consists of Conductivity, Temperature, Depth (CTD) data in MATLAB Format from the 2002 Polar Star Mooring Cruise (AWS-02-I). These data are provided in a single mat-file (MATLAB) for the entire cruise.
C
Raw Data for ConfLab: A Data Collection Concept, Dataset, and Benchmark for...
data.4tu.nl
Updated Jun 7, 2022
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Chirag Raman; Jose Vargas Quiros; Stephanie Tan; Ashraful Islam; Ekin Gedik; Hayley Hung (2022). Raw Data for ConfLab: A Data Collection Concept, Dataset, and Benchmark for Machine Analysis of Free-Standing Social Interactions in the Wild [Dataset]. http://doi.org/10.4121/20017748.v2
Explore at:
Unique identifier
https://doi.org/10.4121/20017748.v2
Dataset updated
Jun 7, 2022
Dataset provided by
4TU.ResearchData
Authors
Chirag Raman; Jose Vargas Quiros; Stephanie Tan; Ashraful Islam; Ekin Gedik; Hayley Hung
License
https://data.4tu.nl/info/fileadmin/user_upload/Documenten/4TU.ResearchData_Restricted_Data_2022.pdfhttps://data.4tu.nl/info/fileadmin/user_upload/Documenten/4TU.ResearchData_Restricted_Data_2022.pdf
Description
This file contains raw data for cameras and wearables of the ConfLab dataset.

./cameras

contains the overhead video recordings for 9 cameras (cam2-10) in MP4 files.

These cameras cover the whole interaction floor, with camera 2 capturing the

bottom of the scene layout, and camera 10 capturing top of the scene layout.

Note that cam5 ran out of battery before the other cameras and thus the recordings

are cut short. However, cam4 and 6 contain significant overlap with cam 5, to

reconstruct any information needed.

Note that the annotations are made and provided in 2 minute segments.

The annotated portions of the video include the last 3min38sec of x2xxx.MP4

video files, and the first 12 min of x3xxx.MP4 files for cameras (2,4,6,8,10),

with "x" being the placeholder character in the mp4 file names. If one wishes

to separate the video into 2 min segments as we did, the "video-splitting.sh"

script is provided.

./camera-calibration contains the camera instrinsic files obtained from

https://github.com/idiap/multicamera-calibration. Camera extrinsic parameters can

be calculated using the existing intrinsic parameters and the instructions in the

multicamera-calibration repo. The coordinates in the image are provided by the

crosses marked on the floor, which are visible in the video recordings.

The crosses are 1m apart (=100cm).

./wearables

subdirectory includes the IMU, proximity and audio data from each

participant at the Conflab event (48 in total). In the directory numbered

by participant ID, the following data are included:

1. raw audio file

2. proximity (bluetooth) pings (RSSI) file (raw and csv) and a visualization

3. Tri-axial accelerometer data (raw and csv) and a visualization

4. Tri-axial gyroscope data (raw and csv) and a visualization

5. Tri-axial magnetometer data (raw and csv) and a visualization

6. Game rotation vector (raw and csv), recorded in quaternions.

All files are timestamped.

The sampling frequencies are:

- audio: 1250 Hz

- rest: around 50Hz. However, the sample rate is not fixed

and instead the timestamps should be used.

For rotation, the game rotation vector's output frequency is limited by the

actual sampling frequency of the magnetometer. For more information, please refer to

https://invensense.tdk.com/wp-content/uploads/2016/06/DS-000189-ICM-20948-v1.3.pdf

Audio files in this folder are in raw binary form. The following can be used to convert

them to WAV files (1250Hz):

ffmpeg -f s16le -ar 1250 -ac 1 -i /path/to/audio/file

Synchronization of cameras and werables data

Raw videos contain timecode information which matches the timestamps of the data in

the "wearables" folder. The starting timecode of a video can be read as:

ffprobe -hide_banner -show_streams -i /path/to/video

./audio

./sync: contains wav files per each subject

./sync_files: auxiliary csv files used to sync the audio. Can be used to improve the synchronization.

The code used for syncing the audio can be found here:

https://github.com/TUDelft-SPC-Lab/conflab/tree/master/preprocessing/audio
d
SA Health Mental Health data
data.gov.au
xlsx
Updated Jun 8, 2018
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
SA Health (2018). SA Health Mental Health data [Dataset]. https://data.gov.au/dataset/ds-sa-a9372b1f-e5c0-4d03-9714-0dc707343f2f
Explore at:
xlsxAvailable download formats
Dataset updated
Jun 8, 2018
Dataset provided by
SA Healthhttps://www.sahealth.sa.gov.au/wps/wcm/connect/public+content/sa+health+internet/about+us/department+for+health+and+wellbeing/department+for+health+and+wellbeing
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
Time series state level datasets showing important indicators regarding mental health. Includes data on mental health readmissions within 28 days, Mental health Community Care with within seven days …Show full descriptionTime series state level datasets showing important indicators regarding mental health. Includes data on mental health readmissions within 28 days, Mental health Community Care with within seven days of discharge and mental health average length of stay (days).
California Streams
data.ca.gov
data.cnra.ca.gov
+5more
Updated Sep 13, 2023
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
California Department of Fish and Wildlife (2023). California Streams [Dataset]. https://data.ca.gov/dataset/california-streams
Explore at:
arcgis geoservices rest api, kml, geojson, csv, zip, htmlAvailable download formats
Dataset updated
Sep 13, 2023
Dataset authored and provided by
California Department of Fish and Wildlifehttps://wildlife.ca.gov/
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Area covered
California
Description
Notes: As of June 2020 this dataset has been static for several years. Recent versions of NHD High Res may be more detailed than this dataset for some areas, while this dataset may still be more detailed than NHD High Res in other areas. This dataset is considered authoritative as used by CDFW for particular tracking purposes but may not be current or comprehensive for all streams in the state.
National Hydrography Dataset (NHD) high resolution NHDFlowline features for California were originally dissolved on common GNIS_ID or StreamLevel* attributes and routed from mouth to headwater in meters. The results are measured polyline features representing entire streams. Routes on these streams are measured upstream, i.e., the measure at the mouth of a stream is zero and at the upstream end the measure matches the total length of the stream feature. Using GIS tools, a user of this dataset can retrieve the distance in meters upstream from the mouth at any point along a stream feature.** CA_Streams_v3 Update Notes: This version includes over 200 stream modifications and additions resulting from requests for updating from CDFW staff and others***. New locator fields from the USGS Watershed Boundary Dataset (WBD) have been added for v3 to enhance user's ability to search for or extract subsets of California Streams by hydrologic area. *See the Source Citation section of this metadata for further information on NHD, WBD, NHDFlowline, GNIS_ID and StreamLevel. **See the Data Quality section of this metadata for further explanation of stream feature development. ***Some current NHD data has not yet been included in CA_Streams. The effort to synchronize CA_Streams with NHD is ongoing.
m
Data from: CNC Machining Data Repository - Geometry, NC Code &...
data.mendeley.com
Updated Apr 17, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Markus Brillinger (2025). CNC Machining Data Repository - Geometry, NC Code & High-Frequency Energy Consumption Data for Aluminum and Plastic Machining [Dataset]. http://doi.org/10.17632/gtvvwmz7r7.2
Explore at:
Unique identifier
https://doi.org/10.17632/gtvvwmz7r7.2
Dataset updated
Apr 17, 2025
Authors
Markus Brillinger
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
In the field of manufacturing, high-quality datasets are essential for optimizing production processes, improving energy efficiency, and developing predictive maintenance strategies. This repository introduces a comprehensive CNC machining data repository that includes three key data categories: (1) product geometry data, (2) NC code data, and (3) high frequency energy consumption data. This dataset is particularly valuable for researchers and engineers working in manufacturing analytics, energy-efficient machining, and machine learning applications in smart manufacturing. Potential use cases include optimizing machining parameters for energy reduction, predicting tool wear based on power consumption patterns, and enhancing digital twin models with real-world machining data. By making this dataset publicly available, we aim to support the development of data-driven solutions in modern manufacturing and facilitate benchmarking efforts across different machining strategies.
d
Data release of hydrogeologic data Hat Creek basin, Shasta County,...
catalog.data.gov
data.usgs.gov
+2more
Updated Sep 7, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
U.S. Geological Survey (2024). Data release of hydrogeologic data Hat Creek basin, Shasta County, California [Dataset]. https://catalog.data.gov/dataset/data-release-of-hydrogeologic-data-hat-creek-basin-shasta-county-california
Explore at:
Dataset updated
Sep 7, 2024
Dataset provided by
United States Geological Surveyhttp://www.usgs.gov/
Area covered
Shasta County, Hat Creek, California
Description
This data release contains California Department of Water Resource borehole data that were regularized by the US Geological Survey. This dataset contains borehole lithologic data, and geospatial data of water wells in the Hat Creek basin California, located east of Mount Shasta in southern California. The borehole dataset is released as an excel table and includes (1) individual borehole location, and (2) downhole lithologic interval data derived from well drillers’ lithology logs.
C
Preferred Language Spoken in California Facilities
data.chhs.ca.gov
healthdata.gov
+2more
csv, xlsx, zip
Updated Aug 29, 2024
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Department of Health Care Access and Information (2024). Preferred Language Spoken in California Facilities [Dataset]. https://data.chhs.ca.gov/dataset/preferred-language-spoken-in-california-facilities
Explore at:
csv(902301), xlsx(10550), xlsx(9475), zipAvailable download formats
Dataset updated
Aug 29, 2024
Dataset authored and provided by
Department of Health Care Access and Information
Area covered
California
Description
The dataset contains combined counts for hospital discharges, emergency room encounters, and ambulatory surgeries by preferred language spoken at each facility. The nearly 100 languages collected in the patient-level data were combined into eight geographical or cultural groups: English Language, Spanish Language, Asian/Pacific Islander Languages, Middle Eastern Languages, European Languages, African Languages, Latin American Languages, Native American Languages, and Sign Language. See the Preferred Language Spoken Language List below to see the exact separation of languages.
h
deepseek-r1-gen-data
huggingface.co
Updated Feb 21, 2025
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
RUIJI YU (2025). deepseek-r1-gen-data [Dataset]. https://huggingface.co/datasets/Aristo2333/deepseek-r1-gen-data
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Feb 21, 2025
Authors
RUIJI YU
Description
Dataset Card for deepseek-r1-gen-data

This dataset has been created with distilabel.

Dataset Summary

This dataset contains a pipeline.yaml which can be used to reproduce the pipeline that generated it in distilabel using the distilabel CLI: distilabel pipeline run --config "https://hf-mirror.com/datasets/Aristo2333/deepseek-r1-gen-data/raw/main/pipeline.yaml"

or explore the configuration: distilabel pipeline info --config… See the full description on the dataset page: https://huggingface.co/datasets/Aristo2333/deepseek-r1-gen-data.

Facebook

Twitter

Click to copy link

Link copied

Cite

nasa.gov (2025). My NASA Data [Dataset]. https://data.nasa.gov/dataset/my-nasa-data

My NASA Data

Explore at:

Dataset updated

Mar 31, 2025

Dataset provided by

NASAhttp://nasa.gov/

Description

MY NASA DATA (MND) is a tool that allows anyone to make use of satellite data that was previously unavailable.Through the use of MND’s Live Access Server (LAS) a multitude of charts, plots and graphs can be generated using a wide variety of constraints. This site provides a large number of lesson plans with a wide variety of topics, all with the students in mind. Not only can you use our lesson plans, you can use the LAS to improve the ones that you are currently implementing in your classroom.

Clear search

Close search

Google apps

Main menu

My NASA Data

COVID-19 Diagnostic Laboratory Testing (PCR Testing) Time Series

Indicators of Coastal Water Quality: Ancillary Data

Head Data Set 2 Dataset

Head Data Set 2

Orange dataset table

pii-comp

Data from: Predicting Long-term Dynamics of Soil Salinity and Sodicity on a...

United States Census Bureau Open Data

PG&E: Energy Usage Data

Flight Data For Tail 655

Data Policy and Governance Guide

Soil Data Grevena

Description

Categories

Acknowledgements & Source

AWS-02-I CTD Data (MATLAB Format) [Flagg, C.]

Raw Data for ConfLab: A Data Collection Concept, Dataset, and Benchmark for...

SA Health Mental Health data

California Streams

Data from: CNC Machining Data Repository - Geometry, NC Code &...

Data release of hydrogeologic data Hat Creek basin, Shasta County,...

Preferred Language Spoken in California Facilities

deepseek-r1-gen-data

My NASA Data