https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/
In this dataset, there are soil data analyses with properties such as pH, organic matter (OM), salinity (EC), etc., major elements (N, P, K, Mg) as well as some microelements (Fe, Zn, Mn, Cu, B) with significant impact on plant nutrition.
Agricultural Soil
Panagiotis Tziachris
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
The complete dataset used in the analysis comprises 36 samples, each described by 11 numeric features and 1 target. The attributes considered were caspase 3/7 activity, Mitotracker red CMXRos area and intensity (3 h and 24 h incubations with both compounds), Mitosox oxidation (3 h incubation with the referred compounds) and oxidation rate, DCFDA fluorescence (3 h and 24 h incubations with either compound) and oxidation rate, and DQ BSA hydrolysis. The target of each instance corresponds to one of the 9 possible classes (4 samples per class): Control, 6.25, 12.5, 25 and 50 µM for 6-OHDA and 0.03, 0.06, 0.125 and 0.25 µM for rotenone. The dataset is balanced, it does not contain any missing values and data was standardized across features. The small number of samples prevented a full and strong statistical analysis of the results. Nevertheless, it allowed the identification of relevant hidden patterns and trends.
Exploratory data analysis, information gain, hierarchical clustering, and supervised predictive modeling were performed using Orange Data Mining version 3.25.1 [41]. Hierarchical clustering was performed using the Euclidean distance metric and weighted linkage. Cluster maps were plotted to relate the features with higher mutual information (in rows) with instances (in columns), with the color of each cell representing the normalized level of a particular feature in a specific instance. The information is grouped both in rows and in columns by a two-way hierarchical clustering method using the Euclidean distances and average linkage. Stratified cross-validation was used to train the supervised decision tree. A set of preliminary empirical experiments were performed to choose the best parameters for each algorithm, and we verified that, within moderate variations, there were no significant changes in the outcome. The following settings were adopted for the decision tree algorithm: minimum number of samples in leaves: 2; minimum number of samples required to split an internal node: 5; stop splitting when majority reaches: 95%; criterion: gain ratio. The performance of the supervised model was assessed using accuracy, precision, recall, F-measure and area under the ROC curve (AUC) metrics.
This data set shows 311 service requests in the City of Pittsburgh. This data is collected from the request intake software used by the 311 Response Center in the Department of Innovation & Performance. Requests are collected from phone calls, tweets, emails, a form on the City website, and through the 311 mobile application. For more information, see the 311 Data User Guide. If you are unable to download the 311 Data table due to a 504 Gateway Timeout error, use this link instead: https://tools.wprdc.org/downstream/76fda9d0-69be-4dd5-8108-0de7907fc5a4 NOTE: The data feed for this dataset is broken as of December 21st, 2022. We're working on restoring it.
MY NASA DATA (MND) is a tool that allows anyone to make use of satellite data that was previously unavailable.Through the use of MND’s Live Access Server (LAS) a multitude of charts, plots and graphs can be generated using a wide variety of constraints. This site provides a large number of lesson plans with a wide variety of topics, all with the students in mind. Not only can you use our lesson plans, you can use the LAS to improve the ones that you are currently implementing in your classroom.
Open Government Licence - Canada 2.0https://open.canada.ca/en/open-government-licence-canada
License information was derived automatically
FluWatch is Canada's national surveillance system that monitors the spread of flu and flu-like illnesses on an on-going basis. Activity Level surveillance is a component of FluWatch that provides an overall assessment of the intensity and geographical spread of laboratory-confirmed influenza cases, influenza-like-illness (ILI) and reported outbreaks for a given surveillance region. Activity Levels are assigned and reported by Provincial and Territorial Ministries of Health. A surveillance region can be classified under one of the four following categories: no activity, sporadic, localized or widespread. For a description of the categories, see the data dictionary resource. For more information on flu activity in Canada, see the FluWatch report. (https://www.canada.ca/en/public-health/services/diseases/flu-influenza/influenza-surveillance/weekly-influenza-reports.html) Note: The reported activity levels are a reflection of the surveillance data available to FluWatch at the time of production. Delays in reporting of data may cause data to change retrospectively.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
This dataset globally (excluding frigid/polar zones) quantifies the different facets of variability in surface soil (0 – 30 cm) salinity and sodicity for the period between 1980 and 2018. This is realised by developing 4-D predictive models of Electrical Conductivity of saturated soil Extract (ECe) and soil Exchangeable Sodium Percentage (ESP) as indicators of soil salinity and sodicity. These machine learning-based models make predictions for ECe and ESP at different times, locations, and depths and by extracting meaningful statistics form those predictions, different facets of variability in the surface soil salinity and sodicity are quantified. The dataset includes 10 maps documenting different aspects of soil salinity and sodicity variations, and auxiliary data required for generation of those maps. Users are referred to the corresponding "READ_ME" file for more information about this dataset.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Notes: As of June 2020 this dataset has been static for several years. Recent versions of NHD High Res may be more detailed than this dataset for some areas, while this dataset may still be more detailed than NHD High Res in other areas. This dataset is considered authoritative as used by CDFW for particular tracking purposes but may not be current or comprehensive for all streams in the state.
National Hydrography Dataset (NHD) high resolution NHDFlowline features for California were originally dissolved on common GNIS_ID or StreamLevel* attributes and routed from mouth to headwater in meters. The results are measured polyline features representing entire streams. Routes on these streams are measured upstream, i.e., the measure at the mouth of a stream is zero and at the upstream end the measure matches the total length of the stream feature. Using GIS tools, a user of this dataset can retrieve the distance in meters upstream from the mouth at any point along a stream feature.** CA_Streams_v3 Update Notes: This version includes over 200 stream modifications and additions resulting from requests for updating from CDFW staff and others***. New locator fields from the USGS Watershed Boundary Dataset (WBD) have been added for v3 to enhance user's ability to search for or extract subsets of California Streams by hydrologic area. *See the Source Citation section of this metadata for further information on NHD, WBD, NHDFlowline, GNIS_ID and StreamLevel. **See the Data Quality section of this metadata for further explanation of stream feature development. ***Some current NHD data has not yet been included in CA_Streams. The effort to synchronize CA_Streams with NHD is ongoing.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
## Overview
Blackberry Data is a dataset for object detection tasks - it contains Blackberry Ripeness annotations for 781 images.
## Getting Started
You can download this dataset for use within your own projects, or fork it into a workspace on Roboflow to create your own model.
## License
This dataset is available under the [CC BY 4.0 license](https://creativecommons.org/licenses/CC BY 4.0).
Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
License information was derived automatically
Journey9ni/VLM-3R-DATA dataset hosted on Hugging Face and contributed by the HF Datasets community
Open Government Licence - Canada 2.0https://open.canada.ca/en/open-government-licence-canada
License information was derived automatically
The operating expenses by North American Industry Classification System (NAICS) which include all members under industry expenditures, for software publishers (NAICS 51121), annual (percent), for five years of data.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Use this guide to find information on Tempe data policy and standards.Open Data PolicyEthical Artificial Intelligence (AI) PolicyEvaluation PolicyExpedited Data Sharing PolicyData Sharing Agreement (General)Data Sharing Agreement (GIS)Data Quality Standard and ChecklistDisaggregated Data StandardsData and Analytics Service Standard
This data release contains California Department of Water Resource borehole data that were regularized by the US Geological Survey. This dataset contains borehole lithologic data, and geospatial data of water wells in the Hat Creek basin California, located east of Mount Shasta in southern California. The borehole dataset is released as an excel table and includes (1) individual borehole location, and (2) downhole lithologic interval data derived from well drillers’ lithology logs.
Open Government Licence - Canada 2.0https://open.canada.ca/en/open-government-licence-canada
License information was derived automatically
Provides information on the plans and expected performance of appropriated departments over a three year period. Departmental Plans are tabled in Parliament each spring
Wholesale trade sales, available for all members under North American Industry Classification System (NAICS) and adjustments dimensions, available monthly, for Canada, provinces and territories, in dollars X 1,000.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
## Overview
600noisy Data is a dataset for object detection tasks - it contains Tumor annotations for 600 images.
## Getting Started
You can download this dataset for use within your own projects, or fork it into a workspace on Roboflow to create your own model.
## License
This dataset is available under the [CC BY 4.0 license](https://creativecommons.org/licenses/CC BY 4.0).
The Ancillary Data component of the Indicators of Coastal Water Quality Collection includes a 5 arc-minute (approximately 9 x 9 km at the equator) sequence grid, grid cell centroids that relate to the grid cells in the tabular "Indicators of Coastal Water Quality: Change in Chlorophyll-a Concentration 1998-2007" data set, and a country buffer data set that is divided by exclusive economic zones (EEZ). The data are produced by the Columbia University Center for International Earth Science Information Network (CIESIN).
Note: Find data at source; data is continuously updated・ PG&E provides non-confidential, aggregated usage data that are available to the public and updated on a quarterly basis. These public datasets consist of monthly consumption aggregated by ZIP code and by customer segment: Residential, Commercial, Industrial and Agricultural. The public datasets must meet the standards for aggregating and anonymizing customer data pursuant to CPUC Decision 14-05-016, as follows: a minimum of 100 Residential customers; a minimum of 15 Non-Residential customers, with no single Non-Residential customer in each sector accounting for more than 15% of the total consumption. If the aggregation standard is not met, the consumption will be combined with a neighboring ZIP code until the aggregation requirements are met.
Overview This dataset was produced from the raw sodar .mnd files from the Lubbock, TX site during the WFIP1 campaign. Quality control and formatting have been applied to transform the numerous raw files into a single file to provide user friendliness and improved wind resource characterization at this location. Data Details Location: 33.60279, -102.02821 Elevation: 1017 m Output heights: Every 5 meters from 10 meters to 300 meters Data Quality Data from the raw files were filtered according to the following automated and manual procedures. Missing and rejected values were flagged as -999. High precipitation events as suggested by the vertical velocity values were subjected to quality control. If any vertical velocity value at any height for a given timestamp fell below a -1.5 m/s threshold, all variables at all heights at that timestamp were rejected. On a height-by-height basis, if the signal-to- noise ratio (SNR) for any of the u, v, or w wind components reached 9 or below, all variables for that height and timestamp were rejected. The raw files were also screened for nonphysical values such as wind speeds less than zero and directions outside 0-360 degrees. Finally, the data were visually examined for events of atypical sodar retrievals, such as excessive magnitudes in oscillations or periods of stagnancy.
Current profile data from Downward-looking Teledyne RDI 38 kHz Ocean Observer ADCP, ADCP #1 on Atwood Condor, 27.85N, 90.53W, 2015/05/25 through 2015/08/08
Bolt provides system data owned by Bolt conforming to the General Bikeshare Feed Specification (GBFS) - available on request. In case of interest, please contact Bolt to discuss if and how access can be granted, and whether or not a fee will apply.
https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/
In this dataset, there are soil data analyses with properties such as pH, organic matter (OM), salinity (EC), etc., major elements (N, P, K, Mg) as well as some microelements (Fe, Zn, Mn, Cu, B) with significant impact on plant nutrition.
Agricultural Soil
Panagiotis Tziachris