100+ datasets found

Amount of data created, consumed, and stored 2010-2023, with forecasts to...
statista.com
Updated Nov 21, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Statista (2024). Amount of data created, consumed, and stored 2010-2023, with forecasts to 2028 [Dataset]. https://www.statista.com/statistics/871513/worldwide-data-created/
Explore at:
Dataset updated
Nov 21, 2024
Dataset authored and provided by
Statistahttp://statista.com/
Time period covered
May 2024
Area covered
Worldwide
Description
The total amount of data created, captured, copied, and consumed globally is forecast to increase rapidly, reaching 149 zettabytes in 2024. Over the next five years up to 2028, global data creation is projected to grow to more than 394 zettabytes. In 2020, the amount of data created and replicated reached a new high. The growth was higher than previously expected, caused by the increased demand due to the COVID-19 pandemic, as more people worked and learned from home and used home entertainment options more often. Storage capacity also growing Only a small percentage of this newly created data is kept though, as just two percent of the data produced and consumed in 2020 was saved and retained into 2021. In line with the strong growth of the data volume, the installed base of storage capacity is forecast to increase, growing at a compound annual growth rate of 19.2 percent over the forecast period from 2020 to 2025. In 2020, the installed base of storage capacity reached 6.7 zettabytes.
d
Tutorial: How to use Google Data Studio and ArcGIS Online to create an...
search.dataone.org
hydroshare.org
+1more
Updated Apr 15, 2022
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Sarah Beganskas (2022). Tutorial: How to use Google Data Studio and ArcGIS Online to create an interactive data portal [Dataset]. https://search.dataone.org/view/sha256%3A9bed617ebf170dd7202c2f35b3d218fe77840774cf951bc922f0383db4620e7a
Explore at:
Dataset updated
Apr 15, 2022
Dataset provided by
Hydroshare
Authors
Sarah Beganskas
Description
This tutorial will teach you how to take time-series data from many field sites and create a shareable online map, where clicking on a field location brings you to a page with interactive graph(s).

The tutorial can be completed with a sample dataset (provided via a Google Drive link within the document) or with your own time-series data from multiple field sites.

Part 1 covers how to make interactive graphs in Google Data Studio and Part 2 covers how to link data pages to an interactive map with ArcGIS Online. The tutorial will take 1-2 hours to complete.

An example interactive map and data portal can be found at: https://temple.maps.arcgis.com/apps/View/index.html?appid=a259e4ec88c94ddfbf3528dc8a5d77e8
w
Data from: PayCalc : how to create customized payroll spreadsheets
workwithdata.com
Updated Jan 12, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Work With Data (2025). PayCalc : how to create customized payroll spreadsheets [Dataset]. https://www.workwithdata.com/object/paycalc-how-to-create-customized-payroll-spreadsheets-book-by-thomas-e-towle-0000
Explore at:
Dataset updated
Jan 12, 2025
Dataset authored and provided by
Work With Data
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
PayCalc : how to create customized payroll spreadsheets is a book. It was written by Thomas E. Towle and published by Foulsham in 1984.

How to Create an Okta Account

data.nsw.gov.au
researchdata.edu.au

Updated Jan 28, 2025

+ more versions

Facebook

Twitter

Click to copy link

Link copied

Cite

Spatial Services (DCS) (2025). How to Create an Okta Account [Dataset]. https://data.nsw.gov.au/data/dataset/1-8584a80239754e66b39a1325f08f5b53

Explore at:

Dataset updated

Jan 28, 2025

Dataset provided by

Spatial Services (DCS)

Description

Access API

Metadata Portal Metadata Information

Content Title	How to create an Okta Account
Content Type	Document
Description	Documentation on how to create an Okta Account
Initial Publication Date	09/07/2024
Data Currency	09/07/2024
Data Update Frequency	Other
Content Source	Data provider files
File Type	Document
Attribution
Data Theme, Classification or Relationship to other Datasets
Accuracy
Spatial Reference System (dataset)	Other
Spatial Reference System (web service)	Other
WGS84 Equivalent To	Other
Spatial Extent
Content Lineage
Data Classification	Unclassified
Data Access Policy	Open
Data Quality
Terms and Conditions	Creative Commons
Standard and Specification
Data Custodian	Customer Hub
Point of Contact	Customer Hub
Data Aggregator
Data Distributor
Additional Supporting Information
TRIM Number

m
Dataset of development of business during the COVID-19 crisis
data.mendeley.com
narcis.nl
Updated Nov 9, 2020
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Tatiana N. Litvinova (2020). Dataset of development of business during the COVID-19 crisis [Dataset]. http://doi.org/10.17632/9vvrd34f8t.1
Explore at:
Unique identifier
https://doi.org/10.17632/9vvrd34f8t.1
Dataset updated
Nov 9, 2020
Authors
Tatiana N. Litvinova
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
To create the dataset, the top 10 countries leading in the incidence of COVID-19 in the world were selected as of October 22, 2020 (on the eve of the second full of pandemics), which are presented in the Global 500 ranking for 2020: USA, India, Brazil, Russia, Spain, France and Mexico. For each of these countries, no more than 10 of the largest transnational corporations included in the Global 500 rating for 2020 and 2019 were selected separately. The arithmetic averages were calculated and the change (increase) in indicators such as profitability and profitability of enterprises, their ranking position (competitiveness), asset value and number of employees. The arithmetic mean values of these indicators for all countries of the sample were found, characterizing the situation in international entrepreneurship as a whole in the context of the COVID-19 crisis in 2020 on the eve of the second wave of the pandemic. The data is collected in a general Microsoft Excel table. Dataset is a unique database that combines COVID-19 statistics and entrepreneurship statistics. The dataset is flexible data that can be supplemented with data from other countries and newer statistics on the COVID-19 pandemic. Due to the fact that the data in the dataset are not ready-made numbers, but formulas, when adding and / or changing the values in the original table at the beginning of the dataset, most of the subsequent tables will be automatically recalculated and the graphs will be updated. This allows the dataset to be used not just as an array of data, but as an analytical tool for automating scientific research on the impact of the COVID-19 pandemic and crisis on international entrepreneurship. The dataset includes not only tabular data, but also charts that provide data visualization. The dataset contains not only actual, but also forecast data on morbidity and mortality from COVID-19 for the period of the second wave of the pandemic in 2020. The forecasts are presented in the form of a normal distribution of predicted values and the probability of their occurrence in practice. This allows for a broad scenario analysis of the impact of the COVID-19 pandemic and crisis on international entrepreneurship, substituting various predicted morbidity and mortality rates in risk assessment tables and obtaining automatically calculated consequences (changes) on the characteristics of international entrepreneurship. It is also possible to substitute the actual values identified in the process and following the results of the second wave of the pandemic to check the reliability of pre-made forecasts and conduct a plan-fact analysis. The dataset contains not only the numerical values of the initial and predicted values of the set of studied indicators, but also their qualitative interpretation, reflecting the presence and level of risks of a pandemic and COVID-19 crisis for international entrepreneurship.
w
Data from: How to create the inclusive classroom : removing barriers to...
workwithdata.com
Updated Dec 15, 2022
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Work With Data (2022). How to create the inclusive classroom : removing barriers to learning [Dataset]. https://www.workwithdata.com/object/how-to-create-the-inclusive-classroom-removing-barriers-to-learning-book-by-rita-cheminais-0000
Explore at:
Dataset updated
Dec 15, 2022
Dataset authored and provided by
Work With Data
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
How to create the inclusive classroom : removing barriers to learning is a book. It was written by Rita Cheminais and published by David Fulton Publishers in 2016.
Supplemental data
figshare.com
xlsx
Updated Mar 15, 2024
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
T Miyakoshi; Yoichi M. Ito (2024). Supplemental data [Dataset]. http://doi.org/10.6084/m9.figshare.24596058.v1
Explore at:
xlsxAvailable download formats
Unique identifier
https://doi.org/10.6084/m9.figshare.24596058.v1
Dataset updated
Mar 15, 2024
Dataset provided by
figshare
Authors
T Miyakoshi; Yoichi M. Ito
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
The dataset for the article "The current utilization status of wearable devices in clinical research".Analyses were performed by utilizing the JMP Pro 16.10, Microsoft Excel for Mac version 16 (Microsoft).The file extension "jrp" is a file of the statistical analysis software JMP, which contains both the analysis code and the data set.In case JMP is not available, a "csv" file as a data set and JMP script, the analysis code, are prepared in "rtf" format.The "xlsx" file is a Microsoft Excel file that contains the data set and the data plotted or tabulated using Microsoft Excel functions.Supplementary Figure 1. NCT number duplication frequencyIncludes Excel file used to create the figure (Supplemental Figure 1).・Sfig1_NCT number duplication frequency.xlsxSupplementary Figure 2-5　Simple and annual time series aggregationIncludes Excel file, JMP repo file, csv dataset of JMP repo file and JMP scripts used to create the figure (Supplementary Figures 2-5).・Sfig2-5 Annual time series aggregation.xlsx・Sfig2 Study Type.jrp・Sfig4device type.jrp・Sfig3 Interventions Type.jrp・Sfig5Conditions type.jrp・Sfig2, 3 ,5_database.csv・Sfig2_JMP script_Study type.rtf・Sfig3_JMP script Interventions type.rtf・Sfig5_JMP script Conditions type.rtf・Sfig4_dataset.csv・Sfig4_JMP script_device type.rtfSupplementary Figures 6-11 Mosaic diagram of intervention by conditionSupplementary tables 4-9 Analysis of contingency table for intervention by condition JMP repot files used to create the figures(Supplementary Figures 6-11 ) and tables(Supplementary Tablea 4-9) , including the csv dataset of JMP repot files and JMP scripts.・Sfig6-11 Stable4-9 Intervention devicetype_conditions.jrp・Sfig6-11_Stable4-9_dataset.csv・Sfig6-11_Stable4-9_JMP script.rtfSupplementary Figure 12. Distribution of enrollmentIncludes Excel file, JMP repo file, csv dataset of JMP repo file and JMP scripts used to create the figure (Supplementary Figures 12).・Sfig12_Distribution of enrollment.jrp・Sfig12_Distribution of enrollment.csv・Sfig12_JMP script.rtf
mytestdata
kaggle.com
zip
Updated Sep 9, 2019
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
fordreaming1 (2019). mytestdata [Dataset]. https://www.kaggle.com/datasets/fordreaming1/mytestdata
Explore at:
zip(174 bytes)Available download formats
Dataset updated
Sep 9, 2019
Authors
fordreaming1
License
https://www.worldbank.org/en/about/legal/terms-of-use-for-datasetshttps://www.worldbank.org/en/about/legal/terms-of-use-for-datasets
Description
Dataset

This dataset was created by fordreaming1

Released under World Bank Dataset Terms of Use

Contents
Company Datasets for Business Profiling
datarade.ai
Updated Feb 23, 2017
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Oxylabs (2017). Company Datasets for Business Profiling [Dataset]. https://datarade.ai/data-products/company-datasets-for-business-profiling-oxylabs
Explore at:
.json, .xml, .csv, .xlsAvailable download formats
Dataset updated
Feb 23, 2017
Dataset authored and provided by
Oxylabs
Area covered
Moldova (Republic of), Tunisia, Andorra, Nepal, Taiwan, British Indian Ocean Territory, Canada, Bangladesh, Isle of Man, Northern Mariana Islands
Description
Company Datasets for valuable business insights!

Discover new business prospects, identify investment opportunities, track competitor performance, and streamline your sales efforts with comprehensive Company Datasets.

These datasets are sourced from top industry providers, ensuring you have access to high-quality information:

Owler: Gain valuable business insights and competitive intelligence. -AngelList: Receive fresh startup data transformed into actionable insights. -CrunchBase: Access clean, parsed, and ready-to-use business data from private and public companies. -Craft.co: Make data-informed business decisions with Craft.co's company datasets. -Product Hunt: Harness the Product Hunt dataset, a leader in curating the best new products.

We provide fresh and ready-to-use company data, eliminating the need for complex scraping and parsing. Our data includes crucial details such as:

Company name;

Size;

Founding date;

Location;

Industry;

Revenue;

Employee count;

Competitors.

You can choose your preferred data delivery method, including various storage options, delivery frequency, and input/output formats.

Receive datasets in CSV, JSON, and other formats, with storage options like AWS S3 and Google Cloud Storage. Opt for one-time, monthly, quarterly, or bi-annual data delivery.

With Oxylabs Datasets, you can count on:

Fresh and accurate data collected and parsed by our expert web scraping team.

Time and resource savings, allowing you to focus on data analysis and achieving your business goals.

A customized approach tailored to your specific business needs.

Legal compliance in line with GDPR and CCPA standards, thanks to our membership in the Ethical Web Data Collection Initiative.

Pricing Options:

Standard Datasets: choose from various ready-to-use datasets with standardized data schemas, priced from $1,000/month.

Custom Datasets: Tailor datasets from any public web domain to your unique business needs. Contact our sales team for custom pricing.

Experience a seamless journey with Oxylabs:

Understanding your data needs: We work closely to understand your business nature and daily operations, defining your unique data requirements.

Developing a customized solution: Our experts create a custom framework to extract public data using our in-house web scraping infrastructure.

Delivering data sample: We provide a sample for your feedback on data quality and the entire delivery process.

Continuous data delivery: We continuously collect public data and deliver custom datasets per the agreed frequency.

Unlock the power of data with Oxylabs' Company Datasets and supercharge your business insights today!
Consumers comfortable sharing data to create personalized ads in North...
statista.com
Updated Dec 10, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Consumers comfortable sharing data to create personalized ads in North America 2021 [Dataset]. https://www.statista.com/statistics/1333798/consumers-comfortable-share-data-personalized-ads-usa/
Explore at:
Dataset updated
Dec 10, 2024
Dataset authored and provided by
Statistahttp://statista.com/
Time period covered
Aug 2021
Area covered
United States
Description
During a 2021 survey, 31 percent of responding consumers from Canada and the United States stated they were comfortable sharing their data to create personalized advertising. However, 45 percent said they were not.
d
Data from: Footprints and producers of source data used to create northern...
catalog.data.gov
data.usgs.gov
Updated Jul 6, 2024
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
U.S. Geological Survey (2024). Footprints and producers of source data used to create northern portion of the high-resolution (1 m) San Francisco Bay, California, digital elevation model (DEM) [Dataset]. https://catalog.data.gov/dataset/footprints-and-producers-of-source-data-used-to-create-northern-portion-of-the-high-resolu
Explore at:
Dataset updated
Jul 6, 2024
Dataset provided by
United States Geological Surveyhttp://www.usgs.gov/
Area covered
San Francisco Bay, California
Description
Polygon shapefile showing the footprint boundaries, source agency origins, and resolutions of compiled bathymetric digital elevation models (DEMs) used to construct a continuous, high-resolution DEM of the northern portion of San Francisco Bay.
d
Sidewalk to Street "Walkability" Ratio
catalog.data.gov
gimi9.com
+1more
Updated Jan 24, 2023
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Western Pennsylvania Regional Data Center (2023). Sidewalk to Street "Walkability" Ratio [Dataset]. https://catalog.data.gov/dataset/sidewalk-to-street-walkability-ratio
Explore at:
Dataset updated
Jan 24, 2023
Dataset provided by
Western Pennsylvania Regional Data Center
Description
We’ve been asked to create measures of communities that are “walkable” for several projects. While there is no standard definition of what makes a community “walkable”, and the definition of “walkability” can differ from person to person, we thought an indicator that explores the total length of available sidewalks relative to the total length of streets in a community could be a good place to start. In this blog post, we describe how we used open data from SPC and Allegheny County to create a new measure for how “walkable” a community is. We wanted to create a ratio of the length of a community’s sidewalks to the length of a community’s streets as a measure of pedestrian infrastructure. A ratio of 1 would mean that a community has an equal number of linear feet of sidewalks and streets. A ratio of about 2 would mean that a community has two linear feet of sidewalk for every linear foot of street. In other words, every street has a sidewalk on either side of it. In creating a measure of the ratio of streets to sidewalks, we had to do a little bit of data cleanup. Much of this was by trial and error, ground-truthing the data based on our personal experiences walking in different neighborhoods. Since street data was not shared as open data by many counties in our region either on PASDA or through the SPC open data portal, we limited our analysis of “walkability” to Allegheny County. In looking at the sidewalk data table and map, we noticed that trails were included. While nice to have in the data, we wanted to exclude these two features from the ratio. We did this to avoid a situation where a community that had few sidewalks but was in the same blockgroup as a park with trails would get “credit” for being more “walkable” than it actually is according to our definition. We did this by removing all segments where “Trail” was in the “Type_Name” field. We also used a similar tabular selection method to remove crosswalks from the sidewalk data “Type_Name”=”Crosswalk.” We kept the steps in the dataset along with the sidewalks. In the street data obtained from Allegheny County’s GIS department, we felt like we should try to exclude limited-access highway segments from the analysis, since pedestrians are prohibited from using them, and their presence would have reduced the sidewalk/street ratio in communities where they are located. We did this by excluding street segments whose values in the “FCC” field (designating type of street) equaled “A11” or “A63.” We also removed trails from this dataset by excluding those classified as “H10.” Since documentation was sparse, we looked to see how these features were classified in the data to determine which codes to exclude. After running the data initially, we also realized that excluding alleyways from the calculations also could improve the accuracy of our results. Some of the communities with substantial pedestrian infrastructure have alleyways, and including them would make them appear to be less-”walkable” in our indicator. We removed these from the dataset by removing records with a value of “Aly” or “Way” in the “St_Type” field. We also excluded streets where the word “Alley” appeared in the street name, or “St_Name” field. The full methodology used for this dataset is captured in our blog post, and we have also included the sidewalk and street data used to create the ratio here as well.
Data from: Create a Story Map
hubexamples-dcdev.hub.arcgis.com
Updated Jul 16, 2018
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
ESRI R&D Center (2018). Create a Story Map [Dataset]. https://hubexamples-dcdev.hub.arcgis.com/datasets/create-a-story-map
Explore at:
Dataset updated
Jul 16, 2018
Dataset provided by
Esrihttp://esri.com/
Authors
ESRI R&D Center
Description
DO NOT DELETE OR MODIFY THIS ITEM. This item is managed by the ArcGIS Hub application. Create your own initiative by combining existing applications with a custom site.
Developer Community and Code Datasets
datarade.ai
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Oxylabs, Developer Community and Code Datasets [Dataset]. https://datarade.ai/data-products/developer-community-and-code-datasets-oxylabs
Explore at:
.bin, .json, .xml, .csv, .xls, .sql, .txtAvailable download formats
Dataset authored and provided by
Oxylabs
Area covered
Guyana, Saint Pierre and Miquelon, Bahamas, El Salvador, Tuvalu, Djibouti, Marshall Islands, South Sudan, Philippines, United Kingdom
Description
Unlock the power of ready-to-use data sourced from developer communities and repositories with Developer Community and Code Datasets.

Data Sources:

GitHub: Access comprehensive data about GitHub repositories, developer profiles, contributions, issues, social interactions, and more.

StackShare: Receive information about companies, their technology stacks, reviews, tools, services, trends, and more.

DockerHub: Dive into data from container images, repositories, developer profiles, contributions, usage statistics, and more.

Developer Community and Code Datasets are a treasure trove of public data points gathered from tech communities and code repositories across the web.

With our datasets, you'll receive:

Usernames;

Companies;

Locations;

Job Titles;

Follower Counts;

Contact Details;

Employability Statuses;

And More.

Choose from various output formats, storage options, and delivery frequencies:

Get datasets in CSV, JSON, or other preferred formats.

Opt for data delivery via SFTP or directly to your cloud storage, such as AWS S3.

Receive datasets either once or as per your agreed-upon schedule.

Why choose our Datasets?

Fresh and accurate data: Access complete, clean, and structured data from scraping professionals, ensuring the highest quality.

Time and resource savings: Let us handle data extraction and processing cost-effectively, freeing your resources for strategic tasks.

Customized solutions: Share your unique data needs, and we'll tailor our data harvesting approach to fit your requirements perfectly.

Legal compliance: Partner with a trusted leader in ethical data collection. Oxylabs is trusted by Fortune 500 companies and adheres to GDPR and CCPA standards.

Pricing Options:

Standard Datasets: choose from various ready-to-use datasets with standardized data schemas, priced from $1,000/month.

Custom Datasets: Tailor datasets from any public web domain to your unique business needs. Contact our sales team for custom pricing.

Experience a seamless journey with Oxylabs:

Understanding your data needs: We work closely to understand your business nature and daily operations, defining your unique data requirements.

Developing a customized solution: Our experts create a custom framework to extract public data using our in-house web scraping infrastructure.

Delivering data sample: We provide a sample for your feedback on data quality and the entire delivery process.

Continuous data delivery: We continuously collect public data and deliver custom datasets per the agreed frequency.

Empower your data-driven decisions with Oxylabs Developer Community and Code Datasets!
Enterprise Data Inventory
catalog.data.gov
Updated Feb 16, 2023
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Enterprise Data Inventory [Dataset]. https://catalog.data.gov/dataset/enterprise-data-inventory-5d30b
Explore at:
Dataset updated
Feb 16, 2023
Dataset provided by
U.S. Department of Homeland Securityhttp://www.dhs.gov/
Description
The DHS Data Inventory Program is working to create a single data inventory of all data within DHS, including the DHS Components, the DHS Functional Data Domains, and DHS HQ. rnrnThe Data Inventory Program is designed to exceed DHS obligations under the Foundations for Evidence-Based Policymaking Act (the Evidence Act), the OPEN Government Data Act, the DHS Data Framework Act of 2018, and DHS Delegation Number 04004 rev 00 of May 18, 2021 from Secretary Mayorkas to the Chief Data Officer. The goal is to create a data inventory that will be useful for all of DHS to help answer questions about DHS data in a timely manner and help DHS leadership plan new activities.
w
Data from: The constant economy : how to create a stable society
workwithdata.com
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Work With Data, The constant economy : how to create a stable society [Dataset]. https://www.workwithdata.com/object/the-constant-economy-how-to-create-a-stable-society-book-by-zac-goldsmith-1975
Explore at:
Dataset authored and provided by
Work With Data
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
The constant economy : how to create a stable society is a book. It was written by Zac Goldsmith and published by Atlantic in 2010.
C
National Hydrography Dataset (NHD) Archive
data.cnra.ca.gov
data.ca.gov
+3more
file geodatabase, txt +1
Updated Apr 3, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
California Department of Water Resources (2023). National Hydrography Dataset (NHD) Archive [Dataset]. https://data.cnra.ca.gov/dataset/nhd-archive
Explore at:
zip(889808228), zip(824262141), zip(998312686), zip(1186108845), zip(1013340904), zip(983611827), zip(872070531), file geodatabase(993378507), zip(908458063), zip(881824363), zip(880493872), zip(975085485), zip(956932298), zip(913697599), zip(1168308782), zip(1051441641), txt(1735), zip(908490824)Available download formats
Dataset updated
Apr 3, 2023
Dataset authored and provided by
California Department of Water Resources
Description
This page is meant to act as an archival repository for previous versions of the statewide National Hydrography Dataset (NHD) which includes the Watershed Boundary Dataset (WBD) released by USGS. All data are presented in their original, unmodified format as downloaded from USGS. Be aware that the data do not come with a geometric network; directions for building the geometric network can be found in the text file on this resource page called "How to Create NHD Network".
h
sql-create-context
huggingface.co
opendatalab.com
Updated Apr 21, 2023
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
brianm (2023). sql-create-context [Dataset]. https://huggingface.co/datasets/b-mc2/sql-create-context
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Apr 21, 2023
Authors
brianm
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
Overview

This dataset builds from WikiSQL and Spider. There are 78,577 examples of natural language queries, SQL CREATE TABLE statements, and SQL Query answering the question using the CREATE statement as context. This dataset was built with text-to-sql LLMs in mind, intending to prevent hallucination of column and table names often seen when trained on text-to-sql datasets. The CREATE TABLE statement can often be copy and pasted from different DBMS and provides table names… See the full description on the dataset page: https://huggingface.co/datasets/b-mc2/sql-create-context.
c
Documentation of R scripts to create boxplots of change factors by NOAA...
s.cnmilf.com
data.usgs.gov
+1more
Updated Jul 6, 2024
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
U.S. Geological Survey (2024). Documentation of R scripts to create boxplots of change factors by NOAA Atlas 14 station, or for all stations in an ArcHydro Enhanced Database (AHED) basin or county (Documentation_R_script_create_boxplot.docx) [Dataset]. https://s.cnmilf.com/user74170196/https/catalog.data.gov/dataset/documentation-of-r-scripts-to-create-boxplots-of-change-factors-by-noaa-atlas-14-station-o
Explore at:
Dataset updated
Jul 6, 2024
Dataset provided by
United States Geological Surveyhttp://www.usgs.gov/
Description
The South Florida Water Management District (SFWMD) and the U.S. Geological Survey have developed projected future change factors for precipitation depth-duration-frequency (DDF) curves at 174 National Oceanic and Atmospheric Administration (NOAA) Atlas 14 stations in central and south Florida. The change factors were computed as the ratio of projected future to historical extreme precipitation depths fitted to extreme precipitation data from various downscaled climate datasets using a constrained maximum likelihood (CML) approach. The change factors correspond to the period 2050-2089 (centered in the year 2070) as compared to the 1966-2005 historical period. An R script (create_boxplot.R) is provided which generates boxplots of change factors by NOAA Atlas 14 station, or for all NOAA Atlas 14 stations in an ArcHydro Enhanced Database (AHED) basin or county. In addition, the R script basin_boxplot.R is provided as an example on how to create a wrapper function that will automate the generation of boxplots of change factors for all AHED basins. This Microsoft Word file (Documentation_R_script_create_boxplot.docx) serves as documentation on the code usage and available options for running the scripts. As described in the documentation, the R scripts rely on some of the Microsoft Excel spreadsheets published as part of this data release.
d
WalkLists | Consumer Data Household Automobile Purchase USA (>240M...
datarade.ai
.csv, .xls
Updated Mar 24, 2021
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
ScaleCampaign (2021). WalkLists | Consumer Data Household Automobile Purchase USA (>240M Consumers) [Dataset]. https://datarade.ai/data-products/walklists-consumer-data-household-automobile-purchase-usa-240m-consumers-scalecampaign
Explore at:
.csv, .xlsAvailable download formats
Dataset updated
Mar 24, 2021
Dataset authored and provided by
ScaleCampaign
Area covered
United States
Description
Clients can request access to data for a targeted audience by setting geographic boundaries as well as multiple attribute filters. By specifying the target criteria and focusing on an extremely specific list of consumers that matches their desired interest, they can run their sales and service campaigns with great efficiency.

Clients can create direct consumer outreach data files at their fingertips with the power of artificial intelligence to guide them through valuable analytic research. We work closely with our clients to understand their specific goals and help them understand their options and solidify their focus on selections that optimize their budget by creating narrowly focused consumer data list that are right on point!

Personal identification such as names, address, cellphone numbers (where available) and landline phone numbers (where available) are automatically included in the resultant data products. Verified emails may also be appended in the order as an option to their data by request and purchase.

We look forward to guiding clients through the process to serve the best interests of their needs. To provide this service to our clients as they expect and deserve, we need to understand the goals and limitations of their project and budget. For potential and current clients who want or need personal assistance through the data selection and filtering process, we ask them to please ask us for help by allowing us to guide through the process. To begin the process, clients must first provide us with a good description of the desired criteria, and we will review it for clarity.

These are a minimum basic guide to filtering attribute criteria, but others will certainly apply based on the project specific goals: 1.) Full Name and email Address of Requestor 2.) Identify, by name, of the Organization requesting the data 3.) Provide a Valid Budget expectation so we can focus on suitable data 4.) Provide a specific geographic Region of Interest for your data request 5.) Include filtering criteria to be used to process the data, such as: • Auto Buy New Interest • Auto Buy Used Interest • Auto Buy Used Next 5 Months • Auto Buy Used 6 Months Plus

6.) Available emails (unverified) 7.) Available fully verified emails 8.) Are cell phone numbers for individuals a requirement? 9.) How urgently do you need this data delivered?

Standard delivery is within 3 to 5 business days of full agreement of request criteria when verified emails are included.

Expedited delivery is within 1 business day of full agreement of request criteria when there is no need of verified emails.

Universities are welcomed and they usually qualify for special academic discounts (please ask if you think this may apply).

We always follow laws and regulations of the USA for consumer data products. Therefore, an additional validation processes may be required based on location and request. We do not provide legal advice to our clients, but we try to help them by providing as much information on the topics we know. Keep in mind, each state in the union may or may not have legal restrictions on the consumer data and it is the client's responsibility to be aware and comply with all laws regarding data we may provide.

Facebook

Twitter

Click to copy link

Link copied

Cite

Statista (2024). Amount of data created, consumed, and stored 2010-2023, with forecasts to 2028 [Dataset]. https://www.statista.com/statistics/871513/worldwide-data-created/

Amount of data created, consumed, and stored 2010-2023, with forecasts to 2028

Explore at:

Dataset updated

Nov 21, 2024

Dataset authored and provided by

Statistahttp://statista.com/

Time period covered

May 2024

Area covered

Worldwide

Description

The total amount of data created, captured, copied, and consumed globally is forecast to increase rapidly, reaching 149 zettabytes in 2024. Over the next five years up to 2028, global data creation is projected to grow to more than 394 zettabytes. In 2020, the amount of data created and replicated reached a new high. The growth was higher than previously expected, caused by the increased demand due to the COVID-19 pandemic, as more people worked and learned from home and used home entertainment options more often. Storage capacity also growing Only a small percentage of this newly created data is kept though, as just two percent of the data produced and consumed in 2020 was saved and retained into 2021. In line with the strong growth of the data volume, the installed base of storage capacity is forecast to increase, growing at a compound annual growth rate of 19.2 percent over the forecast period from 2020 to 2025. In 2020, the installed base of storage capacity reached 6.7 zettabytes.

Clear search

Close search

Google apps

Main menu

Amount of data created, consumed, and stored 2010-2023, with forecasts to...

Tutorial: How to use Google Data Studio and ArcGIS Online to create an...

Data from: PayCalc : how to create customized payroll spreadsheets

How to Create an Okta Account

Dataset of development of business during the COVID-19 crisis

Data from: How to create the inclusive classroom : removing barriers to...

Supplemental data

mytestdata

Dataset

Contents

Company Datasets for Business Profiling

Consumers comfortable sharing data to create personalized ads in North...

Data from: Footprints and producers of source data used to create northern...

Sidewalk to Street "Walkability" Ratio

Data from: Create a Story Map

Developer Community and Code Datasets

Enterprise Data Inventory

Data from: The constant economy : how to create a stable society

National Hydrography Dataset (NHD) Archive

sql-create-context

Documentation of R scripts to create boxplots of change factors by NOAA...

WalkLists | Consumer Data Household Automobile Purchase USA (>240M...

Amount of data created, consumed, and stored 2010-2023, with forecasts to 2028