Our Brand Data dataset includes such data points as company name, location, headcount, industry, and size, among others. It offers extensive fresh and historical data, including even companies that operate in stealth mode.
For market and business analysis
Our Brand Data gives information about millions of companies, allowing you to find your competitors and see their weak and strong points.
Use cases
For Investors
We recommend Brand Data for investors to discover and evaluate businesses with the highest potential.
Gain strategic business insights, enhance decision-making, and maintain algorithms that signal investment opportunities with Coresignal’s global Brand Data.
Use cases
For sales prospecting
Brand Data saves time your employees would otherwise use it to manually find potential clients and choose the best prospects.
Use cases
Subscribers can find out export and import data of 23 countries by HS code or product’s name. This demo is helpful for market analysis.
The total amount of data created, captured, copied, and consumed globally is forecast to increase rapidly, reaching 149 zettabytes in 2024. Over the next five years up to 2028, global data creation is projected to grow to more than 394 zettabytes. In 2020, the amount of data created and replicated reached a new high. The growth was higher than previously expected, caused by the increased demand due to the COVID-19 pandemic, as more people worked and learned from home and used home entertainment options more often. Storage capacity also growing Only a small percentage of this newly created data is kept though, as just two percent of the data produced and consumed in 2020 was saved and retained into 2021. In line with the strong growth of the data volume, the installed base of storage capacity is forecast to increase, growing at a compound annual growth rate of 19.2 percent over the forecast period from 2020 to 2025. In 2020, the installed base of storage capacity reached 6.7 zettabytes.
Subscribers can find out export and import data of 23 countries by HS code or product’s name. This demo is helpful for market analysis.
Coastal, land based automatic weather stations situated in the West Coast of South Africa; Cape Columbine, Elands Bay and Port Nolloth have been actively collecting data for various project since 1982 until today at different locations. These instruments provide useful meteorological information for understanding coastal dynamics such as upwelling, ocean current movements and air sea exchange.
https://www.usa.gov/government-workshttps://www.usa.gov/government-works
The Daily Mobility Statistics were derived from a data panel constructed from several mobile data providers, a step taken to address the reduce the risks of geographic and temporal sample bias that would result from using a single data source. In turn, the merged data panel only included data from those mobile devices whose anonymized location data met a set of data quality standards, e.g., temporal frequency and spatial accuracy of anonymized location point observations, device-level temporal coverage and representativeness, spatial distribution of data at the sample and county levels. After this filtering, final mobility estimate statistics were computed using a multi-level weighting method that employed both device- and trip-level weights, thus expanding the sample represented by the devices in the data panel to the at-large populations of each state and county in the US.
Data analysis was conducted at the aggregate national, state, and county levels. To assure confidentiality and support data quality, no data were reported for a county if it had fewer than 50 devices in the sample on any given day.
Trips were defined as movements that included a stay of longer than 10 minutes at an anonymized location away from home. A movement with multiple stays of longer than 10 minutes--before returning home--was counted as multiple trips.
The Daily Mobility Statistics data on this page, which cover the COVID and Post-COVID periods, are experimental. Experimental data products are created using novel or exploratory data sources or methodologies that benefit data users in the absence of other statistically rigorous products, and they not meet all BTS data quality standards.
Subscribers can find out export and import data of 23 countries by HS code or product’s name. This demo is helpful for market analysis.
U.S. Government Workshttps://www.usa.gov/government-works
License information was derived automatically
This data release includes data provided by Colorado Springs Utilities to the U.S. Geological Survey. Two types of data are included here. First are daily average effluent discharge rates from the Las Vegas Street Wastewater Treatment Facility and JD Phillips Water Resource Recovery Facility along with the sum of rates on dates with available data. Second are daily total volumes of potable water delivered to customers as summed from multiple facilities. The data were released by Colorado Springs Utilities, which performed its own quality control checks on the data. The data span the years 2008 to 2022 but are not complete for all parameters for that time frame. The data provide context for understanding water use and water quality in the Fountain Creek watershed in Colorado.
**This data set was last updated 3:30 PM ET Monday, January 4, 2021. The last date of data in this dataset is December 31, 2020. **
Data shows that mobility declined nationally since states and localities began shelter-in-place strategies to stem the spread of COVID-19. The numbers began climbing as more people ventured out and traveled further from their homes, but in parallel with the rise of COVID-19 cases in July, travel declined again.
This distribution contains county level data for vehicle miles traveled (VMT) from StreetLight Data, Inc, updated three times a week. This data offers a detailed look at estimates of how much people are moving around in each county.
Data available has a two day lag - the most recent data is from two days prior to the update date. Going forward, this dataset will be updated by AP at 3:30pm ET on Monday, Wednesday and Friday each week.
This data has been made available to members of AP’s Data Distribution Program. To inquire about access for your organization - publishers, researchers, corporations, etc. - please click Request Access in the upper right corner of the page or email kromano@ap.org. Be sure to include your contact information and use case.
01_vmt_nation.csv - Data summarized to provide a nationwide look at vehicle miles traveled. Includes single day VMT across counties, daily percent change compared to January and seven day rolling averages to smooth out the trend lines over time.
02_vmt_state.csv - Data summarized to provide a statewide look at vehicle miles traveled. Includes single day VMT across counties, daily percent change compared to January and seven day rolling averages to smooth out the trend lines over time.
03_vmt_county.csv - Data providing a county level look at vehicle miles traveled. Includes VMT estimate, percent change compared to January and seven day rolling averages to smooth out the trend lines over time.
* Filter for specific state - filters 02_vmt_state.csv
daily data for specific state.
* Filter counties by state - filters 03_vmt_county.csv
daily data for counties in specific state.
* Filter for specific county - filters 03_vmt_county.csv
daily data for specific county.
The AP has designed an interactive map to show percent change in vehicle miles traveled by county since each counties lowest point during the pandemic:
@(https://interactives.ap.org/vmt-map/)
This data can help put your county's mobility in context with your state and over time. The data set contains different measures of change - daily comparisons and seven day rolling averages. The rolling average allows for a smoother trend line for comparison across counties and states. To get the full picture, there are also two available baselines - vehicle miles traveled in January 2020 (pre-pandemic) and vehicle miles traveled at each geography's low point during the pandemic.
A daily census of the inmates at the Allegheny County Jail (ACJ). Includes gender, race, age at booking, and current age. The records for each month contain a census for every day, therefore many inmates records are repeated each day. Another representation of this data is the County's jail population management dashboard.
hysts-bot-data/daily-papers dataset hosted on Hugging Face and contributed by the HF Datasets community
Subscribers can find out export and import data of 23 countries by HS code or product’s name. This demo is helpful for market analysis.
On October 20, 2022, CDC began retrieving aggregate case and death data from jurisdictional and state partners weekly instead of daily. This dataset contains archived community transmission and related data elements by county as originally displayed on the COVID Data Tracker. Although these data will continue to be publicly available, this dataset has not been updated since October 20, 2022. An archived dataset containing weekly community transmission data by county as originally posted can also be found here: Weekly COVID-19 County Level of Community Transmission as Originally Posted | Data | Centers for Disease Control and Prevention (cdc.gov).
Related data CDC has been providing the public with two versions of COVID-19 county-level community transmission level data: this dataset with the daily values as originally posted on the COVID Data Tracker, and an historical dataset with daily data as well as the updates and corrections from state and local health departments. Similar to this dataset, the original historical dataset is archived on 10/20/2022. It will continue to be publicly available but will no longer be updated. A new dataset containing historical community transmission data by county is now published weekly and can be found at: Weekly COVID-19 County Level of Community Transmission Historical Changes | Data | Centers for Disease Control and Prevention (cdc.gov).
This public use dataset has 7 data elements reflecting community transmission levels for all available counties and jurisdictions. It contains reported daily transmission levels at the county level with the same values used to display transmission maps on the COVID Data Tracker. Each day, the dataset is appended to contain the most recent day's data. Transmission level is set to low, moderate, substantial, or high using the calculation rules below.
Methods for calculating county level of community transmission indicator The County Level of Community Transmission indicator uses two metrics: (1) total new COVID-19 cases per 100,000 persons in the last 7 days and (2) percentage of positive SARS-CoV-2 diagnostic nucleic acid amplification tests (NAAT) in the last 7 days. For each of these metrics, CDC classifies transmission values as low, moderate, substantial, or high (below and here). If the values for each of these two metrics differ (e.g., one indicates moderate and the other low), then the higher of the two should be used for decision-making.
CDC core metrics of and thresholds for community transmission levels of SARS-CoV-2
Total New Case Rate Metric: "New cases per 100,000 persons in the past 7 days" is calculated by adding the number of new cases in the county (or other administrative level) in the last 7 days divided by the population in the county (or other administrative level) and multiplying by 100,000. "New cases per 100,000 persons in the past 7 days" is considered to have a transmission level of Low (0-9.99); Moderate (10.00-49.99); Substantial (50.00-99.99); and High (greater than or equal to 100.00).
Test Percent Positivity Metric: "Percentage of positive NAAT in the past 7 days" is calculated by dividing the number of positive tests in the county (or other administrative level) during the last 7 days by the total number of tests conducted over the last 7 days. "Percentage of positive NAAT in the past 7 days" is considered to have a transmission level of Low (less than 5.00); Moderate (5.00-7.99); Substantial (8.00-9.99); and High (greater than or equal to 10.00).
If
This dataset provides modeled predictions of particulate matter (PM2.5) levels from the EPA's Downscaler model. These data are used by the CDC's National Environmental Public Health Tracking Network to generate air quality measures. Data are at the county levels for 2001-2014. The dataset includes the maximum, median, mean, and population-weighted mean concentration. Please refer to the metadata attachment for more information.
Learn more about outdoor air quality on the Tracking Network's website: https://ephtracking.cdc.gov/showAirLanding.action.
By using these data, you signify your agreement to comply with the following requirements: 1. Use the data for statistical reporting and analysis only. 2. Do not attempt to learn the identity of any person included in the data and do not combine these data with other data for the purpose of matching records to identify individuals. 3. Do not disclose of or make use of the identity of any person or establishment discovered inadvertently and report the discovery to: trackingsupport@cdc.gov. 4. Do not imply or state, either in written or oral form, that interpretations based on the data are those of the original data sources and CDC unless the data user and data source are formally collaborating. 5. Acknowledge, in all reports or presentations based on these data, the original source of the data and CDC. 6. Suggested citation: Centers for Disease Control and Prevention. National Environmental Public Health Tracking Network. Web. Accessed: insert date. www.cdc.gov/ephtracking.
Problems or Questions? Email trackingsupport@cdc.gov.
https://www.usa.gov/government-workshttps://www.usa.gov/government-works
How many people are staying at home? How far are people traveling when they don’t stay home? Which states and counties have more people taking trips? The Bureau of Transportation Statistics (BTS) now provides answers to those questions through our mobility statistics program.
The "Trips by Distance" data and number of people staying home and not staying home are estimated for the Bureau of Transportation Statistics by the Maryland Transportation Institute and Center for Advanced Transportation Technology Laboratory at the University of Maryland. The travel statistics are produced from an anonymized national panel of mobile device data from multiple sources. All data sources used in the creation of the metrics contain no personal information. Data analysis is conducted at the aggregate national, state, and county levels. A weighting procedure expands the sample of millions of mobile devices, so the results are representative of the entire population in a nation, state, or county. To assure confidentiality and support data quality, no data are reported for a county if it has fewer than 50 devices in the sample on any given day.
Trips are defined as movements that include a stay of longer than 10 minutes at an anonymized location away from home. Home locations are imputed on a weekly basis. A movement with multiple stays of longer than 10 minutes before returning home is counted as multiple trips. Trips capture travel by all modes of transportation. including driving, rail, transit, and air.
The daily travel estimates are from a mobile device data panel from merged multiple data sources that address the geographic and temporal sample variation issues often observed in a single data source. The merged data panel only includes mobile devices whose anonymized location data meet a set of data quality standards, which further ensures the overall data quality and consistency. The data quality standards consider both temporal frequency and spatial accuracy of anonymized location point observations, temporal coverage and representativeness at the device level, spatial representativeness at the sample and county level, etc. A multi-level weighting method that employs both device and trip-level weights expands the sample to the underlying population at the county and state levels, before travel statistics are computed.
These data are experimental and may not meet all of our quality standards. Experimental data products are created using new data sources or methodologies that benefit data users in the absence of other relevant products. We are seeking feedback from data users and stakeholders on the quality and usefulness of these new products. Experimental data products that meet our quality standards and demonstrate sufficient user demand may enter regular production if resources permit.
These data are made available under a public domain license. Data should be attributed to the "Maryland Transportation Institute and Center for Advanced Transportation Technology Laboratory at the University of Maryland and the United States Bureau of Transportation Statistics."
Daily data for a given week will be uploaded to the BTS website within 9-10 days of the end of the week in question (e.g., data for Sunday September 17-Saturday September 23 would be updated on Tuesday, October 3). All BTS visualizations and tables that rely on these data will update at approximately 10am ET on days when new data are received, processed, and uploaded.
The methodology used to develop these data can be found at: https://rosap.ntl.bts.gov/view/dot/67520.
Datasets of basic instructions and answers for SmolLM-Instruct models trainings: it includes answers to greetings and questions such as "Who are you". This dataset was included in training of SmolLM-Instruct v0.2 but we didn't notice that it had an impact on model generations. We recommend using this generic larger dataset of multi-turn everyday conversations: https://huggingface.co/datasets/HuggingFaceTB/everyday-conversations-llama3.1-2k
Subscribers can find out export and import data of 23 countries by HS code or product’s name. This demo is helpful for market analysis.
https://www.usa.gov/government-workshttps://www.usa.gov/government-works
This dataset provides modeled predictions of ozone levels from the EPA's Downscaler model. These data are used by the CDC's National Environmental Public Health Tracking Network to generate air quality measures. Data are at the county levels for 2001-2014. The dataset includes the maximum, median, mean, and population-weighted mean concentration. Please refer to the metadata attachment for more information.
Learn more about outdoor air quality on the Tracking Network's website: https://ephtracking.cdc.gov/showAirLanding.action.
By using these data, you signify your agreement to comply with the following requirements: 1. Use the data for statistical reporting and analysis only. 2. Do not attempt to learn the identity of any person included in the data and do not combine these data with other data for the purpose of matching records to identify individuals. 3. Do not disclose of or make use of the identity of any person or establishment discovered inadvertently and report the discovery to: trackingsupport@cdc.gov. 4. Do not imply or state, either in written or oral form, that interpretations based on the data are those of the original data sources and CDC unless the data user and data source are formally collaborating. 5. Acknowledge, in all reports or presentations based on these data, the original source of the data and CDC. 6. Suggested citation: Centers for Disease Control and Prevention. National Environmental Public Health Tracking Network. Web. Accessed: insert date. www.cdc.gov/ephtracking.
https://www.nstauthority.co.uk/footer/access-to-information/https://www.nstauthority.co.uk/footer/access-to-information/
Under the powers of the Energy Act 2016, detailed daily production data from individual wellbores must be reported to the NSTA, for the whole life of the field, as set out in the NSTA's Reporting and Disclosure of Information and Samples Guidance. The data is reportable when permanent cessation of production occurs. This requirement has been applied to all UKCS fields that have ceased production since January 2018. The apps below provide access and insights to this reported data. The data reflects the available production history of each field and provides an insight into daily values for gas, oil and H2O; as well as the pressures and temperatures at well heads and bottom holes, where available. The datasets can be downloaded by wellbore, hydrocarbon field or production hub.
deerfieldgreen/stk-daily-data-1day dataset hosted on Hugging Face and contributed by the HF Datasets community
Our Brand Data dataset includes such data points as company name, location, headcount, industry, and size, among others. It offers extensive fresh and historical data, including even companies that operate in stealth mode.
For market and business analysis
Our Brand Data gives information about millions of companies, allowing you to find your competitors and see their weak and strong points.
Use cases
For Investors
We recommend Brand Data for investors to discover and evaluate businesses with the highest potential.
Gain strategic business insights, enhance decision-making, and maintain algorithms that signal investment opportunities with Coresignal’s global Brand Data.
Use cases
For sales prospecting
Brand Data saves time your employees would otherwise use it to manually find potential clients and choose the best prospects.
Use cases