MIT Licensehttps://opensource.org/licenses/MIT
License information was derived automatically
Dataset Card for Deita 10K V0
GitHub | Paper Deita is an open-sourced project designed to facilitate Automatic Data Selection for instruction tuning in Large Language Models (LLMs). This dataset includes 10k of lightweight, high-quality alignment SFT data, mainly automatically selected from the following datasets:
ShareGPT (Apache 2.0 listed, no official repo found): Use the 58 K ShareGPT dataset for selection. UltraChat (MIT): Sample 105 K UltraChat dataset for selection.… See the full description on the dataset page: https://huggingface.co/datasets/hkust-nlp/deita-10k-v0.
MIT Licensehttps://opensource.org/licenses/MIT
License information was derived automatically
Dataset Card for Deita 10k v0
This is a formatted version of hkust-nlp/deita-10k-v0 to store the conversations in the same format as the OpenAI SDK.
Citation
If you find this dataset useful, please cite the original dataset: @misc{liu2023what, title={What Makes Good Data for Alignment? A Comprehensive Study of Automatic Data Selection in Instruction Tuning}, author={Wei Liu and Weihao Zeng and Keqing He and Yong Jiang and Junxian He}, year={2023}… See the full description on the dataset page: https://huggingface.co/datasets/HuggingFaceH4/deita-10k-v0-sft.
MIT Licensehttps://opensource.org/licenses/MIT
License information was derived automatically
Dataset Card for Deita 6K V0
GitHub | Paper Deita is an open-sourced project designed to facilitate Automatic Data Selection for instruction tuning in Large Language Models (LLMs). This dataset includes 6k of lightweight, high-quality alignment SFT data, mainly automatically selected from the following datasets:
ShareGPT (Apache 2.0 listed, no official repo found): Use the 58 K ShareGPT dataset for selection. UltraChat (MIT): Sample 105 K UltraChat dataset for selection. WizardLM… See the full description on the dataset page: https://huggingface.co/datasets/hkust-nlp/deita-6k-v0.
MIT Licensehttps://opensource.org/licenses/MIT
License information was derived automatically
hkust-nlp/deita-redundant-pool-data dataset hosted on Hugging Face and contributed by the HF Datasets community
MIT Licensehttps://opensource.org/licenses/MIT
License information was derived automatically
Dataset Card for Deita Complexity Scorer Training Data
GitHub | Paper Deita is an open-sourced project designed to facilitate Automatic Data Selection for instruction tuning in Large Language Models (LLMs). This dataset includes data for training Deita Complexity Scorer. Model Family: Other models and the dataset are found in the Deita Collection
Performance
Model Align Data Size MT-Bench AlpacaEval(%) OpenLLM (Avg.)
Proprietary Models
GPT-4-Turbo… See the full description on the dataset page: https://huggingface.co/datasets/hkust-nlp/deita-complexity-scorer-data.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
This product is described in Chapter 5 of the 2018 DWR Delta Modeling Section annual report, produced jointly with USGS.
This product is a mutually compatible suite of DEMs covering most of the aquatic and terrestrial areas of the Bay-Delta. The product was derived from original point data collections, lidar and other DEMs. Also included in the resources are images and shapefiles describing the source data.
Changes between 4.1 and 4.2 are documented in the change log below. Changes prior to that are recorded in the 4.1 web page.
Changes in version 4 relative to prior products are limited to the region east of the Carquinez Strait (starting around Carquinez Bridge). To facilitate compatibility between products released by DWR and USGS/NOAA partners, DWR distributes the region west of the active work at 10m resolution but does not actively work in this region. The San Pablo Bay boundary of active revision in the present product in a place where its source data matches that of other Bay elevation models, e.g., the 2m seamless high-resolution bathymetric and topographic DEM of San Francisco Bay by USGS Earth Resources Observation and Science Center (EROS) (https://topotools.cr.usgs.gov/coned/sanfrancisco.php ), the 2010 San Francisco Bay DEM by National Oceanic and Atmospheric Administration (https://www.ngdc.noaa.gov/metaview/page?xml=NOAA/NESDIS/NGDC/MGG/DEM/iso/xml/741.xml&view=getDataView&header=none ) or the prior (version 3) 10m digital elevation model (https://data.cnra.ca.gov/dataset/san-francisco-bay-and-sacramento-san-joaquin-delta-dem-v3 ).The 10m DEM for the Bay-Delta is based on the first on the list, i.e. EROS’ 2m DEM for the Bay
https://okredo.com/en-lt/general-ruleshttps://okredo.com/en-lt/general-rules
UAB "DEITA" financial data: profit, annual turnover, paid taxes, sales revenue, equity, assets (long-term and short-term), profitability indicators.
This map illustrates extents and types of unconsolidated deposits and bedrock in the Big Delta A-4 Quadrangle, Alaska. This map is based on field observations begun by P�w� in 1949 and by Reger in 1976. Unit characteristics and extents were determined during field visits and by interpreting 1:40,000-scale black-and-white aerial photographs taken in August 1949 and 1:63,360-scale, false-color infrared aerial photographs taken in July 1978, August 1980, and August 1981.
https://fred.stlouisfed.org/legal/#copyright-public-domainhttps://fred.stlouisfed.org/legal/#copyright-public-domain
Graph and download economic data for Percent of Population Below the Poverty Level (5-year estimate) in Delta County, TX (S1701ACS048119) from 2012 to 2019 about Delta County, TX; Dallas; poverty; TX; percent; 5-year; population; Prosperity Scorecard; and USA.
Data contains historical polygons of in-channel islands within the Sacramento San Joaquin Delta. Data consists of merged datasets from 1929, 1940, 1949, 1952, 1995, 2002, and 2017. The 2017 polygons are digitized from the 2017 Delta LiDAR imagery by the Division of Engineering, Geomatics Branch, Geospatial Data Support Section. The older pre-2017 polygons were all digitized by staff in the Delta Levees Program. Data can be queried for a single year or date range using the 'Year' field. Historical data was compiled and merged from datasets provided by the Delta Levees program. Data coverage differs between years. Absences or gaps in historical data may occur. Older acquisitions generally have a smaller footprint than recent imagery acquisitions. The 2017 in-channel islands cover the Legal Delta, and also include Chipps Island.
https://fred.stlouisfed.org/legal/#copyright-pre-approvalhttps://fred.stlouisfed.org/legal/#copyright-pre-approval
Graph and download economic data for Equifax Subprime Credit Population for Delta County, CO (EQFXSUBPRIME008029) from Q2 2014 to Q2 2025 about Delta County, CO; subprime; CO; population; and USA.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Context
The dataset tabulates the Delta County population over the last 20 plus years. It lists the population for each year, along with the year on year change in population, as well as the change in percentage terms for each year. The dataset can be utilized to understand the population change of Delta County across the last two decades. For example, using this dataset, we can identify if the population is declining or increasing. If there is a change, when the population peaked, or if it is still growing and has not reached its peak. We can also compare the trend with the overall trend of United States population over the same period of time.
Key observations
In 2023, the population of Delta County was 36,790, a 0.02% increase year-by-year from 2022. Previously, in 2022, Delta County population was 36,781, a decline of 0.12% compared to a population of 36,825 in 2021. Over the last 20 plus years, between 2000 and 2023, population of Delta County decreased by 1,753. In this period, the peak population was 38,543 in the year 2000. The numbers suggest that the population has already reached its peak and is showing a trend of decline. Source: U.S. Census Bureau Population Estimates Program (PEP).
When available, the data consists of estimates from the U.S. Census Bureau Population Estimates Program (PEP).
Data Coverage:
Variables / Data Columns
Good to know
Margin of Error
Data in the dataset are based on the estimates and are subject to sampling variability and thus a margin of error. Neilsberg Research recommends using caution when presening these estimates in your research.
Custom data
If you do need custom data for any of your research project, report or presentation, you can contact our research staff at research@neilsberg.com for a feasibility of a custom tabulation on a fee-for-service basis.
Neilsberg Research Team curates, analyze and publishes demographics and economic data from a variety of public and proprietary sources, each of which often includes multiple surveys and programs. The large majority of Neilsberg Research aggregated datasets and insights is made available for free download at https://www.neilsberg.com/research/.
This dataset is a part of the main dataset for Delta County Population by Year. You can refer the same here
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
This dataset tracks annual distribution of students across grade levels in D.e.l.t.a. Steam Academy
https://fred.stlouisfed.org/legal/#copyright-public-domainhttps://fred.stlouisfed.org/legal/#copyright-public-domain
Graph and download economic data for Employed Persons in Delta County, CO (LAUCN080290000000005A) from 1990 to 2024 about Delta County, CO; CO; persons; household survey; employment; and USA.
https://fred.stlouisfed.org/legal/#copyright-public-domainhttps://fred.stlouisfed.org/legal/#copyright-public-domain
Graph and download economic data for Resident Population in Delta County, CO (CODELT9POP) from 1970 to 2024 about Delta County, CO; CO; residents; population; and USA.
https://fred.stlouisfed.org/legal/#copyright-public-domainhttps://fred.stlouisfed.org/legal/#copyright-public-domain
Graph and download economic data for Unemployment Rate in Delta County, CO (CODELT9URN) from Jan 1990 to Jun 2025 about Delta County, CO; CO; unemployment; rate; and USA.
HuggingFaceH4/deita-6k-v0-sft dataset hosted on Hugging Face and contributed by the HF Datasets community
Annual report submitted by the DWR's Delta Modeling Section
Newsletters for the DWR Delta Modeling User Group (DMUG)
This dataset provides information about the number of properties, residents, and average property values for Flintville Road cross streets in Delta, PA.
MIT Licensehttps://opensource.org/licenses/MIT
License information was derived automatically
Dataset Card for Deita 10K V0
GitHub | Paper Deita is an open-sourced project designed to facilitate Automatic Data Selection for instruction tuning in Large Language Models (LLMs). This dataset includes 10k of lightweight, high-quality alignment SFT data, mainly automatically selected from the following datasets:
ShareGPT (Apache 2.0 listed, no official repo found): Use the 58 K ShareGPT dataset for selection. UltraChat (MIT): Sample 105 K UltraChat dataset for selection.… See the full description on the dataset page: https://huggingface.co/datasets/hkust-nlp/deita-10k-v0.