12 datasets found

National Population Projections: Projected Births by Sex, Race, and Hispanic...
catalog.data.gov
gimi9.com
Updated Jul 19, 2023
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
U.S. Census Bureau (2023). National Population Projections: Projected Births by Sex, Race, and Hispanic Origin for the United States: 2016-2060 [Dataset]. https://catalog.data.gov/dataset/national-population-projections-projected-births-by-sex-race-and-hispanic-origin-for-2016-
Explore at:
Dataset updated
Jul 19, 2023
Dataset provided by
United States Census Bureauhttp://census.gov/
Area covered
United States
Description
Projected Births by Sex, Race, and Hispanic Origin for the United States: 2016-2060 // Source: U.S. Census Bureau, Population Division // There are four projection scenarios: 1. Main series, 2. High Immigration series, 3. Low Immigration series, and 4. Zero Immigration series. // Note: Hispanic origin is considered an ethnicity, not a race. Hispanics may be of any race. All projected births are considered native born. // For detailed information about the methods used to create the population projections, see https://www2.census.gov/programs-surveys/popproj/technical-documentation/methodology/methodstatement17.pdf. // Population projections are estimates of the population for future dates. They are typically based on an estimated population consistent with the most recent decennial census and are produced using the cohort-component method. Projections illustrate possible courses of population change based on assumptions about future births, deaths, net international migration, and domestic migration. The Population Estimates and Projections Program provides additional information on its website: https://www.census.gov/programs-surveys/popproj.html.
N
South Carolina Non-Hispanic Population Breakdown By Race Dataset:...
neilsberg.com
csv, json
Updated Feb 21, 2025
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Neilsberg Research (2025). South Carolina Non-Hispanic Population Breakdown By Race Dataset: Non-Hispanic Population Counts and Percentages for 7 Racial Categories as Identified by the US Census Bureau // 2025 Edition [Dataset]. https://www.neilsberg.com/insights/south-carolina-population-by-race/
Explore at:
csv, jsonAvailable download formats
Dataset updated
Feb 21, 2025
Dataset authored and provided by
Neilsberg Research
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Area covered
South Carolina
Variables measured
Non-Hispanic Asian Population, Non-Hispanic Black Population, Non-Hispanic White Population, Non-Hispanic Some other race Population, Non-Hispanic Two or more races Population, Non-Hispanic American Indian and Alaska Native Population, Non-Hispanic Native Hawaiian and Other Pacific Islander Population, Non-Hispanic Asian Population as Percent of Total Non-Hispanic Population, Non-Hispanic Black Population as Percent of Total Non-Hispanic Population, Non-Hispanic White Population as Percent of Total Non-Hispanic Population, and 4 more
Measurement technique
The data presented in this dataset is derived from the latest U.S. Census Bureau American Community Survey (ACS) 2017-2021 5-Year Estimates. To measure the two variables, namely (a) Non-Hispanic population and (b) population as a percentage of the total Non-Hispanic population, we initially analyzed and categorized the data for each of the racial categories idetified by the US Census Bureau. It is ensured that the population estimates used in this dataset pertain exclusively to the identified racial categories, and are part of Non-Hispanic classification. For further information regarding these estimates, please feel free to reach out to us via email at research@neilsberg.com.
Dataset funded by
Neilsberg Research
Description
About this dataset

Context

The dataset tabulates the Non-Hispanic population of South Carolina by race. It includes the distribution of the Non-Hispanic population of South Carolina across various race categories as identified by the Census Bureau. The dataset can be utilized to understand the Non-Hispanic population distribution of South Carolina across relevant racial categories.

Key observations

Of the Non-Hispanic population in South Carolina, the largest racial group is White alone with a population of 3.24 million (66.97% of the total Non-Hispanic population).

Content

When available, the data consists of estimates from the U.S. Census Bureau American Community Survey (ACS) 2019-2023 5-Year Estimates.

Racial categories include:

White

Black or African American

American Indian and Alaska Native

Asian

Native Hawaiian and Other Pacific Islander

Some other race

Two or more races (multiracial)

Variables / Data Columns

Race: This column displays the racial categories (for Non-Hispanic) for the South Carolina

Population: The population of the racial category (for Non-Hispanic) in the South Carolina is shown in this column.

% of Total Population: This column displays the percentage distribution of each race as a proportion of South Carolina total Non-Hispanic population. Please note that the sum of all percentages may not equal one due to rounding of values.

Good to know

Margin of Error

Data in the dataset are based on the estimates and are subject to sampling variability and thus a margin of error. Neilsberg Research recommends using caution when presening these estimates in your research.

Custom data

If you do need custom data for any of your research project, report or presentation, you can contact our research staff at research@neilsberg.com for a feasibility of a custom tabulation on a fee-for-service basis.

Inspiration

Neilsberg Research Team curates, analyze and publishes demographics and economic data from a variety of public and proprietary sources, each of which often includes multiple surveys and programs. The large majority of Neilsberg Research aggregated datasets and insights is made available for free download at https://www.neilsberg.com/research/.

Recommended for further research

This dataset is a part of the main dataset for South Carolina Population by Race & Ethnicity. You can refer the same here
F
US Spanish TTS Speech Dataset for Speech Synthesis
futurebeeai.com
wav
Updated Aug 1, 2022
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
FutureBee AI (2022). US Spanish TTS Speech Dataset for Speech Synthesis [Dataset]. https://www.futurebeeai.com/dataset/speech-dataset/tts-monolgue-spanish-usa
Explore at:
wavAvailable download formats
Dataset updated
Aug 1, 2022
Dataset provided by
FutureBeeAI
Authors
FutureBee AI
License
https://www.futurebeeai.com/policies/ai-data-license-agreementhttps://www.futurebeeai.com/policies/ai-data-license-agreement
Dataset funded by
FutureBeeAI
Description
The Spanish TTS Monologue Speech Dataset is a professionally curated resource built to train realistic, expressive, and production-grade text-to-speech (TTS) systems. It contains studio-recorded long-form speech by trained native Spanish voice artists, each contributing 1 to 2 hours of clean, uninterrupted monologue audio.
Unlike typical prompt-based datasets with short, isolated phrases, this collection features long-form, topic-driven monologues that mirror natural human narration. It includes content types that are directly useful for real-world applications, like audiobook-style storytelling, educational lectures, health advisories, product explainers, digital how-tos, formal announcements, and more.
All recordings are captured in professional studios using high-end equipment and under the guidance of experienced voice directors.
Recording & Audio Quality
•
Audio Format: WAV, 48 kHz, available in 16-bit, 24-bit, and 32-bit depth

•
SNR: Minimum 30 dB

•
Channel: Mono

•
Recording Duration: 20-30 minutes

•
Recording Environment: Studio-controlled, acoustically treated rooms

•
Per Speaker Volume: 1–2 hours of speech per artist

•
Quality Control: Each file is reviewed and cleaned for common acoustic issues, including: reverberation, lip smacks, mouth clicks, thumping, hissing, plosives, sibilance, background noise, static interference, clipping, and other artifacts.

Only clean, production-grade audio makes it into the final dataset.
Voice Artist Selection
All voice artists are native Spanish speakers with professional training or prior experience in narration. We ensure a diverse pool in terms of age, gender, and region to bring a balanced and rich vocal dataset.
•Artist Profile:
•Gender: Male and Female
•Age Range: 20–60 years
•Regions: Native Spanish-speaking states from USA
•
Selection Process: All artists are screened, onboarded, and sample-approved using FutureBeeAI’s proprietary Yugo platform.

Script Quality & Coverage
Scripts are not generic or repetitive. Scripts are professionally authored by domain experts to reflect real-world use cases. They avoid redundancy and include modern vocabulary, emotional range, and phonetically rich sentence structures.
•
Word Count per Script: 3,000–5,000 words per 30-minute session

•Content Types:
•Storytelling
•Script and book reading
•Informational explainers
•Government service instructions
•E-commerce tutorials
•Motivational content
•Health & wellness guides
•Education & career advice
•
Linguistic Design: Balanced punctuation, emotional range, modern syntax, and vocabulary diversity

Transcripts & Alignment
While the script is used during the recording, we also provide post-recording updates to ensure the transcript reflects the final spoken audio. Minor edits are made to adjust for skipped or rephrased words.
•
Segmentation: Time-stamped at the sentence level, aligned to actual spoken delivery

•
Format: Available in plain text and JSON

•Post-processing:
•Corrected for disfluencies
<div
f
Social-group identity and population substructure in admixed populations in...
plos.figshare.com
datasetcatalog.nlm.nih.gov
tiff
Updated May 31, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Meghan E. Healy; Deirdre Hill; Marianne Berwick; Heather Edgar; Jessica Gross; Keith Hunley (2023). Social-group identity and population substructure in admixed populations in New Mexico and Latin America [Dataset]. http://doi.org/10.1371/journal.pone.0185503
Explore at:
tiffAvailable download formats
Unique identifier
https://doi.org/10.1371/journal.pone.0185503
Dataset updated
May 31, 2023
Dataset provided by
PLOS ONE
Authors
Meghan E. Healy; Deirdre Hill; Marianne Berwick; Heather Edgar; Jessica Gross; Keith Hunley
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Area covered
New Mexico, Latin America
Description
We examined the relationship between continental-level genetic ancestry and racial and ethnic identity in an admixed population in New Mexico with the goal of increasing our understanding of how racial and ethnic identity influence genetic substructure in admixed populations. Our sample consists of 98 New Mexicans who self-identified as Hispanic or Latino (NM-HL) and who further categorized themselves by race and ethnic subgroup membership. The genetic data consist of 270 newly-published autosomal microsatellites from the NM-HL sample and previously published data from 57 globally distributed populations, including 13 admixed samples from Central and South America. For these data, we 1) summarized the major axes of genetic variation using principal component analyses, 2) performed tests of Hardy Weinberg equilibrium, 3) compared empirical genetic ancestry distributions to those predicted under a model of admixture that lacked substructure, 4) tested the hypotheses that individuals in each sample had 100%, 0%, and the sample-mean percentage of African, European, and Native American ancestry. We found that most NM-HL identify themselves and their parents as belonging to one of two groups, conforming to a region-specific narrative that distinguishes recent immigrants from Mexico from individuals whose families have resided in New Mexico for generations and who emphasize their Spanish heritage. The “Spanish” group had significantly lower Native American ancestry and higher European ancestry than the “Mexican” group. Positive FIS values, PCA plots, and heterogeneous ancestry distributions suggest that most Central and South America admixed samples also contain substructure, and that this substructure may be related to variation in social identity. Genetic substructure appears to be common in admixed populations in the Americas and may confound attempts to identify disease-causing genes and to understand the social causes of variation in health outcomes and social inequality.
f
DataSheet_1_Association Between Tumor Mutation Profile and Clinical Outcomes...
figshare.com
frontiersin.figshare.com
docx
Updated Jun 5, 2023
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Alexander Philipovskiy; Reshad Ghafouri; Alok Kumar Dwivedi; Luis Alvarado; Richard McCallum; Felipe Maegawa; Ioannis T. Konstantinidis; Nawar Hakim; Scott Shurmur; Sanjay Awasthi; Sumit Gaur; Javier Corral (2023). DataSheet_1_Association Between Tumor Mutation Profile and Clinical Outcomes Among Hispanic-Latino Patients With Metastatic Colorectal Cancer.docx [Dataset]. http://doi.org/10.3389/fonc.2021.772225.s001
Explore at:
docxAvailable download formats
Unique identifier
https://doi.org/10.3389/fonc.2021.772225.s001
Dataset updated
Jun 5, 2023
Dataset provided by
Frontiers
Authors
Alexander Philipovskiy; Reshad Ghafouri; Alok Kumar Dwivedi; Luis Alvarado; Richard McCallum; Felipe Maegawa; Ioannis T. Konstantinidis; Nawar Hakim; Scott Shurmur; Sanjay Awasthi; Sumit Gaur; Javier Corral
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
In the United States, CRC is the third most common type of cancer and the second leading cause of cancer-related death. Although the incidence of CRC among the Hispanic population has been declining, recently, a dramatic increase in CRC incidents among HL younger than 50 years of age has been reported. The incidence of early-onset CRC is more significant in HL population (45%) than in non-Hispanic Whites (27%) and African-Americans (15%). The reason for these racial disparities and the biology of CRC in the HL are not well understood. We performed this study to understand the biology of the disease in HL patients. We analyzed formalin-fixed paraffin-embedded tumor tissue samples from 52 HL patients with mCRC. We compared the results with individual patient clinical histories and outcomes. We identified commonly altered genes in HL patients (APC, TP53, KRAS, GNAS, and NOTCH). Importantly, mutation frequencies in the APC gene were significantly higher among HL patients. The combination of mutations in the APC, NOTCH, and KRAS genes in the same tumors was associated with a higher risk of progression after first-line of chemotherapy and overall survival. Our data support the notion that the molecular drivers of CRC might be different in HL patients.
Data from: Gangs in Rural America, 1996-1998
catalog.data.gov
datasets.ai
+1more
Updated Mar 12, 2025
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
National Institute of Justice (2025). Gangs in Rural America, 1996-1998 [Dataset]. https://catalog.data.gov/dataset/gangs-in-rural-america-1996-1998-9527e
Explore at:
Dataset updated
Mar 12, 2025
Dataset provided by
National Institute of Justicehttp://nij.ojp.gov/
Area covered
United States
Description
This study was undertaken to enable cross-community analysis of gang trends in all areas of the United States. It was also designed to provide a comparative analysis of social, economic, and demographic differences among non-metropolitan jurisdictions in which gangs were reported to have been persistent problems, those in which gangs had been more transitory, and those that reported no gang problems. Data were collected from four separate sources and then merged into a single dataset using the county Federal Information Processing Standards (FIPS) code as the attribute of common identification. The data sources included: (1) local police agency responses to three waves (1996, 1997, and 1998) of the National Youth Gang Survey (NYGS), (2) rural-urban classification and county-level measures of primary economic activity from the Economic Research Service (ERS) of the United States Department of Agriculture, (3) county-level economic and demographic data from the County and City Data Book, 1994, and from USA Counties, 1998, produced by the United States Department of Commerce, and (4) county-level data on access to interstate highways provided by Tom Ricketts and Randy Randolph of the University of North Carolina at Chapel Hill. Variables include the FIPS codes for state, county, county subdivision, and sub-county, population in the agency jurisdiction, type of jurisdiction, and whether the county was dependent on farming, mining, manufacturing, or government. Other variables categorizing counties include retirement destination, federal lands, commuting, persistent poverty, and transfer payments. The year gang problems began in that jurisdiction, number of youth groups, number of active gangs, number of active gang members, percent of gang members who migrated, and the number of gangs in 1996, 1997, and 1998 are also available. Rounding out the variables are unemployment rates, median household income, percent of persons in county below poverty level, percent of family households that were one-parent households, percent of housing units in the county that were vacant, had no telephone, or were renter-occupied, resident population of the county in 1990 and 1997, change in unemployment rates, land area of county, percent of persons in the county speaking Spanish at home, and whether an interstate highway intersected the county.
g
HAZUS, Race Demographics, Washington Section of the Portland Oregon MSA,...
geocommons.com
Updated Jun 2, 2008
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
data (2008). HAZUS, Race Demographics, Washington Section of the Portland Oregon MSA, 2006 [Dataset]. http://geocommons.com/search.html
Explore at:
Dataset updated
Jun 2, 2008
Dataset provided by
HAZUS
data
Description
HAZUS is an abbreviation for Hazards United States, and was developed by FEMA. The HAZUS dataset was designed to estimate the potential physical, economic and social losses during hazardous events such as flooding or earthquakes. To measure the social impact of these events, HAZUS includes detailed demographic data for the United States. This dataset pulls out the racial data from the demographic files, at the census block level for the Washington portion of the Portland Metropolitan Statistic Area (MSA). Attributes include Whites, Blacks, Asians, Hispanics, Hawaiian and Pacific Islanders, Native Americans, and populations stating other race. Demographics data was recent as of May 2006.
g
Census, Basic Demographic Data by Tract, San Francisco, 2000
geocommons.com
Updated May 6, 2008
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
data (2008). Census, Basic Demographic Data by Tract, San Francisco, 2000 [Dataset]. http://geocommons.com/search.html
Explore at:
Dataset updated
May 6, 2008
Dataset provided by
US Census
data
Description
This Dataset shows some basic demographic data from the US census located around the San Francisco MSA at tract level. Attributes include Average age, female and male population, white population, hispanic population, population density, and total population.
g
CARMA, Spain Power Plant Emissions, Spain, 2000/2007/Future
geocommons.com
Updated May 5, 2008
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
CARMA (2008). CARMA, Spain Power Plant Emissions, Spain, 2000/2007/Future [Dataset]. http://geocommons.com/search.html
Explore at:
Dataset updated
May 5, 2008
Dataset provided by
data
CARMA
Description
All the data for this dataset is provided from CARMA: Data from CARMA (www.carma.org) This dataset provides information about Power Plant emissions in Spain. Power Plant emissions from all power plants in Spain were obtained by CARMA for the past (2000 Annual Report), the present (2007 data), and the future. CARMA determine data presented for the future to reflect planned plant construction, expansion, and retirement. The dataset provides the name, company, parent company, city, state, zip, county, metro area, lat/lon, and plant id for each individual power plant. The dataset reports for the three time periods: Intensity: Pounds of CO2 emitted per megawatt-hour of electricity produced. Energy: Annual megawatt-hours of electricity produced. Carbon: Annual carbon dioxide (CO2) emissions. The units are short or U.S. tons. Multiply by 0.907 to get metric tons. Carbon Monitoring for Action (CARMA) is a massive database containing information on the carbon emissions of over 50,000 power plants and 4,000 power companies worldwide. Power generation accounts for 40% of all carbon emissions in the United States and about one-quarter of global emissions. CARMA is the first global inventory of a major, sector of the economy. The objective of CARMA.org is to equip individuals with the information they need to forge a cleaner, low-carbon future. By providing complete information for both clean and dirty power producers, CARMA hopes to influence the opinions and decisions of consumers, investors, shareholders, managers, workers, activists, and policymakers. CARMA builds on experience with public information disclosure techniques that have proven successful in reducing traditional pollutants. Please see carma.org for more information
g
Center for Disease Control and Prevention, National Vital Statistics...
geocommons.com
Updated May 6, 2008
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Emily Sciarillo (2008). Center for Disease Control and Prevention, National Vital Statistics Reports: Births, USA, 2005 [Dataset]. http://geocommons.com/search.html
Explore at:
Dataset updated
May 6, 2008
Dataset provided by
Center for Disease Control and Prevention, National Center for Health Statistics
data
Authors
Emily Sciarillo
Description
This dataset was created from the CDC's National Vital Statistics Reports Volume 56, Number 6. The dataset includes all data available from this report by state level and includes births by race and Hispanic origin, births to unmarried women, rates of cesarean delivery, and twin and multiple birth rates. The data are final for 2005. No value is represented by a -1. "Descriptive tabulations of data reported on the birth certificates of the 4.1 million births that occurred in 2005 are presented. Denominators for population-based rates are postcensal estimates derived from the U.S. 2000 census".
g
NCES, Percentage of eighth-grade public school students and average scores...
geocommons.com
Updated May 9, 2008
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
data (2008). NCES, Percentage of eighth-grade public school students and average scores in NAEP writing by race and state, USA, 2007 [Dataset]. http://geocommons.com/search.html
Explore at:
Dataset updated
May 9, 2008
Dataset provided by
U.S. Department of Education, Institute of Education Sciences, National Center for Education Statistics, National Assessment of Educational Progress
data
Description
This dataset explores Percentage of eighth-grade public school students and average scores in NAEP writing by race and state, USA, 2007 Notes: Not available. The state/jurisdiction did not participate. # Rounds to zero. Reporting standards not met. Sample size is insufficient to permit a reliable estimate. NOTE: Black includes African American, Hispanic includes Latino, and Pacifi c Islander includes Native Hawaiian. Race categories exclude Hispanic origin. Results are not shown for students whose race/ethnicity was unclassified Detail may not sum to totals because of rounding. SOURCE: U.S. Department of Education, Institute of Education Sciences, National Center for Education Statistics, National Assessment of Educational Progress (NAEP), 2007 Writing Assessment.
g
Paginasamarillas, Bars, Pamplona Spain, 2008
geocommons.com
Updated Jul 8, 2008
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Emily Sciarillo (2008). Paginasamarillas, Bars, Pamplona Spain, 2008 [Dataset]. http://geocommons.com/search.html
Explore at:
Dataset updated
Jul 8, 2008
Dataset provided by
emily
http://www.paginasamarillas.es
Authors
Emily Sciarillo
Description
This dataset contains bars in Pamplona, Spain. This list does not necessarally contain all of the bars in Pamplona.
Not seeing a result you expected?
Learn how you can add new datasets to our index.

Facebook

Twitter

Click to copy link

Link copied

Cite

U.S. Census Bureau (2023). National Population Projections: Projected Births by Sex, Race, and Hispanic Origin for the United States: 2016-2060 [Dataset]. https://catalog.data.gov/dataset/national-population-projections-projected-births-by-sex-race-and-hispanic-origin-for-2016-

National Population Projections: Projected Births by Sex, Race, and Hispanic Origin for the United States: 2016-2060

Explore at:

Dataset updated

Jul 19, 2023

Dataset provided by

United States Census Bureauhttp://census.gov/

Area covered

United States

Description

Projected Births by Sex, Race, and Hispanic Origin for the United States: 2016-2060 // Source: U.S. Census Bureau, Population Division // There are four projection scenarios: 1. Main series, 2. High Immigration series, 3. Low Immigration series, and 4. Zero Immigration series. // Note: Hispanic origin is considered an ethnicity, not a race. Hispanics may be of any race. All projected births are considered native born. // For detailed information about the methods used to create the population projections, see https://www2.census.gov/programs-surveys/popproj/technical-documentation/methodology/methodstatement17.pdf. // Population projections are estimates of the population for future dates. They are typically based on an estimated population consistent with the most recent decennial census and are produced using the cohort-component method. Projections illustrate possible courses of population change based on assumptions about future births, deaths, net international migration, and domestic migration. The Population Estimates and Projections Program provides additional information on its website: https://www.census.gov/programs-surveys/popproj.html.

Clear search

Close search

Google apps

Main menu

National Population Projections: Projected Births by Sex, Race, and Hispanic...

South Carolina Non-Hispanic Population Breakdown By Race Dataset:...

About this dataset

Content

Inspiration

Recommended for further research

US Spanish TTS Speech Dataset for Speech Synthesis

Recording & Audio Quality

Voice Artist Selection

Script Quality & Coverage

Transcripts & Alignment

Social-group identity and population substructure in admixed populations in...

DataSheet_1_Association Between Tumor Mutation Profile and Clinical Outcomes...

Data from: Gangs in Rural America, 1996-1998

HAZUS, Race Demographics, Washington Section of the Portland Oregon MSA,...

Census, Basic Demographic Data by Tract, San Francisco, 2000

CARMA, Spain Power Plant Emissions, Spain, 2000/2007/Future

Center for Disease Control and Prevention, National Vital Statistics...

NCES, Percentage of eighth-grade public school students and average scores...

Paginasamarillas, Bars, Pamplona Spain, 2008

National Population Projections: Projected Births by Sex, Race, and Hispanic Origin for the United States: 2016-2060See More Versions

National Population Projections: Projected Births by Sex, Race, and Hispanic Origin for the United States: 2016-2060