22 datasets found
  1. 1940 Census: Official 1940 Census Website

    • catalog.data.gov
    • data.amerigeoss.org
    Updated Nov 7, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    National Archives and Records Administration (2024). 1940 Census: Official 1940 Census Website [Dataset]. https://catalog.data.gov/dataset/1940-census-official-1940-census-website
    Explore at:
    Dataset updated
    Nov 7, 2024
    Dataset provided by
    National Archives and Records Administrationhttp://www.archives.gov/
    Description

    Website alows the public full access to the 1940 Census images, census maps and descriptions.

  2. Historic US Census - 1940

    • redivis.com
    application/jsonl +7
    Updated Jan 10, 2020
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Stanford Center for Population Health Sciences (2020). Historic US Census - 1940 [Dataset]. http://doi.org/10.57761/660g-eq95
    Explore at:
    avro, arrow, sas, application/jsonl, spss, parquet, stata, csvAvailable download formats
    Dataset updated
    Jan 10, 2020
    Dataset provided by
    Redivis Inc.
    Authors
    Stanford Center for Population Health Sciences
    Time period covered
    Jan 1, 1940 - Dec 31, 1940
    Area covered
    United States
    Description

    Abstract

    The Integrated Public Use Microdata Series (IPUMS) Complete Count Data include more than 650 million individual-level and 7.5 million household-level records. The IPUMS microdata are the result of collaboration between IPUMS and the nation’s two largest genealogical organizations—Ancestry.com and FamilySearch—and provides the largest and richest source of individual level and household data.

    Before Manuscript Submission

    All manuscripts (and other items you'd like to publish) must be submitted to

    phsdatacore@stanford.edu for approval prior to journal submission.

    We will check your cell sizes and citations.

    For more information about how to cite PHS and PHS datasets, please visit:

    https:/phsdocs.developerhub.io/need-help/citing-phs-data-core

    Documentation

    Historic data are scarce and often only exists in aggregate tables. The key advantage of historic US census data is the availability of individual and household level characteristics that researchers can tabulate in ways that benefits their specific research questions. The data contain demographic variables, economic variables, migration variables and family variables. Within households, it is possible to create relational data as all relations between household members are known. For example, having data on the mother and her children in a household enables researchers to calculate the mother’s age at birth. Another advantage of the Complete Count data is the possibility to follow individuals over time using a historical identifier.

    In sum: the historic US census data are a unique source for research on social and economic change and can provide population health researchers with information about social and economic determinants.Historic data are scarce and often only exists in aggregate tables. The key advantage of historic US census data is the availability of individual and household level characteristics that researchers can tabulate in ways that benefits their specific research questions. The data contain demographic variables, economic variables, migration variables and family variables. Within households, it is possible to create relational data as all relations between household members are known. For example, having data on the mother and her children in a household enables researchers to calculate the mother’s age at birth. Another advantage of the Complete Count data is the possibility to follow individuals over time using a historical identifier. In sum: the historic US census data are a unique source for research on social and economic change and can provide population health researchers with information about social and economic determinants.

    The historic US 1940 census data was collected in April 1940. Enumerators collected data traveling to households and counting the residents who regularly slept at the household. Individuals lacking permanent housing were counted as residents of the place where they were when the data was collected. Household members absent on the day of data collected were either listed to the household with the help of other household members or were scheduled for the last census subdivision.

    Notes

    • We provide IPUMS household and person data separately so that it is convenient to explore the descriptive statistics on each level. In order to obtain a full dataset, merge the household and person on the variables SERIAL and SERIALP. In order to create a longitudinal dataset, merge datasets on the variable HISTID.
    • Households with more than 60 people in the original data were broken up for processing purposes. Every person in the large households are considered to be in their own household. The original large households can be identified using the variable SPLIT40, reconstructed using the variable SERIAL40, and the original count is found in the variable NUMPREC40.
    • Some variables are missing from this data set for specific enumeration districts. The enumeration districts with missing data can be identified using the variable EDMISS. These variables will be added in a future release.
    • Coded variables derived from string variables are still in progress. These variables include: occupation, industry and migration status.
    • Missing observations have been allocated and some inconsistencies have been edited for the following variables: Missing observations have been allocated and some inconsistencies have been edited for the following variables: SURSIM, SEX, SCHOOL, RELATE, RACE, OCC1950, MTONGUE, MBPL, FBPL, BPL, MARST, EMPSTAT, CITIZEN, OWNERSHP. The flag variables indicating an allocated observation for the associated variables can be included in your extract by clicking the ‘Select data quality flags’ box on the extract summary page.
    • Most inconsistent information was not edited for this release, thus there are observations outside of the universe for many variables. In particular, the variables GQ, and GQTYPE have known inconsistencies and will be improved with the next r
  3. Census of Population, 1940 [United States]: Public Use Microdata Sample

    • icpsr.umich.edu
    ascii
    Updated Jan 12, 2006
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    United States. Bureau of the Census (2006). Census of Population, 1940 [United States]: Public Use Microdata Sample [Dataset]. http://doi.org/10.3886/ICPSR08236.v1
    Explore at:
    asciiAvailable download formats
    Dataset updated
    Jan 12, 2006
    Dataset provided by
    Inter-university Consortium for Political and Social Researchhttps://www.icpsr.umich.edu/web/pages/
    Authors
    United States. Bureau of the Census
    License

    https://www.icpsr.umich.edu/web/ICPSR/studies/8236/termshttps://www.icpsr.umich.edu/web/ICPSR/studies/8236/terms

    Time period covered
    1940
    Area covered
    United States, New Hampshire, Vermont, Hawaii, Florida, Connecticut, Washington, Maryland, New Mexico, New York (state)
    Description

    The 1940 Census Public Use Microdata Sample Project was assembled through a collaborative effort between the United States Bureau of the Census and the Center for Demography and Ecology at the University of Wisconsin. The collection contains a stratified 1-percent sample of households, with separate records for each household, for each "sample line" respondent, and for each person in the household. These records were encoded from microfilm copies of original handwritten enumeration schedules from the 1940 Census of Population. Geographic identification of the location of the sampled households includes Census regions and divisions, states (except Alaska and Hawaii), standard metropolitan areas (SMAs), and state economic areas (SEAs). Accompanying the data collection is a codebook that includes an abstract, descriptions of sample design, processing procedures and file structure, a data dictionary (record layout), category code lists, and a glossary. Also included is a procedural history of the 1940 Census. Each of the 20 subsamples contains three record types: household, sample line, and person. Household variables describe the location and condition of the household. The sample line records contain variables describing demographic characteristics such as nativity, marital status, number of children, veteran status, wage deductions for Social Security, and occupation. Person records also contain variables describing demographic characteristics including nativity, marital status, family membership, education, employment status, income, and occupation.

  4. 1940 Census Population Schedules, Enumeration District Maps, and Enumeration...

    • registry.opendata.aws
    Updated Apr 15, 2021
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    National Archives and Records Administration (NARA) (2021). 1940 Census Population Schedules, Enumeration District Maps, and Enumeration District Descriptions [Dataset]. https://registry.opendata.aws/nara-1940-census/
    Explore at:
    Dataset updated
    Apr 15, 2021
    Dataset provided by
    National Archives and Records Administrationhttp://www.archives.gov/
    Description

    The 1940 Census population schedules were created by the Bureau of the Census in an attempt to enumerate every person living in the United States on April 1, 1940, although some persons were missed. The 1940 census population schedules were digitized by the National Archives and Records Administration (NARA) and released publicly on April 2, 2012. The 1940 Census enumeration district maps contain maps of counties, cities, and other minor civil divisions that show enumeration districts, census tracts, and related boundaries and numbers used for each census. The coverage is nation wide and includes territorial areas. The 1940 Census enumeration district descriptions contain written descriptions of census districts, subdivisions, and enumeration districts.

  5. r

    Households

    • redivis.com
    Updated Jan 10, 2020
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Stanford Center for Population Health Sciences (2020). Households [Dataset]. https://redivis.com/datasets/fdpr-cd26cbc9y
    Explore at:
    Dataset updated
    Jan 10, 2020
    Dataset authored and provided by
    Stanford Center for Population Health Sciences
    Time period covered
    1940
    Description

    This dataset includes all households from the 1940 US census.

  6. r

    Persons

    • redivis.com
    Updated Jan 10, 2020
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Stanford Center for Population Health Sciences (2020). Persons [Dataset]. https://redivis.com/datasets/fdpr-cd26cbc9y
    Explore at:
    Dataset updated
    Jan 10, 2020
    Dataset authored and provided by
    Stanford Center for Population Health Sciences
    Time period covered
    1940
    Description

    This dataset includes all individuals from the 1940 US census.

  7. r

    Lookup

    • redivis.com
    Updated Jan 10, 2020
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Stanford Center for Population Health Sciences (2020). Lookup [Dataset]. https://redivis.com/datasets/fdpr-cd26cbc9y
    Explore at:
    Dataset updated
    Jan 10, 2020
    Dataset authored and provided by
    Stanford Center for Population Health Sciences
    Description

    This dataset includes variable names, variable labels, variable values, and corresponding variable value labels for the IPUMS 1940 datasets.

  8. d

    Census Linking Project: 1920-1940 Crosswalk

    • search.dataone.org
    • dataverse.harvard.edu
    Updated Nov 9, 2023
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Abramitzky, Ran; Boustan, Leah; Eriksson, Katherine; Rashid, Myera; Pérez, Santiago (2023). Census Linking Project: 1920-1940 Crosswalk [Dataset]. http://doi.org/10.7910/DVN/M7KNVH
    Explore at:
    Dataset updated
    Nov 9, 2023
    Dataset provided by
    Harvard Dataverse
    Authors
    Abramitzky, Ran; Boustan, Leah; Eriksson, Katherine; Rashid, Myera; Pérez, Santiago
    Description

    This crosswalk consists of individuals matched between the 1920 and 1940 complete-count US Censuses. Within the crosswalk, users have the option to select the linking method with which these matches were created. This version of the crosswalk contains links made by the ABE-exact (conservative and standard) method, the ABE-NYSIIS (conservative and standard) method and the ABE-NYSIIS (conservative and standard) method where race is used as a matching variable. For any chosen method, users can merge into this crosswalk a wide set of individual- and household-level variables provided publicly by IPUMS, thereby creating a historical longitudinal dataset for analysis.

  9. d

    Census Linking Project: 1880-1940 Crosswalk

    • search.dataone.org
    • dataverse.harvard.edu
    Updated Nov 9, 2023
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Abramitzky, Ran; Boustan, Leah; Eriksson, Katherine; Rashid, Myera; Pérez, Santiago (2023). Census Linking Project: 1880-1940 Crosswalk [Dataset]. http://doi.org/10.7910/DVN/HI60E6
    Explore at:
    Dataset updated
    Nov 9, 2023
    Dataset provided by
    Harvard Dataverse
    Authors
    Abramitzky, Ran; Boustan, Leah; Eriksson, Katherine; Rashid, Myera; Pérez, Santiago
    Description

    This crosswalk consists of individuals matched between the 1880 and 1940 complete-count US Censuses. Within the crosswalk, users have the option to select the linking method with which these matches were created. This version of the crosswalk contains links made by the ABE-exact (conservative and standard) method, the ABE-NYSIIS (conservative and standard) method and the ABE-NYSIIS (conservative and standard) method where race is used as a matching variable. For any chosen method, users can merge into this crosswalk a wide set of individual- and household-level variables provided publicly by IPUMS, thereby creating a historical longitudinal dataset for analysis.

  10. d

    Baldwin-Green Study: Canada-U.S. Census of Industry 1867-1940

    • search.dataone.org
    • borealisdata.ca
    Updated Dec 28, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    John Baldwin; Alan Green (2023). Baldwin-Green Study: Canada-U.S. Census of Industry 1867-1940 [Dataset]. https://search.dataone.org/view/sha256%3A77bbc22a8a786a93216bb616cbe75dfe9170485ce58720426447e7cb0d0a2568
    Explore at:
    Dataset updated
    Dec 28, 2023
    Dataset provided by
    Borealis
    Authors
    John Baldwin; Alan Green
    Time period covered
    Jan 1, 1867 - Jan 1, 1940
    Description

    This study matches Canadian and US manufacturing industries at the 2-digit SIC code level for census years 1900 to 1940. Canadian figures start at 1870. Only general figures were recorded, such as number of employees, number of establishments, salary and wages, gross production, cost of input materials, gross value added. The project does have some drawbacks, such as the lack of US figures gross production, cost of materials, and lack of figures for the iron and steel industry. But for an aggregate comparison of the two countries, the numbers can be considered reliable.

  11. e

    Alaskan Population Demographic Information from Decennial and American...

    • knb.ecoinformatics.org
    • search.dataone.org
    • +1more
    Updated Apr 11, 2019
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    United States Census Bureau; Juliet Bachtel; John Randazzo; Erika Gavenus (2019). Alaskan Population Demographic Information from Decennial and American Community Survey Census Data, 1940-2016 [Dataset]. http://doi.org/10.5063/F10R9MPV
    Explore at:
    Dataset updated
    Apr 11, 2019
    Dataset provided by
    Knowledge Network for Biocomplexity
    Authors
    United States Census Bureau; Juliet Bachtel; John Randazzo; Erika Gavenus
    Time period covered
    Jan 1, 1940 - Dec 31, 2015
    Area covered
    Variables measured
    lat, lng, Year, city, ANVSA, Negro, Other, Place, White, Aleut., and 145 more
    Description

    These data comprise Census records relating to the Alaskan people's population demographics for the State of Alaskan Salmon and People (SASAP) Project. Decennial census data were originally extracted from IPUMS National Historic Geographic Information Systems website: https://data2.nhgis.org/main (Citation: Steven Manson, Jonathan Schroeder, David Van Riper, and Steven Ruggles. IPUMS National Historical Geographic Information System: Version 12.0 [Database]. Minneapolis: University of Minnesota. 2017. http://doi.org/10.18128/D050.V12.0). A number of relevant tables of basic demographics on age and race, household income and poverty levels, and labor force participation were extracted. These particular variables were selected as part of an effort to understand and potentially quantify various dimensions of well-being in Alaskan communities. The file "censusdata_master.csv" is a consolidation of all 21 other data files in the package. For detailed information on how the datasets vary over different years, view the file "readme.docx" available in this data package. The included .Rmd file is a script which combines the 21 files by year into a single file (censusdata_master.csv). It also cleans up place names (including typographical errors) and uses the USGS place names dataset and the SASAP regions dataset to assign latitude and longitude values and region values to each place in the dataset. Note that some places were not assigned a region or location because they do not fit well into the regional framework. Considerable heterogeneity exists between census surveys each year. While we have attempted to combine these datasets in a way that makes sense, there may be some discrepancies or unexpected values. The RMarkdown document SASAPWebsiteGraphicsCensus.Rmd is used to generate a variety of figures using these data, including the additional file Chignik_population.png. An additional set of 25 figures showing regional trends in population and income metrics are also included.

  12. d

    Old vs. new urban: U.S. national mapping of the year of “first development”...

    • catalog.data.gov
    • data.usgs.gov
    • +5more
    Updated Jul 6, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    U.S. Geological Survey (2024). Old vs. new urban: U.S. national mapping of the year of “first development” for urban areas, from 1940-2010. [Dataset]. https://catalog.data.gov/dataset/old-vs-new-urban-u-s-national-mapping-of-the-year-of-first-development-for-urban-area-1940
    Explore at:
    Dataset updated
    Jul 6, 2024
    Dataset provided by
    United States Geological Surveyhttp://www.usgs.gov/
    Description

    This product is a 100-meter resolution raster which classifies urban lands in the conterminous United States by the year in which they were “first developed.” The classification is based on housing density by era for 1940-2010 developed by the University of Wisconsin-Madison SILVIS lab. The classification was applied by identifying a housing unit density threshold which matched current-era “development” as represented by U.S. Census Urban Areas polygons, then re-applying that threshold to former eras back to 1940. The result is a national raster (.tif format) with eight classes, for the years 1940 and before, 1950, 1960, 1970, 1980, 1990, 2000, and 2010, representing the year in which a pixel crossed the threshold of being “developed.” It is suitable for watershed mean calculations in distinguishing areas which have newly developed areas vs. older developed areas.

  13. o

    Data from: United States Microdata Samples Extract File, 1940-1980:...

    • explore.openaire.eu
    • icpsr.umich.edu
    Updated Dec 20, 1985
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Inter-University Consortium For Political And Social Research (1985). United States Microdata Samples Extract File, 1940-1980: Demographics of Aging [Dataset]. http://doi.org/10.3886/icpsr08353
    Explore at:
    Dataset updated
    Dec 20, 1985
    Authors
    Inter-University Consortium For Political And Social Research
    Area covered
    United States
    Description

    This is an extract of the decennial Public Use Microdata Sample (PUMS) released by the Bureau of the Census. Because the complete PUMS files contain several hundred thousand records, ICPSR has constructed this subset to allow for easier and less costly analysis. The collection of data at ten year increments allows the user to follow various age cohorts through the life-cycle. Data include information on the household and its occupants such as size and value of dwelling, utility costs, number of people in the household, and their relationship to the respondent. More detailed information was collected on the respondent, the head of household, and the spouse, if present. Variables include education, marital status, occupation and income. The stratified sample has unequal sampling rates across strata and requires the use of weights for analyses using more than one stratum. The epsem sample was selected in a second stage from the stratified sample and used compensating sampling rates within each stratum so that the overall probability of selection for each person is equal. The person level weight for use with the stratified sample and the household weight to be used with the epsem sample are included in the data file.Conducted by the United States Department of Commerce, Bureau of the Census. Stratified sample of adults contained in the Public Use Microdata Sample. Approximately 500 records were drawn from each of 28 sex/age/race strata. Additionally, an equal probability (epsem) sample was drawn from the stratified sample. Datasets: DS0: Study-Level Files DS1: United States Microdata Samples Extract File, 1940-1980: Demographics of Aging DS2: Frequencies, 1940-1980 For 1960-1980, all PUMS records for persons 18 and over. For 1940 and 1950, all sample line records.

  14. United States Agriculture Data, 1840 - 2012 - Archival Version

    • search.gesis.org
    Updated Aug 20, 2018
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Inter-University Consortium for Political and Social Research (2018). United States Agriculture Data, 1840 - 2012 - Archival Version [Dataset]. http://doi.org/10.3886/ICPSR35206
    Explore at:
    Dataset updated
    Aug 20, 2018
    Dataset provided by
    Inter-university Consortium for Political and Social Researchhttps://www.icpsr.umich.edu/web/pages/
    GESIS search
    License

    https://search.gesis.org/research_data/datasearch-httpwww-da-ra-deoaip--oaioai-da-ra-de451385https://search.gesis.org/research_data/datasearch-httpwww-da-ra-deoaip--oaioai-da-ra-de451385

    Description

    Abstract (en): This collection includes county-level data from the United States Censuses of Agriculture for the years 1840 to 2012. The files provide data about the number, types, output, and prices of various agricultural products, as well as information on the amount, expenses, sales, values, and production of machinery. Most of the basic crop output data apply to the previous harvest year. Data collected also included the population and value of livestock, the number of animals slaughtered, and the size, type, and value of farms. Part 46 of this collection contains data from 1980 through 2010. Variables in part 46 include information such as the average value of farmland, number and value of buildings per acre, food services, resident population, composition of households, and unemployment rates. ICPSR data undergo a confidentiality review and are altered when necessary to limit the risk of disclosure. ICPSR also routinely creates ready-to-go data files along with setups in the major statistical software formats as well as standard codebooks to accompany the data. In addition to these procedures, ICPSR performed the following processing steps for this data collection: Checked for undocumented or out-of-range codes.. Response Rates: Not applicable. Datasets:DS0: Study-Level FilesDS1: Farm Land Value Data Set (County and State) 1850-1959DS2: 1840 County and StateDS3: 1850 County and StateDS4: 1860 County and StateDS5: 1870 County and StateDS6: 1880 County and StateDS7: 1890 County and StateDS8: 1900 County and StateDS9: 1910 County and StateDS10: 1920 County and State, Dataset 1DS11: 1920 County and State, Dataset 2DS12: 1925 County and StateDS13: 1930 County and State, Dataset 1DS14: 1930 County and State, Dataset 2DS15: 1935 County and StateDS16: 1940 County and State, Dataset 1DS17: 1940 County and State, Dataset 2DS18: 1940 County and State, Dataset 3DS19: 1940 County and State, Dataset 4 (Water)DS20: 1945 County and StateDS21: 1950 County and State, Dataset 1DS22: 1950 Crops, County and State, Dataset 2DS23: 1950 County, Dataset 3DS24: 1950 County and State, Dataset 4DS25: 1954 County and State, Dataset 1DS26: 1954 Crops, County and State, Dataset 2DS27: 1959 County and State, Dataset 1DS28: 1959 Crops, County and State, Dataset 2DS29: 1959 County, Dataset 3DS30: 1964 Dataset 1DS31: 1964 Crops, County and State, Dataset 2DS32: 1964 County, Dataset 3DS33: 1969 All Farms, County and State, Dataset 1DS34: 1969 Farms 2500, County and State, Dataset 2DS35: 1969 Crops, County and State, Dataset 3DS36: 1974 All Farms, County and State, Dataset 1DS37: 1974 Farms 2500, County and State, Dataset 2DS38: 1974 Crops, County and State, Dataset 3DS39: 1978 County and StateDS40: 1982 County and StateDS41: 1987 County and StateDS42: 1992 County and StateDS43: 1997 County and StateDS44: 2002 County and StateDS45: 2007 County and StateDS46: State and County Data, United States, 1980-2010DS47: 2012 County and State Farms within United States counties and states. Smallest Geographic Unit: FIPS code The sample was the universe of agricultural operating units. For 1969-2007, data were taken from computer files from the Census Bureau and the United States Department of Agriculture. 2018-08-20 The P.I. resupplied data and documentation for 1935 County and State (dataset 15) and 1997 County and State (dataset 43). Additionally, documentation updates and variable label revisions have been incorporated in datasets 22, 26, 28, 31, 35, and 38 at the request of the P.I.2016-06-29 The data and documentation for 2012 County and State (data set 47) have been added to this collection. The collection and documentation titles have been updated to reflect the new year.2015-08-05 The data, setup files, and documentation for 1964 Dataset 1 have been updated to reflect changes from the producer. Funding insitution(s): National Science Foundation (NSF-SES-0921732; 0648045). United States Department of Health and Human Services. National Institutes of Health (R01 HD057929).

  15. o

    Deep Roots of Racial Inequalities in US Healthcare: The 1906 American...

    • portal.sds.ox.ac.uk
    txt
    Updated Dec 5, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Benjamin Chrisinger (2023). Deep Roots of Racial Inequalities in US Healthcare: The 1906 American Medical Directory [Dataset]. http://doi.org/10.25446/oxford.24065709.v2
    Explore at:
    txtAvailable download formats
    Dataset updated
    Dec 5, 2023
    Dataset provided by
    University of Oxford
    Authors
    Benjamin Chrisinger
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Area covered
    United States
    Description

    This dataset comprises physician-level entries from the 1906 American Medical Directory, the first in a series of semi-annual directories of all practicing physicians published by the American Medical Association [1]. Physicians are consistently listed by city, county, and state. Most records also include details about the place and date of medical training. From 1906-1940, Directories also identified the race of black physicians [2].This dataset comprises physician entries for a subset of US states and the District of Columbia, including all of the South and several adjacent states (Alabama, Arkansas, Delaware, Florida, Georgia, Kansas, Kentucky, Louisiana, Maryland, Mississippi, Missouri, North Carolina, Oklahoma, South Carolina, Tennessee, Texas, Virginia, West Virginia). Records were extracted via manual double-entry by professional data management company [3], and place names were matched to latitude/longitude coordinates. The main source for geolocating physician entries was the US Census. Historical Census records were sourced from IPUMS National Historical Geographic Information System [4]. Additionally, a public database of historical US Post Office locations was used to match locations that could not be found using Census records [5]. Fuzzy matching algorithms were also used to match misspelled place or county names [6].The source of geocoding match is described in the “match.source” field (Type of spatial match (census_YEAR = match to NHGIS census place-county-state for given year; census_fuzzy_YEAR = matched to NHGIS place-county-state with fuzzy matching algorithm; dc = matched to centroid for Washington, DC; post_places = place-county-state matched to Blevins & Helbock's post office dataset; post_fuzzy = matched to post office dataset with fuzzy matching algorithm; post_simp = place/state matched to post office dataset; post_confimed_missing = post office dataset confirms place and county, but could not find coordinates; osm = matched using Open Street Map geocoder; hand-match = matched by research assistants reviewing web archival sources; unmatched/hand_match_missing = place coordinates could not be found). For records where place names could not be matched, but county names could, coordinates for county centroids were used. Overall, 40,964 records were matched to places (match.type=place_point) and 931 to county centroids ( match.type=county_centroid); 76 records could not be matched (match.type=NA).Most records include information about the physician’s medical training, including the year of graduation and a code linking to a school. A key to these codes is given on Directory pages 26-27, and at the beginning of each state’s section [1]. The OSM geocoder was used to assign coordinates to each school by its listed location. Straight-line distances between physicians’ place of training and practice were calculated using the sf package in R [7], and are given in the “school.dist.km” field. Additionally, the Directory identified a handful of schools that were “fraudulent” (school.fraudulent=1), and institutions set up to train black physicians (school.black=1).AMA identified black physicians in the directory with the signifier “(col.)” following the physician’s name (race.black=1). Additionally, a number of physicians attended schools identified by AMA as serving black students, but were not otherwise identified as black; thus an expanded racial identifier was generated to identify black physicians (race.black.prob=1), including physicians who attended these schools and those directly identified (race.black=1).Approximately 10% of dataset entries were audited by trained research assistants, in addition to 100% of black physician entries. These audits demonstrated a high degree of accuracy between the original Directory and extracted records. Still, given the complexity of matching across multiple archival sources, it is possible that some errors remain; any identified errors will be periodically rectified in the dataset, with a log kept of these updates.For further information about this dataset, or to report errors, please contact Dr Ben Chrisinger (Benjamin.Chrisinger@tufts.edu). Future updates to this dataset, including additional states and Directory years, will be posted here: https://dataverse.harvard.edu/dataverse/amd.References:1. American Medical Association, 1906. American Medical Directory. American Medical Association, Chicago. Retrieved from: https://catalog.hathitrust.org/Record/000543547.2. Baker, Robert B., Harriet A. Washington, Ololade Olakanmi, Todd L. Savitt, Elizabeth A. Jacobs, Eddie Hoover, and Matthew K. Wynia. "African American physicians and organized medicine, 1846-1968: origins of a racial divide." JAMA 300, no. 3 (2008): 306-313. doi:10.1001/jama.300.3.306.3. GABS Research Consult Limited Company, https://www.gabsrcl.com.4. Steven Manson, Jonathan Schroeder, David Van Riper, Tracy Kugler, and Steven Ruggles. IPUMS National Historical Geographic Information System: Version 17.0 [GNIS, TIGER/Line & Census Maps for US Places and Counties: 1900, 1910, 1920, 1930, 1940, 1950; 1910_cPHA: ds37]. Minneapolis, MN: IPUMS. 2022. http://doi.org/10.18128/D050.V17.05. Blevins, Cameron; Helbock, Richard W., 2021, "US Post Offices", https://doi.org/10.7910/DVN/NUKCNA, Harvard Dataverse, V1, UNF:6:8ROmiI5/4qA8jHrt62PpyA== [fileUNF]6. fedmatch: Fast, Flexible, and User-Friendly Record Linkage Methods. https://cran.r-project.org/web/packages/fedmatch/index.html7. sf: Simple Features for R. https://cran.r-project.org/web/packages/sf/index.html

  16. e

    Data on Alaskan Population demographics ranging from 1940 to 2015

    • knb.ecoinformatics.org
    • dataone.org
    • +1more
    Updated Mar 14, 2019
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    United States Census Bureau; Juliet Bachtel; John Randazzo; Erika Gavenus (2019). Data on Alaskan Population demographics ranging from 1940 to 2015 [Dataset]. http://doi.org/10.5063/F1FQ9TV8
    Explore at:
    Dataset updated
    Mar 14, 2019
    Dataset provided by
    Knowledge Network for Biocomplexity
    Authors
    United States Census Bureau; Juliet Bachtel; John Randazzo; Erika Gavenus
    Time period covered
    Jan 1, 1940 - Dec 31, 2015
    Area covered
    Variables measured
    lat, lng, Name, Year, city, ANVSA, Negro, Other, Place, White, and 147 more
    Description

    These data comprise Census records relating to the Alaskan people's population demographics for the State of Alaskan Salmon and People (SASAP) Project. Decennial census data were originally extracted from IPUMS National Historic Geographic Information Systems website: https://data2.nhgis.org/main (Citation: Steven Manson, Jonathan Schroeder, David Van Riper, and Steven Ruggles. IPUMS National Historical Geographic Information System: Version 12.0 [Database]. Minneapolis: University of Minnesota. 2017. http://doi.org/10.18128/D050.V12.0). A number of relevant tables of basic demographics on age and race, household income and poverty levels, and labor force participation were extracted. These particular variables were selected as part of an effort to understand and potentially quantify various dimensions of well-being in Alaskan communities. The file "censusdata_master.csv" is a consolidation of all 21 other data files in the package. For detailed information on how the datasets vary over different years, view the file "readme.docx" available in this data package. The included .Rmd file is a script which combines the 21 files by year into a single file (censusdata_master.csv). It also cleans up place names (including typographical errors) and uses the USGS place names dataset and the SASAP regions dataset to assign latitude and longitude values and region values to each place in the dataset. Note that some places were not assigned a region or location because they do not fit well into the regional framework. Considerable heterogeneity exists between census surveys each year. While we have attempted to combine these datasets in a way that makes sense, there may be some discrepancies or unexpected values. The RMarkdown document SASAPWebsiteGraphicsCensus.Rmd is used to generate a variety of figures using these data, including the additional file Chignik_population.png

  17. F

    Total Households

    • fred.stlouisfed.org
    json
    Updated Nov 12, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2024). Total Households [Dataset]. https://fred.stlouisfed.org/series/TTLHH
    Explore at:
    jsonAvailable download formats
    Dataset updated
    Nov 12, 2024
    License

    https://fred.stlouisfed.org/legal/#copyright-public-domainhttps://fred.stlouisfed.org/legal/#copyright-public-domain

    Description

    Graph and download economic data for Total Households (TTLHH) from 1940 to 2024 about household survey, households, and USA.

  18. g

    Aging of Veterans of the Union Army: Surgeons' Certificates, Version S-1...

    • search.gesis.org
    Updated Jan 18, 2006
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Fogel, Robert W., et al. (2006). Aging of Veterans of the Union Army: Surgeons' Certificates, Version S-1 Standardized, 1862-1940 - Archival Version [Dataset]. http://doi.org/10.3886/ICPSR03417
    Explore at:
    Dataset updated
    Jan 18, 2006
    Dataset provided by
    GESIS search
    ICPSR - Interuniversity Consortium for Political and Social Research
    Authors
    Fogel, Robert W., et al.
    License

    https://search.gesis.org/research_data/datasearch-httpwww-da-ra-deoaip--oaioai-da-ra-de436566https://search.gesis.org/research_data/datasearch-httpwww-da-ra-deoaip--oaioai-da-ra-de436566

    Area covered
    Union
    Description

    Abstract (en): This data collection constitutes a portion of the historical data collected by the project "Early Indicators of Later Work Levels, Disease, and Death." With the goal of constructing datasets suitable for longitudinal analyses of factors affecting the aging process, the project collects military, medical, and socioeconomic data on a sample of white males mustered into the Union Army during the Civil War. The surgeons' certificates contain information from examining physicians to determine eligibility for pension benefits. Also included are questions regarding the age, occupation, residence, and military experience of the veterans. These data can be linked to AGING OF VETERANS OF THE UNION ARMY: MILITARY, PENSION, AND MEDICAL RECORDS, 1820-1940 (ICPSR 6837) and AGING OF VETERANS OF THE UNION ARMY: UNITED STATES FEDERAL CENSUS RECORDS, 1850, 1860, 1900, 1910 (ICPSR 6836) using the variable "recidnum." This version of the Surgeons' Certificates differs from the previous version, AGING OF VETERANS OF THE UNION ARMY: SURGEONS' CERTIFICATES, 1860-1940 (ICPSR 2877), in that the data contain standard codes for medical variables and that 5,346 new observations have been added from Ohio veterans. This collection studies the health conditions and disabilities of Union Army veterans, identifying relationships between biomedical and socioeconomic conditions. Also examined is the impact of age at onset of disabilities, comorbidities, and rates of deterioration on waiting time to death. These data also look at the connection between the burden of diseases and the cause of death among Union Army veterans compared to that of persons dying toward the end of the twentieth century. The investigators seek to determine how the age-specific curve of chronic disease burdens after age 50 has changed over time. Union Army recruits in white volunteer infantry regiments. Commissioned officers, Black recruits, and other branches of the military were excluded from the universe. A one-stage cluster sample of Union Army companies was randomly selected from the "Regimental Books" housed at the National Archives in Washington, DC. 2006-01-18 File DOC3417.ALL.PDF was removed from any previous datasets and flagged as a study-level file, so that it will accompany all downloads.2006-01-18 File CB3417.ALL.PDF was removed from any previous datasets and flagged as a study-level file, so that it will accompany all downloads. Funding insitution(s): United States Department of Health and Human Services. National Institutes of Health (NIH-PO1-AG10120). National Science Foundation (NSF-SBR-9114981). (1) This collection contains 87,233 cases that are split into five files containing all the cases per group of variables. (2) Files can be merged by using the variables "recidnum" and "examnum." Users should refer to the Supplemental Documentation for information on merging these files.(3) The codebook and supplemental documentation are provided as Portable Document Format (PDF) files. The PDF file format was developed by Adobe Systems Incorporated and can be accessed using PDF reader software, such as the Adobe Acrobat Reader. Information on how to obtain a copy of the Acrobat Reader is provided on the ICPSR Web site.

  19. o

    HHUUD10: Historical Housing Unit and Urbanization Database 2010

    • osf.io
    Updated Feb 17, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Scott Markley; Steven Holloway; Taylor Hafley; Mathew Hauer (2023). HHUUD10: Historical Housing Unit and Urbanization Database 2010 [Dataset]. http://doi.org/10.17605/OSF.IO/FZV5E
    Explore at:
    Dataset updated
    Feb 17, 2023
    Dataset provided by
    Center For Open Science
    Authors
    Scott Markley; Steven Holloway; Taylor Hafley; Mathew Hauer
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Subcounty housing unit counts are important for studying geo-historical patterns of (sub)urbanization, land-use change, and residential loss and gain. The most commonly used subcounty geographical unit for social research in the United States is the census tract. However, their changing geometries and historically incomplete coverage present significant obstacles for longitudinal analysis that existing datasets do not adequately address. Overcoming these barriers, we provide housing unit estimates in consistent 2010 tract boundaries for every census year from 1940 to 2010 plus 2019 for the entire continental US. Moreover, we develop an “urbanization year” indicator that denotes if and when tracts became “urbanized” during this timeframe. We produce these data by blending existing interpolation techniques with a novel procedure we call “maximum reabsorption”. Conducting out-of-sample validation, we find that our hybrid approach generally produces more reliable estimates than existing alternatives. The final dataset, Historical Housing Unit and Urbanization Database 2010 (HHUUD10), has myriad potential uses for research involving housing, population, and land-use change, as well as (sub)urbanization.

  20. Historical Jewish population by region 1170-1995

    • statista.com
    Updated Jan 1, 2001
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Statista (2001). Historical Jewish population by region 1170-1995 [Dataset]. https://www.statista.com/statistics/1357607/historical-jewish-population/
    Explore at:
    Dataset updated
    Jan 1, 2001
    Dataset authored and provided by
    Statistahttp://statista.com/
    Area covered
    Worldwide
    Description

    The world's Jewish population has had a complex and tumultuous history over the past millennia, regularly dealing with persecution, pogroms, and even genocide. The legacy of expulsion and persecution of Jews, including bans on land ownership, meant that Jewish communities disproportionately lived in urban areas, working as artisans or traders, and often lived in their own settlements separate to the rest of the urban population. This separation contributed to the impression that events such as pandemics, famines, or economic shocks did not affect Jews as much as other populations, and such factors came to form the basis of the mistrust and stereotypes of wealth (characterized as greed) that have made up anti-Semitic rhetoric for centuries. Development since the Middle Ages The concentration of Jewish populations across the world has shifted across different centuries. In the Middle Ages, the largest Jewish populations were found in Palestine and the wider Levant region, with other sizeable populations in present-day France, Italy, and Spain. Later, however, the Jewish disapora became increasingly concentrated in Eastern Europe after waves of pogroms in the west saw Jewish communities move eastward. Poland in particular was often considered a refuge for Jews from the late-Middle Ages until the 18th century, when it was then partitioned between Austria, Prussia, and Russia, and persecution increased. Push factors such as major pogroms in the Russian Empire in the 19th century and growing oppression in the west during the interwar period then saw many Jews migrate to the United States in search of opportunity.

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
National Archives and Records Administration (2024). 1940 Census: Official 1940 Census Website [Dataset]. https://catalog.data.gov/dataset/1940-census-official-1940-census-website
Organization logo

1940 Census: Official 1940 Census Website

Explore at:
4 scholarly articles cite this dataset (View in Google Scholar)
Dataset updated
Nov 7, 2024
Dataset provided by
National Archives and Records Administrationhttp://www.archives.gov/
Description

Website alows the public full access to the 1940 Census images, census maps and descriptions.

Search
Clear search
Close search
Google apps
Main menu