99 datasets found
  1. Baby Names from Social Security Card Applications - National Data

    • catalog.data.gov
    • data.amerigeoss.org
    Updated May 5, 2022
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Social Security Administration (2022). Baby Names from Social Security Card Applications - National Data [Dataset]. https://catalog.data.gov/dataset/baby-names-from-social-security-card-applications-national-data
    Explore at:
    Dataset updated
    May 5, 2022
    Dataset provided by
    Social Security Administrationhttp://ssa.gov/
    Description

    The data (name, year of birth, sex, and number) are from a 100 percent sample of Social Security card applications for 1880 onward.

  2. US baby names

    • kaggle.com
    Updated Sep 25, 2020
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Anurag (2020). US baby names [Dataset]. https://www.kaggle.com/zerryberry/us-baby-names/code
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Sep 25, 2020
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    Anurag
    License

    https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/

    Description

    Context

    This data tells us the naming trend in US for babies from 1880s to late 2000s. You can explore different factors affecting parents - that compel them to name their baby in a certain trend.

    Content

    The source can be found here- https://github.com/wesm/pydata-book/tree/2nd-edition/datasets

    Acknowledgements

    Special thanks to Python for Data Analysis by Wes Mckinney for this.

  3. N

    MOST POPULAR NAMES NYC

    • data.cityofnewyork.us
    application/rdfxml +5
    Updated Jun 8, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Department of Health and Mental Hygiene (DOHMH) (2025). MOST POPULAR NAMES NYC [Dataset]. https://data.cityofnewyork.us/Health/MOST-POPULAR-NAMES-NYC/7v44-25wq
    Explore at:
    application/rssxml, csv, tsv, application/rdfxml, json, xmlAvailable download formats
    Dataset updated
    Jun 8, 2025
    Authors
    Department of Health and Mental Hygiene (DOHMH)
    Area covered
    New York
    Description

    The most popular baby names by sex and mother's ethnicity in New York City.

  4. d

    Street Names

    • catalog.data.gov
    • datadiscoverystudio.org
    • +1more
    Updated May 10, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    data.lacity.org (2025). Street Names [Dataset]. https://catalog.data.gov/dataset/street-names-7385b
    Explore at:
    Dataset updated
    May 10, 2025
    Dataset provided by
    data.lacity.org
    Description

    Official Street Names in the City of Los Angeles created and maintained by the Bureau of Engineering.

  5. C

    Discover the most common surnames among Americans

    • surnam.es
    html
    Updated Jun 28, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2025). Discover the most common surnames among Americans [Dataset]. https://surnam.es/united-states
    Explore at:
    htmlAvailable download formats
    Dataset updated
    Jun 28, 2025
    License

    Attribution-ShareAlike 4.0 (CC BY-SA 4.0)https://creativecommons.org/licenses/by-sa/4.0/
    License information was derived automatically

    Time period covered
    2025
    Area covered
    United States
    Variables measured
    Ranking, Frecuencia
    Description

    In the United States, cultural and ethnic diversity is reflected in a wide variety of surnames that have fascinating stories and origins. These surnames, more than simple identifiers, are a reflection of the roots, traditions and origins of their bearers. In this article, we will explore the most common surnames among the inhabitants of this country, offering a vision of how American influences have shaped and enriched the onomastic landscape. As we delve into this list, we'll see how each last name can tell a unique story about American culture and heritage, highlighting the plurality that characterizes this nation.

  6. h

    french_first_names_insee_2024

    • huggingface.co
    Updated Nov 4, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Ronan L.M. (2024). french_first_names_insee_2024 [Dataset]. http://doi.org/10.57967/hf/3431
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Nov 4, 2024
    Authors
    Ronan L.M.
    License

    MIT Licensehttps://opensource.org/licenses/MIT
    License information was derived automatically

    Area covered
    French
    Description

    French First Names from Death Records (1970-2024)

    This dataset contains French first names extracted from death records provided by INSEE (French National Institute of Statistics and Economic Studies) covering the period from 1970 to September 2024.

      Dataset Description
    
    
    
    
    
      Data Source
    

    The data is sourced from INSEE's death records database. It includes first names of deceased individuals in France, providing valuable insights into naming patterns across different… See the full description on the dataset page: https://huggingface.co/datasets/eltorio/french_first_names_insee_2024.

  7. Popular Child Name

    • kaggle.com
    Updated May 19, 2021
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Parul (2021). Popular Child Name [Dataset]. https://www.kaggle.com/tarzon/popular-name/metadata
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    May 19, 2021
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    Parul
    License

    https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/

    Description

    Context

    Social Security Administration (SSA) of The United States published the frequency of the born a baby name in the US (United State) after 1879.

    Content

    This dataset contains raw data in txt format which include year from 1880 to 2019 with name and sex columns.

    Acknowledgements

    I have taken a dataset from U.S. Social Security, you can check out from here:https://www.ssa.gov/oact/babynames/limits.html

    Inspiration

    Use simple python code to Analyzing the name pattern in the US.

  8. e

    LinkedIn US city ID's

    • datarepository.eur.nl
    txt
    Updated May 31, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Paul Kievits (2023). LinkedIn US city ID's [Dataset]. http://doi.org/10.25397/eur.19932221.v1
    Explore at:
    txtAvailable download formats
    Dataset updated
    May 31, 2023
    Dataset provided by
    Erasmus University Rotterdam (EUR)
    Authors
    Paul Kievits
    License

    MIT Licensehttps://opensource.org/licenses/MIT
    License information was derived automatically

    Area covered
    United States
    Description

    This is a very small but useful dataset if you are ever looking to get jobs for a certain US city in LinkedIn. It contains a list of US cities and states and it's corresponding LinkedIn ID (which is usually externally hidden).

    The cities list was retreived from here: https://github.com/kelvins/US-Cities-Database and the names of the ciiadjusted to match the name used in LinkedIn (which could differ in subtle ways).

    Some cities do not have an ID, this is because the city is either too small or because there was a difference in the name on LinkedIn which I did not detect (human error). If you ever run in to one of these feel free to enhance this dataset.

  9. Domain Names with US Registrants — Registered between March 16, 2018 to...

    • dataandsons.com
    csv, zip
    Updated Apr 2, 2018
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Jason Iverson Consulting (2018). Domain Names with US Registrants — Registered between March 16, 2018 to March 31, 2018 [Dataset]. https://www.dataandsons.com/data-market/lead-generation/domain-names-with-us-registrants-registered-between-march-16-2018-to-march-31-2018
    Explore at:
    csv, zipAvailable download formats
    Dataset updated
    Apr 2, 2018
    Dataset provided by
    Authors
    Jason Iverson Consulting
    License

    Attribution-ShareAlike 4.0 (CC BY-SA 4.0)https://creativecommons.org/licenses/by-sa/4.0/
    License information was derived automatically

    Time period covered
    Mar 16, 2018 - Mar 31, 2018
    Area covered
    United States
    Description

    About this Dataset

    For sale are domain names with WHO IS information that were registered between Mar 16, 2018 and Mar 31, 2018 by registrants in United States. Domains which obfuscate registrant, administrative, and other WHO IS contact details have been omitted from this dataset. The following information is availble for download in this dataset: - Domain name, Created Date, Updated Date, Expiration Date, Registrar Name- Registrant Company, Name, Address, City, State/Province/Other, Postal Code, Country, Email, Phone #, Fax #- Administrative Company, Name, Address, City, State/Province/Other, Postal Code, Country, Email, Phone #, Fax #- Technical Company, Name, Address, City, State/Province/Other, Postal Code, Country, Email, Phone #, Fax #- Billing Company, Name, Address, City, State/Province/Other, Postal Code, Country, Email, Phone #, Fax #- NameServer1, NameServer2, NameServer3, NameServer4, - DomainStatus1, DomainStatus2, DomainStatus3, DomainStatus4 Still unsure about purchasing this dataset? View and Download a free sample dataset of global domain name registrations in the Lead Generation category Are you interested in a more targeted domain name registration dataset? Select the "Ask Seller a Question" link, send me a message, and I'll get back to you as soon as I can.

    Category

    Lead Generation

    Keywords

    usa,united-states,newly-registered-domains,who-is-data

    Row Count

    220910

    Price

    $20.00

  10. o

    Counties - United States of America

    • public.opendatasoft.com
    • bfortune.opendatasoft.com
    csv, excel, geojson +1
    Updated Jun 6, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2024). Counties - United States of America [Dataset]. https://public.opendatasoft.com/explore/dataset/georef-united-states-of-america-county/
    Explore at:
    excel, json, geojson, csvAvailable download formats
    Dataset updated
    Jun 6, 2024
    License

    https://en.wikipedia.org/wiki/Public_domainhttps://en.wikipedia.org/wiki/Public_domain

    Area covered
    United States
    Description

    This dataset is part of the Geographical repository maintained by Opendatasoft. This dataset contains data for counties and equivalent entities in United States of America. The primary legal divisions of most states are termed counties. In Louisiana, these divisions are known as parishes. In Alaska, which has no counties, the equivalent entities are the organized boroughs, city and boroughs, municipalities, and for the unorganized area, census areas. The latter are delineated cooperatively for statistical purposes by the State of Alaska and the Census Bureau. In four states (Maryland, Missouri, Nevada, and Virginia), there are one or more incorporated places that are independent of any county organization and thus constitute primary divisions of their states. These incorporated places are known as independent cities and are treated as equivalent entities for purposes of data presentation. The District of Columbia and Guam have no primary divisions, and each area is considered an equivalent entity for purposes of data presentation. The Census Bureau treats the following entities as equivalents of counties for purposes of data presentation: Municipios in Puerto Rico, Districts and Islands in American Samoa, Municipalities in the Commonwealth of the Northern Mariana Islands, and Islands in the U.S. Virgin Islands. The entire area of the United States, Puerto Rico, and the Island Areas is covered by counties or equivalent entities.Processors and tools are using this data. Enhancements Add ISO 3166-3 codes. Simplify geometries to provide better performance across the services. Add administrative hierarchy.

  11. d

    Data from: Validated Names for Experimental Studies on Race and Ethnicity

    • search.dataone.org
    • dataverse.harvard.edu
    Updated Nov 8, 2023
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Crabtree, Charles; Kim, Jae Yeon (2023). Validated Names for Experimental Studies on Race and Ethnicity [Dataset]. http://doi.org/10.7910/DVN/LP4EAR
    Explore at:
    Dataset updated
    Nov 8, 2023
    Dataset provided by
    Harvard Dataverse
    Authors
    Crabtree, Charles; Kim, Jae Yeon
    Description

    A large and fast-growing number of studies across the social sciences use experiments to better understand the role of race in human interactions, particularly in the American context. Researchers often use names to signal the race of individuals portrayed in these experiments. However, those names might also signal other attributes, such as socioeconomic status (e.g., education and income) and citizenship. If they do, researchers need pre-tested names with data on perceptions of these attributes. Such data would permit researchers to draw correct inferences about the causal effect of race in their experiments. In this paper, we provide the largest dataset of validated name perceptions based on three different surveys conducted in the United States. In total, our data include over 44,170 name evaluations from 4,026 respondents for 600 names. In addition to respondent perceptions of race, income, education, and citizenship from names, our data also include respondent characteristics. Our data will be broadly helpful for researchers conducting experiments on the manifold ways in which race shapes American life.

  12. E

    ArabLEX: Database of Arabic Place Names (DAP)

    • catalog.elra.info
    Updated Oct 7, 2019
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    ELRA (European Language Resources Association) and its operational body ELDA (Evaluations and Language resources Distribution Agency) (2019). ArabLEX: Database of Arabic Place Names (DAP) [Dataset]. https://catalog.elra.info/en-us/repository/browse/ELRA-M0105/
    Explore at:
    Dataset updated
    Oct 7, 2019
    Dataset provided by
    ELRA (European Language Resources Association) and its operational body ELDA (Evaluations and Language resources Distribution Agency)
    ELRA (European Language Resources Association)
    License

    https://catalog.elra.info/static/from_media/metashare/licences/ELRA_END_USER.pdfhttps://catalog.elra.info/static/from_media/metashare/licences/ELRA_END_USER.pdf

    https://catalog.elra.info/static/from_media/metashare/licences/ELRA_VAR.pdfhttps://catalog.elra.info/static/from_media/metashare/licences/ELRA_VAR.pdf

    Description

    This database is part of the ArabLEX set of data which consists of the Database of Arabic General Vocabulary (DAG), Database of Arabic Place Names (DAP), Database of Foreign Names in Arabic (DAF) and Database of Arab Names (DAN) available from ELRA under references, respectively, ELRA-L0131, ELRA-M0105, ELRA-M0106 and ELRA-M0107.This full-form Arabic-English place name database of over 21,000 lemmas and nearly 6.5 million forms provides worldwide coverage of common place names, given in standard MSA orthography, and includes all inflected and cliticized forms for each place name. In addition, precise phonemic transcriptions and full vowel diacritics are designed to enhance Arabic speech technology. Orthographic variants are also extensively covered.This database is provided with three options: 1) proclitics, 2) phonetic information (CARS) and 3) orthographic variants. Subsets excluding some of the three proposed options may be provided upon demand. CARS is an accurate phonemic transcription. Optionally, phonetic transcriptions, IPA and/or SAMPA, can be provided, fine tuned to a customer's specifications.Quantity and size: 6,455,201 lines / 812 MBFile format: flat TSV text filesSamples and a specifications document available upon request.

  13. E

    Database of Foreign Names in Arabic

    • catalogue.elra.info
    • live.european-language-grid.eu
    Updated Oct 7, 2019
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    ELRA (European Language Resources Association) and its operational body ELDA (Evaluations and Language resources Distribution Agency) (2019). Database of Foreign Names in Arabic [Dataset]. https://catalogue.elra.info/en-us/repository/browse/ELRA-L0124/
    Explore at:
    Dataset updated
    Oct 7, 2019
    Dataset provided by
    ELRA (European Language Resources Association) and its operational body ELDA (Evaluations and Language resources Distribution Agency)
    ELRA (European Language Resources Association)
    License

    https://catalogue.elra.info/static/from_media/metashare/licences/ELRA_VAR.pdfhttps://catalogue.elra.info/static/from_media/metashare/licences/ELRA_VAR.pdf

    https://catalogue.elra.info/static/from_media/metashare/licences/ELRA_END_USER.pdfhttps://catalogue.elra.info/static/from_media/metashare/licences/ELRA_END_USER.pdf

    Description

    This database covers non-Arabic names, their Arabic equivalents, and Arabic script variants for each name (with the most important variant given first).

  14. Forest Common Names (Feature Layer)

    • agdatacommons.nal.usda.gov
    • catalog.data.gov
    • +3more
    bin
    Updated Jun 21, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    U.S. Forest Service (2025). Forest Common Names (Feature Layer) [Dataset]. https://agdatacommons.nal.usda.gov/articles/dataset/Forest_Common_Names_Feature_Layer_/25972276
    Explore at:
    binAvailable download formats
    Dataset updated
    Jun 21, 2025
    Dataset provided by
    U.S. Department of Agriculture Forest Servicehttp://fs.fed.us/
    Authors
    U.S. Forest Service
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    This dataset contains the common names of the national forests and grasslands and their respective FS WWW URL information that is used for both display of the national forest and national grassland boundaries on any map product and for dynamic interactivity of the map. This dataset exhibits the following characteristics: 1. Granularity of the polygon features - The spatial extent of the national forests and the grasslands match the way the agency would like to communicate with the public. 2. Preferred /Common Name of the National Forest Units - The common names of the national forest and grassland match the preferred name column that is present in the common names decision table maintained by the FS Office of Communication. 3. Hyperlinks to FS WWW Home page - This column contains the national forest and their respective FS WWW URL information. This URL could be used on any interactive map applications to link users directly to a forest's home page. Data Source - This dataset is derived from the following FS ALP (Automated Lands Program) Land Status Records System authoritative data sources: 1. Administrative Forest Boundaries 2. Proclaimed Forest Boundaries 3. Ranger District Boundaries 4. National Grassland Areas. The common names decision table maintained by the FS Office of Communication contains the common name and its respective Land Status Records System authoritative data source to be used for building the spatial polygon. The spatial polygons for every feature in this dataset comes from one or more authoritative data sources listed above. The process to create the common names dataset is reusing the already existing ALP names from the data sources listed above.This record was taken from the USDA Enterprise Data Inventory that feeds into the https://data.gov catalog. Data for this record includes the following resources: ISO-19139 metadata ArcGIS Hub Dataset ArcGIS GeoService OGC WMS CSV Shapefile GeoJSON KML https://apps.fs.usda.gov/arcx/rest/services/EDW/EDW_ForestCommonNames_01/MapServer/1 http://data.fs.usda.gov/geodata/edw/datasets.php For complete information, please visit https://data.gov.

  15. H

    GNIS (Geographic Names)

    • opendata.hawaii.gov
    kml, ogc wfs, zip
    Updated Oct 9, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Office of Planning (2024). GNIS (Geographic Names) [Dataset]. https://opendata.hawaii.gov/dataset/gnis-geographic-names
    Explore at:
    ogc wfs, zip, kmlAvailable download formats
    Dataset updated
    Oct 9, 2024
    Dataset provided by
    Hawaii Statewide GIS Program
    Authors
    Office of Planning
    Description

    [Metadata] Geographic Names for the State of Hawaii as of September 3, 2024. (Data current / last edited in GNIS December 2023). Downloaded by the Hawaii Statewide GIS Program from the U.S. Board on Geographic Names Geographic Names Information System (GNIS) September 3, 2024 (https://www.usgs.gov/u.s.-board-on-geographic-names/download-gnis-data). The Geographic Names Information System (GNIS) is the Federal standard for geographic nomenclature. The U.S. Geological Survey developed the GNIS for the U.S. Board on Geographic Names, a Federal inter-agency body chartered by public law to maintain uniform feature name usage throughout the Government and to promulgate standard names to the public. The GNIS is the official repository of domestic geographic names data; the official vehicle for geographic names use by all departments of the Federal Government; and the source for applying geographic names to Federal electronic and printed products of all types.

    For additional information, please refer to metadata at https://files.hawaii.gov/dbedt/op/gis/data/geonames.pdf or contact Hawaii Statewide GIS Program, Office of Planning and Sustainable Development, State of Hawaii; PO Box 2359, Honolulu, Hi. 96804; (808) 587-2846; email: gis@hawaii.gov; Website: https://planning.hawaii.gov/gis.

  16. o

    US Colleges and Universities

    • public.opendatasoft.com
    • data.smartidf.services
    csv, excel, geojson +1
    Updated Jul 6, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2025). US Colleges and Universities [Dataset]. https://public.opendatasoft.com/explore/dataset/us-colleges-and-universities/
    Explore at:
    json, excel, geojson, csvAvailable download formats
    Dataset updated
    Jul 6, 2025
    License

    https://en.wikipedia.org/wiki/Public_domainhttps://en.wikipedia.org/wiki/Public_domain

    Area covered
    United States
    Description

    The Colleges and Universities feature class/shapefile is composed of all Post Secondary Education facilities as defined by the Integrated Post Secondary Education System (IPEDS, http://nces.ed.gov/ipeds/), National Center for Education Statistics (NCES, https://nces.ed.gov/), US Department of Education for the 2018-2019 school year. Included are Doctoral/Research Universities, Masters Colleges and Universities, Baccalaureate Colleges, Associates Colleges, Theological seminaries, Medical Schools and other health care professions, Schools of engineering and technology, business and management, art, music, design, Law schools, Teachers colleges, Tribal colleges, and other specialized institutions. Overall, this data layer covers all 50 states, as well as Puerto Rico and other assorted U.S. territories. This feature class contains all MEDS/MEDS+ as approved by the National Geospatial-Intelligence Agency (NGA) Homeland Security Infrastructure Program (HSIP) Team. Complete field and attribute information is available in the ”Entities and Attributes” metadata section. Geographical coverage is depicted in the thumbnail above and detailed in the "Place Keyword" section of the metadata. This feature class does not have a relationship class but is related to Supplemental Colleges. Colleges and Universities that are not included in the NCES IPEDS data are added to the Supplemental Colleges feature class when found. This release includes the addition of 175 new records, the removal of 468 no longer reported by NCES, and modifications to the spatial location and/or attribution of 6682 records.

  17. g

    Download US ZIP Code Dataset - United States of America

    • geopostcodes.com
    csv
    Updated Jun 10, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    GeoPostcodes (2025). Download US ZIP Code Dataset - United States of America [Dataset]. https://www.geopostcodes.com/country/united-states-zip-code/
    Explore at:
    csvAvailable download formats
    Dataset updated
    Jun 10, 2025
    Dataset authored and provided by
    GeoPostcodes
    Area covered
    United States
    Description

    Our United States zip code Database offers comprehensive postal code data for spatial analysis, including postal and administrative areas. This dataset contains accurate and up-to-date information on all administrative divisions, cities, and zip codes, making it an invaluable resource for various applications such as address capture and validation, map and visualization, reporting and business intelligence (BI), master data management, logistics and supply chain management, and sales and marketing. Our location data packages are available in various formats, including CSV, optimized for seamless integration with popular systems like Esri ArcGIS, Snowflake, QGIS, and more. Product features include fully and accurately geocoded data, multi-language support with address names in local and foreign languages, comprehensive city definitions, and the option to combine map data with UNLOCODE and IATA codes, time zones, and daylight saving times. Companies choose our location databases for their enterprise-grade service, reduction in integration time and cost by 30%, and weekly updates to ensure the highest quality.

  18. E

    Database of Japanese Name Variants

    • catalog.elra.info
    • live.european-language-grid.eu
    Updated Oct 7, 2019
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    ELRA (European Language Resources Association) and its operational body ELDA (Evaluations and Language resources Distribution Agency) (2019). Database of Japanese Name Variants [Dataset]. https://catalog.elra.info/en-us/repository/browse/ELRA-L0116/
    Explore at:
    Dataset updated
    Oct 7, 2019
    Dataset provided by
    ELRA (European Language Resources Association) and its operational body ELDA (Evaluations and Language resources Distribution Agency)
    ELRA (European Language Resources Association)
    License

    https://catalog.elra.info/static/from_media/metashare/licences/ELRA_END_USER.pdfhttps://catalog.elra.info/static/from_media/metashare/licences/ELRA_END_USER.pdf

    https://catalog.elra.info/static/from_media/metashare/licences/ELRA_VAR.pdfhttps://catalog.elra.info/static/from_media/metashare/licences/ELRA_VAR.pdf

    Area covered
    Japan
    Description

    This resource covers four million Japanese names and their romanized variants, and includes gender codes, classification codes, and frequency rankings.

  19. K

    US Communities

    • koordinates.com
    csv, dwg, geodatabase +6
    Updated Sep 3, 2018
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    US Federal Emergency Management Agency (FEMA) (2018). US Communities [Dataset]. https://koordinates.com/layer/25566-us-communities/
    Explore at:
    mapinfo mif, csv, mapinfo tab, pdf, shapefile, dwg, kml, geopackage / sqlite, geodatabaseAvailable download formats
    Dataset updated
    Sep 3, 2018
    Dataset authored and provided by
    US Federal Emergency Management Agency (FEMA)
    Area covered
    Description

    The Census data utilized for developing the Community Layer used 2010 TIGER/Line shapefile datasets (TIGER = Topologically Integrated Geographic Encoding and Referencing). TIGER/Line shapefiles are available for free download from the US Census Bureau and include various legal and statistical geographic areas for which the Census tabulates data. The shapefiles are designed to be used in a GIS environment, with the ability to directly link the geographic areas to Census data via a unique GEOID number.The following TIGER/Line datasets should be used: - Counties and Equivalent Entities –primary legal divisions within each state (counties, parishes, etc)- County Subdivisions –includes both legal areas (Minor Civil Divisions or MCDs) and various statistical areas- Places –includes both legal areas (Incorporated Places) and statistical areas (Census Designated Places or CDPs)- Blocks –the smallest geographical area for which Census population counts are recorded; blocks never cross boundaries of any entity for which the Census Bureau tabulates data, including counties, county subdivisions, places, and American Indian, Alaska Native, and Native Hawaiian (AIANNH) areas- American Indian, Alaska Native, and Native Hawaiian (AIANNH) AreasExtracting and Formatting CIS DataA key component of the community layer is the ability to link CIS information spatially. Data from CIS cannot directly be joined with Census data. The two datasets have community name discrepancies which impede an exact match. Therefore, CIS data needs to be formatted to match Census community names. A custom report can be obtained from CIS to include a CID number, Community Name, County, State, Community Status, and Tribal status for all CIS records. Make sure all CID numbers are six digits and you follow the CIS community naming convention outlined in Table 4.2.1.1 in the Community Layer Update Technical Guide 20131206. Converting the CIS name“ADDISON, VILLAGE OF” to “ADDISON TOWN”involves removing unneeded spaces, comma, and preposition to make the join successful to the Census data. Using a comprehensive report at a national level gains efficiencies as bulk edits can be made. Data for each state should be extracted as needed by separating the CIS data into each type of community corresponding to the Census geography layers used, and a new JoinID column (e.g. ADDISON TOWN) can be created for each dataset allowing the CIS data to be joined to the Census data.

  20. US state names Codes and Abbreviations

    • kaggle.com
    Updated Feb 15, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Francesco Pettini (2022). US state names Codes and Abbreviations [Dataset]. https://www.kaggle.com/datasets/francescopettini/us-state-names-codes-and-abbreviations/data
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Feb 15, 2022
    Dataset provided by
    Kaggle
    Authors
    Francesco Pettini
    License

    https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/

    Area covered
    United States
    Description

    List of each U.S. state Code, Name, Abbreviation and Alpha code, useful to convert one to the other.

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
Social Security Administration (2022). Baby Names from Social Security Card Applications - National Data [Dataset]. https://catalog.data.gov/dataset/baby-names-from-social-security-card-applications-national-data
Organization logo

Baby Names from Social Security Card Applications - National Data

Explore at:
18 scholarly articles cite this dataset (View in Google Scholar)
Dataset updated
May 5, 2022
Dataset provided by
Social Security Administrationhttp://ssa.gov/
Description

The data (name, year of birth, sex, and number) are from a 100 percent sample of Social Security card applications for 1880 onward.

Search
Clear search
Close search
Google apps
Main menu