5 datasets found
  1. Foreign-Born Population in New York City

    • kaggle.com
    zip
    Updated Jan 3, 2019
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    City of New York (2019). Foreign-Born Population in New York City [Dataset]. https://www.kaggle.com/new-york-city/foreign-born-population-in-new-york-city
    Explore at:
    zip(61131 bytes)Available download formats
    Dataset updated
    Jan 3, 2019
    Dataset authored and provided by
    City of New York
    License

    https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/

    Area covered
    New York
    Description

    Content

    Details of foreign-born population versus total population in New York City since 1820

    Context

    This is a dataset hosted by the City of New York. The city has an open data platform found here and they update their information according the amount of data that is brought in. Explore New York City using Kaggle and all of the data sources available through the City of New York organization page!

    • Update Frequency: This dataset is updated annually.

    Acknowledgements

    This dataset is maintained using Socrata's API and Kaggle's API. Socrata has assisted countless organizations with hosting their open data and has been an integral part of the process of bringing more data to the public.

    Cover photo by Kyle Glenn on Unsplash
    Unsplash Images are distributed under a unique Unsplash License.

  2. Data from: Ethnic Albanian Organized Crime in New York City, 1975-2014

    • catalog.data.gov
    • datasets.ai
    • +1more
    Updated Nov 14, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    National Institute of Justice (2025). Ethnic Albanian Organized Crime in New York City, 1975-2014 [Dataset]. https://catalog.data.gov/dataset/ethnic-albanian-organized-crime-in-new-york-city-1975-2014-236ba
    Explore at:
    Dataset updated
    Nov 14, 2025
    Dataset provided by
    National Institute of Justicehttp://nij.ojp.gov/
    Area covered
    New York
    Description

    These data are part of NACJD's Fast Track Release and are distributed as they were received from the data depositor. The files have been zipped by NACJD for release, but not checked or processed except for the removal of direct identifiers. Users should refer to the accompanying readme file for a brief description of the files available with this collection and consult the investigator(s) if further information is needed. The main aim of this research is to study the criminal mobility of ethnic-based organized crime groups. The project examines whether organized crime groups are able to move abroad easily and to reproduce their territorial control in a foreign country, or whether these groups, and/or individual members, start a life of crime only after their arrival in the new territories, potentially as a result of social exclusion, economic strain, culture conflict and labeling. More specifically, the aim is to examine the criminal mobility of ethnic Albanian organized crime groups involved in a range of criminal markets and operating in and around New York City, area and to study the relevance of the importation/alien conspiracy model versus the deprivation model of organized crime in relation to Albanian organized crime. There are several analytical dimensions in this study: (1) reasons for going abroad; (2) the nature of the presence abroad; (3) level of support from ethnic constituencies in the new territories; (4) importance of cultural codes; (5) organizational structure; (6) selection of criminal activities; (7) economic incentives and political infiltration. This study utilizes a mixed-methods approach with a sequential exploratory design, in which qualitative data and documents are collected and analyzed first, followed by quantitative data. Demographic variables in this collection include age, gender, birth place, immigration status, nationality, ethnicity, education, religion, and employment status. Two main data sources were employed: (1) court documents, including indictments and court transcripts related to select organized crime cases (84 court documents on 29 groups, 254 offenders); (2) in-depth, face-to-face interviews with 9 ethnic Albanian offenders currently serving prison sentences in U.S. Federal Prisons for organized crime related activities, and with 79 adult ethnic Albanian immigrants in New York, including common people, undocumented migrants, offenders, and people with good knowledge of Albanian organized crime modus operandi. Sampling for these data were conducted in five phases, the first of which involved researchers examining court documents and identifying members of 29 major ethnic Albanian organized crime groups operating in the New York area between 1975 and 2013 who were or had served sentences in the U.S. Federal Prisons for organized crime related activities. In phase two researchers conducted eight in-depth interviews with law enforcement experts working in New York or New Jersey. Phase three involved interviews with members of the Albanian diaspora and filed observations from an ethnographic study. Researchers utilized snowball and respondent driven (RDS) recruitment methods to create the sample for the diaspora dataset. The self-reported criteria for recruitment to participate in the diaspora interviews were: (1) age 18 or over; (2) of ethnic Albanian origin (foreign-born or 1st/2nd generation); and (3) living in NYC area for at least 1 year. They also visited neighborhoods identified as high concentrations of ethnic Albanian individuals and conducted an ethnographic study to locate the target population. In phase four, data for the cultural advisors able to help with the project data was collected. In the fifth and final phase, researchers gathered data for the second wave of the diaspora data, and conducted interviews with offenders with ethnic Albanian immigrants with knowledge of the organized crime situation in New York City area. Researchers also approached about twenty organized crime figures currently serving a prison sentence, and were able to conduct 9 in-depth interviews.

  3. d

    Data from: “The Best Country in the World”: The Surprising Social Mobility...

    • search.dataone.org
    • dataverse.harvard.edu
    Updated Nov 9, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Anbinder, Tyler (2023). “The Best Country in the World”: The Surprising Social Mobility of New York’s Irish-Famine Immigrants [Dataset]. http://doi.org/10.7910/DVN/KGYS74
    Explore at:
    Dataset updated
    Nov 9, 2023
    Dataset provided by
    Harvard Dataverse
    Authors
    Anbinder, Tyler
    Area covered
    New York
    Description

    The main dataset ("ESB Mobility Database") contains occupational data on 1,200 Irish immigrants who arrived in the U.S. in the Famine years and could be tracked for at least a decade. We also present the most up-to-date version of our Emigrant Savings Bank Depositor Database, which contains data on all 15,000 people who opened accounts at the bank from 1850 to 1858. Also provided are data from the 1855 New York State census documenting the occupations of New York's entire Irish-born population as well as datasets documenting the occupations held by New York's Irish immigrants one year and ten years after their arrival in America,

  4. n

    American Crow SNPs and microsatellite data

    • data.niaid.nih.gov
    • nde-dev.biothings.io
    • +1more
    zip
    Updated May 28, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Andrea Townsend; Melissa Jones; Nancy Chen; Caroline Chivily; Casey McAndrews; Anne Clark; McGowan Kevin; John Eimes (2023). American Crow SNPs and microsatellite data [Dataset]. http://doi.org/10.5061/dryad.5dv41nsbj
    Explore at:
    zipAvailable download formats
    Dataset updated
    May 28, 2023
    Dataset provided by
    Binghamton University
    Sungkyunkwan University
    University of Rochester
    University of California, Davis
    Cornell University
    Hamilton College
    Authors
    Andrea Townsend; Melissa Jones; Nancy Chen; Caroline Chivily; Casey McAndrews; Anne Clark; McGowan Kevin; John Eimes
    License

    https://spdx.org/licenses/CC0-1.0.htmlhttps://spdx.org/licenses/CC0-1.0.html

    Description

    Infectious diseases can cause steep declines in wildlife populations, leading to changes in genetic diversity that may affect the susceptibility of individuals to infection and the overall resilience of populations to pathogen outbreaks. Here, we examine evidence for a genetic bottleneck in a population of American crows (Corvus brachyrhynchos) before and after the emergence of West Nile virus (WNV). More than 50% of marked birds in this population were lost over the two-year period of the epizootic, representing a 10-fold increase in adult mortality. Using analyses of SNPs and microsatellite markers, we tested for evidence of a genetic bottleneck and compared levels of inbreeding and immigration in the pre- and post-WNV populations. Counter to expectations, genetic diversity (allelic diversity and the number of new alleles) increased after WNV emergence. This was likely due to increases in immigration, as the estimated membership coefficients were lower in the post-WNV population. Simultaneously, however, the frequency of inbreeding appeared to increase: mean inbreeding coefficients were higher among SNP markers, and heterozygosity-heterozygosity correlations were stronger among microsatellite markers, in the post-WNV population. These results indicate that loss of genetic diversity at the population level is not an inevitable consequence of a population decline, particularly in the presence of gene flow. The changes observed in post-WNV crows could have very different implications for their response to future pathogen risks, potentially making the population as a whole more resilient to a changing pathogen community, while increasing the frequency of inbred individuals with elevated susceptibility to disease. Methods Study population and data collection. Crows in the Ithaca, New York, population are cooperative breeders. They live in groups of up to 14 birds, including a socially bonded pair of adults as well as 0-12 auxiliary birds, which are usually offspring from previous broods). Although auxiliaries usually do not contribute offspring to the brood, molecular work in the post-WNV population indicates that auxiliary males occasionally do sire extra-pair offspring with the female breeder, arising both through incest (mothers mating with their adult auxiliary sons) and through matings between non-relatives (e.g., unrelated step-mothers and adult auxiliary males). Genetic samples were collected from crow nestlings from 1990–2011. We collected blood (~150 ul) from the brachial vein of nestlings and banded them with unique combinations of metal bands, color bands, and patagial tags on days 24–30 after hatching. DNA was extracted from samples using DNeasy tissue kits (Qiagen, Valencia, CA) following the manufacturer’s protocol. All fieldwork with American crows was carried out under protocols approved by the Institutional Animal Care and Use Committees of Binghamton University (no. 537-03 and 607-07) and Cornell University (no. 1988–0210). The pre-WNV dataset included samples collected between 1990 and 2002. The 2002 nestlings were sampled prior to WNV emergence, as nestlings fledge the nest between May and July, whereas WNV mortality typically occurs between August and October in this crow population. The post-WNV samples were collected between 2005 and 2011. Samples collected immediately after WNV emergence (2003 and 2004) were not included in the analysis to allow time for the birds to respond to the population loss. We maximized independence of the birds selected for analysis by including only one randomly chosen offspring per brood and no more than two broods per family group in the pre-and post-WNV samples, with each brood per family group separated by the maximum number of years possible within the pre- or post-WNV sampling periods (1990–2002 pre-WNV; 2005–2011 post-WNV; Figure S1). Birds were randomly and independently selected (with replacement) for the SNP and microsatellite analyses; therefore, there was little overlap among individual birds included in these marker sets. Of the 286 individual birds included in this analysis, 22 were common to both marker sets (15 pre-WNV; 7 post-WNV). The 20-year time period of this study may have encompassed 2–4 breeding cohorts (approximately 1–2 pre- and 1–2 post-WNV, with a sharp turn-over immediately after WNV emergence). Crows can produce offspring as early as two years after hatching, but most do not begin breeding independently until at least 3–4 years after hatching. Breeding initiation is limited at least in part by breeding vacancies, which are created by the death of one or both members of an established breeding pair. Such breeding vacancies likely increased in availability after the emergence of WNV. Microsatellite genotyping. A total of 222 crows (n = 113 and 109 crows pre- and post-WNV, respectively) were genotyped at 34 polymorphic microsatellite loci that were optimized for American crows. Alleles were scored using the microsatellite plugin for Geneious 9.1.8. We used GenePop version 4.7 to test for linkage disequilibrium between all pairs of loci, departures from Hardy–Weinberg equilibrium (HWE), and null allele frequency. Locus characteristics (e.g., alleles/ locus, tests of Hardy–Weinberg equilibrium and null allele frequencies) are given in the supplementary materials (Table S1). Departures from HWE expectations were observed at two loci (PnuA3w from the pre-WNV sample and Cb06 from the post-WNV sample) after Bonferroni correction (Table S1); these loci were removed from subsequent analysis. In 561 pairwise comparisons, four pairs of loci appeared to be in linkage disequilibrium (Cb20 and Cb21; Cb14 and CoBr36; CoBr22 and Cb17, and CoBr12 and Cb10), but this linkage was only apparent at both time points (the pre-WNV and post-WNV populations) for Cb20 and Cb21. We removed both Cb20 and Cb21 from the analysis but retained the other loci because apparent linkage at only a single time point was unlikely to be a result of physical linkage. Two additional loci (Cb17 and Cb10) had a high frequency of null alleles (> 0.1) and were removed from the dataset. All subsequent analyses are therefore based on 28 loci. We scored all birds at a minimum of 26 of these 28 loci, and most (>98%) were scored at all loci (mean proportion of loci typed >0.99). Mean allelic diversity at these loci was 11.25 ± 1.17 alleles/locus (range: 3–31 alleles/locus). Double Digest Restriction Associated DNA (ddRAD) sequencing. We performed ddRAD sequencing on 86 randomly selected crows (43 pre-WNV and 43 post-WNV). 100-500 ng of DNA were digested with SbfI-HF (NEB, R3642L) and MspI-HF (NEB, R016S) restriction enzymes. Samples were ligated with a P2-MspI adapter and pooled in groups of 18-20, each with a unique P1 adapter. Pooled index groups were purified using 1.5X volumes of homemade MagNA made with Sera-Mag Magnetic Speed-beads (FisherSci). Fragments 450-600 bp long were selected using BluePippin (Sage Science) by the Cornell University Biotechnology Resource Center (BRC). After size selection, unique index barcodes were added to each index group by performing 11 cycles of PCR with Phusion® DNA polymerase (NEB). Reactions were purified using 0.7X volumes of MagNA beads and pooled in equimolar ratios for sequencing on the Illumina HiSeq 2500 at the BRC, with single end reads (100 bp). The sequencing was performed with an added Illumina PhiX control (15%) due to low 5’ complexity. Pre- and post-WNV samples were library prepared together and sequenced on a single lane to avoid the introduction of a library or lane effect. We used FASTQC v0.11.9 (Babraham Bioinformatics; http://www.bioinformatics.babraham.ac.uk/projects/fastqc/) to assess read quality. We trimmed reads to 147 bp using fastX_trimmer (FASTX-Toolkit) to exclude low-quality data at the 3’ end of reads. Next, we eliminated reads with Phred scores below 10, then eliminated reads in which 5% or more bases had Phred scores below 20 (fastq_quality_filter). The fastq files were demultiplexed using the process_radtags module in STACKS v2.52 pipeline to create a file with sequences specific to each individual. We first scaffolded the American Crow reference genome (NCBI assembly: ASM69197v1, Accession no: GCA_000691975.1) into putative pseudochromosomes using the synteny-based Chromosemble tool in Satsuma2 (Grabherr et al. 2010) and the Hooded Crow genome (NCBI assembly: ASM73873v5, Accession no: GCA_000738735.5). We aligned sequence reads to the American Crow pseudochromosome assembly using BWA-MEM (Li & Durbin 2009). We called SNPs in ANGSD (Korneliussen et al. 2014) using the GATK model, requiring SNPs to be present in 80% of the individuals (0.95 postcutoff, SNP p-value 1e-6) with a minimum allele frequency of 0.015. We removed bases with quality scores below 20 (-minQ 20), bad reads (-remove_bads), mapping quality below 20 (-minMapQ20), base alignment quality below 1 (-baq), more than two alleles (-skipTriallelic), and heterozygote bias (-hetbias_pval 1e-5), requiring the minimum depth per individual to be at least two and read depth higher than 1,800. These filters resulted in 16,200 SNPs. To reduce differences in missingness between the pre- and post-WNV populations, we excluded loci that had less than 80% called genotypes per population, resulting in 5,151 SNPs.

  5. Global City Data

    • ckan.publishing.service.gov.uk
    • data.europa.eu
    Updated Mar 23, 2017
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    ckan.publishing.service.gov.uk (2017). Global City Data [Dataset]. https://ckan.publishing.service.gov.uk/dataset/global-city-data
    Explore at:
    Dataset updated
    Mar 23, 2017
    Dataset provided by
    CKANhttps://ckan.org/
    Description

    A range of indicators for a selection of cities from the New York City Global City database. Dataset includes the following: Geography City Area (km2) Metro Area (km2) People City Population (millions) Metro Population (millions) Foreign Born Annual Population Growth Economy GDP Per Capita (thousands $, PPP rates, per resident) Primary Industry Secondary Industry Share of Global 500 Companies (%) Unemployment Rate Poverty Rate Transportation Public Transportation Mass Transit Commuters Major Airports Major Ports Education Students Enrolled in Higher Education Percent of Population with Higher Education (%) Higher Education Institutions Tourism Total Tourists Annually (millions) Foreign Tourists Annually (millions) Domestic Tourists Annually (millions) Annual Tourism Revenue ($US billions) Hotel Rooms (thousands) Health Infant Mortality (Deaths per 1,000 Births) Life Expectancy in Years (Male) Life Expectancy in Years (Female) Physicians per 100,000 People Number of Hospitals Anti-Smoking Legislation Culture Number of Museums Number of Cultural and Arts Organizations Environment Green Spaces (km2) Air Quality Laws or Regulations to Improve Energy Efficiency Retrofitted City Vehicle Fleet Bike Share Program

  6. Not seeing a result you expected?
    Learn how you can add new datasets to our index.

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
City of New York (2019). Foreign-Born Population in New York City [Dataset]. https://www.kaggle.com/new-york-city/foreign-born-population-in-new-york-city
Organization logo

Foreign-Born Population in New York City

From New York City Open Data

Explore at:
26 scholarly articles cite this dataset (View in Google Scholar)
zip(61131 bytes)Available download formats
Dataset updated
Jan 3, 2019
Dataset authored and provided by
City of New York
License

https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/

Area covered
New York
Description

Content

Details of foreign-born population versus total population in New York City since 1820

Context

This is a dataset hosted by the City of New York. The city has an open data platform found here and they update their information according the amount of data that is brought in. Explore New York City using Kaggle and all of the data sources available through the City of New York organization page!

  • Update Frequency: This dataset is updated annually.

Acknowledgements

This dataset is maintained using Socrata's API and Kaggle's API. Socrata has assisted countless organizations with hosting their open data and has been an integral part of the process of bringing more data to the public.

Cover photo by Kyle Glenn on Unsplash
Unsplash Images are distributed under a unique Unsplash License.

Search
Clear search
Close search
Google apps
Main menu