Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Version 2 (18 March 2025) includes a further 356 service itineraries. In addition, 41 entries from the previous version were updated or expanded. Currently the database covers a total of 1,858 Jewish soldiers, 421 wives and 83 children.
ORIGINAL VERSION 1 (18 September 2024)
With more than 1,500 individual entries, this is the inaugural instalment of my research database collated in the framework of the Project Forgotten Soldiers: Jewish Military Experience in the Habsburg Monarchy. This is an open access database, and everyone is welcome to use it according to their own scholarly and personal interests. In 1,189 cases we have official documented records confirming the soldiers were Jewish. In another 313 entries I was able to identify likely Jewish soldiers based on circumstantial evidence cross-referencing names and places of birth, with the presence of confirmed Jewish soldiers drafted into the same units as part of the same recruitment drive. This dataset further includes evidence for 156 spouses and 47 children. While military records do mentions these, their number suggests that the Habsburg army preferred to enlist unmarried men.
The database is structured in a similar way to an official individual entry in the Habsburg military records. These were arranged in tables, with soldiers listed by seniority. Name, place and land of birth are followed by age and religion. This latter rubric allows identifying the bulk of the Jewish soldiers. Also included in the record is marital status, profession (if any), number, names and ages of children (if any), followed by a short summary text of the soldier’s service itinerary. While not always consistent in detail, these texts mention enlistment dates, transfers between units, promotions, desertions, periods as prisoner of war and military awards (if any). I have taken the material from the personal records and added several additional parameters:
The soldiers are entered into the database according to their date of enlistment. This is followed by a colour-coded table showing their years of service. To see the meaning of the different colours employed, scroll to the legend at the end of the dataset.
Following the years of service, we see the date when the soldier left service (final year in service for incomplete service records). When known, the reason the soldier left the army is given (discharge/ death/ desertion etc).
Then come the three most important columns within the table: service record, primary sources and units. At first glance, these columns have only a few letters and numbers, but bring your mouse courser onto the relevant field marked with red triangles. An additional window will then open:
a. Service Record: Shows the entire service record of the soldier arranged by date. I use original German as it appears in the archival records. If you see spelling differences with modern German – they are there for a reason.
b. Primary Sources: Provides the information on all the archival records consulted to reconstruct the service itinerary. The number in the field denotes the number of the archival cartons consulted.
c. Units: Number of units in which a soldier serves. Bringing the cursor on to the field will open their list. Most Jewish soldiers served in the line infantry (IR) and the Military Transport Corps (MFWK or MFK). However, there were also Jewish sharpshooters, cavalrymen, gunners and even a few members of the nascent Austrian Navy.
The next two columns provide entries of the soldier’s conduct and medical condition, which in Habsburg military jargon was referred to rather callously as Defekten. I note the original medical diagnoses verbatim. When possible to identify, I note the modern medical term.
General database-wide parameters are then noted in the next part of the table. Among others, it provides information on enlistment type (conscript/ volunteer?), main branches of service (such as Infantry/ Cavalry/ Artillery), and roles within the military (such as non-commissioned officers/ drummers/ medics).
Concluding this part of the table are columns covering desertions, periods as prisoner of war and awards of the army cannon cross (for veterans of 1813-14) and other military awards.
The last column provides the original German outtake rubric as to how the soldier left service. In special cases, additional service notes are provides on the right.
How to use this dataset
This depends on what you are looking for. Firstly, download the dataset on to your computer via the link provided below. It is a simple Excel file which is easy to work with. If you wish to find out whether one of your ancestors served in the Habsburg army, use a simple keyword search. Please note that in our period there was no single accepted orthography meaning that some letters were used interchangeably (for instance B/P; D/T). There were also various patronymic suffices used in different parts of the monarchy (-witz in German/ -wicz in Polish/ -vits in Hungarian). Habsburg military clerks were mostly German speakers who often recorded the name phonetically. For instance, Jankel/ Jankl/ Jacob/ Jacobus all denote the same name. A Jewish teenager who identified himself as Moische when first reporting to duty, may have stayed so in the military records for decades, even if he was already a non-commissioned officer whose subordinates referred to as Herr Corporal.
If you study the history of concrete Jewish communities, use the keyword search and the filter option to find entries in the database where this locality is mentioned. Some places like Prague and Lublin could be identified effortlessly. In other cases (and see the above point on German-speaking clerks), place names were recorded phonetically. The military authority usually stuck to official Polish names in Galicia, and Hungarian in the Lands of the Crown of St. Stephan. In reality, a Jewish recruit from Transcarpathian Ruthenia could have his place of birth recorded in Hungarian, Romanian or Rusin. When I could not identify the place in question, I marked it with italics. Do you think you identified something I could not? Excellent! Then please write me, and I will correct the entry in the next instalment of this database.
I should stress that, currently, the database is not statistically representative. I have worked chronologically, meaning that there are disproportionally more entries for Jewish soldiers from the Turkish War, the first two Coalition Wars, and the Wars of 1805 and 1809. If you look at some of my other databases (for instance, that of the 1st Line Infantry Regiment 'Kaiser'), you will find least as many Jews who served in the wars of 1813-15. I will cover these in due course. This said, using the filter option of the Excel sheet, you can already make some individual queries. For instance, did Jewish grenadiers meet the minimal height requirement to be eligible for transfer into the elite infantry? (Hint: they did not!) If you are interested in the historical study of nutritional standards, compare the height of the soldiers with their year and place of birth. In my other project, I made calculations of the average height of Habsburg soldiers and I can already reveal that Jewish conscripts were, on average, several centimetres smaller than their non-Jewish comrades drafted in the same annual intake. Whatever stereotypes said, most Jews in the Habsburg Monarchy around 1800 were very poor and the sad fact of malnutrition as a child is reflected in their height as adults.
I should stress that this is a cumulative database. ZENODO has an excellent feature allowing updated versions to supersede earlier files while retaining the same DOI (Digital Object Identifier) and metadata. As my research progresses, I plan to upload new versions of this database bi-annually. This includes not only adding new entries, but also expanding and correcting existing ones. It might well be that the service record of a soldier covered up to 1806 will be brought to a later date, possibly even to his discharge from the army. If you have not found whom you are looking for, or if you want to work with larger samples for your research, visit this page again in a few months’ time. And if you do use this database for scholarly research (by all means, please do), do not forget to cite it as you would cite any other item in your bibliography! If you are a museum professional and you want to employ material from your database to illustrate your exhibitions, you are welcome, but please cite this resource for others to learn. Links to this database will also be appreciated.
List of the data tables as part of the Immigration System Statistics Home Office release. Summary and detailed data tables covering the immigration system, including out-of-country and in-country visas, asylum, detention, and returns.
If you have any feedback, please email MigrationStatsEnquiries@homeoffice.gov.uk.
The Microsoft Excel .xlsx files may not be suitable for users of assistive technology.
If you use assistive technology (such as a screen reader) and need a version of these documents in a more accessible format, please email MigrationStatsEnquiries@homeoffice.gov.uk
Please tell us what format you need. It will help us if you say what assistive technology you use.
Immigration system statistics, year ending March 2025
Immigration system statistics quarterly release
Immigration system statistics user guide
Publishing detailed data tables in migration statistics
Policy and legislative changes affecting migration to the UK: timeline
Immigration statistics data archives
https://assets.publishing.service.gov.uk/media/68258d71aa3556876875ec80/passenger-arrivals-summary-mar-2025-tables.xlsx">Passenger arrivals summary tables, year ending March 2025 (MS Excel Spreadsheet, 66.5 KB)
‘Passengers refused entry at the border summary tables’ and ‘Passengers refused entry at the border detailed datasets’ have been discontinued. The latest published versions of these tables are from February 2025 and are available in the ‘Passenger refusals – release discontinued’ section. A similar data series, ‘Refused entry at port and subsequently departed’, is available within the Returns detailed and summary tables.
https://assets.publishing.service.gov.uk/media/681e406753add7d476d8187f/electronic-travel-authorisation-datasets-mar-2025.xlsx">Electronic travel authorisation detailed datasets, year ending March 2025 (MS Excel Spreadsheet, 56.7 KB)
ETA_D01: Applications for electronic travel authorisations, by nationality
ETA_D02: Outcomes of applications for electronic travel authorisations, by nationality
https://assets.publishing.service.gov.uk/media/68247953b296b83ad5262ed7/visas-summary-mar-2025-tables.xlsx">Entry clearance visas summary tables, year ending March 2025 (MS Excel Spreadsheet, 113 KB)
https://assets.publishing.service.gov.uk/media/682c4241010c5c28d1c7e820/entry-clearance-visa-outcomes-datasets-mar-2025.xlsx">Entry clearance visa applications and outcomes detailed datasets, year ending March 2025 (MS Excel Spreadsheet, 29.1 MB)
Vis_D01: Entry clearance visa applications, by nationality and visa type
Vis_D02: Outcomes of entry clearance visa applications, by nationality, visa type, and outcome
Additional dat
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
MCGD_Data_V2.2 contains all the data that we have collected on locations in modern China, plus a number of locations outside of China that we encounter frequently in historical sources on China. All further updates will appear under the name "MCGD_Data" with a time stamp (e.g., MCGD_Data2023-06-21)
You can also have access to this dataset and all the datasets that the ENP-China makes available on GitLab: https://gitlab.com/enpchina/IndexesEnp
Altogether there are 464,970 entries. The data include the name of locations and their variants in Chinese, pinyin, and any recorded transliteration; the name of the province in Chinese and in pinyin; Province ID; the latitude and longitude; the Name ID and Location ID, and NameID_Legacy. The Name IDs all start with H followed by seven digits. This is the internal ID system of MCGD (the NameID_Legacy column records the Name IDs in their original format depending on the source). Locations IDs that start with "DH" are data points extracted from China Historical GIS (Harvard University); those that start with "D" are locations extracted from the data points in Geonames; those that have only digits (8 digits) are data points we have added from various map sources.
One of the main features of the MCGD Main Dataset is the systematic collection and compilation of place names from non-Chinese language historical sources. Locations were designated in transliteration systems that are hardly comprehensible today, which makes it very difficult to find the actual locations they correspond to. This dataset allows for the conversion from these obsolete transliterations to the current names and geocoordinates.
From June 2021 onward, we have adopted a different file naming system to keep track of versions. From MCGD_Data_V1 we have moved to MCGD_Data_V2. In June 2022, we introduced time stamps, which result in the following naming convention: MCGD_Data_YYYY.MM.DD.
UPDATES
MCGD_Data2025_02_28 includes a major change with the duplication of all the locations listed under Beijing, Shanghai, Tianjin, and Chongqing (北京, 上海, 天津, 重慶) and their listing under the name of the provinces to which they belonge origially before the creation of the four special municipalities after 1949. This is meant to facilitate the matching of data from historical sources. Each location has a unique NameID. Altogether there are 472,818 entries
MCGD_Data2025_02_27 inclues an update on locations extracted from Minguo zhengfu ge yuanhui keyuan yishang zhiyuanlu 國民政府各院部會科員以上職員錄 (Directory of staff members and above in the ministries and committees of the National Government). Nanjing: Guomin zhengfu wenguanchu yinzhuju 國民政府文官處印鑄局國民政府文官處印鑄局, 1944). We also made corrections in the Prov_Py and Prov_Zh columns as there were some misalignments between the pinyin name and the name in Chines characters. The file now includes 465,128 entries.
MCGD_Data2024_03_23 includes an update on locations in Taiwan from the Asia Directories. Altogether there are 465,603 entries (of which 187 place names without geocoordinates, labelled in the Lat Long columns as "Unknown").
MCGD_Data2023.12.22 contains all the data that we have collected on locations in China, whatever the period. Altogether there are 465,603 entries (of which 187 place names without geocoordinates, labelled in the Lat Long columns as "Unknown"). The dataset also includes locations outside of China for the purpose of matching such locations to the place names extracted from historical sources. For example, one may need to locate individuals born outside of China. Rather than maintaining two separate files, we made the decision to incorporate all the place names found in historical sources in the gazetteer. Such place names can easily be removed by selecting all the entries where the 'Province' data is missing.
https://digital.nhs.uk/about-nhs-digital/terms-and-conditionshttps://digital.nhs.uk/about-nhs-digital/terms-and-conditions
This is a publication on maternity activity in English NHS hospitals. This report examines data relating to delivery and birth episodes in 2023-24, and the booking appointments for these deliveries. This annual publication covers the financial year ending March 2024. Data is included from both the Hospital Episodes Statistics (HES) data warehouse and the Maternity Services Data Set (MSDS). HES contains records of all admissions, appointments and attendances for patients admitted to NHS hospitals in England. The HES data used in this publication are called 'delivery episodes'. The MSDS collects records of each stage of the maternity service care pathway in NHS-funded maternity services, and includes information not recorded in HES. The MSDS is a maturing, national-level dataset. In April 2019, the MSDS transitioned to a new version of the dataset. This version, MSDS v2.0, is an update that introduced a new structure and content - including clinical terminology, in order to meet current clinical practice and incorporate new requirements. It is designed to meet requirements that resulted from the National Maternity Review, which led to the publication of the Better Births report in February 2016. This is the fifth publication of data from MSDS v2.0 and data from 2019-20 onwards is not directly comparable to data from previous years. This publication shows the number of HES delivery episodes during the period, with a number of breakdowns including by method of onset of labour, delivery method and place of delivery. It also shows the number of MSDS deliveries recorded during the period, with a breakdown for the mother's smoking status at the booking appointment by age group. It also provides counts of live born term babies with breakdowns for the general condition of newborns (via Apgar scores), skin-to-skin contact and baby's first feed type - all immediately after birth. There is also data available in a separate file on breastfeeding at 6 to 8 weeks. For the first time information on 'Smoking at Time of Delivery' has been presented using annual data from the MSDS. This includes national data broken down by maternal age, ethnicity and deprivation. From 2025/2026, MSDS will become the official source of 'Smoking at Time of Delivery' information and will replace the historic 'Smoking at Time of Delivery' data which is to become retired. We are currently undergoing dual collection and reporting on a quarterly basis for 2024/25 to help users compare information from the two sources. We are working with data submitters to help reconcile any discrepancies at a local level before any close down activities begin. A link to the dual reporting in the SATOD publication series can be found in the links below. Information on how all measures are constructed can be found in the HES Metadata and MSDS Metadata files provided below. In this publication we have also included an interactive Power BI dashboard to enable users to explore key NHS Maternity Statistics measures. The purpose of this publication is to inform and support strategic and policy-led processes for the benefit of patient care. This report will also be of interest to researchers, journalists and members of the public interested in NHS hospital activity in England. Any feedback on this publication or dashboard can be provided to enquiries@nhsdigital.nhs.uk, under the subject “NHS Maternity Statistics”.
https://digital.nhs.uk/about-nhs-digital/terms-and-conditionshttps://digital.nhs.uk/about-nhs-digital/terms-and-conditions
This is a publication on maternity activity in English NHS hospitals. This report examines data relating to delivery and birth episodes in 2022-23, and the booking appointments for these deliveries. This annual publication covers the financial year ending March 2023. Data is included from both the Hospital Episodes Statistics (HES) data warehouse and the Maternity Services Data Set (MSDS). HES contains records of all admissions, appointments and attendances for patients admitted to NHS hospitals in England. The HES data used in this publication are called 'delivery episodes'. The MSDS collects records of each stage of the maternity service care pathway in NHS-funded maternity services, and includes information not recorded in HES. The MSDS is a maturing, national-level dataset. In April 2019 the MSDS transitioned to a new version of the dataset. This version, MSDS v2.0, is an update that introduced a new structure and content - including clinical terminology, in order to meet current clinical practice and incorporate new requirements. It is designed to meet requirements that resulted from the National Maternity Review, which led to the publication of the Better Births report in February 2016. This is the fourth publication of data from MSDS v2.0 and data from 2019-20 onwards is not directly comparable to data from previous years. This publication shows the number of HES delivery episodes during the period, with a number of breakdowns including by method of onset of labour, delivery method and place of delivery. It also shows the number of MSDS deliveries recorded during the period, with breakdowns including the baby's first feed type, birthweight, place of birth, and breastfeeding activity; and the mothers' ethnicity and age at booking. There is also data available in a separate file on breastfeeding at 6 to 8 weeks. The count of Total Babies includes both live and still births, and previous changes to how Total Babies and Total Deliveries were calculated means that comparisons between 2019-20 MSDS data and later years should be made with care. Information on how all measures are constructed can be found in the HES Metadata and MSDS Metadata files provided below. In this publication we have also included an interactive Power BI dashboard to enable users to explore key NHS Maternity Statistics measures. The purpose of this publication is to inform and support strategic and policy-led processes for the benefit of patient care. This report will also be of interest to researchers, journalists and members of the public interested in NHS hospital activity in England. Any feedback on this publication or dashboard can be provided to enquiries@nhsdigital.nhs.uk, under the subject “NHS Maternity Statistics”.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
With more than 1,500 individual entries, this is the inaugural instalment of my research database collated in the framework of the Project Forgotten Soldiers: Jewish Military Experience in the Habsburg Monarchy. This is an open access database, and everyone is welcome to use it according to their own scholarly and personal interests. In 1,189 cases we have official documented records confirming the soldiers were Jewish. In another 313 entries I was able to identify likely Jewish soldiers based on circumstantial evidence cross-referencing names and places of birth, with the presence of confirmed Jewish soldiers drafted into the same units as part of the same recruitment drive. This dataset further includes evidence for 156 spouses and 47 children. While military records do mentions these, their number suggests that the Habsburg army preferred to enlist unmarried men.
The database is structured in a similar way to an official individual entry in the Habsburg military records. These were arranged in tables, with soldiers listed by seniority. Name, place and land of birth are followed by age and religion. This latter rubric allows identifying the bulk of the Jewish soldiers. Also included in the record is marital status, profession (if any), number, names and ages of children (if any), followed by a short summary text of the soldier’s service itinerary. While not always consistent in detail, these texts mention enlistment dates, transfers between units, promotions, desertions, periods as prisoner of war and military awards (if any). I have taken the material from the personal records and added several additional parameters:
a. Service Record: Shows the entire service record of the soldier arranged by date. I use original German as it appears in the archival records. If you see spelling differences with modern German – they are there for a reason.
b. Primary Sources: Provides the information on all the archival records consulted to reconstruct the service itinerary. The number in the field denotes the number of the archival cartons consulted.
c. Units: Number of units in which a soldier serves. Bringing the cursor on to the field will open their list. Most Jewish soldiers served in the line infantry (IR) and the Military Transport Corps (MFWK or MFK). However, there were also Jewish sharpshooters, cavalrymen, gunners and even a few members of the nascent Austrian Navy.
How to use this dataset
This depends on what you are looking for. Firstly, download the dataset on to your computer via the link provided below. It is a simple Excel file which is easy to work with. If you wish to find out whether one of your ancestors served in the Habsburg army, use a simple keyword search. Please note that in our period there was no single accepted orthography meaning that some letters were used interchangeably (for instance B/P; D/T). There were also various patronymic suffices used in different parts of the monarchy (-witz in German/ -wicz in Polish/ -vits in Hungarian). Habsburg military clerks were mostly German speakers who often recorded the name phonetically. For instance, Jankel/ Jankl/ Jacob/ Jacobus all denote the same name. A Jewish teenager who identified himself as Moische when first reporting to duty, may have stayed so in the military records for decades, even if he was already a non-commissioned officer whose subordinates referred to as Herr Corporal.
If you study the history of concrete Jewish communities, use the keyword search and the filter option to find entries in the database where this locality is mentioned. Some places like Prague and Lublin could be identified effortlessly. In other cases (and see the above point on German-speaking clerks), place names were recorded phonetically. The military authority usually stuck to official Polish names in Galicia, and Hungarian in the Lands of the Crown of St. Stephan. In reality, a Jewish recruit from Transcarpathian Ruthenia could have his place of birth recorded in Hungarian, Romanian or Rusin. When I could not identify the place in question, I marked it with italics. Do you think you identified something I could not? Excellent! Then please write me, and I will correct the entry in the next instalment of this database.
I should stress that, currently, the database is not statistically representative. I have worked chronologically, meaning that there are disproportionally more entries for Jewish soldiers from the Turkish War, the first two Coalition Wars, and the Wars of 1805 and 1809. If you look at some of my other databases (for instance, that of the 1st Line Infantry Regiment 'Kaiser'), you will find least as many Jews who served in the wars of 1813-15. I will cover these in due course. This said, using the filter option of the Excel sheet, you can already make some individual queries. For instance, did Jewish grenadiers meet the minimal height requirement to be eligible for transfer into the elite infantry? (Hint: they did not!) If you are interested in the historical study of nutritional standards, compare the height of the soldiers with their year and place of birth. In my other project, I made calculations of the average height of Habsburg soldiers and I can already reveal that Jewish conscripts were, on average, several centimetres smaller than their non-Jewish comrades drafted in the same annual intake. Whatever stereotypes said, most Jews in the Habsburg Monarchy around 1800 were very poor and the sad fact of malnutrition as a child is reflected in their height as adults.
I should stress that this is a cumulative database. ZENODO has an excellent feature allowing updated versions to supersede earlier files while retaining the same DOI (Digital Object Identifier) and metadata. As my research progresses, I plan to upload new versions of this database bi-annually. This includes not only adding new entries, but also expanding and correcting existing ones. It might well be that the service record of a soldier covered up to 1806 will be brought to a later date, possibly even to his discharge from the army. If you have not found whom you are looking for, or if you want to work with larger samples for your research, visit this page again in a few months’ time. And if you do use this database for scholarly research (by all means, please do), do not forget to cite it as you would cite any other item in your bibliography! If you are a museum professional and you want to employ material from your database to illustrate your exhibitions, you are welcome, but please cite this resource for others to learn. Links to this database will also be appreciated.
https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/
This dataset contains the locations found in the Kiva datasets included in an administrative or geographical region. You can also find poverty data about this region. This facilitates answering some of the tough questions about a region's poverty.
In the interest of preserving the original names and spelling for the locations/countries/regions all the data is in Excel format and has no preview (I think only the Kaggle recommended file types have preview - if anyone can show me how to do this for an xlsx file, it will be greatly appreciated)
The Tables datasets contain the most recent analysis of the MPI on countries and regions. These datasets are updated regularly. In unique regions_names_from_google_api you will find 3 levels of inclusion for every geocode provided in Kiva datasets. (village/town, administrative region, sub-national region - which can be administrative or geographical). These are the results from the Google API Geocoding process.
Files:
Dropped multiple columns, kept all the rows from loans.csv with names, tags, descriptions and got a csv file of 390MB instead of 2.13 GB. Basically is a simplified version of loans.csv (originally included in the analysis by beluga)
This is the loan_themes_by_region left joined with Tables_5.3_Contribution_of_Deprivations. (all the original entries from loan_themes and only the entries that match from Tables_5; for the regions that lack MPI data, you will find Nan)
These are the columns in the database:
Matched the loans in loan_themes_by_region with the regions that have info regarding MPI. This dataset brings together the amount invested in a region and the biggest problems the said region has to deal with. It is a join between the loan_themes_by_region provided by Kiva and Tables 5.3 Contribution_of_Deprivations.
It is a subset of the all_loan_theme_merged_with_geo_mpi_regions.xlsx, which contains only the entries that I could match with poverty decomposition data. It has the same columns.
Multidimensional poverty index decomposition for over 1000 regions part of 79 countries.
Table 5.3: Contribution of deprivations to the MPI, by sub-national regions
This table shows which dimensions and indicators contribute most to a region's MPI, which is useful for understanding the major source(s) of deprivation in a sub-national region.
Source: http://ophi.org.uk/multidimensional-poverty-index/global-mpi-2016/
MPI decomposition for 120 countries.
Table 7 All Published MPI Results since 2010
The table presents an archive of all MPI estimations published over the past 5 years, together with MPI, H, A and censored headcount ratios. For comparisons over time please use Table 6, which is strictly harmonised. The full set of data tables for each year published (Column A), is found on the 'data tables' page under 'Archive'.
The data in this file is shown in interactive plots on Oxford Poverty and Human Development Initiative website. http://www.dataforall.org/dashboard/ophi/index.php/
These are all the regions corresponding to the geocodes found in Kiva's loan_themes_by_region.
There are 718 unique entries, that you can join with any database from Kiva that has either a coordinates or region column.
Columns:
geo: pair of Lat, Lon (from loan_themes_by_region)
City: name of the city (has the most NaN's)
Administrative region: first level of administrative inclusion for the city/location; (the equivalent of county for US)
Sub-national region: second level of administrative inclusion for the geo pair. (like state for US)
Country: name of the country
Thanks to Shane Lynn for the batch geocoding and to Joseph Deferio for reverse geocoding:
https://www.shanelynn.ie/batch-geocoding-in-python-with-google-geocoding-api/
https://github.com/jdeferio/Reverse_Geocode
The MPI datasets you can find on the Oxford website (http://ophi.org.uk/) under Research.
"Citation: Alkire, S. and Kanagaratnam, U. (2018)
“Multidimensional Poverty Index Winter 2017-18: Brief methodological note and results.” Oxford Poverty and Human Development Initiative, University of Oxford, OPHI Methodological Notes 45."
CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
License information was derived automatically
In this file there are statistics for a number of variables broken down by Malmö’s different areas over time. Source Unless otherwise stated, the statistics in this database are retrieved from Statistics Sweden’s (SCB) regional database, Skånedatabasen or from Statistics Sweden’s area statistics database (OSDB).The Skåne database and OSDB show data from several different sources that Statistics Sweden has compiled on a geographical level. The statistics only cover persons who are part of the population registered in the population.Therefore, persons without a residence permit, such as asylum seekers, and persons who simply have not registered in the municipality are not included. Statistics Sweden does not provide statistics on which language residents speak, which religion you belong to or what ethnicity or political views you have. Therefore, such data is not available here either. However, the Electoral Authority reports election results per constituency on its website val.se. There are statistics from the last election as well as several previous elections available. Please note, however, that the constituencies do not necessarily follow the division of the city made here. Update The data is updated every spring as Statistics Sweden releases the figures to the municipality. Most variables are available for the year before. However, income and employment data are released with another year’s backlog. Unless otherwise stated, the date of measurement is 31 December of each year. Geographical breakdown Unless otherwise stated, the data is available for Malmö as a whole and broken down into urban areas (5 pieces), districts (10 pieces) and subareas (136 pieces). In addition to these, there is a residual post that contains the people who are not written in a specific place in the municipality, have protected identity and more. These people are also part of the total. In several of the subareas there are no or only a few registered population registers. Therefore, no data are reported for these areas. Examples of such sub-areas are parks such as Pildammsparken and Kroksbäcksparken and industrial areas such as Fosieby Industriområde and Spillepengen. Privacy clearance In order to protect the identity of individuals, the data is confidentially audited. This means that small values are suppressed, i.e. replaced by empty cells. However, the values are included in summaries. In general, the following rules apply: * No statistics are reported for geographical areas with very few housing. * No cells with fewer than 5 individuals are reported. For data classified as sensitive (e.g. income and country of birth), larger values can also be suppressed. * In cases where a subcategory (e.g. a training category) is too small to be accounted for, all categories are often suppressed. Please use the numbers, but use “City Office, Malmö City” as the source.
CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
License information was derived automatically
In this file there are statistics for a number of variables broken down by Malmö’s different areas over time. Source Unless otherwise stated, the statistics in this database are retrieved from Statistics Sweden’s (SCB) regional database, Skånedatabasen or from Statistics Sweden’s area statistics database (OSDB). The Skåne database and OSDB show data from several different sources that Statistics Sweden has compiled on a geographical level. The statistics only cover persons who are part of the population registered in the population. Therefore, persons without a residence permit, such as asylum seekers, and persons who simply have not registered in the municipality are not included.Statistics Sweden does not provide statistics on which language residents speak, which religion you belong to or what ethnicity or political views you have. Therefore, such data is not available here either. However, the Electoral Authority reports election results per constituency on its website val.se.There are statistics from the last election as well as several previous elections available. Please note, however, that the constituencies do not necessarily follow the division of the city made here. Update The data is updated every spring as Statistics Sweden releases the figures to the municipality. Most variables are available for the year before. However, income and employment data are released with another year’s backlog. Unless otherwise stated, the date of measurement is 31 December of each year. Geographical breakdown Unless otherwise stated, the data is available for Malmö as a whole and broken down into urban areas (5 pieces), districts (10 pieces) and subareas (136 pieces). In addition to these, there is a residual post that contains the people who are not written in a specific place in the municipality, have protected identity and more. These people are also part of the total.In several of the subareas there are no or only a few registered population registers. Therefore, no data are reported for these areas. Examples of such sub-areas are parks such as Pildammsparken and Kroksbäcksparken and industrial areas such as Fosieby Industriområde and Spillepengen. Privacy clearance In order to protect the identity of individuals, the data is confidentially audited. This means that small values are suppressed, i.e. replaced by empty cells.However, the values are included in summaries. In general, the following rules apply: * No statistics are reported for geographical areas with very few housing. * No cells with fewer than 5 individuals are reported. For data classified as sensitive (e.g. income and country of birth), larger values can also be suppressed. * In cases where a subcategory (e.g. a training category) is too small to be accounted for, all categories are often suppressed. Please use the numbers, but use “City Office, Malmö City” as the source.
Attribution 3.0 (CC BY 3.0)https://creativecommons.org/licenses/by/3.0/
License information was derived automatically
From 25 March 2025, the dataset update frequency has change from monthly to weekly every Tuesday.
We have replaced the .xlsx file resources for all our datasets. This was required due to the API and web page search functionality no longer being supported for .xlsx files on the Data.Gov platform.
ASIC is Australia’s corporate, markets and financial services regulator. ASIC contributes to Australia’s economic reputation and wellbeing by ensuring that Australia's financial markets are fair and transparent, and supported by confident and informed investors and consumers.
The Banned and Disqualified Persons Dataset file on data.gov.au is extracted from ASIC's Banned and Disqualified Registers. This dataset is a point in time snapshot of the Banned and Disqualified Persons Register data. The dataset provides information on persons that are:
It also provides information about persons that have been:
Information provided in this search is taken from the following registers:
Selected data from the registers will be uploaded each week to www.data.gov.au. The data made available will be a snapshot of the register at a point in time. Legislation prescribes the type of information ASIC is allowed to disclose to the public.
There may be multiple instances of identical or similar names in the dataset, with slight differences in address, place of birth, and other details. The data is recorded as it was reported to ASIC and we cannot confirm if these similar records are/are not the same person.
The information in the downloadable dataset includes:
Additional information about Banned and Disqualified Persons can be found via ASIC's website. To view some information you may be charged a fee.
More information about searching ASIC's registers.
This table contains 2394 series, with data for years 1991 - 1991 (not all combinations necessarily have data for all years). This table contains data described by the following dimensions (Not all combinations are available): Geography (1 items: Canada ...), Population group (19 items: Entire cohort; Income adequacy quintile 1 (lowest);Income adequacy quintile 2;Income adequacy quintile 3 ...), Age (14 items: At 25 years; At 30 years; At 40 years; At 35 years ...), Sex (3 items: Both sexes; Females; Males ...), Characteristics (3 items: Life expectancy; High 95% confidence interval; life expectancy; Low 95% confidence interval; life expectancy ...).
CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
License information was derived automatically
Number of households broken down by type of household. The variable is divided into single household types (including single residents), cohabiting and other households, all with or without children. Children are counted by kinship and not age. Households are household-dwelling units.All persons registered in the same apartment are counted to the same household. For more information on household statistics, see Statistics Sweden’s document “Register-based household statistics”. Data from the Register of Total Population (RTB). In this file there are statistics for a number of variables broken down by Malmö’s different areas over time. Source Unless otherwise stated, the statistics in this database are retrieved from Statistics Sweden’s (SCB) regional database, Skånedatabasen or from Statistics Sweden’s area statistics database (OSDB). The Skåne database and OSDB show data from several different sources that Statistics Sweden has compiled on a geographical level. The statistics only cover persons who are part of the population registered in the population. Therefore, persons without a residence permit, such as asylum seekers, and persons who simply have not registered in the municipality are not included. Statistics Sweden does not provide statistics on which language residents speak, which religion you belong to or what ethnicity or political views you have. Therefore, such data is not available here either. However, the Electoral Authority reports election results per constituency on its website val.se.There are statistics from the last election as well as several previous elections available. Please note, however, that the constituencies do not necessarily follow the division of the city made here. Update The data is updated every spring as Statistics Sweden releases the figures to the municipality. Most variables are available for the year before.However, income and employment data are released with another year’s backlog. Unless otherwise stated, the date of measurement is 31 December of each year. Geographical breakdown Unless otherwise stated, the data is available for Malmö as a whole and broken down into urban areas (5 pieces), districts (10 pieces) and subareas (136 pieces). In addition to these, there is a residual post that contains the people who are not written in a specific place in the municipality, have protected identity and more. These people are also part of the total. In several of the subareas there are no or only a few registered population registers.Therefore, no data are reported for these areas. Examples of such sub-areas are parks such as Pildammsparken and Kroksbäcksparken and industrial areas such as Fosieby Industriområde and Spillepengen. Privacy clearance In order to protect the identity of individuals, the data is confidentially audited. This means that small values are suppressed, i.e. replaced by empty cells.However, the values are included in summaries. In general, the following rules apply: * No statistics are reported for geographical areas with very few housing. * No cells with fewer than 5 individuals are reported. For data classified as sensitive (e.g. income and country of birth), larger values can also be suppressed. * In cases where a subcategory (e.g. a training category) is too small to be accounted for, all categories are often suppressed. Please use the numbers, but use “City Office, Malmö City” as the source.
https://datafinder.stats.govt.nz/license/attribution-4-0-international/https://datafinder.stats.govt.nz/license/attribution-4-0-international/
Dataset contains counts and measures for individuals from the 2013, 2018, and 2023 Censuses. Data is available by statistical area 1.
The variables included in this dataset are for the census usually resident population count (unless otherwise stated). All data is for level 1 of the classification.
The variables for part 2 of the dataset are:
Download lookup file for part 2 from Stats NZ ArcGIS Online or embedded attachment in Stats NZ geographic data service. Download data table (excluding the geometry column for CSV files) using the instructions in the Koordinates help guide.
Footnotes
Te Whata
Under the Mana Ōrite Relationship Agreement, Te Kāhui Raraunga (TKR) will be publishing Māori descent and iwi affiliation data from the 2023 Census in partnership with Stats NZ. This will be available on Te Whata, a TKR platform.
Geographical boundaries
Statistical standard for geographic areas 2023 (updated December 2023) has information about geographic boundaries as of 1 January 2023. Address data from 2013 and 2018 Censuses was updated to be consistent with the 2023 areas. Due to the changes in area boundaries and coding methodologies, 2013 and 2018 counts published in 2023 may be slightly different to those published in 2013 or 2018.
Subnational census usually resident population
The census usually resident population count of an area (subnational count) is a count of all people who usually live in that area and were present in New Zealand on census night. It excludes visitors from overseas, visitors from elsewhere in New Zealand, and residents temporarily overseas on census night. For example, a person who usually lives in Christchurch city and is visiting Wellington city on census night will be included in the census usually resident population count of Christchurch city.
Population counts
Stats NZ publishes a number of different population counts, each using a different definition and methodology. Population statistics – user guide has more information about different counts.
Caution using time series
Time series data should be interpreted with care due to changes in census methodology and differences in response rates between censuses. The 2023 and 2018 Censuses used a combined census methodology (using census responses and administrative data), while the 2013 Census used a full-field enumeration methodology (with no use of administrative data).
Study participation time series
In the 2013 Census study participation was only collected for the census usually resident population count aged 15 years and over.
About the 2023 Census dataset
For information on the 2023 dataset see Using a combined census model for the 2023 Census. We combined data from the census forms with administrative data to create the 2023 Census dataset, which meets Stats NZ's quality criteria for population structure information. We added real data about real people to the dataset where we were confident the people who hadn’t completed a census form (which is known as admin enumeration) will be counted. We also used data from the 2018 and 2013 Censuses, administrative data sources, and statistical imputation methods to fill in some missing characteristics of people and dwellings.
Data quality
The quality of data in the 2023 Census is assessed using the quality rating scale and the quality assurance framework to determine whether data is fit for purpose and suitable for release. Data quality assurance in the 2023 Census has more information.
Concept descriptions and quality ratings
Data quality ratings for 2023 Census variables has additional details about variables found within totals by topic, for example, definitions and data quality.
Disability indicator
This data should not be used as an official measure of disability prevalence. Disability prevalence estimates are only available from the 2023 Household Disability Survey. Household Disability Survey 2023: Final content has more information about the survey.
Activity limitations are measured using the Washington Group Short Set (WGSS). The WGSS asks about six basic activities that a person might have difficulty with: seeing, hearing, walking or climbing stairs, remembering or concentrating, washing all over or dressing, and communicating. A person was classified as disabled in the 2023 Census if there was at least one of these activities that they had a lot of difficulty with or could not do at all.
Using data for good
Stats NZ expects that, when working with census data, it is done so with a positive purpose, as outlined in the Māori Data Governance Model (Data Iwi Leaders Group, 2023). This model states that "data should support transformative outcomes and should uplift and strengthen our relationships with each other and with our environments. The avoidance of harm is the minimum expectation for data use. Māori data should also contribute to iwi and hapū tino rangatiratanga”.
Confidentiality
The 2023 Census confidentiality rules have been applied to 2013, 2018, and 2023 data. These rules protect the confidentiality of individuals, families, households, dwellings, and undertakings in 2023 Census data. Counts are calculated using fixed random rounding to base 3 (FRR3) and suppression of ‘sensitive’ counts less than six, where tables report multiple geographic variables and/or small populations. Individual figures may not always sum to stated totals. Applying confidentiality rules to 2023 Census data and summary of changes since 2018 and 2013 Censuses has more information about 2023 Census confidentiality rules.
Measures
Measures like averages, medians, and other quantiles are calculated from unrounded counts, with input noise added to or subtracted from each contributing value
https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/
There is a beginning and an end to every life. But when do people die in The Netherlands? Do seasonal factors cause a difference? I might add weather and birth data to this graph to get a more full picture.
Data is per week, per age group (0-65, 65-80, 80+) and sex. Notice that a full year is never exactly 52 weeks, so there are always two halve weeks in de data set. What do you do with this difference? Data is downloaded from the CBS site (see Source).
More explanation (in Dutch) can be found here: https://www.cbs.nl/nl-nl/onze-diensten/methoden/onderzoeksomschrijvingen/korte-onderzoeksbeschrijvingen/bevolkingsstatistiek.
Since the start of 2010, more late death registrations have been counted.
Is it possible to see the small change in measurement from 2010? How many deaths have gone unnoticed before that year without using this method? In which season do more people die? And which group causes this difference? And a more gruesome question: how many people will die next week?
Downloaded from the Dutch Bureau for Statistics (CBS) at 21-12-2019 with their tool Statline. Source: https://opendata.cbs.nl/statline/#/CBS/nl/dataset/70895ned/table?fromstatweb
Copyright (c) Centraal Bureau voor de Statistiek, Den Haag / Heerlen Verveelvoudiging is toegestaan, mits het CBS als bron wordt vermeld. Translation: Can be shared when source is mentioned.
Cover photo by Aron Visuals on Unsplash (https://unsplash.com/@aronvisuals?utm_source=unsplash&utm_medium=referral&utm_content=creditCopyText)
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
There is no scientific consensus on the fundamental question whether the probability distribution of the human life span has a finite endpoint or not and, if so, whether this upper limit changes over time. Our study uses a unique dataset of the ages at death—in days—of all (about 285,000) Dutch residents, born in the Netherlands, who died in the years 1986–2015 at a minimum age of 92 years and is based on extreme value theory, the coherent approach to research problems of this type. Unlike some other studies, we base our analysis on the configuration of thousands of mortality data of old people, not just the few oldest old. We find compelling statistical evidence that there is indeed an upper limit to the life span of men and to that of women for all the 30 years we consider and, moreover, that there are no indications of trends in these upper limits over the last 30 years, despite the fact that the number of people reaching high age (say 95 years) was almost tripling. We also present estimates for the endpoints, for the force of mortality at very high age, and for the so-called perseverance parameter. Supplementary materials for this article, including a standardized description of the materials available for reproducing the work, are available as an online supplement.
CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
License information was derived automatically
Shows in which industry the employed persons living in the area work. The breakdown follows the Standard for Swedish Industrial Classification (SNI) 2007. Categories 0 (unknown), A (Agriculture, hunting and forestry), B (extraction of materials), D (supply of electricity, gas, heating and cooling), E (water supply, etc.), T (household production, etc.) and U (activities of international organisations) have been merged into the category “Others” due to the low number of workers in these industries. The variable also shows the total number of workers living in the area. Only people who work in Sweden are included. The variable is divided by gender. Shown to the population aged 16-74. Based on statistics from Statistics Sweden RAMS (“Register-based labour market statistics”). The statistics include all individuals between 16 and 74 who are registered in Sweden on 31 December. As of 2011, Statistics Sweden has made some changes in how to count people aged 65 or over to get a more consistent assessment of self-employed persons. It is therefore not appropriate to compare statistics further back in time than in 2011. As of the reference year 2019, Statistics Sweden is using a new data source and method for classifying workers in RAMS. The change of source and method means that comparisons of statistics for 2019 and previous reference years must be made with great care. Read more at SCB.se
In this file there are statistics for a number of variables broken down by Malmö’s different areas over time.
Source
Unless otherwise stated, the statistics in this database are retrieved from Statistics Sweden’s (SCB) regional database, Skånedatabasen or from Statistics Sweden’s area statistics database (OSDB). The Skåne database and OSDB show data from several different sources that Statistics Sweden has compiled on a geographical level. The statistics only cover persons who are part of the population registered in the population. Therefore, persons without a residence permit, such as asylum seekers, and persons who simply have not registered in the municipality are not included. Statistics Sweden does not provide statistics on which language residents speak, which religion you belong to or what ethnicity or political views you have. Therefore, such data is not available here either. However, the Electoral Authority reports election results per constituency on its website val.se. There are statistics from the last election as well as several previous elections available. Please note, however, that the constituencies do not necessarily follow the division of the city made here.
Update
The data is updated every spring as Statistics Sweden releases the figures to the municipality. Most variables are available for the year before. However, income and employment data are released with another year’s backlog. Unless otherwise stated, the date of measurement is 31 December of each year.
Geographical breakdown
Unless otherwise stated, the data is available for Malmö as a whole and broken down into urban areas (5 pieces), districts (10 pieces) and subareas (136 pieces). In addition to these, there is a residual post that contains the people who are not written in a specific place in the municipality, have protected identity and more. These people are also part of the total. In several of the subareas there are no or only a few registered population registers. Therefore, no data are reported for these areas. Examples of such sub-areas are parks such as Pildammsparken and Kroksbäcksparken and industrial areas such as Fosieby Industriområde and Spillepengen.
Privacy clearance
In order to protect the identity of individuals, the data is confidentially audited. This means that small values are suppressed, i.e. replaced by empty cells. However, the values are included in summaries. In general, the following rules apply:
API
With the help of the API call https://ckan-malmo.dataplatform.se/api/3/action/resource_search?query=description:malm%C3%B6%20statistik, you get in JSON format all datasets that contain statistical data for Malmö’s areas. In each instance of result/results in the JSON result, for each resource there is an “id” property. The value of “id” can be used to retrieve the data for the respective statistical variable (according to the value of the property “name”). API calls to retrieve the statistical data: * https://ckan-malmo.dataplatform.se/api/action/datastore_search?resource_id=[id-värdet]*
Please use the numbers, but use “City Office, Malmö City” as the source.
CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
License information was derived automatically
In this file there are statistics for a number of variables broken down by Malmö’s different areas over time. Source Unless otherwise stated, the statistics in this database are retrieved from Statistics Sweden’s (SCB) regional database, Skånedatabasen or from Statistics Sweden’s area statistics database (OSDB). The Skåne database and OSDB show data from several different sources that Statistics Sweden has compiled on a geographical level. The statistics only cover persons who are part of the population registered in the population. Therefore, persons without a residence permit, such as asylum seekers, and persons who simply have not registered in the municipality are not included. Statistics Sweden does not provide statistics on which language residents speak, which religion you belong to or what ethnicity or political views you have. Therefore, such data is not available here either. However, the Electoral Authority reports election results per constituency on its website val.se. There are statistics from the last election as well as several previous elections available. Please note, however, that the constituencies do not necessarily follow the division of the city made here. Update The data is updated every spring as Statistics Sweden releases the figures to the municipality. Most variables are available for the year before. However, income and employment data are released with another year’s backlog. Unless otherwise stated, the date of measurement is 31 December of each year. Geographical breakdown Unless otherwise stated, the data is available for Malmö as a whole and broken down into urban areas (5 pieces), districts (10 pieces) and subareas (136 pieces). In addition to these, there is a residual post that contains the people who are not written in a specific place in the municipality, have protected identity and more. These people are also part of the total. In several of the subareas there are no or only a few registered population registers. Therefore, no data are reported for these areas. Examples of such sub-areas are parks such as Pildammsparken and Kroksbäcksparken and industrial areas such as Fosieby Industriområde and Spillepengen. Privacy clearance In order to protect the identity of individuals, the data is confidentially audited. This means that small values are suppressed, i.e. replaced by empty cells. However, the values are included in summaries. In general, the following rules apply: * No statistics are reported for geographical areas with very few housing. * No cells with fewer than 5 individuals are reported. For data classified as sensitive (e.g. income and country of birth), larger values can also be suppressed. * In cases where a subcategory (e.g. a training category) is too small to be accounted for, all categories are often suppressed. Please use the numbers, but use “City Office, Malmö City” as the source.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Depression has strong negative impacts on how individuals function, leading to the assumption that there is strong negative selection on this trait that should deplete genetic variation and decrease its prevalence in human populations. Yet, depressive symptoms remain common. While there has been a large body of work trying to resolve this paradox by mapping genetic variation of this complex trait, there have been few direct empirical tests of the core assumption that there is consistent negative selection on depression in human populations. Here, we use a unique long-term dataset from the National Health and Nutrition Examination Survey that spans four generational cohorts (Silent Generation: 1928–1945, Baby Boomers: 1946–1964, Generation X: 1965–1980, and Millenials: 1981–1996) to measure both depression scores and fitness components (lifetime sexual partners, pregnancies, and live births) of women from the United States born between 1938–1994. We not only assess fitness consequences of depression across multiple generations to determine whether the strength and direction of selection on depression has changed over time, but we also pair these fitness measurements with mixed models to assess how several important covariates, including age, body mass, education, race/ethnicity, and income might influence this relationship. We found that, overall, selection on depression was positive and the strength of selection changed over time–women reporting higher depression had relatively more sexual partners, pregnancies, and births except during the Silent Generation when selection coefficients neared zero. We also found that depression scores and fitness components differed among generations—Baby Boomers showed the highest severity of depression and the most sexual partners. These results were not changed by the inclusion of covariates in our models. A limitation of this study is that for the Millenials, reproduction has not completed and data for this generation is interrupted by right censoring. Most importantly, our results undermine the common belief that there is consistent negative selection on depression and demonstrate that the relationship between depression and fitness changes between generations, which may explain its maintenance in human populations.
Open Government Licence 3.0http://www.nationalarchives.gov.uk/doc/open-government-licence/version/3/
License information was derived automatically
This dataset provides Census 2022 estimates for Trans Status or History (7 Groups) in Scotland.
Scotland’s Census included a new question on trans status or history in 2022. This means there is not comparable data for previous censuses.
The question was “Do you consider yourself to be trans, or have a trans history?”. People were asked to tick “No” or “Yes”. People who ticked “Yes” were asked to describe their trans status (for example, non-binary, trans man, trans woman).
Transgender or trans is a term used to describe people whose gender is not the same as the sex they were assigned at birth.
This was a voluntary question for people aged 16 and over.
The quality assurance report can be found here
The State of Giving project, established by the Centre for Civil Society (CCS) at the University of KwaZulu-Natal (UKZN), the Southern African Grantmakers’ Association (SAGA) and the National Development Agency (NDA), was initiated to generate information on and analyse the resource flows to poverty alleviation and development in South Africa. One component of the broader project was a focus on individual-level giving, which involved the design, implementation and analysis of a national sample survey on individual level giving behaviour. It thus speaks to both the urban and rural and the formal and informal dimensions of our social context. The survey collected data on who gives, why and how much they give, as well as what they give and the recipients of their giving.
The sample, a random stratified one comprising 3000 respondents, is representative of all South Africans aged 18 and above.
Individuals
The population of interest in the survey was all South Africans aged 18 and above.
Sample survey data
A random stratified survey sample was drawn by Ross Jennings at S&T. The sample was stratified by race and province at the first level, and then by area (rural/urban/etc.) at the second level. The sample frame comprised 3000 respondents, yielding an error bar of 1.8%. The results are representative of all South Africans aged 18 and above, in all parts of the country, including formal and informal dwellings. Unlike many surveys, the project partners ensured that the rural component of the sample (commonly the most expensive for logistical reasons) was large and did not require heavy weighting (where a small number of respondents have to represent the views of a far larger community).
Randomness was built into the selection of starting points (from which fieldworkers begin their work) - every 5th dwelling was selected, after a randomly selected starting point had been identified - and into the selection of respondents, where the birthday rule was applied. That is, a household roster was completed, all those aged 18 and above were listed, and the householder whose birthday came next was identified as the respondent. Three call-backs were undertaken to interview the selected respondent; if s/he was unavailable, the household was substituted.
A second sample was drawn, specifically to boost the minority religious groups – namely Hindus, Jews and Muslims. They are separately analysed and reported as part of the broader project, since area sampling was used, disallowing us from incorporating them into the national survey dataset.
Face-to-face [f2f]
A set of focus groups were staged across the country in order to inform questionnaire design. Groups were recruited across a range of criteria, including demographic and religious differences, in order to ensure a wide range of views were canvassed. Direct input from focus group participants informed a series of robust design sessions with all the project partners, from which a draft questionnaire was designed. The questionnaire was piloted in two provinces, involving urban and rural respondents and covering all four race groups. The pilot included testing specific questions, and the overall methodological approach, namely our ability to quantify giving. After the pilot results had been assessed, the questionnaire was revised before going into field.
"0" values in some variables Many of the variables have a "0" value in addition to the values for responses, e.g. variables with yes/no responses are coded "0" "1""2". There is no indication that the 0 represents "missing" (only Q75 specifies the use of "0" for none/nobody).
Variable Q9 (Question 9) Q8 lists the number of resident children under the age of 18. Q9 refers to this question with: "of these children aged below 16 living in your household". This should probably be "aged below 18", in line with Q8 The data only reflects children under 16, so the question should probably have been "of these children, how many below the age of 16 are (Q9A) children of the head of the household and (Q9B) children not born to the head of household, i.e. children born to others. It seems though, that Q8 and Q9 should match, with Q8 identifying children and Q9 identifying children of the household head. If specifying 16 rather than 18 in Q9 is an error, then this has been reflected in the data. This means that household members 17-18 years are listed, but the data does not record whether they are children of the household head.
Variable Q21 (Question 21) “What do you think is the most deserving cause that you support or would support if you could?” There are 14 values for Q21 (1-14).According to the report (Everatt, D. and G. Solanki. 2005. A Nation of givers: Social giving amongst South Africans) this and other open-ended questions were later categorised and given numeric codes. However, a codebook was not included with the documentation provided to DataFirst
Variable Q22 (Question 22) “Is there one cause or charity or organisation you would definitely NOT give money to?” There are 14 values for Q22 (1-14). Again, this requires a code list for explanation.
Variable Q29 (Question 29) Q28 deals with the giving of goods/food/clothes. Q29 provides a breakdown of these items, and Q28Q29L lists time/labour as one of these. It seems that Q29L is incorrectly listed as a sub-set of goods/food/clothes. Also, giving time to causes is dealt with extensively in Q30A-Q and Q31A-Q, so this variable seems out of place.
Variable Q39 (Question 36) This concerns the giving of food, goods, or other forms of help to beggars/street children/people asking for help, but the question text does not specifically mention these forms of help, so can be misleading.
Variable Q44 (Question 44) Q44 asks the respondent to complete the sentence "Help the poor because…." There are 8 values for this variable (0-7 and 11). Again, a code list is required to explain these values.
Variable Q59 (Question 59) This question has three coded responses (1-3) so should have three values (or 4, with a “missing” value). There are 12 values for this variable, though (59A-59L). It is possible that this variable has been swopped with Q60 (However, Q60 only has 11 options in the questionnaire)
Variable Q60 (Question 60) The variable from this question only has 4 values, but there are 11 possible responses to this question (60A-60K). This variable could have been swopped with Q59 (In which case, the extra value needs explanation, as Q59 only has 11 options in the questionnaire.
Variables Q67 - Q82 From this point on the order of variables seems wrong, as the responses don't match the number of values listed in the questionnaire. The variables seem to refer to the next question along, e.g. Variable Q67 seems to have data emanating from Question 68, and so on. The data in the revised dataset has been corrected to reflect this.
There is no variable Q83 in the dataset, although there is a question 83 in the questionnaire. This seems to support the above explanation. Data users are requested to provide any additional findings on this that come to light in their research.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Version 2 (18 March 2025) includes a further 356 service itineraries. In addition, 41 entries from the previous version were updated or expanded. Currently the database covers a total of 1,858 Jewish soldiers, 421 wives and 83 children.
ORIGINAL VERSION 1 (18 September 2024)
With more than 1,500 individual entries, this is the inaugural instalment of my research database collated in the framework of the Project Forgotten Soldiers: Jewish Military Experience in the Habsburg Monarchy. This is an open access database, and everyone is welcome to use it according to their own scholarly and personal interests. In 1,189 cases we have official documented records confirming the soldiers were Jewish. In another 313 entries I was able to identify likely Jewish soldiers based on circumstantial evidence cross-referencing names and places of birth, with the presence of confirmed Jewish soldiers drafted into the same units as part of the same recruitment drive. This dataset further includes evidence for 156 spouses and 47 children. While military records do mentions these, their number suggests that the Habsburg army preferred to enlist unmarried men.
The database is structured in a similar way to an official individual entry in the Habsburg military records. These were arranged in tables, with soldiers listed by seniority. Name, place and land of birth are followed by age and religion. This latter rubric allows identifying the bulk of the Jewish soldiers. Also included in the record is marital status, profession (if any), number, names and ages of children (if any), followed by a short summary text of the soldier’s service itinerary. While not always consistent in detail, these texts mention enlistment dates, transfers between units, promotions, desertions, periods as prisoner of war and military awards (if any). I have taken the material from the personal records and added several additional parameters:
The soldiers are entered into the database according to their date of enlistment. This is followed by a colour-coded table showing their years of service. To see the meaning of the different colours employed, scroll to the legend at the end of the dataset.
Following the years of service, we see the date when the soldier left service (final year in service for incomplete service records). When known, the reason the soldier left the army is given (discharge/ death/ desertion etc).
Then come the three most important columns within the table: service record, primary sources and units. At first glance, these columns have only a few letters and numbers, but bring your mouse courser onto the relevant field marked with red triangles. An additional window will then open:
a. Service Record: Shows the entire service record of the soldier arranged by date. I use original German as it appears in the archival records. If you see spelling differences with modern German – they are there for a reason.
b. Primary Sources: Provides the information on all the archival records consulted to reconstruct the service itinerary. The number in the field denotes the number of the archival cartons consulted.
c. Units: Number of units in which a soldier serves. Bringing the cursor on to the field will open their list. Most Jewish soldiers served in the line infantry (IR) and the Military Transport Corps (MFWK or MFK). However, there were also Jewish sharpshooters, cavalrymen, gunners and even a few members of the nascent Austrian Navy.
The next two columns provide entries of the soldier’s conduct and medical condition, which in Habsburg military jargon was referred to rather callously as Defekten. I note the original medical diagnoses verbatim. When possible to identify, I note the modern medical term.
General database-wide parameters are then noted in the next part of the table. Among others, it provides information on enlistment type (conscript/ volunteer?), main branches of service (such as Infantry/ Cavalry/ Artillery), and roles within the military (such as non-commissioned officers/ drummers/ medics).
Concluding this part of the table are columns covering desertions, periods as prisoner of war and awards of the army cannon cross (for veterans of 1813-14) and other military awards.
The last column provides the original German outtake rubric as to how the soldier left service. In special cases, additional service notes are provides on the right.
How to use this dataset
This depends on what you are looking for. Firstly, download the dataset on to your computer via the link provided below. It is a simple Excel file which is easy to work with. If you wish to find out whether one of your ancestors served in the Habsburg army, use a simple keyword search. Please note that in our period there was no single accepted orthography meaning that some letters were used interchangeably (for instance B/P; D/T). There were also various patronymic suffices used in different parts of the monarchy (-witz in German/ -wicz in Polish/ -vits in Hungarian). Habsburg military clerks were mostly German speakers who often recorded the name phonetically. For instance, Jankel/ Jankl/ Jacob/ Jacobus all denote the same name. A Jewish teenager who identified himself as Moische when first reporting to duty, may have stayed so in the military records for decades, even if he was already a non-commissioned officer whose subordinates referred to as Herr Corporal.
If you study the history of concrete Jewish communities, use the keyword search and the filter option to find entries in the database where this locality is mentioned. Some places like Prague and Lublin could be identified effortlessly. In other cases (and see the above point on German-speaking clerks), place names were recorded phonetically. The military authority usually stuck to official Polish names in Galicia, and Hungarian in the Lands of the Crown of St. Stephan. In reality, a Jewish recruit from Transcarpathian Ruthenia could have his place of birth recorded in Hungarian, Romanian or Rusin. When I could not identify the place in question, I marked it with italics. Do you think you identified something I could not? Excellent! Then please write me, and I will correct the entry in the next instalment of this database.
I should stress that, currently, the database is not statistically representative. I have worked chronologically, meaning that there are disproportionally more entries for Jewish soldiers from the Turkish War, the first two Coalition Wars, and the Wars of 1805 and 1809. If you look at some of my other databases (for instance, that of the 1st Line Infantry Regiment 'Kaiser'), you will find least as many Jews who served in the wars of 1813-15. I will cover these in due course. This said, using the filter option of the Excel sheet, you can already make some individual queries. For instance, did Jewish grenadiers meet the minimal height requirement to be eligible for transfer into the elite infantry? (Hint: they did not!) If you are interested in the historical study of nutritional standards, compare the height of the soldiers with their year and place of birth. In my other project, I made calculations of the average height of Habsburg soldiers and I can already reveal that Jewish conscripts were, on average, several centimetres smaller than their non-Jewish comrades drafted in the same annual intake. Whatever stereotypes said, most Jews in the Habsburg Monarchy around 1800 were very poor and the sad fact of malnutrition as a child is reflected in their height as adults.
I should stress that this is a cumulative database. ZENODO has an excellent feature allowing updated versions to supersede earlier files while retaining the same DOI (Digital Object Identifier) and metadata. As my research progresses, I plan to upload new versions of this database bi-annually. This includes not only adding new entries, but also expanding and correcting existing ones. It might well be that the service record of a soldier covered up to 1806 will be brought to a later date, possibly even to his discharge from the army. If you have not found whom you are looking for, or if you want to work with larger samples for your research, visit this page again in a few months’ time. And if you do use this database for scholarly research (by all means, please do), do not forget to cite it as you would cite any other item in your bibliography! If you are a museum professional and you want to employ material from your database to illustrate your exhibitions, you are welcome, but please cite this resource for others to learn. Links to this database will also be appreciated.