39 datasets found

Mortality and Causes of Death 1997-2017 - South Africa
catalog.ihsn.org
datacatalog.ihsn.org
+1more
Updated Jan 19, 2021
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Statistics South Africa (2021). Mortality and Causes of Death 1997-2017 - South Africa [Dataset]. https://catalog.ihsn.org/catalog/study/ZAF_1997-2017_MCD_v01_M
Explore at:
Dataset updated
Jan 19, 2021
Dataset provided by
Statistics South Africahttp://www.statssa.gov.za/
Department of Home Affairs
Time period covered
1997 - 2017
Area covered
South Africa
Description
Abstract

This cumulative dataset contains statistics on mortality and causes of death in South Africa covering the period 1997-2017. The mortality and causes of death dataset is part of a regular series published by Stats SA, based on data collected through the civil registration system. This dataset is the most recent cumulative round in the series which began with the separately available dataset Recorded Deaths 1996.

The main objective of this dataset is to outline emerging trends and differentials in mortality by selected socio-demographic and geographic characteristics for deaths that occurred in the registered year and over time. Reliable mortality statistics, are the cornerstone of national health information systems, and are necessary for population health assessment, health policy and service planning; and programme evaluation. They are essential for studying the occurrence and distribution of health-related events, their determinants and management of related health problems. These data are particularly critical for monitoring the Sustainable Development Goals (SDGs) and Agenda 2063 which share the same goal for a high standard of living and quality of life, sound health and well-being for all and at all ages. Mortality statistics are also required for assessing the impact of non-communicable diseases (NCD's), emerging infectious diseases, injuries and natural disasters.

Geographic coverage

National coverage

Analysis unit

Individuals

Universe

This dataset is based on information on mortality and causes of death from the South African civil registration system. It covers all death notification forms from the Department of Home Affairs for deaths that occurred in 1997-2017, that reached Stats SA during the 2018/2019 processing phase.

Kind of data

Administrative records data [adm]

Mode of data collection

Other [oth]

Research instrument

The registration of deaths is captured using two instruments: form BI-1663 and form DHA-1663 (Notification/Register of death/stillbirth).

Data appraisal

This cumulative dataset is part of a regular series published by Stats SA and includes all previous rounds in the series (excluding Recorded Deaths 1996). Stats SA only includes one variable to classify the occupation group of the deceased (OccupationGrp) in the current round (1997-2017). Prior to 2016, Stats SA included both occupation group (OccupationGrp) and industry classification (Industry) in all previous rounds. Therefore, DataFirst has made the 1997-2015 cumulative round available as a separately downloadable dataset which includes both occupation group and industry classification of the deceased spanning the years 1997-2015.
Crime statistics - Dataset - data.sa.gov.au
data.sa.gov.au
Updated Oct 9, 2017
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
data.sa.gov.au (2017). Crime statistics - Dataset - data.sa.gov.au [Dataset]. https://data.sa.gov.au/data/dataset/crime-statistics
Explore at:
Dataset updated
Oct 9, 2017
Dataset provided by
Government of South Australiahttp://sa.gov.au/
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Area covered
South Australia
Description
Suburb-based crime statistics for crimes against the person and crimes against property. The Crime statistics datasets contain all offences against the person and property that were reported to police in that respective financial year. The Family and Domestic Abuse-related offences datasets are a subset of this, in that a separate file is presented for these offences that were flagged as being of a family and domestic abuse nature for that financial year. Consequently the two files for the same financial year must not be added together. Data is point in time.
Z
Effect of suicide rates on life expectancy dataset
data.niaid.nih.gov
zenodo.org
Updated Apr 16, 2021
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Filip Zoubek (2021). Effect of suicide rates on life expectancy dataset [Dataset]. https://data.niaid.nih.gov/resources?id=zenodo_4694269
Explore at:
Dataset updated
Apr 16, 2021
Dataset authored and provided by
Filip Zoubek
License
Attribution-NonCommercial-ShareAlike 3.0 (CC BY-NC-SA 3.0)https://creativecommons.org/licenses/by-nc-sa/3.0/
License information was derived automatically
Description
Effect of suicide rates on life expectancy dataset

Abstract In 2015, approximately 55 million people died worldwide, of which 8 million committed suicide. In the USA, one of the main causes of death is the aforementioned suicide, therefore, this experiment is dealing with the question of how much suicide rates affects the statistics of average life expectancy. The experiment takes two datasets, one with the number of suicides and life expectancy in the second one and combine data into one dataset. Subsequently, I try to find any patterns and correlations among the variables and perform statistical test using simple regression to confirm my assumptions.

Data

The experiment uses two datasets - WHO Suicide Statistics[1] and WHO Life Expectancy[2], which were firstly appropriately preprocessed. The final merged dataset to the experiment has 13 variables, where country and year are used as index: Country, Year, Suicides number, Life expectancy, Adult Mortality, which is probability of dying between 15 and 60 years per 1000 population, Infant deaths, which is number of Infant Deaths per 1000 population, Alcohol, which is alcohol, recorded per capita (15+) consumption, Under-five deaths, which is number of under-five deaths per 1000 population, HIV/AIDS, which is deaths per 1 000 live births HIV/AIDS, GDP, which is Gross Domestic Product per capita, Population, Income composition of resources, which is Human Development Index in terms of income composition of resources, and Schooling, which is number of years of schooling.

LICENSE

THE EXPERIMENT USES TWO DATASET - WHO SUICIDE STATISTICS AND WHO LIFE EXPECTANCY, WHICH WERE COLLEECTED FROM WHO AND UNITED NATIONS WEBSITE. THEREFORE, ALL DATASETS ARE UNDER THE LICENSE ATTRIBUTION-NONCOMMERCIAL-SHAREALIKE 3.0 IGO (https://creativecommons.org/licenses/by-nc-sa/3.0/igo/).

[1] https://www.kaggle.com/szamil/who-suicide-statistics

[2] https://www.kaggle.com/kumarajarshi/life-expectancy-who
d
Annual Report 2021-22 - Dataset - data.sa.gov.au
data.sa.gov.au
Updated Dec 31, 2021
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
(2021). Annual Report 2021-22 - Dataset - data.sa.gov.au [Dataset]. https://data.sa.gov.au/data/dataset/annual-report-2021-22
Explore at:
Dataset updated
Dec 31, 2021
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Area covered
South Australia
Description
This dataset contains data relating to child deaths in South Australia, as reported in the Child Death and Serious Injury Review Committee's Annual Report 2021-22.
Z
MultiCaRe: An open-source clinical case dataset for medical image...
data.niaid.nih.gov
Updated Mar 9, 2025
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Nievas Offidani, Mauro (2025). MultiCaRe: An open-source clinical case dataset for medical image classification and multimodal AI applications [Dataset]. https://data.niaid.nih.gov/resources?id=zenodo_10079369
Explore at:
Dataset updated
Mar 9, 2025
Dataset authored and provided by
Nievas Offidani, Mauro
License
Attribution-NonCommercial-ShareAlike 4.0 (CC BY-NC-SA 4.0)https://creativecommons.org/licenses/by-nc-sa/4.0/
License information was derived automatically
Description
The dataset contains multi-modal data from over 70,000 open access and de-identified case reports, including metadata, clinical cases, image captions and more than 130,000 images. Images and clinical cases belong to different medical specialties, such as oncology, cardiology, surgery and pathology. The structure of the dataset allows to easily map images with their corresponding article metadata, clinical case, captions and image labels. Details of the data structure can be found in the file data_dictionary.csv.

More than 90,000 patients and 280,000 medical doctors and researchers were involved in the creation of the articles included in this dataset. The citation data of each article can be found in the metadata.parquet file.

Refer to the examples showcased in this GitHub repository to understand how to optimize the use of this dataset.The license of the dataset as a whole is CC BY-NC-SA. However, its individual contents may have less restrictive license types (CC BY, CC BY-NC, CC0). For instance, regarding image filess, 66K of them are CC BY, 32K are CC BY-NC-SA, 32K are CC BY-NC, and 20 of them are CC0.
z
Counts of Dengue reported in SAUDI ARABIA: 1994-1994
zenodo.org
data.niaid.nih.gov
+1more
json, xml, zip
Updated Jun 3, 2024
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Willem Van Panhuis; Willem Van Panhuis; Anne Cross; Anne Cross; Donald Burke; Donald Burke (2024). Counts of Dengue reported in SAUDI ARABIA: 1994-1994 [Dataset]. http://doi.org/10.25337/t7/ptycho.v2.0/sa.38362002
Explore at:
json, zip, xmlAvailable download formats
Unique identifier
https://doi.org/10.25337/t7/ptycho.v2.0/sa.38362002
Dataset updated
Jun 3, 2024
Dataset provided by
Project Tycho
Authors
Willem Van Panhuis; Willem Van Panhuis; Anne Cross; Anne Cross; Donald Burke; Donald Burke
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Time period covered
Jan 1, 1994 - Dec 31, 1994
Area covered
Saudi Arabia
Description
Project Tycho datasets contain case counts for reported disease conditions for countries around the world. The Project Tycho data curation team extracts these case counts from various reputable sources, typically from national or international health authorities, such as the US Centers for Disease Control or the World Health Organization. These original data sources include both open- and restricted-access sources. For restricted-access sources, the Project Tycho team has obtained permission for redistribution from data contributors. All datasets contain case count data that are identical to counts published in the original source and no counts have been modified in any way by the Project Tycho team. The Project Tycho team has pre-processed datasets by adding new variables, such as standard disease and location identifiers, that improve data interpretabilty. We also formatted the data into a standard data format.
Each Project Tycho dataset contains case counts for a specific condition (e.g. measles) and for a specific country (e.g. The United States). Case counts are reported per time interval. In addition to case counts, datsets include information about these counts (attributes), such as the location, age group, subpopulation, diagnostic certainty, place of aquisition, and the source from which we extracted case counts. One dataset can include many series of case count time intervals, such as "US measles cases as reported by CDC", or "US measles cases reported by WHO", or "US measles cases that originated abroad", etc.
Depending on the intended use of a dataset, we recommend a few data processing steps before analysis:
Analyze missing data: Project Tycho datasets do not inlcude time intervals for which no case count was reported (for many datasets, time series of case counts are incomplete, due to incompleteness of source documents) and users will need to add time intervals for which no count value is available. Project Tycho datasets do include time intervals for which a case count value of zero was reported.
Separate cumulative from non-cumulative time interval series. Case count time series in Project Tycho datasets can be "cumulative" or "fixed-intervals". Cumulative case count time series consist of overlapping case count intervals starting on the same date, but ending on different dates. For example, each interval in a cumulative count time series can start on January 1st, but end on January 7th, 14th, 21st, etc. It is common practice among public health agencies to report cases for cumulative time intervals. Case count series with fixed time intervals consist of mutually exxclusive time intervals that all start and end on different dates and all have identical length (day, week, month, year). Given the different nature of these two types of case count data, we indicated this with an attribute for each count value, named "PartOfCumulativeCountSeries".
Injured Bicyclists Under 16 Years of Age: Case Data - Dataset -...
data.sa.gov.au
Updated May 28, 2014
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
data.sa.gov.au (2014). Injured Bicyclists Under 16 Years of Age: Case Data - Dataset - data.sa.gov.au [Dataset]. https://data.sa.gov.au/data/dataset/injured-bicyclists-under-16-years-of-age-case-data
Explore at:
Dataset updated
May 28, 2014
Dataset provided by
Government of South Australiahttp://sa.gov.au/
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Area covered
South Australia
Description
The South Australian injury surveillance data was collected from the casualty services of several metropolitan public hospitals. The exercise was conducted from 1986 through June 2013. The objective was to identify the circumstances associated with hospital-treated injury, in order to better target opportunities for prevention. A number of important safety measures resulted directly from this work. NB. The historical success in capturing cases varied by year.
w
Afrobarometer Survey 1 1999-2000, Merged 7 Country - Botswana, Lesotho,...
microdata.worldbank.org
catalog.ihsn.org
Updated Apr 27, 2021
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Institute for Democracy in South Africa (IDASA) (2021). Afrobarometer Survey 1 1999-2000, Merged 7 Country - Botswana, Lesotho, Malawi, Namibia, South Africa, Zambia, Zimbabwe [Dataset]. https://microdata.worldbank.org/index.php/catalog/889
Explore at:
Dataset updated
Apr 27, 2021
Dataset provided by
Michigan State University (MSU)
Ghana Centre for Democratic Development (CDD-Ghana)
Institute for Democracy in South Africa (IDASA)
Time period covered
1999 - 2000
Area covered
Zambia, South Africa, Namibia, Lesotho, Zimbabwe, Africa, Botswana, Malawi
Description
Abstract

Round 1 of the Afrobarometer survey was conducted from July 1999 through June 2001 in 12 African countries, to solicit public opinion on democracy, governance, markets, and national identity. The full 12 country dataset released was pieced together out of different projects, Round 1 of the Afrobarometer survey,the old Southern African Democracy Barometer, and similar surveys done in West and East Africa.

The 7 country dataset is a subset of the Round 1 survey dataset, and consists of a combined dataset for the 7 Southern African countries surveyed with other African countries in Round 1, 1999-2000 (Botswana, Lesotho, Malawi, Namibia, South Africa, Zambia and Zimbabwe). It is a useful dataset because, in contrast to the full 12 country Round 1 dataset, all countries in this dataset were surveyed with the identical questionnaire

Geographic coverage

Botswana Lesotho Malawi Namibia South Africa Zambia Zimbabwe

Analysis unit

Basic units of analysis that the study investigates include: individuals and groups

Kind of data

Sample survey data [ssd]

Sampling procedure

A new sample has to be drawn for each round of Afrobarometer surveys. Whereas the standard sample size for Round 3 surveys will be 1200 cases, a larger sample size will be required in societies that are extremely heterogeneous (such as South Africa and Nigeria), where the sample size will be increased to 2400. Other adaptations may be necessary within some countries to account for the varying quality of the census data or the availability of census maps.

The sample is designed as a representative cross-section of all citizens of voting age in a given country. The goal is to give every adult citizen an equal and known chance of selection for interview. We strive to reach this objective by (a) strictly applying random selection methods at every stage of sampling and by (b) applying sampling with probability proportionate to population size wherever possible. A randomly selected sample of 1200 cases allows inferences to national adult populations with a margin of sampling error of no more than plus or minus 2.5 percent with a confidence level of 95 percent. If the sample size is increased to 2400, the confidence interval shrinks to plus or minus 2 percent.

Sample Universe

The sample universe for Afrobarometer surveys includes all citizens of voting age within the country. In other words, we exclude anyone who is not a citizen and anyone who has not attained this age (usually 18 years) on the day of the survey. Also excluded are areas determined to be either inaccessible or not relevant to the study, such as those experiencing armed conflict or natural disasters, as well as national parks and game reserves. As a matter of practice, we have also excluded people living in institutionalized settings, such as students in dormitories and persons in prisons or nursing homes.

What to do about areas experiencing political unrest? On the one hand we want to include them because they are politically important. On the other hand, we want to avoid stretching out the fieldwork over many months while we wait for the situation to settle down. It was agreed at the 2002 Cape Town Planning Workshop that it is difficult to come up with a general rule that will fit all imaginable circumstances. We will therefore make judgments on a case-by-case basis on whether or not to proceed with fieldwork or to exclude or substitute areas of conflict. National Partners are requested to consult Core Partners on any major delays, exclusions or substitutions of this sort.

Sample Design

The sample design is a clustered, stratified, multi-stage, area probability sample.

To repeat the main sampling principle, the objective of the design is to give every sample element (i.e. adult citizen) an equal and known chance of being chosen for inclusion in the sample. We strive to reach this objective by (a) strictly applying random selection methods at every stage of sampling and by (b) applying sampling with probability proportionate to population size wherever possible.

In a series of stages, geographically defined sampling units of decreasing size are selected. To ensure that the sample is representative, the probability of selection at various stages is adjusted as follows:

The sample is stratified by key social characteristics in the population such as sub-national area (e.g. region/province) and residential locality (urban or rural). The area stratification reduces the likelihood that distinctive ethnic or language groups are left out of the sample. And the urban/rural stratification is a means to make sure that these localities are represented in their correct proportions. Wherever possible, and always in the first stage of sampling, random sampling is conducted with probability proportionate to population size (PPPS). The purpose is to guarantee that larger (i.e., more populated) geographical units have a proportionally greater probability of being chosen into the sample. The sampling design has four stages

A first-stage to stratify and randomly select primary sampling units;

A second-stage to randomly select sampling start-points;

A third stage to randomly choose households;

A final-stage involving the random selection of individual respondents

We shall deal with each of these stages in turn.

STAGE ONE: Selection of Primary Sampling Units (PSUs)

The primary sampling units (PSU's) are the smallest, well-defined geographic units for which reliable population data are available. In most countries, these will be Census Enumeration Areas (or EAs). Most national census data and maps are broken down to the EA level. In the text that follows we will use the acronyms PSU and EA interchangeably because, when census data are employed, they refer to the same unit.

We strongly recommend that NIs use official national census data as the sampling frame for Afrobarometer surveys. Where recent or reliable census data are not available, NIs are asked to inform the relevant Core Partner before they substitute any other demographic data. Where the census is out of date, NIs should consult a demographer to obtain the best possible estimates of population growth rates. These should be applied to the outdated census data in order to make projections of population figures for the year of the survey. It is important to bear in mind that population growth rates vary by area (region) and (especially) between rural and urban localities. Therefore, any projected census data should include adjustments to take such variations into account.

Indeed, we urge NIs to establish collegial working relationships within professionals in the national census bureau, not only to obtain the most recent census data, projections, and maps, but to gain access to sampling expertise. NIs may even commission a census statistician to draw the sample to Afrobarometer specifications, provided that provision for this service has been made in the survey budget.

Regardless of who draws the sample, the NIs should thoroughly acquaint themselves with the strengths and weaknesses of the available census data and the availability and quality of EA maps. The country and methodology reports should cite the exact census data used, its known shortcomings, if any, and any projections made from the data. At minimum, the NI must know the size of the population and the urban/rural population divide in each region in order to specify how to distribute population and PSU's in the first stage of sampling. National investigators should obtain this written data before they attempt to stratify the sample.

Once this data is obtained, the sample population (either 1200 or 2400) should be stratified, first by area (region/province) and then by residential locality (urban or rural). In each case, the proportion of the sample in each locality in each region should be the same as its proportion in the national population as indicated by the updated census figures.

Having stratified the sample, it is then possible to determine how many PSU's should be selected for the country as a whole, for each region, and for each urban or rural locality.

The total number of PSU's to be selected for the whole country is determined by calculating the maximum degree of clustering of interviews one can accept in any PSU. Because PSUs (which are usually geographically small EAs) tend to be socially homogenous we do not want to select too many people in any one place. Thus, the Afrobarometer has established a standard of no more than 8 interviews per PSU. For a sample size of 1200, the sample must therefore contain 150 PSUs/EAs (1200 divided by 8). For a sample size of 2400, there must be 300 PSUs/EAs.

These PSUs should then be allocated proportionally to the urban and rural localities within each regional stratum of the sample. Let's take a couple of examples from a country with a sample size of 1200. If the urban locality of Region X in this country constitutes 10 percent of the current national population, then the sample for this stratum should be 15 PSUs (calculated as 10 percent of 150 PSUs). If the rural population of Region Y constitutes 4 percent of the current national population, then the sample for this stratum should be 6 PSU's.

The next step is to select particular PSUs/EAs using random methods. Using the above example of the rural localities in Region Y, let us say that you need to pick 6 sample EAs out of a census list that contains a total of 240 rural EAs in Region Y. But which 6? If the EAs created by the national census bureau are of equal or roughly equal population size, then selection is relatively straightforward. Just number all EAs consecutively, then make six selections using a table of random numbers. This procedure, known as simple random sampling (SRS), will
w
Dataset of health expenditure and suicide mortality rate of countries per...
workwithdata.com
Updated Apr 9, 2025
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Work With Data (2025). Dataset of health expenditure and suicide mortality rate of countries per year in South Africa (Historical) [Dataset]. https://www.workwithdata.com/datasets/countries-yearly?col=country%2Cdate%2Chealth_expenditure_pct_gdp%2Csuicide_rate&f=1&fcol0=country&fop0=%3D&fval0=South+Africa
Explore at:
Dataset updated
Apr 9, 2025
Dataset authored and provided by
Work With Data
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Area covered
South Africa
Description
This dataset is about countries per year in South Africa. It has 64 rows. It features 4 columns: country, health expenditure, and suicide mortality rate.
z
Counts of Dengue without warning signs reported in SAUDI ARABIA: 1994-1994
zenodo.org
json, xml, zip
Updated Jun 3, 2024
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Willem Van Panhuis; Willem Van Panhuis; Anne Cross; Anne Cross; Donald Burke; Donald Burke (2024). Counts of Dengue without warning signs reported in SAUDI ARABIA: 1994-1994 [Dataset]. http://doi.org/10.25337/t7/ptycho.v2.0/sa.722862003
Explore at:
zip, xml, jsonAvailable download formats
Unique identifier
https://doi.org/10.25337/t7/ptycho.v2.0/sa.722862003
Dataset updated
Jun 3, 2024
Dataset provided by
Project Tycho
Authors
Willem Van Panhuis; Willem Van Panhuis; Anne Cross; Anne Cross; Donald Burke; Donald Burke
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Time period covered
Jan 1, 1994 - Dec 31, 1994
Area covered
Saudi Arabia
Description
Project Tycho datasets contain case counts for reported disease conditions for countries around the world. The Project Tycho data curation team extracts these case counts from various reputable sources, typically from national or international health authorities, such as the US Centers for Disease Control or the World Health Organization. These original data sources include both open- and restricted-access sources. For restricted-access sources, the Project Tycho team has obtained permission for redistribution from data contributors. All datasets contain case count data that are identical to counts published in the original source and no counts have been modified in any way by the Project Tycho team. The Project Tycho team has pre-processed datasets by adding new variables, such as standard disease and location identifiers, that improve data interpretabilty. We also formatted the data into a standard data format.
Each Project Tycho dataset contains case counts for a specific condition (e.g. measles) and for a specific country (e.g. The United States). Case counts are reported per time interval. In addition to case counts, datsets include information about these counts (attributes), such as the location, age group, subpopulation, diagnostic certainty, place of aquisition, and the source from which we extracted case counts. One dataset can include many series of case count time intervals, such as "US measles cases as reported by CDC", or "US measles cases reported by WHO", or "US measles cases that originated abroad", etc.
Depending on the intended use of a dataset, we recommend a few data processing steps before analysis:
Analyze missing data: Project Tycho datasets do not inlcude time intervals for which no case count was reported (for many datasets, time series of case counts are incomplete, due to incompleteness of source documents) and users will need to add time intervals for which no count value is available. Project Tycho datasets do include time intervals for which a case count value of zero was reported.
Separate cumulative from non-cumulative time interval series. Case count time series in Project Tycho datasets can be "cumulative" or "fixed-intervals". Cumulative case count time series consist of overlapping case count intervals starting on the same date, but ending on different dates. For example, each interval in a cumulative count time series can start on January 1st, but end on January 7th, 14th, 21st, etc. It is common practice among public health agencies to report cases for cumulative time intervals. Case count series with fixed time intervals consist of mutually exxclusive time intervals that all start and end on different dates and all have identical length (day, week, month, year). Given the different nature of these two types of case count data, we indicated this with an attribute for each count value, named "PartOfCumulativeCountSeries".
Data from: dopanim: A Dataset of Doppelganger Animals with Noisy Annotations...
zenodo.org
json, zip
Updated Nov 6, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Marek Herde; Marek Herde; Denis Huseljic; Denis Huseljic; Lukas Rauch; Lukas Rauch; Bernhard Sick; Bernhard Sick (2024). dopanim: A Dataset of Doppelganger Animals with Noisy Annotations from Multiple Humans [Dataset]. http://doi.org/10.5281/zenodo.14016659
Explore at:
json, zipAvailable download formats
Unique identifier
https://doi.org/10.5281/zenodo.14016659
Dataset updated
Nov 6, 2024
Dataset provided by
Zenodohttp://zenodo.org/
Authors
Marek Herde; Marek Herde; Denis Huseljic; Denis Huseljic; Lukas Rauch; Lukas Rauch; Bernhard Sick; Bernhard Sick
License
Attribution-NonCommercial-ShareAlike 4.0 (CC BY-NC-SA 4.0)https://creativecommons.org/licenses/by-nc-sa/4.0/
License information was derived automatically
Description
Profile

The dopanim dataset features about 15,750 animal images of 15 classes, organized into four groups of doppelganger animals and collected together with ground truth labels from iNaturalist. For approximately 10,500 of these images, 20 humans provided over 52,000 annotations with an accuracy of circa 67%.

Key attributes include the challenging task of classifying doppelganger animals, human-estimated likelihoods per image-annotator pair, and annotator metadata.

The dataset's broad research scope covers noisy label learning, multi-annotator learning, active learning, and learning beyond hard labels.

Further information is given in the associated article and our GitHub repository for using the data.

File Descriptions

task_data.json contains data, e.g., the ground truth class labels, for each image classification task. Thereby, each task record is indexed by the iNaturalist observation index. A description of each record's entries is given in the supplementary material of the associated article.

annotation_data.json contains data, e.g., likelihoods per animal class, for each obtained image annotation. Thereby, each annotation record has a unique identifier. A description of each record's entries is given in the supplementary material of the associated article.

annotator_metadata.json contains metadata, e.g., self-assessed levels of knowledge and interest regarding animals, for each annotator. Thereby, each metadata record is indexed by the anonymous identifier of an annotator. A description of each record's entries is given in the supplementary material of the associated article.

train.zip, valid.zip, and test.zip contain the training, validation, and test images organized into directories of the 15 animal classes.

Licenses

Images and their associated metadata are collected as observations from iNaturalist. Thereby, we constrained the collection to images and metadata with CC0, CC-BY, CC-BY-SA, CC-BY-NC, or CC-BY-NC-SA licenses. The information about these licenses is given by the fields license_code and photo_license_code in each record of task_data.json. The links to each image and observation are given for further reference.

We collected the data in the files annotation_data.json and annotator_metadata.json in an annotation campaign via LabelStudio and distribute them under the license CC-BY-NC 4.0.

Contact

If you have questions or issues relevant to other dataset users, we ask you to create a corresponding issue at our GitHub repository.

In all other cases, you can contact the dataset collectors via the e-mail marek.herde@uni-kassel.de.

Acknowledgements

This work was funded by the ALDeep and CIL projects at the University of Kassel. Moreover, we thank Franz Götz-Hahn for his insightful comments on improving our annotation campaign. Finally, we thank the iNaturalist community for their many observations that help explore our nature's biodiversity and our annotators for their dedicated efforts in making the annotation campaign via LabelStudio possible.

Disclaimer

We carefully selected and composed this dataset's content. If you believe that any of this content violates licensing agreements or infringes on intellectual property rights, please contact us immediately (cf. contact information). In such a case, we will promptly investigate the issue and remove the implicated data records from our dataset if necessary.

Users are responsible for ensuring that their use of the dataset complies with all licenses, applicable laws, regulations, and ethical guidelines. We make no representations or warranties of any kind and accept no responsibility in the case of violations.
o
Most Fatal Cancers in South Africa - Dataset - openAFRICA
open.africa
Updated Oct 22, 2015
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
(2015). Most Fatal Cancers in South Africa - Dataset - openAFRICA [Dataset]. https://open.africa/dataset/most-fatal-cancers-in-south-africa
Explore at:
Dataset updated
Oct 22, 2015
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Area covered
South Africa
Description
Two datasets that explore causes of death due to cancer in South Africa, drawing on data from the Revised Burden of Disease estimates for the Comparative Risk Factor Assessment for South Africa, 2000. The number and percentage of deaths due to cancer by cause are ranked for persons, males and females in the tables below. Lung cancer is the leading cause of cancer in SA accounting for 17% of all cancer deaths. This is followed by oesophagus Ca which accounts for 13%, cervix cancer accounting for 8%, breast cancer accounting for 8% and liver cancer which accounts for 6% of all cancers. Many more males suffer from lung and oesophagus cancer than females.
o
Statistics South Africa - Causes of Death - Dataset - openAFRICA
open.africa
Updated Nov 4, 2015
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
(2015). Statistics South Africa - Causes of Death - Dataset - openAFRICA [Dataset]. https://open.africa/dataset/statistics-south-africa-causes-of-death
Explore at:
Dataset updated
Nov 4, 2015
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Area covered
South Africa
Description
Mortality and causes of death from death notification
Marriages and Divorces 2021 - South Africa
datafirst.uct.ac.za
Updated Mar 4, 2023
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Statistics South Africa (2023). Marriages and Divorces 2021 - South Africa [Dataset]. https://www.datafirst.uct.ac.za/dataportal/index.php/catalog/933
Explore at:
Dataset updated
Mar 4, 2023
Dataset authored and provided by
Statistics South Africahttp://www.statssa.gov.za/
Time period covered
2021
Area covered
South Africa
Description
Abstract

The Marriages and Divorces (MD) dataset is one of three primary sources of of marriage and divorce statistics in South Africa. Unlike the other two sources (population censuses and household sample surveys), the MD dataset is compiled from administrative data and based on continuous recording (i.e. from civil registration systems and administrative records). Statistics South Africa (Stats SA) regularly publishes a series of data on marriages and divorces, with the first dataset in the series begining in 2006. The most recent dataset in the series is MD 2020.

Marriage data: Data on marriages for citizens and permanent residents are obtained from registered marriage records that are collected through the civil registration systems of the Department of Home Affairs (DHA). South Africa recognises three types of marriages by law: civil marriages, customary marriages and civil unions. Before 2008, marriage data only covered civil marriages. The registration of customary marriages and civil unions began in 2003 and 2007 respectively. However from 2008 onwards, Stats SA began publishing available data on customary marriages and civil unions.

Divorce data: Data on divorces are obtained from various regional courts that deal with divorce matters. The data are based on successful divorce cases that have been issued with a decree of divorce by the Department of Justice and Constitutional Development (DoJCD). Divorce cases come from marriages that were registered in different years as well as divorce cases that were filed in different years but whose divorce decrees were granted in the relevant year of collection.

NOTE: although both the data on marriages and divorces are collected in the same year, the data sets are not linked to each other.

Geographic coverage

The data has national coverage.

Analysis unit

Individuals

Universe

The data covers all civil marriages that were recoreded by the Department of Home Affairs and all divorce applications that were granted by the Department of Justice and Constitutional Development in 2021 in South Africa.

Kind of data

Administrative records

Mode of data collection

Other

Data appraisal

Geography is problematic in this dataset as not all the data files have geographic data. The Civil Marriages and Civil Unions data files include a Province of Registration variable but the Customary Marriages data file does not. There is also no geographical data in the Divorces file. As this data file includes divorce data from only a subset of divorce courts, this lack of geographical information compromises its usability.
R
Bccd Dataset
universe.roboflow.com
zip
Updated Aug 2, 2022
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Joseph Nelson (2022). Bccd Dataset [Dataset]. https://universe.roboflow.com/joseph-nelson/bccd/dataset/1
Explore at:
zipAvailable download formats
Dataset updated
Aug 2, 2022
Dataset authored and provided by
Joseph Nelson
License
MIT Licensehttps://opensource.org/licenses/MIT
License information was derived automatically
Variables measured
Cells
Description
Overview

This is a dataset of blood cells photos, originally open sourced by cosmicad and akshaylambda.

There are 364 images across three classes: WBC (white blood cells), RBC (red blood cells), and Platelets. There are 4888 labels across 3 classes (and 0 null examples).

Here's a class count from Roboflow's Dataset Health Check:

https://i.imgur.com/BVopW9p.png" alt="BCCD health">

And here's an example image:

https://i.imgur.com/QwyX2aD.png" alt="Blood Cell Example">

Fork this dataset (upper right hand corner) to receive the raw images, or (to save space) grab the 500x500 export.

Use Cases

This is a small scale object detection dataset, commonly used to assess model performance. It's a first example of medical imaging capabilities.

Using this Dataset

We're releasing the data as public domain. Feel free to use it for any purpose.

It's not required to provide attribution, but it'd be nice! :)

About Roboflow

Roboflow makes managing, preprocessing, augmenting, and versioning datasets for computer vision seamless.

Developers reduce 50% of their boilerplate code when using Roboflow's workflow, automate annotation quality assurance, save training time, and increase model reproducibility.
Marriages and Divorces 2023 - South Africa
datafirst.uct.ac.za
Updated Mar 27, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Statistics South Africa (2025). Marriages and Divorces 2023 - South Africa [Dataset]. https://www.datafirst.uct.ac.za/dataportal/index.php/catalog/1023
Explore at:
Dataset updated
Mar 27, 2025
Dataset authored and provided by
Statistics South Africahttp://www.statssa.gov.za/
Time period covered
2023
Area covered
South Africa
Description
Abstract

The Marriages and Divorces (MD) dataset is one of three primary sources of of marriage and divorce statistics in South Africa. Unlike the other two sources (population censuses and household sample surveys), the MD dataset is compiled from administrative data and based on continuous recording (i.e. from civil registration systems and administrative records). Statistics South Africa (Stats SA) regularly publishes a series of data on marriages and divorces, with the first dataset in the series begining in 2006. The most recent dataset in the series is MD 2023.

Marriage data: Data on marriages for citizens and permanent residents are obtained from registered marriage records that are collected through the civil registration systems of the Department of Home Affairs (DHA). South Africa recognises three types of marriages by law: civil marriages, customary marriages and civil unions. Before 2008, marriage data only covered civil marriages. The registration of customary marriages and civil unions began in 2003 and 2007 respectively. However from 2008 onwards, Stats SA began publishing available data on customary marriages and civil unions.

Divorce data: Data on divorces are obtained from various regional courts that deal with divorce matters. The data are based on successful divorce cases that have been issued with a decree of divorce by the Department of Justice and Constitutional Development (DoJCD). Divorce cases come from marriages that were registered in different years as well as divorce cases that were filed in different years but whose divorce decrees were granted in the relevant year of collection.

NOTE: although both the data on marriages and divorces are collected in the same year, the data sets are not linked to each other.

Geographic coverage

The data has national coverage.

Analysis unit

Individuals

Universe

The data covers all civil marriages, civil unions and customary marriages that were recorded by the Department of Home Affairs and all divorce applications that were granted by the Department of Justice and Constitutional Development in 2023 in South Africa.

Kind of data

Administrative records

Mode of data collection

Other
Mortality and Causes of Death 2012 - South Africa
microdata.worldbank.org
catalog.ihsn.org
Updated Feb 8, 2021
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Statistics South Africa (2021). Mortality and Causes of Death 2012 - South Africa [Dataset]. https://microdata.worldbank.org/index.php/catalog/3835
Explore at:
Dataset updated
Feb 8, 2021
Dataset provided by
Statistics South Africahttp://www.statssa.gov.za/
Department of Home Affairs
Time period covered
2012
Area covered
South Africa
Description
Abstract

This dataset contains statistics on deaths in South Africa in 2012. The registration of deaths in South Africa is regulated by the Births and Deaths Registration Act, 51 of 1992. The South African Department of Home Affairs (DHA) is responsible for the registration of deaths in South Africa. The data is collected with two instruments: The death register and the medical certificate in respect of death. The staff of the DHA Registrar of Deaths section fills in the former while the medical practitioner attending to the death completes the latter. Causes of death are coded by the Department of Home Affairs according to the tenth revision of the International Classification of Diseases (ICD-10) ICD-10, as required by the World Health Organization for their member countries. The data is used by the Department of Home Affairs to update the Population Register. The forms are sent to Statistics South Africa (Stats SA) for their use for statistical purposes. From the two forms sent to Stats SA, the following data items of the deceased are extracted: place of residence, place of death, date of death, month and year of registration, sex, marital status, occupation, underlying cause of death, whether or not the death was certified by a medical practitioner, and whether or not the deceased died in a health institution or nursing home. From 1991 death notifications do not require data on population group, and therefore this dataset includes death data for all population groups. This dataset excludes 2012 deaths that were not registered, and late registrations which would not have been available to Stats SA in time for the production of the dataset.

Geographic coverage

National coverage

Analysis unit

Individuals

Universe

The data covers all deaths that occurred in 2012 and registered at the Department of Home Affairs in South Africa.

Kind of data

Administrative records data [adm]

Mode of data collection

Other [oth]

Research instrument

The data is collected with two instruments: the death register and the medical certificate in respect of death.
WikiReaD (Wikipedia Readability Dataset)
zenodo.org
bz2
Updated May 22, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Mykola Trokhymovych; Indira Sen; Martin Gerlach; Mykola Trokhymovych; Indira Sen; Martin Gerlach (2025). WikiReaD (Wikipedia Readability Dataset) [Dataset]. http://doi.org/10.5281/zenodo.11371932
Explore at:
bz2Available download formats
Unique identifier
https://doi.org/10.5281/zenodo.11371932
Dataset updated
May 22, 2025
Dataset provided by
Zenodohttp://zenodo.org/
Authors
Mykola Trokhymovych; Indira Sen; Martin Gerlach; Mykola Trokhymovych; Indira Sen; Martin Gerlach
License
Attribution-ShareAlike 4.0 (CC BY-SA 4.0)https://creativecommons.org/licenses/by-sa/4.0/
License information was derived automatically
Description
Dataset Description:

The dataset contains pairs of encyclopedic articles in 14 languages. Each pair includes the same article in two levels of readability (easy/hard). The pairs are obtained by matching Wikipedia articles (hard) with the corresponding versions from different simplified or children's encyclopedias (easy).

Dataset Details:

Number of Languages: 14

Number of files: 19

Use Case: Training and evaluating readability scoring models for articles within and outside Wikipedia.

Processing details: Text pairs are created by matching articles from Wikipedia with the corresponding article in the simplified/children encyclopedia either via the Wikidata item ID or their page titles. The text of each article is extracted directly from their parsed HTML version.

Files: The dataset consists of independent files for each type of children/simplified encyclopedia and each language (e.g., `

Attribution:

The dataset was compiled from the following sources. The text of the original articles comes from the corresponding language version of Wikipedia. The text of the simplified articles comes from one of the following encyclopedias: Simple English Wikipedia, Vikidia, Klexikon, Txikipedia, or Wikikids.

Below we provide information about the license of the original content as well as the template to generate the link to the original source for a given page (

Wikipedia

Source: https://

License: CC BY-SA 4.0, GFDL

Simple English Wikipedia

Source: https://simple.wikipedia.org/wiki/

License: CC BY-SA 4.0, GFDL

Vikidia

Source: https://

License: CC BY-SA 3.0, GFDL

Klexikon

Source: https://klexikon.zum.de/wiki/

License: CC BY-SA 4.0

Txikipedia

Source: https://eu.wikipedia.org/wiki/Txikipedia:

License: CC BY-SA 4.0, GFDL

Wikikids

Source: https://wikikids.nl/

License: CC BY-SA 3.0

Related paper citation:

@inproceedings{trokhymovych-etal-2024-open, title = "An Open Multilingual System for Scoring Readability of {W}ikipedia", author = "Trokhymovych, Mykola and Sen, Indira and Gerlach, Martin", editor = "Ku, Lun-Wei and Martins, Andre and Srikumar, Vivek", booktitle = "Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)", month = aug, year = "2024", address = "Bangkok, Thailand", publisher = "Association for Computational Linguistics", url = "https://aclanthology.org/2024.acl-long.342/", doi = "10.18653/v1/2024.acl-long.342", pages = "6296--6311" }
t
INDIGO Change Detection Reference Dataset
researchdata.tuwien.at
jpeg, png, zip
Updated Jun 25, 2024
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Benjamin Wild; Benjamin Wild; Geert Verhoeven; Geert Verhoeven; Rafał Muszyński; Rafał Muszyński; Norbert Pfeifer; Norbert Pfeifer (2024). INDIGO Change Detection Reference Dataset [Dataset]. http://doi.org/10.48436/ayj4e-v4864
Explore at:
jpeg, zip, pngAvailable download formats
Unique identifier
https://doi.org/10.48436/ayj4e-v4864
Dataset updated
Jun 25, 2024
Dataset provided by
TU Wien
Authors
Benjamin Wild; Benjamin Wild; Geert Verhoeven; Geert Verhoeven; Rafał Muszyński; Rafał Muszyński; Norbert Pfeifer; Norbert Pfeifer
Description
The INDIGO Change Detection Reference Dataset
Description
This graffiti-centred change detection dataset was developed in the context of INDIGO, a research project focusing on the documentation, analysis and dissemination of graffiti along Vienna's Donaukanal. The dataset aims to support the development and assessment of change detection algorithms.
The dataset was collected from a test site approximately 50 meters in length along Vienna's Donaukanal during 11 days between 2022/10/21 and 2022/12/01. Various cameras with different settings were used, resulting in a total of 29 data collection sessions or "epochs" (see "EpochIDs.jpg" for details). Each epoch contains 17 images generated from 29 distinct 3D models with different textures. In total, the dataset comprises 6,902 unique image pairs, along with corresponding reference change maps. Additionally, exclusion masks are provided to ignore parts of the scene that might be irrelevant, such as the background.
To summarise, the dataset, labelled as "Data.zip," includes the following:
Synthetic Images: These are colour images created within Agisoft Metashape Professional 1.8.4, generated by rendering views from 17 artificial cameras observing 29 differently textured versions of the same 3D surface model.
Change Maps: Binary images that were manually and programmatically generated, using a Python script, from two synthetic graffiti images. These maps highlight the areas where changes have occurred.
Exclusion Masks: Binary images are manually created from synthetic graffiti images to identify "no data" areas or irrelevant ground pixels.
Image Acquisition
Image acquisition involved the use of two different camera setups. The first two datasets (ID 1 and 2; cf. "EpochIDs.jpg") were obtained using a Nikon Z 7II camera with a pixel count of 45.4 MP, paired with a Nikon NIKKOR Z 20 mm lens. For the remaining image datasets (ID 3-29), a triple GoPro setup was employed. This triple setup featured three GoPro cameras, comprising two GoPro HERO 10 cameras and one GoPro HERO 11, all securely mounted within a frame. This triple-camera setup was utilised on nine different days with varying camera settings, resulting in the acquisition of 27 image datasets in total (nine days with three datasets each).
Data Structure
The "Data.zip" file contains two subfolders:
1_ImagesAndChangeMaps: This folder contains the primary dataset. Each subfolder corresponds to a specific epoch. Within each epoch folder resides a subfolder for every other epoch with which a distinct epoch pair can be created. It is important to note that the pairs "Epoch Y and Epoch Z" are equivalent to "Epoch Z and Epoch Y", so the latter combinations are not included in this dataset. Each sub-subfolder, organised by epoch, contains 17 more subfolders, which hold the image data. These subfolders consist of:
Two synthetic images rendered from the same synthetic camera ("X_Y.jpg" and "X_Z.jpg")
The corresponding binary reference change map depicting the graffiti-related differences between the two images ("X_YZ.png"). Black areas denote new graffiti (i.e. "change"), and white denotes "no change". "DataStructure.png" provides a visual explanation concerning the creation of the dataset.

The filenames follow the following pattern:
X - Is the ID number of the synthetic camera. In total, 17 synthetic cameras were placed along the test site
Y - Corresponds to the reference epoch (i.e. the "older epoch")
Z - Corresponds to the "new epoch"

2_ExclusionMasks: This folder contains the binary exclusion masks. They were manually created from synthetic graffiti images and identify "no data" areas or areas considered irrelevant, such as "ground pixels". Two exclusion masks were generated for each of the 17 synthetic cameras:
"groundMasks": depict ground pixels which are usually irrelevant for the detection of graffiti
"noDataMasks": depict "background" for which no data is available.

A detailed dataset description (including detailed explanations of the data creation) is part of a journal paper currently in preparation. The paper will be linked here for further clarification as soon as it is available.
Licensing
Due to the nature of the three image types, this dataset comes with two licenses:
Synthetic images:
These come with an In Copyright license (for the rights usage terms, see https://rightsstatements.org/page/InC/1.0/?language=en).
The copyright lies with:
the Ludwig Boltzmann Gesellschaft (https://d-nb.info/gnd/1024204324)
the TU Wien (https://d-nb.info/gnd/55426-1)
One or more anonymous graffiti creator(s) upon whose work these images are based.
The first two entities are also the licensor of these images.

Change maps and masks:
These are openly licensed via CC BY-SA 4.0 (https://creativecommons.org/licenses/by-sa/4.0)
In this case, the copyright lies with:
the Ludwig Boltzmann Gesellschaft (https://d-nb.info/gnd/1024204324)
the TU Wien (https://d-nb.info/gnd/55426-1)
Both institutes are also the licensor of these images.
Every synthetic image, change map and mask has this licensing information embedded as IPTC photo metadata. In addition, the images' IPTC metadata also provide a short image description, the image creator and the creator's identity (in the form of an ORCiD).
-----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
If there are any questions, problems or suggestions for the dataset or the description, please do not hesitate to contact the corresponding author, Benjamin Wild.
g
CARMA, South Africa Power Plant Emissions, South Africa, 2000/ 2007/Future
geocommons.com
Updated May 5, 2008
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
data (2008). CARMA, South Africa Power Plant Emissions, South Africa, 2000/ 2007/Future [Dataset]. http://geocommons.com/search.html
Explore at:
Dataset updated
May 5, 2008
Dataset provided by
data
CARMA
Description
All the data for this dataset is provided from CARMA: Data from CARMA (www.carma.org) This dataset provides information about Power Plant emissions in South Africa. Power Plant emissions from all power plants in South Africa were obtained by CARMA for the past (2000 Annual Report), the present (2007 data), and the future. CARMA determine data presented for the future to reflect planned plant construction, expansion, and retirement. The dataset provides the name, company, parent company, city, state, lat/lon, and plant id for each individual power plant. Only Power Plants that had a listed longitude and latitude in CARMA's database were mapped. The dataset reports for the three time periods: Intensity: Pounds of CO2 emitted per megawatt-hour of electricity produced. Energy: Annual megawatt-hours of electricity produced. Carbon: Annual carbon dioxide (CO2) emissions. The units are short or U.S. tons. Multiply by 0.907 to get metric tons. Carbon Monitoring for Action (CARMA) is a massive database containing information on the carbon emissions of over 50,000 power plants and 4,000 power companies worldwide. Power generation accounts for 40% of all carbon emissions in the United States and about one-quarter of global emissions. CARMA is the first global inventory of a major, sector of the economy. The objective of CARMA.org is to equip individuals with the information they need to forge a cleaner, low-carbon future. By providing complete information for both clean and dirty power producers, CARMA hopes to influence the opinions and decisions of consumers, investors, shareholders, managers, workers, activists, and policymakers. CARMA builds on experience with public information disclosure techniques that have proven successful in reducing traditional pollutants. Please see carma.org for more information http://carma.org/region/detail/174

Facebook

Twitter

Click to copy link

Link copied

Cite

Statistics South Africa (2021). Mortality and Causes of Death 1997-2017 - South Africa [Dataset]. https://catalog.ihsn.org/catalog/study/ZAF_1997-2017_MCD_v01_M

Mortality and Causes of Death 1997-2017 - South Africa

Explore at:

Dataset updated

Jan 19, 2021

Dataset provided by

Statistics South Africahttp://www.statssa.gov.za/
Department of Home Affairs

Time period covered

1997 - 2017

Area covered

South Africa

Description

Abstract

This cumulative dataset contains statistics on mortality and causes of death in South Africa covering the period 1997-2017. The mortality and causes of death dataset is part of a regular series published by Stats SA, based on data collected through the civil registration system. This dataset is the most recent cumulative round in the series which began with the separately available dataset Recorded Deaths 1996.

The main objective of this dataset is to outline emerging trends and differentials in mortality by selected socio-demographic and geographic characteristics for deaths that occurred in the registered year and over time. Reliable mortality statistics, are the cornerstone of national health information systems, and are necessary for population health assessment, health policy and service planning; and programme evaluation. They are essential for studying the occurrence and distribution of health-related events, their determinants and management of related health problems. These data are particularly critical for monitoring the Sustainable Development Goals (SDGs) and Agenda 2063 which share the same goal for a high standard of living and quality of life, sound health and well-being for all and at all ages. Mortality statistics are also required for assessing the impact of non-communicable diseases (NCD's), emerging infectious diseases, injuries and natural disasters.

Geographic coverage

National coverage

Analysis unit

Individuals

Universe

This dataset is based on information on mortality and causes of death from the South African civil registration system. It covers all death notification forms from the Department of Home Affairs for deaths that occurred in 1997-2017, that reached Stats SA during the 2018/2019 processing phase.

Kind of data

Administrative records data [adm]

Mode of data collection

Other [oth]

Research instrument

The registration of deaths is captured using two instruments: form BI-1663 and form DHA-1663 (Notification/Register of death/stillbirth).

Data appraisal

This cumulative dataset is part of a regular series published by Stats SA and includes all previous rounds in the series (excluding Recorded Deaths 1996). Stats SA only includes one variable to classify the occupation group of the deceased (OccupationGrp) in the current round (1997-2017). Prior to 2016, Stats SA included both occupation group (OccupationGrp) and industry classification (Industry) in all previous rounds. Therefore, DataFirst has made the 1997-2015 cumulative round available as a separately downloadable dataset which includes both occupation group and industry classification of the deceased spanning the years 1997-2015.

Clear search

Close search

Google apps

Main menu

Mortality and Causes of Death 1997-2017 - South Africa

Abstract

Geographic coverage

Analysis unit

Universe

Kind of data

Mode of data collection

Research instrument

Data appraisal

Crime statistics - Dataset - data.sa.gov.au

Effect of suicide rates on life expectancy dataset

Annual Report 2021-22 - Dataset - data.sa.gov.au

MultiCaRe: An open-source clinical case dataset for medical image...

Counts of Dengue reported in SAUDI ARABIA: 1994-1994

Injured Bicyclists Under 16 Years of Age: Case Data - Dataset -...

Afrobarometer Survey 1 1999-2000, Merged 7 Country - Botswana, Lesotho,...

Abstract

Geographic coverage

Analysis unit

Kind of data

Sampling procedure

Dataset of health expenditure and suicide mortality rate of countries per...

Counts of Dengue without warning signs reported in SAUDI ARABIA: 1994-1994

Data from: dopanim: A Dataset of Doppelganger Animals with Noisy Annotations...

Profile

File Descriptions

Licenses

Contact

Acknowledgements

Disclaimer

Most Fatal Cancers in South Africa - Dataset - openAFRICA

Statistics South Africa - Causes of Death - Dataset - openAFRICA

Marriages and Divorces 2021 - South Africa

Abstract

Geographic coverage

Analysis unit

Universe

Kind of data

Mode of data collection

Data appraisal

Bccd Dataset

Overview

Use Cases

Using this Dataset

About Roboflow

Marriages and Divorces 2023 - South Africa

Abstract

Geographic coverage

Analysis unit

Universe

Kind of data

Mode of data collection

Mortality and Causes of Death 2012 - South Africa

Abstract

Geographic coverage

Analysis unit

Universe

Kind of data

Mode of data collection

Research instrument

WikiReaD (Wikipedia Readability Dataset)

INDIGO Change Detection Reference Dataset

The INDIGO Change Detection Reference Dataset

Description

Image Acquisition

Data Structure

Licensing

CARMA, South Africa Power Plant Emissions, South Africa, 2000/ 2007/Future

Mortality and Causes of Death 1997-2017 - South AfricaSee More Versions

Abstract

Geographic coverage

Analysis unit

Universe

Kind of data

Mode of data collection

Research instrument

Data appraisal

Mortality and Causes of Death 1997-2017 - South Africa