The 1997 Jordan Population and Family Health Survey (JPFHS) is a national sample survey carried out by the Department of Statistics (DOS) as part of its National Household Surveys Program (NHSP). The JPFHS was specifically aimed at providing information on fertility, family planning, and infant and child mortality. Information was also gathered on breastfeeding, on maternal and child health care and nutritional status, and on the characteristics of households and household members. The survey will provide policymakers and planners with important information for use in formulating informed programs and policies on reproductive behavior and health.
National
Sample survey data
SAMPLE DESIGN AND IMPLEMENTATION
The 1997 JPFHS sample was designed to produce reliable estimates of major survey variables for the country as a whole, for urban and rural areas, for the three regions (each composed of a group of governorates), and for the three major governorates, Amman, Irbid, and Zarqa.
The 1997 JPFHS sample is a subsample of the master sample that was designed using the frame obtained from the 1994 Population and Housing Census. A two-stage sampling procedure was employed. First, primary sampling units (PSUs) were selected with probability proportional to the number of housing units in the PSU. A total of 300 PSUs were selected at this stage. In the second stage, in each selected PSU, occupied housing units were selected with probability inversely proportional to the number of housing units in the PSU. This design maintains a self-weighted sampling fraction within each governorate.
UPDATING OF SAMPLING FRAME
Prior to the main fieldwork, mapping operations were carried out and the sample units/blocks were selected and then identified and located in the field. The selected blocks were delineated and the outer boundaries were demarcated with special signs. During this process, the numbers on buildings and housing units were updated, listed and documented, along with the name of the owner/tenant of the unit or household and the name of the household head. These activities took place between January 7 and February 28, 1997.
Note: See detailed description of sample design in APPENDIX A of the survey report.
Face-to-face
The 1997 JPFHS used two questionnaires, one for the household interview and the other for eligible women. Both questionnaires were developed in English and then translated into Arabic. The household questionnaire was used to list all members of the sampled households, including usual residents as well as visitors. For each member of the household, basic demographic and social characteristics were recorded and women eligible for the individual interview were identified. The individual questionnaire was developed utilizing the experience gained from previous surveys, in particular the 1983 and 1990 Jordan Fertility and Family Health Surveys (JFFHS).
The 1997 JPFHS individual questionnaire consists of 10 sections: - Respondent’s background - Marriage - Reproduction (birth history) - Contraception - Pregnancy, breastfeeding, health and immunization - Fertility preferences - Husband’s background, woman’s work and residence - Knowledge of AIDS - Maternal mortality - Height and weight of children and mothers.
Fieldwork and data processing activities overlapped. After a week of data collection, and after field editing of questionnaires for completeness and consistency, the questionnaires for each cluster were packaged together and sent to the central office in Amman where they were registered and stored. Special teams were formed to carry out office editing and coding.
Data entry started after a week of office data processing. The process of data entry, editing, and cleaning was done by means of the ISSA (Integrated System for Survey Analysis) program DHS has developed especially for such surveys. The ISSA program allows data to be edited while being entered. Data entry was completed on November 14, 1997. A data processing specialist from Macro made a trip to Jordan in November and December 1997 to identify problems in data entry, editing, and cleaning, and to work on tabulations for both the preliminary and final report.
A total of 7,924 occupied housing units were selected for the survey; from among those, 7,592 households were found. Of the occupied households, 7,335 (97 percent) were successfully interviewed. In those households, 5,765 eligible women were identified, and complete interviews were obtained with 5,548 of them (96 percent of all eligible women). Thus, the overall response rate of the 1997 JPFHS was 93 percent. The principal reason for nonresponse among the women was the failure of interviewers to find them at home despite repeated callbacks.
Note: See summarized response rates by place of residence in Table 1.1 of the survey report.
The estimates from a sample survey are subject to two types of errors: nonsampling errors and sampling errors. Nonsampling errors are the result of mistakes made in implementing data collection and data processing (such as failure to locate and interview the correct household, misunderstanding questions either by the interviewer or the respondent, and data entry errors). Although during the implementation of the 1997 JPFHS numerous efforts were made to minimize this type of error, nonsampling errors are not only impossible to avoid but also difficult to evaluate statistically.
Sampling errors, on the other hand, can be evaluated statistically. The respondents selected in the 1997 JPFHS constitute only one of many samples that could have been selected from the same population, given the same design and expected size. Each of those samples would have yielded results differing somewhat from the results of the sample actually selected. Sampling errors are a measure of the variability among all possible samples. Although the degree of variability is not known exactly, it can be estimated from the survey results.
A sampling error is usually measured in terms of the standard error for a particular statistic (mean, percentage, etc.), which is the square root of the variance. The standard error can be used to calculate confidence intervals within which the true value for the population can reasonably be assumed to fall. For example, for any given statistic calculated from a sample survey, the value of that statistic will fall within a range of plus or minus two times the standard error of that statistic in 95 percent of all possible samples of identical size and design.
If the sample of respondents had been selected as a simple random sample, it would have been possible to use straightforward formulas for calculating sampling errors. However, since the 1997 JDHS-II sample resulted from a multistage stratified design, formulae of higher complexity had to be used. The computer software used to calculate sampling errors for the 1997 JDHS-II was the ISSA Sampling Error Module, which uses the Taylor linearization method of variance estimation for survey estimates that are means or proportions. The Jackknife repeated replication method is used for variance estimation of more complex statistics, such as fertility and mortality rates.
Note: See detailed estimate of sampling error calculation in APPENDIX B of the survey report.
Data Quality Tables - Household age distribution - Age distribution of eligible and interviewed women - Completeness of reporting - Births by calendar years - Reporting of age at death in days - Reporting of age at death in months
Note: See detailed tables in APPENDIX C of the survey report.
The 2005 Republic of Palau Census of Population and Housing will be used to give a snapshot of Republic of Palau's population and housing at the mid-point of the decade. This Census is also important because it measures the population at the beginning of the implementation of the Compact of Free Association. The information collected in the census is needed to plan for the needs of the population. The government uses the census figures to allocate funds for public services in a wide variety of areas, such as education, housing, and job training. The figures also are used by private businesses, academic institutions, local organizations, and the public in general to understand who we are and what our situation is, in order to prepare better for our future needs.
The fundamental purpose of a census is to provide information on the size, distribution and characteristics of a country's population. The census data are used for policymaking, planning and administration, as well as in management and evaluation of programmes in education, labour force, family planning, housing, health, transportation and rural development. A basic administrative use is in the demarcation of constituencies and allocation of representation to governing bodies. The census is also an invaluable resource for research, providing data for scientific analysis of the composition and distribution of the population and for statistical models to forecast its future growth. The census provides business and industry with the basic data they need to appraise the demand for housing, schools, furnishings, food, clothing, recreational facilities, medical supplies and other goods and services.
A hierarchical geographic presentation shows the geographic entities in a superior/subordinate structure in census products. This structure is derived from the legal, administrative, or areal relationships of the entities. The hierarchical structure is depicted in report tables by means of indentation. The following structure is used for the 2005 Census of the Republic of Palau:
Republic of Palau State Hamlet/Village Enumeration District Block
Individuals Families Households General Population
The Census covered all the households and respective residents in the entire country.
Census/enumeration data [cen]
Not applicable to a full enumeration census.
Face-to-face [f2f]
The 2005 Palau Census of Population and Housing comprises three parts: 1. Housing - one form for each household 2. Population - one for for each member of the household 3. People who have left home - one form for each household.
Full scale processing and editing activiities comprised eight separate sessions either with or separately but with remote guidance of the U.S. Census Bureau experts to finalize all datasets for publishing stage.
Processing operation was handled with care to produce a set of data that describes the population as clearly and accurately as possible. To meet this objective, questionnaires were reviewed and edited during field data collection operations by crew leaders for consistency, completeness, and acceptability. Questionnaires were also reviewed by census clerks in the census office for omissions, certain inconsistencies, and population coverage. For example, write-in entries such as "Don't know" or "NA" were considered unacceptable in certain quantities and/or in conjunction with other data omissions.
As a result of this review operation, a telephone or personal visit follow-up was made to obtain missing information. Potential coverage errors were included in the follow-up, as well as questionnaires with omissions or inconsistencies beyond the completeness and quality tolerances specified in the review procedures.
Subsequent to field operations, remaining incomplete or inconsistent information on the questionnaires was assigned using imputation procedures during the final automated edit of the collected data. Allocations, or computer assignments of acceptable data in place of unacceptable entries or blanks, were needed most often when an entry for a given item was lacking or when the information reported for a person or housing unit on that item was inconsistent with other information for that same person or housing unit. As in previous censuses, the general procedure for changing unacceptable entries was to assign an entry for a person or housing unit that was consistent with entries for persons or housing units with similar characteristics. The assignment of acceptable data in lace of blanks or unacceptable entries enhanced the usefulness of the data.
Another way to make corrections during the computer editing process is substitution. Substitution is the assignment of a full set of characteristics for a person or housing unit. Because of the detailed field operations, substitution was not needed for the 2005 Census.
Sampling Error is not applicable to full enumeration censuses.
In any large-scale statistical operation, such as the 2005 Census of the Republic of Palau, human- and machine-related errors were anticipated. These errors are commonly referred to as nonsampling errors. Such errors include not enumerating every household or every person in the population, not obtaining all required information form the respondents, obtaining incorrect or inconsistent information, and recording information incorrectly. In addition, errors can occur during the field review of the enumerators' work, during clerical handling of the census questionnaires, or during the electronic processing of the questionnaires.
To reduce various types of nonsampling errors, a number of techniques were implemented during the planning, data collection, and data processing activities. Quality assurance methods were used throughout the data collection and processing phases of the census to improve the quality of the data.
In 1800, the region of Germany was not a single, unified nation, but a collection of decentralized, independent states, bound together as part of the Holy Roman Empire. This empire was dissolved, however, in 1806, during the Revolutionary and Napoleonic eras in Europe, and the German Confederation was established in 1815. Napoleonic reforms led to the abolition of serfdom, extension of voting rights to property-owners, and an overall increase in living standards. The population grew throughout the remainder of the century, as improvements in sanitation and medicine (namely, mandatory vaccination policies) saw child mortality rates fall in later decades. As Germany industrialized and the economy grew, so too did the argument for nationhood; calls for pan-Germanism (the unification of all German-speaking lands) grew more popular among the lower classes in the mid-1800s, especially following the revolutions of 1948-49. In contrast, industrialization and poor harvests also saw high unemployment in rural regions, which led to waves of mass migration, particularly to the U.S.. In 1886, the Austro-Prussian War united northern Germany under a new Confederation, while the remaining German states (excluding Austria and Switzerland) joined following the Franco-Prussian War in 1871; this established the German Empire, under the Prussian leadership of Emperor Wilhelm I and Chancellor Otto von Bismarck. 1871 to 1945 - Unification to the Second World War The first decades of unification saw Germany rise to become one of Europe's strongest and most advanced nations, and challenge other world powers on an international scale, establishing colonies in Africa and the Pacific. These endeavors were cut short, however, when the Austro-Hungarian heir apparent was assassinated in Sarajevo; Germany promised a "blank check" of support for Austria's retaliation, who subsequently declared war on Serbia and set the First World War in motion. Viewed as the strongest of the Central Powers, Germany mobilized over 11 million men throughout the war, and its army fought in all theaters. As the war progressed, both the military and civilian populations grew increasingly weakened due to malnutrition, as Germany's resources became stretched. By the war's end in 1918, Germany suffered over 2 million civilian and military deaths due to conflict, and several hundred thousand more during the accompanying influenza pandemic. Mass displacement and the restructuring of Europe's borders through the Treaty of Versailles saw the population drop by several million more.
Reparations and economic mismanagement also financially crippled Germany and led to bitter indignation among many Germans in the interwar period; something that was exploited by Adolf Hitler on his rise to power. Reckless printing of money caused hyperinflation in 1923, when the currency became so worthless that basic items were priced at trillions of Marks; the introduction of the Rentenmark then stabilized the economy before the Great Depression of 1929 sent it back into dramatic decline. When Hitler became Chancellor of Germany in 1933, the Nazi government disregarded the Treaty of Versailles' restrictions and Germany rose once more to become an emerging superpower. Hitler's desire for territorial expansion into eastern Europe and the creation of an ethnically-homogenous German empire then led to the invasion of Poland in 1939, which is considered the beginning of the Second World War in Europe. Again, almost every aspect of German life contributed to the war effort, and more than 13 million men were mobilized. After six years of war, and over seven million German deaths, the Axis powers were defeated and Germany was divided into four zones administered by France, the Soviet Union, the UK, and the U.S.. Mass displacement, shifting borders, and the relocation of peoples based on ethnicity also greatly affected the population during this time. 1945 to 2020 - Partition and Reunification In the late 1940s, cold war tensions led to two distinct states emerging in Germany; the Soviet-controlled east became the communist German Democratic Republic (DDR), and the three western zones merged to form the democratic Federal Republic of Germany. Additionally, Berlin was split in a similar fashion, although its location deep inside DDR territory created series of problems and opportunities for the those on either side. Life quickly changed depending on which side of the border one lived. Within a decade, rapid economic recovery saw West Germany become western Europe's strongest economy and a key international player. In the east, living standards were much lower, although unemployment was almost non-existent; internationally, East Germany was the strongest economy in the Eastern Bloc (after the USSR), though it eventually fell behind the West by the 1970s. The restriction of movement between the two states also led to labor shortages in t...
The 2011 Population and Housing Census is the third national Census to be conducted in Namibia after independence. The first was conducted 1991 followed by the 2001 Census. Namibia is therefore one of the countries in sub-Saharan Africa that has participated in the 2010 Round of Censuses and followed the international best practice of conducting decennial Censuses, each of which attempts to count and enumerate every person and household in a country every ten years. Surveys, by contrast, collect data from samples of people and/or households.
Censuses provide reliable and critical data on the socio-economic and demographic status of any country. In Namibia, Census data has provided crucial information for development planning and programme implementation. Specifically, the information has assisted in setting benchmarks, formulating policy and the evaluation and monitoring of national development programmes including NDP4, Vision 2030 and several sector programmes. The information has also been used to update the national sampling frame which is used to select samples for household-based surveys, including labour force surveys, demographic and health surveys, household income and expenditure surveys. In addition, Census information will be used to guide the demarcation of Namibia's administrative boundaries where necessary.
At the international level, Census information has been used extensively in monitoring progress towards Namibia's achievement of international targets, particularly the Millennium Development Goals (MDGs).
The latest and most comprehensive Census was conducted in August 2011. Preparations for the Census started in the 2007/2008 financial year under the auspices of the then Central Bureau of Statistics (CBS) which was later transformed into the Namibia Statistics Agency (NSA). The NSA was established under the Statistics Act No. 9 of 2011, with the legal mandate and authority to conduct population Censuses every 10 years. The Census was implemented in three broad phases; pre-enumeration, enumeration and post enumeration.
During the first pre-enumeration phase, activities accomplished including the preparation of a project document, establishing Census management and technical committees, and establishing the Census cartography unit which demarcated the Enumeration Areas (EAs). Other activities included the development of Census instruments and tools, such as the questionnaires, manuals and field control forms.
Field staff were recruited, trained and deployed during the initial stages of the enumeration phase. The actual enumeration exercise was undertaken over a period of about three weeks from 28 August to 15 September 2011, while 28 August 2011 was marked as the reference period or 'Census Day'.
Great efforts were made to check and ensure that the Census data was of high quality to enhance its credibility and increase its usage. Various quality controls were implemented to ensure relevance, timeliness, accuracy, coherence and proper data interpretation. Other activities undertaken to enhance quality included the demarcation of the country into small enumeration areas to ensure comprehensive coverage; the development of structured Census questionnaires after consultat.The post-enumeration phase started with the sending of completed questionnaires to Head Office and the preparation of summaries for the preliminary report, which was published in April 2012. Processing of the Census data began with manual editing and coding, which focused on the household identification section and un-coded parts of the questionnaire. This was followed by the capturing of data through scanning. Finally, the data were verified and errors corrected where necessary. This took longer than planned due to inadequate technical skills.
National coverage
Households and persons
The sampling universe is defined as all households (private and institutions) from 2011 Census dataset.
Census/enumeration data [cen]
Sample Design
The stratified random sample was applied on the constituency and urban/rural variables of households list from Namibia 2011 Population and Housing Census for the Public Use Microdata Sample (PUMS) file. The sampling universe is defined as all households (private and institutions) from 2011 Census dataset. Since urban and rural are very important factor in the Namibia situation, it was then decided to take the stratum at the constituency and urban/rural levels. Some constituencies have very lower households in the urban or rural, the office therefore decided for a threshold (low boundary) for sampling within stratum. Based on data analysis, the threshold for stratum of PUMS file is 250 households. Thus, constituency and urban/rural areas with less than 250 households in total were included in the PUMS file. Otherwise, a simple random sampling (SRS) at a 20% sample rate was applied for each stratum. The sampled households include 93,674 housing units and 418,362 people.
Sample Selection
The PUMS sample is selected from households. The PUMS sample of persons in households is selected by keeping all persons in PUMS households. Sample selection process is performed using Census and Survey Processing System (CSPro).
The sample selection program first identifies the 7 census strata with less than 250 households and the households (private and institutions) with more than 50 people. The households in these areas and with this large size are all included in the sample. For the other households, the program randomly generates a number n from 0 to 4. Out of every 5 households, the program selects the nth household to export to the PUMS data file, creating a 20 percent sample of households. Private households and institutions are equally sampled in the PUMS data file.
Note: The 7 census strata with less than 250 households are: Arandis Constituency Rural, Rehoboth East Urban Constituency Rural, Walvis Bay Rural Constituency Rural, Mpungu Constituency Urban, Etayi Constituency Urban, Kalahari Constituency Urban, and Ondobe Constituency Urban.
Face-to-face [f2f]
The following questionnaire instruments were used for the Namibia 2011 Population and and Housing Census:
Form A (Long Form): For conventional households and residential institutions
Form B1 (Short Form): For special population groups such as persons in transit (travellers), police cells, homeless and off-shore populations
Form B2 (Short Form): For hotels/guesthouses
Form B3 (Short Form): For foreign missions/diplomatic corps
Data editing took place at a number of stages throughout the processing, including: a) During data collection in the field b) Manual editing and coding in the office c) During data entry (Primary validation/editing) Structure checking and completeness using Structured Query Language (SQL) program d) Secondary editing: i. Imputations of variables ii. Structural checking in Census and Survey Processing System (CSPro) program
Sampling Error The standard errors of survey estimates are needed to evaluate the precision of the survey estimation. The statistical software package such as SPSS or SAS can accurately estimate the mean and variance of estimates from the survey. SPSS or SAS software package makes use of the Taylor series approach in computing the variance.
Data quality Great efforts were made to check and ensure that the Census data was of high quality to enhance its credibility and increase its usage. Various quality controls were implemented to ensure relevance, timeliness, accuracy, coherence and proper data interpretation. Other activities undertaken to enhance quality included the demarcation of the country into small enumeration areas to ensure comprehensive coverage; the development of structured Census questionnaires after consultation with government ministries, university expertise and international partners; the preparation of detailed supervisors' and enumerators' instruction manuals to guide field staff during enumeration; the undertaking of comprehensive publicity and advocacy programmes to ensure full Government support and cooperation from the general public; the testing of questionnaires and other procedures; the provision of adequate training and undertaking of intensive supervision using four supervisory layers; the editing of questionnaires at field level; establishing proper mechanisms which ensured that all completed questionnaires were properly accounted for; ensuring intensive verification, validating all information and error corrections; and developing capacity in data processing with support from the international community.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
All cities with a population > 1000 or seats of adm div (ca 80.000)Sources and ContributionsSources : GeoNames is aggregating over hundred different data sources. Ambassadors : GeoNames Ambassadors help in many countries. Wiki : A wiki allows to view the data and quickly fix error and add missing places. Donations and Sponsoring : Costs for running GeoNames are covered by donations and sponsoring.Enrichment:add country name
https://datafinder.stats.govt.nz/license/attribution-4-0-international/https://datafinder.stats.govt.nz/license/attribution-4-0-international/
Dataset shows an individual’s statistical area 3 (SA3) of usual residence and the SA3 of their place of study, for the census usually resident population count who are studying (part time or full time), by main means of travel to education from the 2018 and 2023 Censuses.
The main means of travel to education categories are:
Main means of travel to education is the usual method a person used to travel the longest distance to their place of study.
Educational institution address is the physical location of the individual’s place of study. Educational institutions include early childhood education, primary school, secondary school, and tertiary education institutions. For individuals who study at home, their educational institution address is the same as their usual residence address.
Educational institution address is coded to the most detailed geography possible from the available information. This dataset only includes travel to education information for individuals whose educational institution address is available at SA3 level. The sum of the counts for each region in this dataset may not equal the census usually resident population count who are studying (part time or full time) for that region. Educational institution address – 2023 Census: Information by concept has more information.
This dataset can be used in conjunction with the following spatial files by joining on the SA3 code values:
Download data table using the instructions in the Koordinates help guide.
Footnotes
Geographical boundaries
Statistical standard for geographic areas 2023 (updated December 2023) has information about geographic boundaries as of 1 January 2023. Address data from 2013 and 2018 Censuses was updated to be consistent with the 2023 areas. Due to the changes in area boundaries and coding methodologies, 2013 and 2018 counts published in 2023 may be slightly different to those published in 2013 or 2018.
Subnational census usually resident population
The census usually resident population count of an area (subnational count) is a count of all people who usually live in that area and were present in New Zealand on census night. It excludes visitors from overseas, visitors from elsewhere in New Zealand, and residents temporarily overseas on census night. For example, a person who usually lives in Christchurch city and is visiting Wellington city on census night will be included in the census usually resident population count of Christchurch city.
Population counts
Stats NZ publishes a number of different population counts, each using a different definition and methodology. Population statistics – user guide has more information about different counts.
Caution using time series
Time series data should be interpreted with care due to changes in census methodology and differences in response rates between censuses. The 2023 and 2018 Censuses used a combined census methodology (using census responses and administrative data).
Educational institution address time series
Educational institution address time series data should be interpreted with care at lower geographic levels, such as statistical area 2 (SA2). Methodological improvements in 2023 Census resulted in greater data accuracy, including a greater proportion of people being counted at lower geographic areas compared to the 2018 Census. Educational institution address – 2023 Census: Information by concept has more information.
Rows excluded from the dataset
Rows show SA3 of usual residence by SA3 of educational institution address. Rows with a total population count of less than six have been removed to reduce the size of the dataset, given only a small proportion of SA3-SA3 combinations have commuter flows.
About the 2023 Census dataset
For information on the 2023 dataset see Using a combined census model for the 2023 Census. We combined data from the census forms with administrative data to create the 2023 Census dataset, which meets Stats NZ's quality criteria for population structure information. We added real data about real people to the dataset where we were confident the people who hadn’t completed a census form (which is known as admin enumeration) will be counted. We also used data from the 2018 and 2013 Censuses, administrative data sources, and statistical imputation methods to fill in some missing characteristics of people and dwellings.
Data quality
The quality of data in the 2023 Census is assessed using the quality rating scale and the quality assurance framework to determine whether data is fit for purpose and suitable for release. Data quality assurance in the 2023 Census has more information.
Quality rating of a variable
The quality rating of a variable provides an overall evaluation of data quality for that variable, usually at the highest levels of classification. The quality ratings shown are for the 2023 Census unless stated. There is variability in the quality of data at smaller geographies. Data quality may also vary between censuses, for subpopulations, or when cross tabulated with other variables or at lower levels of the classification. Data quality ratings for 2023 Census variables has more information on quality ratings by variable.
Main means of travel to education quality rating
Main means of travel to education is rated as moderate quality.
Main means of travel to education – 2023 Census: Information by concept has more information, for example, definitions and data quality.
Educational institution address quality rating
Educational institution address is rated as moderate quality.
Educational institution address – 2023 Census: Information by concept has more information, for example, definitions and data quality.
Using data for good
Stats NZ expects that, when working with census data, it is done so with a positive purpose, as outlined in the Māori Data Governance Model (Data Iwi Leaders Group, 2023). This model states that "data should support transformative outcomes and should uplift and strengthen our relationships with each other and with our environments. The avoidance of harm is the minimum expectation for data use. Māori data should also contribute to iwi and hapū tino rangatiratanga”.
Confidentiality
The 2023 Census confidentiality rules have been applied to 2013, 2018, and 2023 data. These rules protect the confidentiality of individuals, families, households, dwellings, and undertakings in 2023 Census data. Counts are calculated using fixed random rounding to base 3 (FRR3) and suppression of ‘sensitive’ counts less than six, where tables report multiple geographic variables and/or small populations. Individual figures may not always sum to stated totals. Applying confidentiality rules to 2023 Census data and summary of changes since 2018 and 2013 Censuses has more information about 2023 Census confidentiality rules.
Percentages
To calculate percentages, divide the figure for the category of interest by the figure for ‘Total stated’ where this applies.
Symbol
-999 Confidential
Inconsistencies in definitions
Please note that there may be differences in definitions between census classifications and those used for other data collections.
The 2010 Rwanda Demographic and Health Survey (RDHS) is designed to provide data for monitoring the population and health situation in Rwanda. The 2010 RDHS is the fifth Demographic and Health Survey to be conducted in Rwanda. The objective of the survey is to provide up-to-date information on fertility, family planning, childhood mortality, nutrition, maternal and child health, domestic violence, malaria, maternal mortality, awareness and behavior regarding HIV/AIDS, HIV prevalence, malaria prevalence, and anemia prevalence. A nationally representative sample of 13,671 women, age 15–49 from 12,540 surveyed households, and 6,329 men, age 15–59 from half of these households, were interviewed. This represents a response rate of 99 percent for women and 99 percent for men. The sample provides estimates at the national and provincial levels.
The main objectives of the 2010 RDHS were to: - Collect data at the national level to facilitate calculation of essential demographic rates, especially rates for fertility and infant and child mortality, and to analyze the direct and indirect factors that determine levels and trends in fertility and child mortality - Measure the levels of knowledge of contraceptive practices among women - Collect data on family health, including immunization practices; prevalence and treatment of diarrhea, acute upper respiratory infections, fever and/or convulsions among children under age 5; antenatal visits; and assistance at delivery - Collect data on the prevention and treatment of malaria, in particular the possession and use of bed nets among children under 5 and among women and pregnant women - Collect data on nutritional practices of children, including breastfeeding - Collect data on the knowledge and attitudes of men and women concerning sexually transmitted infections (STIs) and acquired immune deficiency syndrome (AIDS) and evaluate recent behavioral changes with regard to condom use - Collect data for the estimation of adult mortality and maternal mortality at the national level - Take anthropometric measurements in half of surveyed households in order to evaluate the nutritional status of children, men, and women - Conduct confidential testing for malaria parasitemia using Rapid Diagnostic Testing in half of the surveyed households and anonymous blood smear testing at the National Reference Laboratory - Collect dried blood spots (from finger pricks) for anonymous HIV testing at the National Reference Laboratory in half of surveyed households - Measure hemoglobin level (by finger prick) for anemia of surveyed respondents in half of surveyed households.
National. The sample provides estimates at the national and provincial levels.
Household, adult woman, adult man
Sample survey data
The sample for the 2010 RDHS was designed to provide population and health indicator estimates for the country as a whole and for urban and rural areas in particular. Survey estimates are also reported for the provinces (South, West, North, and East) and for the City of Kigali. The results presented in this report show key indicators that correspond to these provinces and the City of Kigali.
A representative sample of 12,792 households was selected for the 2010 RDHS. The sample was selected in two stages. In the first stage, 492 villages (also known as clusters or enumeration areas) were selected with probability proportional to the village size. The village size is the number of households residing in the village. Then, a complete mapping and listing of all households existing in the selected villages was conducted. The resulting lists of households served as the sampling frame for the second stage of sample selection. Households were systematically selected from those lists for participation in the survey.
All women age 15-49 who were either permanent residents of the household or visitors present in the household on the night before the survey were eligible to be interviewed. In addition, in a subsample of half of all households selected for the survey, all men age 15-59 were eligible to be interviewed if they were either permanent residents or visitors present in the household on the night before the survey.
SAMPLING FRAME
The sampling frame used for the 2010 RDHS is the preparatory frame for the Rwanda General Population and Housing Census (RGPH), which will be conducted in 2012. Provided by the National Institute of Statistics of Rwanda (NISR), the sampling frame is a complete list of natural villages covering the entire country. Though it is preferable to work with a frame consisting of enumeration areas (EAs) because the natural villages are too variable in size, an EA frame is not available at the time of sampling design. The sampling frame that was available is the list of 14,837 natural villages, which contains the administrative characteristics for each village and village population. The village population comes from the national ID card project carried out in 2007-08, which may be under estimated compared with the population projection conducted in 2009 by NISR.
Rwanda's administrative units were reformed in 2006, so the country is currently divided into 5 provinces; 30 districts, 417 sectors, and 14,837 villages.The average village size is 610 residents, which is equivalent to 133 households. The sizes of the districts are quite homogeneous, varying from 2.7 percent to 4.4 percent. There is no urban-rural specification in the sampling frame because the urban-rural definition has not been released by the Ministry of Local Administration (MINALOC). It was expected that the urban-rural definition of the sampled villages will be determined during the data collection or in the office once the MINALOC releases the definition.
Face-to-face
Three questionnaires were used for the 2010 RDHS: the Household Questionnaire, the Woman’s Questionnaire, and the Man’s Questionnaire. They are based on questionnaires developed by the worldwide Demographic and Health Surveys (DHS) program and on questionnaires used during the 2005 RDHS and 2007-08 RIDHS surveys. To reflect relevant issues in population and health in Rwanda, the questionnaires were adapted during a series of technical meetings with various stakeholders from government ministries and agencies, nongovernmental organizations, and international donors. The questionnaires were translated from English and French into Kinyarwanda.
The Household Questionnaire was used to list all the usual members and visitors in the selected households as well as to identify women and men eligible for individual interviews. Basic information was collected on the characteristics of each person listed, including age, sex, education, and relationship to the head of household. For children under 18, survival status of the parents was determined. The Household Questionnaire also collected information on the following: - Dwelling characteristics - Utilization of health services and health expenditures for recent illness and injury - Possession of iodized salt - Possession and utilization of mosquito nets - Height and weight of women and children - Hemoglobin measurement of women and children - Blood collection from women and children for rapid test and laboratory testing of malaria - Blood collection from women and men for laboratory testing for HIV
The Woman’s Questionnaire was used to collect information from all women age 15-49 and was organized by the following sections: - Respondent background characteristics - Reproduction, including a complete birth and death history of respondents’ children and information on abortion - Contraception - Pregnancy and postnatal care - Child’s immunization, health, and nutrition - Marriage and sexual activity - Fertility preferences - Husband’s background and woman’s work - HIV/AIDS and other sexually transmitted infections - Other health issues - Adult mortality - Relationship in the household
The Man’s Questionnaire was administered to all men age 15-59 living in every other household in the RDHS sample. The Man’s Questionnaire collected much of the same information as the Woman’s Questionnaire but was shorter because it did not contain a detailed reproductive history or questions on maternal and child health or nutrition.
An instruction manual was also developed to support standardized data collection. All data collection instruments were pretested in June-July 2010. The observations and experiences gathered from the pretest were used to improve the instruments for the main survey data collection.
Data entry began on November 1, 2010, almost one month after the survey was launched in the field. Data were entered by a team of 15 data processing personnel recruited and trained for this task. They were assisted during these operations by 4 data verification and codification officers and 2 receptionists. Completed questionnaires were periodically brought in from the field to the National Institute of Statistics headquarters, where assigned agents checked them and coded the open-ended questions. Next, the questionnaires were sent to the data entry facility and the blood samples (DBS and malaria slides) were sent to the NRL to be screened for HIV. Data were entered using CSPro, a program developed jointly by the United States Census Bureau, the ORC Macro MEASURE DHS+ program, and Serpro S.A. Processing the data concurrently with data collection allowed for regular monitoring of teams’ performance and data quality. Field check tables were regularly generated during data processing to check
https://datafinder.stats.govt.nz/license/attribution-4-0-international/https://datafinder.stats.govt.nz/license/attribution-4-0-international/
Dataset contains counts and measures for individuals from the 2013, 2018, and 2023 Censuses. Data is available by statistical area 1.
The variables included in this dataset are for the census usually resident population count (unless otherwise stated). All data is for level 1 of the classification.
The variables for part 2 of the dataset are:
Download lookup file for part 2 from Stats NZ ArcGIS Online or embedded attachment in Stats NZ geographic data service. Download data table (excluding the geometry column for CSV files) using the instructions in the Koordinates help guide.
Footnotes
Te Whata
Under the Mana Ōrite Relationship Agreement, Te Kāhui Raraunga (TKR) will be publishing Māori descent and iwi affiliation data from the 2023 Census in partnership with Stats NZ. This will be available on Te Whata, a TKR platform.
Geographical boundaries
Statistical standard for geographic areas 2023 (updated December 2023) has information about geographic boundaries as of 1 January 2023. Address data from 2013 and 2018 Censuses was updated to be consistent with the 2023 areas. Due to the changes in area boundaries and coding methodologies, 2013 and 2018 counts published in 2023 may be slightly different to those published in 2013 or 2018.
Subnational census usually resident population
The census usually resident population count of an area (subnational count) is a count of all people who usually live in that area and were present in New Zealand on census night. It excludes visitors from overseas, visitors from elsewhere in New Zealand, and residents temporarily overseas on census night. For example, a person who usually lives in Christchurch city and is visiting Wellington city on census night will be included in the census usually resident population count of Christchurch city.
Population counts
Stats NZ publishes a number of different population counts, each using a different definition and methodology. Population statistics – user guide has more information about different counts.
Caution using time series
Time series data should be interpreted with care due to changes in census methodology and differences in response rates between censuses. The 2023 and 2018 Censuses used a combined census methodology (using census responses and administrative data), while the 2013 Census used a full-field enumeration methodology (with no use of administrative data).
Study participation time series
In the 2013 Census study participation was only collected for the census usually resident population count aged 15 years and over.
About the 2023 Census dataset
For information on the 2023 dataset see Using a combined census model for the 2023 Census. We combined data from the census forms with administrative data to create the 2023 Census dataset, which meets Stats NZ's quality criteria for population structure information. We added real data about real people to the dataset where we were confident the people who hadn’t completed a census form (which is known as admin enumeration) will be counted. We also used data from the 2018 and 2013 Censuses, administrative data sources, and statistical imputation methods to fill in some missing characteristics of people and dwellings.
Data quality
The quality of data in the 2023 Census is assessed using the quality rating scale and the quality assurance framework to determine whether data is fit for purpose and suitable for release. Data quality assurance in the 2023 Census has more information.
Concept descriptions and quality ratings
Data quality ratings for 2023 Census variables has additional details about variables found within totals by topic, for example, definitions and data quality.
Disability indicator
This data should not be used as an official measure of disability prevalence. Disability prevalence estimates are only available from the 2023 Household Disability Survey. Household Disability Survey 2023: Final content has more information about the survey.
Activity limitations are measured using the Washington Group Short Set (WGSS). The WGSS asks about six basic activities that a person might have difficulty with: seeing, hearing, walking or climbing stairs, remembering or concentrating, washing all over or dressing, and communicating. A person was classified as disabled in the 2023 Census if there was at least one of these activities that they had a lot of difficulty with or could not do at all.
Using data for good
Stats NZ expects that, when working with census data, it is done so with a positive purpose, as outlined in the Māori Data Governance Model (Data Iwi Leaders Group, 2023). This model states that "data should support transformative outcomes and should uplift and strengthen our relationships with each other and with our environments. The avoidance of harm is the minimum expectation for data use. Māori data should also contribute to iwi and hapū tino rangatiratanga”.
Confidentiality
The 2023 Census confidentiality rules have been applied to 2013, 2018, and 2023 data. These rules protect the confidentiality of individuals, families, households, dwellings, and undertakings in 2023 Census data. Counts are calculated using fixed random rounding to base 3 (FRR3) and suppression of ‘sensitive’ counts less than six, where tables report multiple geographic variables and/or small populations. Individual figures may not always sum to stated totals. Applying confidentiality rules to 2023 Census data and summary of changes since 2018 and 2013 Censuses has more information about 2023 Census confidentiality rules.
Measures
Measures like averages, medians, and other quantiles are calculated from unrounded counts, with input noise added to or subtracted from each contributing value
This statistic shows the 20 countries with the highest population growth rate in 2024. In SouthSudan, the population grew by about 4.65 percent compared to the previous year, making it the country with the highest population growth rate in 2024. The global population Today, the global population amounts to around 7 billion people, i.e. the total number of living humans on Earth. More than half of the global population is living in Asia, while one quarter of the global population resides in Africa. High fertility rates in Africa and Asia, a decline in the mortality rates and an increase in the median age of the world population all contribute to the global population growth. Statistics show that the global population is subject to increase by almost 4 billion people by 2100. The global population growth is a direct result of people living longer because of better living conditions and a healthier nutrition. Three out of five of the most populous countries in the world are located in Asia. Ultimately the highest population growth rate is also found there, the country with the highest population growth rate is Syria. This could be due to a low infant mortality rate in Syria or the ever -expanding tourism sector.
ODC Public Domain Dedication and Licence (PDDL) v1.0http://www.opendatacommons.org/licenses/pddl/1.0/
License information was derived automatically
On 6/28/2023, data on cases by vaccination status will be archived and will no longer update.
A. SUMMARY This dataset represents San Francisco COVID-19 positive confirmed cases by vaccination status over time, starting January 1, 2021. Cases are included on the date the positive test was collected (the specimen collection date). Cases are counted in three categories: (1) all cases; (2) unvaccinated cases; and (3) completed primary series cases.
All cases: Includes cases among all San Francisco residents regardless of vaccination status.
Unvaccinated cases: Cases are considered unvaccinated if their positive COVID-19 test was before receiving any vaccine. Cases that are not matched to a COVID-19 vaccination record are considered unvaccinated.
Completed primary series cases: Cases are considered completed primary series if their positive COVID-19 test was 14 days or more after they received their 2nd dose in a 2-dose COVID-19 series or the single dose of a 1-dose vaccine. These are also called “breakthrough cases.”
On September 12, 2021, a new case definition of COVID-19 was introduced that includes criteria for enumerating new infections after previous probable or confirmed infections (also known as reinfections). A reinfection is defined as a confirmed positive PCR lab test more than 90 days after a positive PCR or antigen test. The first reinfection case was identified on December 7, 2021.
Data is lagged by eight days, meaning the most recent specimen collection date included is eight days prior to today. All data updates daily as more information becomes available.
B. HOW THE DATASET IS CREATED Case information is based on confirmed positive laboratory tests reported to the City. The City then completes quality assurance and other data verification processes. Vaccination data comes from the California Immunization Registry (CAIR2). The California Department of Public Health runs CAIR2. Individual-level case and vaccination data are matched to identify cases by vaccination status in this dataset. Case records are matched to vaccine records using first name, last name, date of birth, phone number, and email address.
We include vaccination records from all nine Bay Area counties in order to improve matching rates. This allows us to identify breakthrough cases among people who moved to the City from other Bay Area counties after completing their vaccine series. Only cases among San Francisco residents are included.
C. UPDATE PROCESS Updates automatically at 08:00 AM Pacific Time each day.
D. HOW TO USE THIS DATASET Total San Francisco population estimates can be found in a view based on the San Francisco Population and Demographic Census dataset. These population estimates are from the 2016-2020 5-year American Community Survey (ACS). To identify total San Francisco population estimates, filter the view on “demographic_category_label” = “all ages”.
Population estimates by vaccination status are derived from our publicly reported vaccination counts, which can be found at COVID-19 Vaccinations Given to SF Residents Over Time.
The dataset includes new cases, 7-day average new cases, new case rates, 7-day average new case rates, percent of total cases, and 7-day average percent of total cases for each vaccination category.
New cases are the count of cases where the positive tests were collected on that specific specimen collection date. The 7-day rolling average shows the trend in new cases. The rolling average is calculated by averaging the new cases for a particular day with the prior 6 days.
New case rates are the count of new cases per 100,000 residents in each vaccination status group. The 7-day rolling average shows the trend in case rates. The rolling average is calculated by averaging the case rate for a particular day with the prior six days. Percent of total new cases shows the percent of all cases on each day that were among a particular vaccination status.
Here is more information on how each case rate is calculated:
The case rate for all cases is equal to the number of new cases among all residents divided by the estimated total resident population.
Unvaccinated case rates are equal to the number of new cases among unvaccinated residents divided by the estimated number of unvaccinated residents. The estimated number of unvaccinated residents is calculated by subtracting the number of residents that have received at least one dose of a vaccine from the total estimated resident population.
Completed primary series case rates are equal to the number of new cases among completed primary series residents divided by the estimated number of completed primary series residents. The estimated number of completed primary series residents is calculated by taking the number of residents who have completed their primary series over time and adding a 14-day delay to the “date_administered” column, to align with the definition of “Completed primary series cases” above.
E. CHANGE LOG
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Population, female (% of total population) in World was reported at 49.71 % in 2023, according to the World Bank collection of development indicators, compiled from officially recognized sources. World - Population, female (% of total) - actual values, historical data, forecasts and projections were sourced from the World Bank on May of 2025.
This graph shows the population of the U.S. by race and ethnic group from 2000 to 2023. In 2023, there were around 21.39 million people of Asian origin living in the United States. A ranking of the most spoken languages across the world can be accessed here. U.S. populationCurrently, the white population makes up the vast majority of the United States’ population, accounting for some 252.07 million people in 2023. This ethnicity group contributes to the highest share of the population in every region, but is especially noticeable in the Midwestern region. The Black or African American resident population totaled 45.76 million people in the same year. The overall population in the United States is expected to increase annually from 2022, with the 320.92 million people in 2015 expected to rise to 341.69 million people by 2027. Thus, population densities have also increased, totaling 36.3 inhabitants per square kilometer as of 2021. Despite being one of the most populous countries in the world, following China and India, the United States is not even among the top 150 most densely populated countries due to its large land mass. Monaco is the most densely populated country in the world and has a population density of 24,621.5 inhabitants per square kilometer as of 2021. As population numbers in the U.S. continues to grow, the Hispanic population has also seen a similar trend from 35.7 million inhabitants in the country in 2000 to some 62.65 million inhabitants in 2021. This growing population group is a significant source of population growth in the country due to both high immigration and birth rates. The United States is one of the most racially diverse countries in the world.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
The minor SNP allele frequency (MAF) is given by locus within and across all population samples. The overall North American population mean (μ) and variance (σ) as well as observed (HO) and expected heterozygosity (HE) are given for each locus. Lastly, locus specific FST, FIS, and FIT estimates among populations are given.
Globally, about 25 percent of the population is under 15 years of age and 10 percent is over 65 years of age. Africa has the youngest population worldwide. In Sub-Saharan Africa, more than 40 percent of the population is below 15 years, and only three percent are above 65, indicating the low life expectancy in several of the countries. In Europe, on the other hand, a higher share of the population is above 65 years than the population under 15 years. Fertility rates The high share of children and youth in Africa is connected to the high fertility rates on the continent. For instance, South Sudan and Niger have the highest population growth rates globally. However, about 50 percent of the world’s population live in countries with low fertility, where women have less than 2.1 children. Some countries in Europe, like Latvia and Lithuania, have experienced a population decline of one percent, and in the Cook Islands, it is even above two percent. In Europe, the majority of the population was previously working-aged adults with few dependents, but this trend is expected to reverse soon, and it is predicted that by 2050, the older population will outnumber the young in many developed countries. Growing global population As of 2025, there are 8.1 billion people living on the planet, and this is expected to reach more than nine billion before 2040. Moreover, the global population is expected to reach 10 billions around 2060, before slowing and then even falling slightly by 2100. As the population growth rates indicate, a significant share of the population increase will happen in Africa.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Context
The dataset tabulates the population of United States by gender across 18 age groups. It lists the male and female population in each age group along with the gender ratio for United States. The dataset can be utilized to understand the population distribution of United States by gender and age. For example, using this dataset, we can identify the largest age group for both Men and Women in United States. Additionally, it can be used to see how the gender ratio changes from birth to senior most age group and male to female ratio across each age group for United States.
Key observations
Largest age group (population): Male # 25-29 years (11.59 million) | Female # 25-29 years (11.26 million). Source: U.S. Census Bureau American Community Survey (ACS) 2017-2021 5-Year Estimates.
When available, the data consists of estimates from the U.S. Census Bureau American Community Survey (ACS) 2017-2021 5-Year Estimates.
Age groups:
Scope of gender :
Please note that American Community Survey asks a question about the respondents current sex, but not about gender, sexual orientation, or sex at birth. The question is intended to capture data for biological sex, not gender. Respondents are supposed to respond with the answer as either of Male or Female. Our research and this dataset mirrors the data reported as Male and Female for gender distribution analysis.
Variables / Data Columns
Good to know
Margin of Error
Data in the dataset are based on the estimates and are subject to sampling variability and thus a margin of error. Neilsberg Research recommends using caution when presening these estimates in your research.
Custom data
If you do need custom data for any of your research project, report or presentation, you can contact our research staff at research@neilsberg.com for a feasibility of a custom tabulation on a fee-for-service basis.
Neilsberg Research Team curates, analyze and publishes demographics and economic data from a variety of public and proprietary sources, each of which often includes multiple surveys and programs. The large majority of Neilsberg Research aggregated datasets and insights is made available for free download at https://www.neilsberg.com/research/.
This dataset is a part of the main dataset for United States Population by Gender. You can refer the same here
https://spdx.org/licenses/CC0-1.0.htmlhttps://spdx.org/licenses/CC0-1.0.html
This dataset provides code and example data for simulating specimen collections of flowering plants across North America, and for developing phenological predictions of population-level flowering onset and termination for these data. It further presents code for assessing the accuracy of these predictions relaticve to known (simulated) population-level flowering dates at the location of each collection. Methods Creating a reference dataset: generating sample locations representing known population-level phenological distributions and individual phenological parameters We simulated phenological data for 1200 hypothetical “species” in the coterminous USA that varied in the attributes of their individual- and population-level flowering phenology. For each of these simulated species, we selected 1000 locations within the continental United States, each representing a local population observed during a single year from which a simulated specimen was later obtained. The coordinates for each location, year, and associated mean annual temperature in the year of collection were randomly selected without replacement from 4-km2 PRISM pixels (PRISM Climate Group 2011) between the years 1901 to 2020, and were restricted to locations with 1991–2020 temperature normals of 1–20 °C and mean annual precipitation normals for the same period of 60–3800 mm. Each species generated this way was assigned a series of attributes defining its individual- and population-level flowering phenology. The peak flowering date of an individual was assumed to coincide with its mean flowering date. We then defined a linear equation describing the relationship between the mean date of peak flowering among individuals within a population and local temperature conditions. Each species was assigned a median population flowering DOY of 50 at 0˚C (i.e., the intercept) as well as a phenological responsiveness (i.e., slope) of median flowering DOY to mean annual temperature: advancing by 1, 4, or 8 days per increase in °C. Next, we assigned each species a low or high magnitude of intrapopulation variation in phenological timing (i.e., in peak flowering DOYs) among individuals (based on normal distributions with standard deviations (σ) of either 10 or 30 days), representing the magnitude of variation in the flowering times of early- to late-flowering individuals within each local population. Then, each species was assigned a short, moderate, or long duration of the flowering period by each individual within each population (15, 30, or 60 days, representing the duration of time each individual plant was in flower. Fifty species were simulated for each of these 18 combinations of phenological responsiveness, flowering duration, and intrapopulation variation in phenological timing. To accommodate the possibility that the magnitude of variation in phenological timing within a population could depend on local climate conditions, we also simulated 50 species with temperature-sensitive intrapopulation phenological variation (σ) ranging from 10 to 30 days. For these species, σ of the DOY among individuals in a given population increased by 1 day for every 1 °C increase in the mean annual temperature of its location. For these simulated species, individual flowering duration was fixed at 30 days. Additionally, to accommodate the possibility that individual flowering durations could exhibit linear relationships with local climate conditions, we also simulated 50 species that exhibited individual-level variation in flowering duration resulting from changes in temperature (increasing by 1 day per °C increase in mean annual temperature, and ranging from 10 days to 30 days). For these species, the degree of intrapopulation variation in peak flowering dates was held constant at σ = 30 days (i.e., high intrapopulation variation).
Calculation of population-level onset, median, and termination dates of flowering For each population of each species described above, we calculated a distribution of individual-level peak flowering dates—assumed to be normally distributed (Clark and Thompson 2011)—based on the flowering attributes of the species and the temperature conditions corresponding to its site and year of observation. First, we calculated the median flowering DOY at the location and year from which each specimen was collected based on its pre-defined intercept and phenological responsiveness to mean annual temperature (i.e., 1, 4, and 8 days per °C). Then, we obtained the standard deviation of each local population (i.e., its degree of intrapopulation variation in flowering dates) based on the flowering attributes of the simulated species as outlined above. Next, we arbitrarily defined population-level flowering onset DOYs for each population and year as the 10th percentile of a normally distributed population whose mean and standard deviation we obtained in the previous steps (i.e., the DOYs by which the first 10% of individuals in a local population at a given location and year would have reached their median flowering dates). Similarly, the population-level flowering termination dates were calculated as the 90th percentile of a normally distributed population with the same characteristics as described above (i.e., the DOYs by which all but 10% of individuals in a local population at a given location and year would have reached their peak (or mean) flowering dates). Through this process, we obtained a sample of 1000 annual population-level distributions of flowering dates for each of 1200 hypothetical species. For each of these populations, the quantiles of their flowering distribution—representing the nth individual reaching peak flowering within a population—were known a priori, representing a benchmark against which to compare estimates derived from simulated specimen data.
Simulating randomly selected (unbiased) phenological snapshots from pre-defined populations For each species, we then generated simulated specimens by: (1) randomly selecting an individual within each population and (2) selecting a random DOY within its individual-level flowering period that emulated the phenological snapshot provided by real herbarium specimens. Specifically, using the distribution of peak flowering dates of each population, we selected an individual at random. From its peak flowering date, we then obtained onset and termination dates by subtracting (for flowering onset) or adding (for flowering termination) half the individual’s flowering duration for that species to the sampled date of peak flowering. To simulate a phenological snapshot for that individual, we then randomly selected a DOY between the onset and termination of that individual’s flowering period. As a result, the simulated datum represented a simulated herbarium specimen generated accounting for uncertainty in both the timing of the individual relative to its source population, and in the timing of the collection relative to the onset and termination of that individual’s flowering period. This procedure was repeated across all locations for each simulated species, generating 1000 data points (i.e., simulated specimens or phenological snapshots) per species.
Simulating biases in collection effort across population-level flowering periods To simulate biases towards collection of specimens during the early or late portion of their local population-level flowering displays, we selected an individual at random within each population and year using both left- and right-skewed normal probability distributions. These distributions were constructed by modulating the parameter α in the python package scipy.stats.skewnorm v1.10.1 (Azzalini and Capitanio 1998), such that if the underlying plant population was treated as exhibiting a normal distribution (α = 0), samples were collected from that population with a left-skewed (α = -1.0) or right-skewed (α = -1.0) probability distribution. Once an individual was selected from these skewed distributions, the timing of sample collection from within the individual flowering durations of these ‘specimens’ were generated using similar methods as unbiased specimens. We then determined the accuracy of the model predictions generated from datasets exhibiting biased and unbiased sampling of local populations by comparing predicted population-level flowering onset and termination dates with the actual (i.e., known, simulated) flowering dates that were produced using a normal distribution. To minimize computation time, population-level biases were examined only for the subset of species for which phenological responsiveness to mean annual temperature equaled 4 days/˚C (representing moderate responsiveness to climate stimuli), intrapopulation variation was high (σ = 30), and individual flowering duration was moderate (30 days).
Simulating biases in the timing of collection within flowering periods of individuals In addition to biases towards collection of early or late individuals within a local population, botanists may also preferentially collect individuals from the early or late portion of their individual flowering period (i.e., individual collection bias). In some cases, collectors may preferentially collect individuals that are proximate to their peak flowering date because this is when the most flowers are displayed. In other cases, collectors may preferentially collect specimens that have only recently begun to flower, when floral structures may exhibit less damage from inclement weather or herbivores, or proximate to flowering termination in cases where the collector prefers specimens that include both flowers and fruits. Accordingly, for each population of each species, we simulated DOYs within each individual’s flowering period both at random (i.e., without bias) and with three different types of bias. Unbiased collections were simulated by selecting a random date chosen uniformly within the flowering period
As of February 2025, 5.56 billion individuals worldwide were internet users, which amounted to 67.9 percent of the global population. Of this total, 5.24 billion, or 63.9 percent of the world's population, were social media users. Global internet usage Connecting billions of people worldwide, the internet is a core pillar of the modern information society. Northern Europe ranked first among worldwide regions by the share of the population using the internet in 20254. In The Netherlands, Norway and Saudi Arabia, 99 percent of the population used the internet as of February 2025. North Korea was at the opposite end of the spectrum, with virtually no internet usage penetration among the general population, ranking last worldwide. Eastern Asia was home to the largest number of online users worldwide – over 1.34 billion at the latest count. Southern Asia ranked second, with around 1.2 billion internet users. China, India, and the United States rank ahead of other countries worldwide by the number of internet users. Worldwide internet user demographics As of 2024, the share of female internet users worldwide was 65 percent, five percent less than that of men. Gender disparity in internet usage was bigger in African countries, with around a ten percent difference. Worldwide regions, like the Commonwealth of Independent States and Europe, showed a smaller usage gap between these two genders. As of 2024, global internet usage was higher among individuals between 15 and 24 years old across all regions, with young people in Europe representing the most significant usage penetration, 98 percent. In comparison, the worldwide average for the age group 15–24 years was 79 percent. The income level of the countries was also an essential factor for internet access, as 93 percent of the population of the countries with high income reportedly used the internet, as opposed to only 27 percent of the low-income markets.
https://spdx.org/licenses/CC0-1.0.htmlhttps://spdx.org/licenses/CC0-1.0.html
This data set includes generalized structural equation models (SEM) for the parental populations and hybrid populations in Stata 18, code for figures in R and data in CSV format. We used SEM over linear regression in order to also generate estimates of the errors and unexplained variance of the midpoint value of the parents. First, we encoded sex, nuclear genome composition, collection cohort, days in lab after collection, and arena as variables in both models. For the hybrid model, we also added terms for mitochondrial genome and interaction terms between sex and nuclear genome composition because we observed a different trend in sex differences for one population. Separate SEM equations were necessary since parental populations were perfectly colinear with mitochondrial type, whereas F1 hybrids were not. Since observations were grouped by arena and trial, we utilized a clustered standard error calculator when combining both models to allow for different variance between populations. After, we tested the linear combinations of the means to determine if F1 hybrids performed significantly different from the midpoint value of the two parental populations. By using the estimates from the SEM, we were able to include the estimated error from the models into the equation as well. Lastly, we tested the linear combinations of the means to determine if the reciprocal crosses of F1 hybrids performed significantly differently from each other. Methods Copepods were collected from four locations along the California coastline in December 2019 and August 2021 under collection permits S-192510001-19262-001 and S-192510001-21127-001. All copepods were then transported to the University of North Carolina at Chapel Hill where they were maintained for experiments. Gravid females from each population were transferred to petri dishes with artificial sea water (ASW) and stored in a 12L:12D light cycle at 20°C for at least one month (equivalent to one generation) prior to testing. Salinity was adjusted to 35 ppt, and ground commercial fish flakes, which served as food, were added ad libitum every other week. F1 hybrids were generated by placing virgin females of one population with virgin males from a second population. Copepods were grouped by sex and population then placed in aggregates of 5 into arenas of a water bath. Temperature was maintained at 20°C by a cooling tower that circulated chilled water through the water bath, and the water bath was kept inside a custom-built glove box. An air stone placed in the water bath was used to deliver gas into the water continuously. Nitrogen gas was delivered to the water bath until the dissolved oxygen level reached 0.05 mg/L or less and maintained for 20 hours, then atmospheric air was delivered for 10 hours. At the end of the assay, copepods were prodded gently with a needle probe to prompt a swimming response, and, if copepods did not swim away, they were assumed to be dead. After each assay, all copepods were sacrificed, individually lysed, and genotyped. After lysis, a portion of the mitochondrial DNA was amplified via polymerase chain reaction (PCR) using population specific primers. Any confirmed heteroplasmic individuals were excluded from further analysis.
Estimated number of persons by quarter of a year and by year, Canada, provinces and territories.
Over the past 23 years, there were constantly more men than women living on the planet. Of the 8.06 billion people living on the Earth in 2023, 4.05 billion were men and 4.01 billion were women. One-quarter of the world's total population in 2024 was below 15 years.
The 1997 Jordan Population and Family Health Survey (JPFHS) is a national sample survey carried out by the Department of Statistics (DOS) as part of its National Household Surveys Program (NHSP). The JPFHS was specifically aimed at providing information on fertility, family planning, and infant and child mortality. Information was also gathered on breastfeeding, on maternal and child health care and nutritional status, and on the characteristics of households and household members. The survey will provide policymakers and planners with important information for use in formulating informed programs and policies on reproductive behavior and health.
National
Sample survey data
SAMPLE DESIGN AND IMPLEMENTATION
The 1997 JPFHS sample was designed to produce reliable estimates of major survey variables for the country as a whole, for urban and rural areas, for the three regions (each composed of a group of governorates), and for the three major governorates, Amman, Irbid, and Zarqa.
The 1997 JPFHS sample is a subsample of the master sample that was designed using the frame obtained from the 1994 Population and Housing Census. A two-stage sampling procedure was employed. First, primary sampling units (PSUs) were selected with probability proportional to the number of housing units in the PSU. A total of 300 PSUs were selected at this stage. In the second stage, in each selected PSU, occupied housing units were selected with probability inversely proportional to the number of housing units in the PSU. This design maintains a self-weighted sampling fraction within each governorate.
UPDATING OF SAMPLING FRAME
Prior to the main fieldwork, mapping operations were carried out and the sample units/blocks were selected and then identified and located in the field. The selected blocks were delineated and the outer boundaries were demarcated with special signs. During this process, the numbers on buildings and housing units were updated, listed and documented, along with the name of the owner/tenant of the unit or household and the name of the household head. These activities took place between January 7 and February 28, 1997.
Note: See detailed description of sample design in APPENDIX A of the survey report.
Face-to-face
The 1997 JPFHS used two questionnaires, one for the household interview and the other for eligible women. Both questionnaires were developed in English and then translated into Arabic. The household questionnaire was used to list all members of the sampled households, including usual residents as well as visitors. For each member of the household, basic demographic and social characteristics were recorded and women eligible for the individual interview were identified. The individual questionnaire was developed utilizing the experience gained from previous surveys, in particular the 1983 and 1990 Jordan Fertility and Family Health Surveys (JFFHS).
The 1997 JPFHS individual questionnaire consists of 10 sections: - Respondent’s background - Marriage - Reproduction (birth history) - Contraception - Pregnancy, breastfeeding, health and immunization - Fertility preferences - Husband’s background, woman’s work and residence - Knowledge of AIDS - Maternal mortality - Height and weight of children and mothers.
Fieldwork and data processing activities overlapped. After a week of data collection, and after field editing of questionnaires for completeness and consistency, the questionnaires for each cluster were packaged together and sent to the central office in Amman where they were registered and stored. Special teams were formed to carry out office editing and coding.
Data entry started after a week of office data processing. The process of data entry, editing, and cleaning was done by means of the ISSA (Integrated System for Survey Analysis) program DHS has developed especially for such surveys. The ISSA program allows data to be edited while being entered. Data entry was completed on November 14, 1997. A data processing specialist from Macro made a trip to Jordan in November and December 1997 to identify problems in data entry, editing, and cleaning, and to work on tabulations for both the preliminary and final report.
A total of 7,924 occupied housing units were selected for the survey; from among those, 7,592 households were found. Of the occupied households, 7,335 (97 percent) were successfully interviewed. In those households, 5,765 eligible women were identified, and complete interviews were obtained with 5,548 of them (96 percent of all eligible women). Thus, the overall response rate of the 1997 JPFHS was 93 percent. The principal reason for nonresponse among the women was the failure of interviewers to find them at home despite repeated callbacks.
Note: See summarized response rates by place of residence in Table 1.1 of the survey report.
The estimates from a sample survey are subject to two types of errors: nonsampling errors and sampling errors. Nonsampling errors are the result of mistakes made in implementing data collection and data processing (such as failure to locate and interview the correct household, misunderstanding questions either by the interviewer or the respondent, and data entry errors). Although during the implementation of the 1997 JPFHS numerous efforts were made to minimize this type of error, nonsampling errors are not only impossible to avoid but also difficult to evaluate statistically.
Sampling errors, on the other hand, can be evaluated statistically. The respondents selected in the 1997 JPFHS constitute only one of many samples that could have been selected from the same population, given the same design and expected size. Each of those samples would have yielded results differing somewhat from the results of the sample actually selected. Sampling errors are a measure of the variability among all possible samples. Although the degree of variability is not known exactly, it can be estimated from the survey results.
A sampling error is usually measured in terms of the standard error for a particular statistic (mean, percentage, etc.), which is the square root of the variance. The standard error can be used to calculate confidence intervals within which the true value for the population can reasonably be assumed to fall. For example, for any given statistic calculated from a sample survey, the value of that statistic will fall within a range of plus or minus two times the standard error of that statistic in 95 percent of all possible samples of identical size and design.
If the sample of respondents had been selected as a simple random sample, it would have been possible to use straightforward formulas for calculating sampling errors. However, since the 1997 JDHS-II sample resulted from a multistage stratified design, formulae of higher complexity had to be used. The computer software used to calculate sampling errors for the 1997 JDHS-II was the ISSA Sampling Error Module, which uses the Taylor linearization method of variance estimation for survey estimates that are means or proportions. The Jackknife repeated replication method is used for variance estimation of more complex statistics, such as fertility and mortality rates.
Note: See detailed estimate of sampling error calculation in APPENDIX B of the survey report.
Data Quality Tables - Household age distribution - Age distribution of eligible and interviewed women - Completeness of reporting - Births by calendar years - Reporting of age at death in days - Reporting of age at death in months
Note: See detailed tables in APPENDIX C of the survey report.