Chapter 18. Interpreting disaggregated health data
Overview
Disaggregated health data, which present health data by population subgroups, serve as a starting point for understanding the patterns and trends of inequalities in populations. The visual inspection of disaggregated data provides a cursory way to get a sense of the direction and magnitude of inequalities for a given indicator and time point. This can be done readily for binary dimensions of inequality groupings to identify which subgroup is performing better and which is performing worse, and to assess the extent of the gap between the two. When data are available for a large number of subgroups, displaying data graphically can facilitate the process of data inspection, because it can reveal patterns more easily than trying to make sense of multiple data points across columns and rows of a table.
Disaggregated data can convey information about inequalities in a straightforward and transparent manner, although certain issues related to their interpretation emerge upon close inspection. How can patterns in disaggregated data be described? With what degree of certainty do estimates represent their respective subgroups? When are the disaggregated estimates significantly different? What share of the population is captured in each subgroup – and what are the implications when the population share is uneven across subgroups? What considerations are required when comparing sets of disaggregated data across health topics, between populations (as part of benchmarking), and over time (as part of assessing inequality trends)? Consideration of these questions lends more robust insights into the data and nuances their interpretation.
This chapter presents strategies and fundamental considerations for interpreting disaggregated data. The objective is to facilitate a rigorous understanding of the conclusions derived from inspecting and comparing disaggregated data. It includes discussions on describing characteristic patterns in disaggregated data, accounting for underlying qualities and issues related to subgroup data, and generating valid comparisons of disaggregated data. A detailed understanding of the preparation of disaggregated data and their interpretation (covered in Chapter 17 and in this chapter, respectively) is a helpful precursor for the use of summary measures of health inequality (see Chapter 19-22). Reporting disaggregated data and summary measures is covered in Chapter 23.
Characteristic patterns in disaggregated data
For subgroups that are ordered (i.e. where they can be logically ranked, such as with age, economic status or education), describing characteristic patterns across disaggregated data can be a compelling way to assess and report disaggregated health data (1). Alongside each of these patterns, a corresponding response or intervention can also be proposed as a general starting point for further action (see Annex 10). See Chapter 8 for more about equity-oriented policy-making.
Figure 18.1 demonstrates four patterns of inequality in a health service coverage indicator (measured as percentage) across wealth quintiles, using hypothetical data. In this type of figure (known as an equiplot), the health indicator estimate is plotted on the bottom axis, and the subgroups are represented by coloured circles. The four characteristic patterns are labelled to the left. The top row shows complete coverage. All quintiles have around 100% coverage, indicating universal coverage of this health service with almost no inequality. A response to this situation is continued monitoring to ensure coverage remains high for all. The second row demonstrates a pattern of marginal exclusion, where the poorest quintile has much lower coverage than the four richer quintiles. An appropriate response here may involve targeting the poorest subgroup. The third row is a queuing or linear pattern, whereby there are increases in coverage across each of the quintiles. A combination (or gradient) approach, with differentiated targeting across the population subgroups may be warranted. The bottom row illustrates the mass deprivation pattern, where most of the population – that is, all but the richest quintile – has low levels of coverage. In this scenario, a population-level response may be required to reach all or most of the population. For examples of these characteristic patterns of inequality in countries, see Box 18.1.
FIGURE 18.1. Illustration of patterns of inequality in health service coverage across wealth quintiles
Horizontal lines show the range between the lowest and highest subgroup estimates.
BOX 18.1. Examples of characteristic patterns of inequality in disaggregated data
The following example shows characteristic patterns of inequality observed across wealth quintiles for deliveries in a health facility for nine selected countries in the WHO African Region with Demographic and Health Surveys (DHS) or Multiple Indicator Cluster Surveys (MICS) data available from 2019 or 2020 (Figure 18.2).
A queuing pattern was evident in the Central African Republic and Ethiopia, where the percentage of deliveries in a health facility increased in a stepwise pattern across wealth quintiles. Senegal and Zimbabwe demonstrated a marginal exclusion pattern, where the percentage of health facility deliveries was considerably lower in the poorest quintile compared with the four richer quintiles. Mass deprivation was observed in Chad and Guinea-Bissau, with a considerably higher percentage of health facility deliveries in the richest quintile compared with the four poorer quintiles. A universal pattern of high percentage of health facility deliveries was reported in Algeria, Malawi and Sao Tome and Principe.
FIGURE 18.2. Deliveries in a health facility, by economic status, nine selected countries in the WHO African Region
Horizontal lines show the range between the lowest and highest subgroup estimates.
Source: derived from the WHO Health Inequality Data Repository Reproductive, Maternal, Newborn and Child Health dataset (2), with data sourced from the Demographic and Health Surveys in 2019 (Ethiopia, Senegal) and from Multiple Indicator Cluster Surveys in 2019 (Algeria, Central African Republic, Chad, Guinea-Bissau, Sao Tome and Principe, Zimbabwe) and 2020 (Malawi).
Measures of uncertainty and significance
Measures of uncertainty for point estimates derived from surveys, such as 95% confidence intervals or standard errors, provide information about the reliability of the estimate. They can also be used to assess whether there are statistically significant differences between population subgroups and to help determine whether the results are meaningful. Figure 18.3, for example, shows 2016 estimates for stunting prevalence in children aged under five years in Paraguay disaggregated by wealth quintiles, with corresponding 95% confidence intervals (indicated by horizontal lines). Taking 95% confidence intervals into account, the data show a marginal exclusion pattern. The highest stunting prevalence was in the poorest quintile, with a large overlap in the 95% confidence intervals for the three middle quintiles (quintiles 2–4), and minimal overlap for the two richest quintiles (quintiles 4 and 5).
FIGURE 18.3. Stunting prevalence in children aged under five years, by economic status, Paraguay
Horizontal lines show 95% confidence intervals around the point estimates.
Source: derived from the WHO Health Inequality Data Repository Child Malnutrition dataset (2), with data sourced from the 2016 Multiple Indicator Cluster Surveys.
The mathematical calculation of measures of uncertainty takes survey sample size into account. Subgroup estimates based on smaller sample sizes tend to have more uncertainty, whereas estimates based on larger sample sizes tend to demonstrate lower uncertainty.
The level of uncertainty surrounding an estimate considers whether a comparison between two values is statistically significant (e.g. measured using a P value). Subgroup estimates based on larger sample sizes are more likely to yield statistically significant results. There may be cases where small differences in disaggregated estimates show statistical significance solely because they are based on a large sample size.
When inspecting and interpreting point estimates, a distinction can be made between statistical significance and public health significance. Estimates derived from large samples may prove to be statistically different mathematically, but in public health this difference may not be meaningful. For example, the 2020 DHS in India reported a statistical difference between demand for family planning satisfied (use of modern and traditional methods) in urban and rural areas. The coverage in urban areas was 89.2% (95% confidence interval (CI) 88.8–89.5%), and the coverage in rural areas was 86.9% (95% CI 86.7–87.1%) (2). In terms of public health policies, programmes and practices, however, the difference of 2.3 percentage points likely bears little importance. Nevertheless, this does not mean that sample size and uncertainty measures should be ignored when reporting data. Rather, there is a need to ensure point estimates do not lead to false conclusions and misinformed policy. This includes considering whether the confidence intervals of the point estimates are narrow enough to allow for meaningful conclusions about inequality. In cases where no meaningful conclusions can be drawn, point estimates for indicators in population subgroups should be presented with the necessary caveats to avoid confusion and misinformation. See Chapter 23 for more discussion about reporting the results of health inequality monitoring.
Ecological fallacy
An ecological fallacy is a misinterpretation that occurs because the characteristics of a group are attributed to an individual. Health indicators and dimensions of inequality measured at an aggregate level, such as by district or household, do not necessarily reflect the situation for all individuals within the group. For instance, if richer districts are found to have a higher prevalence of road traffic injuries than poorer districts, it would be erroneous to use these data to draw the conclusion that road traffic injuries are more prevalent among richer individuals – although the data could be used to help inform targeted interventions in richer districts.
An ecological fallacy refers to an erroneous inference that may occur because an association observed between variables on an aggregate level does not necessarily represent or reflect the association that exists at an individual level (3).
Household measures can mask inequalities within households. Care should be taken to avoid drawing conclusions that rely on the extrapolation of characteristics about individuals from household-level data. For example, for the purposes of health inequality monitoring, economic status is commonly measured as household wealth using asset-based indices. Household members, however, may not have equal access to assets and income due to their age, gender or other factors. An interpretation of data disaggregated by household wealth, therefore, would be more accurately expressed as “women from richer households are more likely to access health services” than “rich women are more likely to access health services”.
Resolution issues
Resolution issues arise when interpreting and comparing between sets of disaggregated data based on variable numbers of subgroups. A larger number of groupings will usually capture more heterogeneity (i.e. variation), especially when comparing between subgroups that reflect the extreme ends of an ordered inequality dimension (4, 5). Conversely, a smaller number of subgroups for a dimension of inequality will generally capture less heterogeneity between the subgroups.
To illustrate this issue, consider the case of economic status. This dimension of inequality is commonly categorized using deciles, quintiles or two subgroups:
Deciles (10 subgroups) each contain about 10% of the population.
Quintiles (five subgroups) each contain about 20% of the population.
Two subgroups may be formed from the richest 60% and the poorest 40%, or the richest 10% and the poorest 40% (known as the Palma ratio).
For a given population, observing disaggregated data and making comparisons between the two subgroups at the extremes – the richest and the poorest – leads to different conclusions about inequality, depending on the number of subgroups. Dividing the population into deciles means the comparisons are made between the richest 10% and the poorest 10%. Having 10 subgroups of economic status captures more of the extreme wealth and extreme poverty than having five subgroups (quintiles, where comparisons capture the richest 20% and the poorest 20%) or two subgroups (where comparisons capture the richest 60% and the poorest 40%). Box 18.2 demonstrates the use of economic status deciles versus quintiles to show inequality in births attended by skilled health personnel in Bangladesh.
BOX 18.2. Applying economic status deciles versus quintiles
Figure 18.5 shows data from Bangladesh about the percentage of births attended by skilled health personnel, disaggregated by economic status. Each of the green dots represents one subgroup. On the top, economic status is categorized as deciles. The coverage is 26% in the poorest subgroup and 92% in the richest subgroup. On the bottom, economic status is categorized as quintiles, each consisting of 20% of the population. Here, the range of coverage between the poorest and richest is smaller – 32% coverage in the poorest subgroup and 86% in the richest subgroup. The range of values, therefore, is larger when economic status is categorized as 10 rather than five subgroups.
Figure 18.5. Births attended by skilled health personnel, by economic status, Bangladesh
Horizontal lines show the range between the lowest and highest subgroup estimates.
Source: derived from the WHO Health Inequality Data Repository Reproductive, Maternal, Newborn and Child Health dataset (2), with data sourced from the 2019 Multiple Indicator Cluster Surveys. Data are based on two years prior to the survey.
The numbers of subgroup categories should be harmonized when comparing between situations of inequality.
Due to resolution issues, comparisons between dimensions of inequality that are categorized based on variable numbers of subgroups can be misleading and generally should be avoided. Similarly, attention to resolution issues is required when comparing between different dimensions of inequality for a given indicator, comparing disaggregated estimates over time, and comparing between countries. It would not be valid to make comparisons of within-country inequality if economic status is categorized as quintiles in one country and as deciles in another country. When benchmarking across countries, it may be valid, however, to compare within-country wealth-related inequality if economic status is categorized consistently in all countries.
References
1. Victora CG, Joseph G, Silva ICM, Maia FS, Vaughan JP, Barros FC, et al. The inverse equity hypothesis: analyses of institutional deliveries in 286 national surveys. Am J Public Health. 2018;108(4):464–471. doi:10.2105/AJPH.2017.304277.
2. Health Inequality Data Repository. Geneva: World Health Organization (https://www.who.int/data/inequality-monitor/data, accessed 15 September 2024).
3. Porta M. A dictionary of epidemiology. Oxford: Oxford University Press; 2016.
4. Wong KLM, Restrepo-Méndez MC, Barros AJD, Victora CG. Socioeconomic inequalities in skilled birth attendance and child stunting in selected low and middle income countries: wealth quintiles or deciles? PLoS One. 2017;12(5):e0174823. doi:10.1371/journal.pone.0174823.
5. Flores-Quispe MDP, Restrepo-Méndez MC, Maia MFS, Ferreira LZ, Wehrmeister FC. Trends in socioeconomic inequalities in stunting prevalence in Latin America and the Caribbean countries: differences between quintiles and deciles. Int J Equity Health. 2019;18(1):156. doi:10.1186/s12939-019-1046-7.